{"id":150896,"date":"2022-11-23T15:23:05","date_gmt":"2022-11-23T21:23:05","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2022\/11\/a-simpler-path-to-better-computer-vision"},"modified":"2022-11-23T15:23:05","modified_gmt":"2022-11-23T21:23:05","slug":"a-simpler-path-to-better-computer-vision","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2022\/11\/a-simpler-path-to-better-computer-vision","title":{"rendered":"A simpler path to better computer vision"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/a-simpler-path-to-better-computer-vision2.jpg\"><\/a><\/p>\n<p>Before a machine-learning model can complete a task, such as identifying cancer in medical images, the model must be trained. Training image classification models typically involves showing the model millions of example images gathered into a massive dataset.<\/p>\n<p>However, using real image data can raise practical and <a href=\"https:\/\/techxplore.com\/tags\/ethical+concerns\/\" rel=\"tag\" class=\"\">ethical concerns<\/a>: The images could run afoul of copyright laws, violate people\u2019s privacy, or be biased against a certain racial or ethnic group. To avoid these pitfalls, researchers can use image generation programs to create <a href=\"https:\/\/techxplore.com\/tags\/synthetic+data\/\" rel=\"tag\" class=\"\">synthetic data<\/a> for model training. But these techniques are limited because expert knowledge is often needed to hand-design an image generation program that can create effective training data.<\/p>\n<p>Researchers from MIT, the MIT-IBM Watson AI Lab, and elsewhere took a different approach. Instead of designing customized image generation programs for a particular training task, they gathered a dataset of 21,000 publicly available programs from the internet. Then they used this large collection of basic image generation programs to train a computer vision model.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Before a machine-learning model can complete a task, such as identifying cancer in medical images, the model must be trained. Training image classification models typically involves showing the model millions of example images gathered into a massive dataset. However, using real image data can raise practical and ethical concerns: The images could run afoul of [\u2026]<\/p>\n","protected":false},"author":359,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11,418,6],"tags":[],"class_list":["post-150896","post","type-post","status-publish","format-standard","hentry","category-biotech-medical","category-internet","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/150896","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/359"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=150896"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/150896\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=150896"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=150896"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=150896"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}