{"id":218498,"date":"2025-07-22T20:11:03","date_gmt":"2025-07-23T01:11:03","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2025\/07\/ai-vision-reinvented-vision-language-models-gain-clearer-sight-through-synthetic-training-data"},"modified":"2025-07-22T20:11:03","modified_gmt":"2025-07-23T01:11:03","slug":"ai-vision-reinvented-vision-language-models-gain-clearer-sight-through-synthetic-training-data","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2025\/07\/ai-vision-reinvented-vision-language-models-gain-clearer-sight-through-synthetic-training-data","title":{"rendered":"AI vision, reinvented: Vision-language models gain clearer sight through synthetic training data"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/ai-vision-reinvented-vision-language-models-gain-clearer-sight-through-synthetic-training-data2.jpg\"><\/a><\/p>\n<p>In the race to develop AI that understands complex images like financial forecasts, medical diagrams and nutrition labels\u2014essential for AI to operate independently in everyday settings\u2014closed-source systems like ChatGPT and Claude currently set the pace. But no one outside their makers knows how those models were trained or what data they used, leaving open-source alternatives scrambling to catch up.<\/p>\n<p>Now, researchers at Penn Engineering and the Allen Institute for AI (Ai2) have developed a new approach to train open-source models: using AI to create scientific figures, charts and tables that teach other AI systems how to interpret complex visual information.<\/p>\n<p>Their tool, CoSyn (short for Code-Guided Synthesis), taps open-source AI models\u2019 coding skills to render text-rich images and generate relevant questions and answers, giving other AI systems the data they need to learn how to \u201csee\u201d and understand scientific figures.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In the race to develop AI that understands complex images like financial forecasts, medical diagrams and nutrition labels\u2014essential for AI to operate independently in everyday settings\u2014closed-source systems like ChatGPT and Claude currently set the pace. But no one outside their makers knows how those models were trained or what data they used, leaving open-source alternatives [\u2026]<\/p>\n","protected":false},"author":718,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11,45,6],"tags":[],"class_list":["post-218498","post","type-post","status-publish","format-standard","hentry","category-biotech-medical","category-finance","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/218498","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/718"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=218498"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/218498\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=218498"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=218498"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=218498"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}