{"id":150030,"date":"2022-11-13T00:02:54","date_gmt":"2022-11-13T06:02:54","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2022\/11\/boston-university-and-google-researchers-introduce-an-artificial-intelligence-ai-based-method-to-illustrate-articles-with-visual-summarizes"},"modified":"2022-11-13T00:02:54","modified_gmt":"2022-11-13T06:02:54","slug":"boston-university-and-google-researchers-introduce-an-artificial-intelligence-ai-based-method-to-illustrate-articles-with-visual-summarizes","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2022\/11\/boston-university-and-google-researchers-introduce-an-artificial-intelligence-ai-based-method-to-illustrate-articles-with-visual-summarizes","title":{"rendered":"Boston University and Google Researchers Introduce An Artificial Intelligence (AI) Based Method To Illustrate Articles With Visual Summarizes"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/boston-university-and-google-researchers-introduce-an-artificial-intelligence-ai-based-method-to-illustrate-articles-with-visual-summarizes3.jpg\"><\/a><\/p>\n<p>Recent progress in generative models has paved the way to a manifold of tasks that some years ago were only imaginable. With the help of large-scale image-text datasets, generative models can learn powerful representations exploited in fields such as text-to-image or image-to-text translation.<\/p>\n<p>The recent release of Stable Diffusion and the DALL-E API led to great excitement around text-to-image generative models capable of generating complex and stunning novel images from an input descriptive text, similar to performing a search on the internet.<\/p>\n<p>With the rising interest in the reverse task, i.e., image-to-text translation, several studies tried to generate captions from input images. These methods often presume a one-to-one correspondence between pictures and their captions. However, multiple images can be connected to and paired with a long text narrative, such as photos in a news article. Therefore, the need for illustrative correspondences (e.g., \u201ctravel\u201d or \u201cvacation\u201d) rather than literal one-to-one captions (e.g., \u201cairplane flying\u201d).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Recent progress in generative models has paved the way to a manifold of tasks that some years ago were only imaginable. With the help of large-scale image-text datasets, generative models can learn powerful representations exploited in fields such as text-to-image or image-to-text translation. The recent release of Stable Diffusion and the DALL-E API led to [\u2026]<\/p>\n","protected":false},"author":556,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6,1491],"tags":[],"class_list":["post-150030","post","type-post","status-publish","format-standard","hentry","category-robotics-ai","category-transportation"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/150030","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/556"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=150030"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/150030\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=150030"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=150030"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=150030"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}