{"id":173331,"date":"2023-10-02T18:35:12","date_gmt":"2023-10-02T23:35:12","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2023\/10\/generative-ai-will-far-surpass-what-chatgpt-can-do-heres-everything-on-how-the-tech-advances"},"modified":"2023-10-02T18:35:12","modified_gmt":"2023-10-02T23:35:12","slug":"generative-ai-will-far-surpass-what-chatgpt-can-do-heres-everything-on-how-the-tech-advances","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2023\/10\/generative-ai-will-far-surpass-what-chatgpt-can-do-heres-everything-on-how-the-tech-advances","title":{"rendered":"Generative AI will far surpass what ChatGPT can do. Here\u2019s everything on how the tech advances"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/generative-ai-will-far-surpass-what-chatgpt-can-do-heres-everything-on-how-the-tech-advances2.jpg\"><\/a><\/p>\n<p>Scholars at Carnegie Mellon University <a href=\"https:\/\/openreview.net\/forum?id=ttzypy3kT7\" target=\"_blank\" rel=\"noopener noreferrer nofollow\" class=\"\">recently offered<\/a> what they call a \u201cHigh-Modality Multimodal Transformer,\u201d which combines not just text, image, video, and speech but also database table information and time series data. Lead author Paul Pu Liang and colleagues report that they observed \u201ca crucial scaling behavior\u201d of the 10-mode neural network. \u201cPerformance continues to improve with each modality added, and it transfers to entirely new modalities and tasks.\u201d<\/p>\n<p>Scholars Yiyuan Zhang and colleagues at the Multimedia Lab of The Chinese University of Hong Kong boosted the number of modalities to a dozen in their Meta-Transformer. Its point clouds model 3D vision, while its hyper-spectral sensing data represents electromagnetic energy reflected back from the ground to fly-over images of landscapes.<\/p>\n<p>The immediate payoff of multi-modality will simply be to enrich the output of a thing such as ChatGPT in ways that go far beyond the \u201cdemo\u201d mode. A children\u2019s storybook, a book with text passages combined with pictures illustrating the text, is one immediate example. By combining the language and image attributes, the kinds of pictures created by the diffusion process can be more subtly controlled from picture to picture.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scholars at Carnegie Mellon University recently offered what they call a \u201cHigh-Modality Multimodal Transformer,\u201d which combines not just text, image, video, and speech but also database table information and time series data. Lead author Paul Pu Liang and colleagues report that they observed \u201ca crucial scaling behavior\u201d of the 10-mode neural network. \u201cPerformance continues to [\u2026]<\/p>\n","protected":false},"author":396,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-173331","post","type-post","status-publish","format-standard","hentry","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/173331","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/396"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=173331"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/173331\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=173331"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=173331"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=173331"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}