{"id":178280,"date":"2023-12-14T10:27:30","date_gmt":"2023-12-14T16:27:30","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2023\/12\/googles-new-ai-gemini-beats-chatgpt-in-30-of-32-test-categories"},"modified":"2023-12-14T10:27:30","modified_gmt":"2023-12-14T16:27:30","slug":"googles-new-ai-gemini-beats-chatgpt-in-30-of-32-test-categories","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2023\/12\/googles-new-ai-gemini-beats-chatgpt-in-30-of-32-test-categories","title":{"rendered":"Google\u2019s New AI, Gemini, Beats ChatGPT In 30 Of 32 Test Categories"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/googles-new-ai-gemini-beats-chatgpt-in-30-of-32-test-categories.jpg\"><\/a><\/p>\n<p>Google has released a new Pro model of its latest AI, Gemini, and company sources say it has outperformed GPT-3.5 (the free version of ChatGPT) in widespread testing. According to performance reports, Gemini Ultra <a href=\"https:\/\/blog.google\/technology\/ai\/google-gemini-ai\/#performance\" target=\"_blank\" class=\"\" title=\"https:\/\/blog.google\/technology\/ai\/google-gemini-ai\/#performance\" rel=\"nofollow noopener noreferrer\" aria-label=\"exceeds current state-of-the-art results\">exceeds current state-of-the-art results<\/a> on 30 of the 32 widely-used academic benchmarks used in large language model (LLM) research and development. Google has been accused of lagging behind OpenAI\u2019s ChatGPT, widely regarded as <a href=\"https:\/\/www.forbes.com\/sites\/chriswestfall\/2023\/11\/16\/new-research-shows-chatgpt-reigns-supreme-in-ai-tool-sector\/\" target=\"_self\" class=\"\" title=\"https:\/\/www.forbes.com\/sites\/chriswestfall\/2023\/11\/16\/new-research-shows-chatgpt-reigns-supreme-in-ai-tool-sector\/\" aria-label=\"the most popular\">the most popular<\/a> and powerful in the AI space. Google says Gemini was trained to be multimodal, meaning it can process different types of media such as text, pictures, video, and audio.<\/p>\n<p><em>Insider<\/em> also reports that, with a score of 90.0%, Gemini Ultra is the first model to outperform human experts on <a href=\"https:\/\/arxiv.org\/abs\/2009.03300\" target=\"_blank\" class=\"\" title=\"https:\/\/arxiv.org\/abs\/2009.03300\" rel=\"nofollow noopener noreferrer\" aria-label=\"MMLU\">MMLU<\/a> (massive multitask language understanding), which uses a combination of 57 subjects such as math, physics, history, law, medicine and ethics for testing both world knowledge and <a href=\"https:\/\/www.businessinsider.com\/google-gemini-ai-model-release-ultra-pro-bard-openai-tpu-2023-12\" target=\"_blank\" class=\"\" title=\"https:\/\/www.businessinsider.com\/google-gemini-ai-model-release-ultra-pro-bard-openai-tpu-2023-12\" rel=\"nofollow noopener noreferrer\" aria-label=\"problem-solving abilities\">problem-solving abilities<\/a>.<\/p>\n<p>The Google-based AI comes in three sizes, or stages, for the Gemini platform: Ultra, which is the flagship model, Pro and <a href=\"https:\/\/techcrunch.com\/2023\/12\/06\/googles-gemini-isnt-the-generative-ai-model-we-expected\/\" target=\"_blank\" class=\"\" title=\"https:\/\/techcrunch.com\/2023\/12\/06\/googles-gemini-isnt-the-generative-ai-model-we-expected\/\" rel=\"nofollow noopener noreferrer\" aria-label=\"Nano (designed for mobile devices)\">Nano (designed for mobile devices)<\/a>. According to reports from TechCrunch, the company says it\u2019s making Gemini Pro available to enterprise customers through its Vertex AI program, and for developers in AI Studio, on December 13. Reports indicate that the Pro version can also be accessed via Bard, the company\u2019s chatbot interface.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google has released a new Pro model of its latest AI, Gemini, and company sources say it has outperformed GPT-3.5 (the free version of ChatGPT) in widespread testing. According to performance reports, Gemini Ultra exceeds current state-of-the-art results on 30 of the 32 widely-used academic benchmarks used in large language model (LLM) research and development. [\u2026]<\/p>\n","protected":false},"author":578,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11,30,1496,2229,6],"tags":[],"class_list":["post-178280","post","type-post","status-publish","format-standard","hentry","category-biotech-medical","category-ethics","category-law","category-mathematics","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/178280","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/578"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=178280"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/178280\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=178280"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=178280"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=178280"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}