{"id":204886,"date":"2025-01-29T23:07:12","date_gmt":"2025-01-30T05:07:12","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2025\/01\/hold-off-on-your-panic-until-ai-passes-this-test"},"modified":"2025-01-29T23:07:12","modified_gmt":"2025-01-30T05:07:12","slug":"hold-off-on-your-panic-until-ai-passes-this-test","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2025\/01\/hold-off-on-your-panic-until-ai-passes-this-test","title":{"rendered":"Hold off on your panic \u2014 until AI passes this test"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/hold-off-on-your-panic-until-ai-passes-this-test.jpg\"><\/a><\/p>\n<p>While DeepSeek makes AI cheaper, seemingly without cutting corners on quality, a group is trying to figure out how to make tests for AI models that are hard enough. It\u2019s \u2018Humanity\u2019s Last Exam\u2019<\/p>\n<p>If you\u2019re looking for a new reason to be nervous about artificial intelligence, try this: Some of the smartest humans in the world are struggling to create tests that AI systems can\u2019t pass.<\/p>\n<p>For years, AI systems were measured by giving new models a variety of standardized benchmark tests. Many of these tests consisted of challenging, SAT-calibre problems in areas like math, science and logic. Comparing the models\u2019 scores over time served as a rough measure of AI progress.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>While DeepSeek makes AI cheaper, seemingly without cutting corners on quality, a group is trying to figure out how to make tests for AI models that are hard enough. It\u2019s \u2018Humanity\u2019s Last Exam\u2019 If you\u2019re looking for a new reason to be nervous about artificial intelligence, try this: Some of the smartest humans in the [\u2026]<\/p>\n","protected":false},"author":662,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2229,6],"tags":[],"class_list":["post-204886","post","type-post","status-publish","format-standard","hentry","category-mathematics","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/204886","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/662"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=204886"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/204886\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=204886"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=204886"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=204886"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}