{"id":224879,"date":"2025-11-11T05:04:06","date_gmt":"2025-11-11T11:04:06","guid":{"rendered":"https:\/\/lifeboat.com\/blog\/2025\/11\/ai-evaluates-texts-without-bias-until-the-source-is-revealed"},"modified":"2025-11-11T05:04:06","modified_gmt":"2025-11-11T11:04:06","slug":"ai-evaluates-texts-without-bias-until-the-source-is-revealed","status":"publish","type":"post","link":"https:\/\/lifeboat.com\/blog\/2025\/11\/ai-evaluates-texts-without-bias-until-the-source-is-revealed","title":{"rendered":"AI evaluates texts without bias\u2014until the source is revealed"},"content":{"rendered":"<p><a class=\"aligncenter blog-photo\" href=\"https:\/\/lifeboat.com\/blog.images\/ai-evaluates-texts-without-bias-until-the-source-is-revealed2.jpg\"><\/a><\/p>\n<p>Large language models (LLMs) are increasingly used not only to generate content but also to evaluate it. They are asked to grade essays, moderate social media content, summarize reports, screen job applications and much more.<\/p>\n<p>However, there are heated discussions\u2014in the media as well as in academia\u2014about whether such evaluations are consistent and unbiased. Some LLMs are under suspicion of promoting certain political agendas. For example, Deepseek is often characterized as having a pro-Chinese perspective and Open AI as being \u201cwoke.\u201d<\/p>\n<p>Although these beliefs are widely discussed, they are so far unsubstantiated. UZH-researchers Federico Germani and Giovanni Spitale have now investigated whether LLMs really exhibit systematic biases when evaluating texts. Their results, <a href=\"https:\/\/www.science.org\/doi\/10.1126\/sciadv.adz2924\" target=\"_blank\">published<\/a> in <i>Science Advances<\/i>, show that LLMs indeed deliver biased judgments\u2014but only when information about the source or author of the evaluated message is revealed.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Large language models (LLMs) are increasingly used not only to generate content but also to evaluate it. They are asked to grade essays, moderate social media content, summarize reports, screen job applications and much more. However, there are heated discussions\u2014in the media as well as in academia\u2014about whether such evaluations are consistent and unbiased. Some [\u2026]<\/p>\n","protected":false},"author":662,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-224879","post","type-post","status-publish","format-standard","hentry","category-robotics-ai"],"_links":{"self":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/224879","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/users\/662"}],"replies":[{"embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/comments?post=224879"}],"version-history":[{"count":0,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/posts\/224879\/revisions"}],"wp:attachment":[{"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/media?parent=224879"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/categories?post=224879"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lifeboat.com\/blog\/wp-json\/wp\/v2\/tags?post=224879"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}