{"id":1458,"date":"2025-01-22T05:06:10","date_gmt":"2025-01-22T05:06:10","guid":{"rendered":"https:\/\/serisec.com\/index.php\/2025\/01\/22\/ai-mistakes-are-very-different-from-human-mistakes-html\/"},"modified":"2025-01-22T05:06:10","modified_gmt":"2025-01-22T05:06:10","slug":"ai-mistakes-are-very-different-from-human-mistakes-html","status":"publish","type":"post","link":"https:\/\/serisec.com\/index.php\/2025\/01\/22\/ai-mistakes-are-very-different-from-human-mistakes-html\/","title":{"rendered":"AI Mistakes Are Very Different from Human Mistakes"},"content":{"rendered":"\n<div>AI Mistakes Are Very Different from Human Mistakes<\/div>\n<p> \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p>Humans make mistakes all the time. All of us do, every day, in tasks both new and routine. Some of our mistakes are minor and some are catastrophic. Mistakes can break trust with our friends, lose the confidence of our bosses, and sometimes be the difference between life and death.<\/p>\n<p>Over the millennia, we have created security systems to deal with the sorts of mistakes humans commonly make. These days, casinos rotate their dealers regularly, because they make mistakes if they do the same task for too long. Hospital personnel write on limbs before surgery so that doctors operate on the correct body part, and they count surgical instruments to make sure none were left inside the body. From copyediting to double-entry bookkeeping to appellate courts, we humans have gotten really good at correcting human mistakes.<\/p>\n<p>Humanity is now rapidly integrating a wholly different kind of mistake-maker into society: AI. Technologies like <a href=\"https:\/\/spectrum.ieee.org\/tag\/llms\">large language models<\/a> (LLMs) can perform many cognitive tasks traditionally fulfilled by humans, but they make plenty of mistakes. It seems <a href=\"https:\/\/www.buzzfeed.com\/carleysuthers\/weird-and-wrong-ai-responses\">ridiculous<\/a> when chatbots tell you to eat rocks or add glue to pizza. But it\u2019s not the frequency or severity of AI systems\u2019 mistakes that differentiates them from human mistakes. It\u2019s their weirdness. AI systems do not make mistakes in the same ways that humans do.<\/p>\n<p>Much of the friction\u2014and risk\u2014associated with our use of AI arise from that difference. We need to invent new <a href=\"https:\/\/spectrum.ieee.org\/tag\/security\">security<\/a> systems that adapt to these differences and prevent harm from AI mistakes.<\/p>\n<h3>Human Mistakes vs AI Mistakes<\/h3>\n<p>Life experience makes it fairly easy for each of us to guess when and where humans will make mistakes. Human errors tend to come at the edges of someone\u2019s knowledge: Most of us would make mistakes solving calculus problems. We expect human mistakes to be clustered: A single calculus mistake is likely to be accompanied by others. We expect mistakes to wax and wane, predictably depending on factors such as fatigue and distraction. And mistakes are often accompanied by ignorance: Someone who makes calculus mistakes is also likely to respond \u201cI don\u2019t know\u201d to calculus-related questions.<\/p>\n<p>To the extent that AI systems make these human-like mistakes, we can bring all of our mistake-correcting systems to bear on their output. But the current crop of AI models\u2014particularly LLMs\u2014make mistakes differently.<\/p>\n<p>AI errors come at seemingly random times, without any clustering around particular topics. LLM mistakes tend to be more evenly distributed through the knowledge space. A model might be equally likely to make a mistake on a calculus question as it is to propose that <a href=\"https:\/\/arxiv.org\/html\/2405.19616v1\">cabbages<\/a> eat goats.<\/p>\n<p>And AI mistakes aren\u2019t accompanied by ignorance. A LLM will be <a href=\"https:\/\/spectrum.ieee.org\/chatgpt-reliability\">just as confident<\/a> when saying something completely wrong\u2014and obviously so, to a human\u2014as it will be when saying something true. The seemingly random <a href=\"https:\/\/arxiv.org\/pdf\/2305.14279\">inconsistency<\/a> of LLMs makes it hard to trust their reasoning in complex, multi-step problems. If you want to use an AI model to help with a business problem, it\u2019s not enough to see that it understands what factors make a product profitable; you need to be sure it won\u2019t forget what money is.<\/p>\n<h3>How to Deal with AI Mistakes<\/h3>\n<p>This situation indicates two possible areas of research. The first is to engineer LLMs that make more human-like mistakes. The second is to build new mistake-correcting systems that deal with the specific sorts of mistakes that LLMs tend to make.<\/p>\n<p>We already have some tools to lead LLMs to act in more human-like ways. Many of these arise from the field of \u201c<a href=\"https:\/\/arxiv.org\/abs\/2406.18346\">alignment<\/a>\u201d research, which aims to make models <a href=\"https:\/\/spectrum.ieee.org\/the-alignment-problem-openai\">act in accordance<\/a> with the goals and motivations of their human developers. One example is the technique that was <a href=\"https:\/\/venturebeat.com\/ai\/how-reinforcement-learning-with-human-feedback-is-unlocking-the-power-of-generative-ai\/\">arguably<\/a> responsible for the breakthrough success of <a href=\"https:\/\/spectrum.ieee.org\/tag\/chatgpt\">ChatGPT<\/a>: <a href=\"https:\/\/arxiv.org\/abs\/2203.02155\">reinforcement learning with human feedback<\/a>. In this method, an AI model is (figuratively) rewarded for producing responses that get a thumbs-up from human evaluators. Similar approaches could be used to induce AI systems to make more human-like mistakes, particularly by penalizing them more for mistakes that are less intelligible.<\/p>\n<p>When it comes to catching AI mistakes, some of the systems that we use to prevent human mistakes will help. To an extent, forcing LLMs to <a href=\"https:\/\/arxiv.org\/pdf\/2308.00436\">double-check<\/a> their own work can help prevent errors. But LLMs can also <a href=\"https:\/\/arxiv.org\/pdf\/2406.02061\">confabulate<\/a> seemingly plausible, but truly ridiculous, explanations for their flights from reason.<\/p>\n<p>Other mistake mitigation systems for AI are unlike anything we use for humans. Because machines can\u2019t get fatigued or frustrated in the way that humans do, it can help to ask an LLM the same question repeatedly in slightly different ways and then <a href=\"https:\/\/arxiv.org\/abs\/2210.02441\">synthesize<\/a> its multiple responses. Humans won\u2019t put up with that kind of annoying repetition, but machines will.<\/p>\n<h3>Understanding Similarities and Differences<\/h3>\n<p>Researchers are still struggling to understand where LLM mistakes diverge from human ones. Some of the weirdness of AI is actually more human-like than it first appears. Small changes to a query to an LLM can result in wildly different responses, a problem known as <a href=\"https:\/\/arxiv.org\/pdf\/2311.07230\">prompt sensitivity<\/a>. But, as any survey researcher can tell you, humans behave this way, too. The phrasing of a question in an opinion poll can have drastic <a href=\"https:\/\/psycnet.apa.org\/record\/1992-97329-001\">impacts<\/a> on the answers.<\/p>\n<p>LLMs also seem to have a bias towards <a href=\"http:\/\/proceedings.mlr.press\/v139\/zhao21c\/zhao21c.pdf\">repeating<\/a> the words that were most common in their training data; for example, guessing familiar place names like \u201cAmerica\u201d even when asked about more exotic locations. Perhaps this is an example of the human \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2305.04400\">availability heuristic<\/a>\u201d manifesting in LLMs, with machines spitting out the first thing that comes to mind rather than reasoning through the question. And like humans, perhaps, some LLMs seem to get <a href=\"https:\/\/arxiv.org\/html\/2404.08865v1\">distracted<\/a> in the middle of long documents; they\u2019re better able to remember facts from the beginning and end. There is already progress on improving this error mode, as researchers have found that LLMs trained on <a href=\"https:\/\/www.anthropic.com\/news\/claude-2-1-prompting\">more examples<\/a> of retrieving information from long texts seem to do better at retrieving information uniformly.<\/p>\n<p>In some cases, what\u2019s bizarre about LLMs is that they act more like humans than we think they should. For example, some researchers have tested the <a href=\"https:\/\/minimaxir.com\/2024\/02\/chatgpt-tips-analysis\/\">hypothesis<\/a> that LLMs perform better when offered a cash reward or threatened with death. It also turns out that some of the best ways to \u201c<a href=\"https:\/\/www.usenix.org\/system\/files\/sec24fall-prepub-1500-yu-zhiyuan.pdf\">jailbreak<\/a>\u201d LLMs (getting them to disobey their creators\u2019 explicit instructions) look a lot like the kinds of social engineering tricks that humans use on each other: for example, pretending to be someone else or saying that the request is just a joke. But other effective jailbreaking techniques are things no human would ever fall for. One group <a href=\"https:\/\/arxiv.org\/abs\/2402.11753\">found<\/a> that if they used <a href=\"https:\/\/en.wikipedia.org\/wiki\/ASCII_art\">ASCII art<\/a> (constructions of symbols that look like words or pictures) to pose dangerous questions, like how to build a bomb, the LLM would answer them willingly.<\/p>\n<p>Humans may occasionally make seemingly random, incomprehensible, and inconsistent mistakes, but such occurrences are rare and often indicative of more serious problems. We also tend not to put people exhibiting these behaviors in decision-making positions. Likewise, we should confine AI decision-making systems to applications that suit their actual abilities\u2014while keeping the potential ramifications of their mistakes firmly in mind.<\/p>\n<p><em>This essay was written with Nathan E. Sanders, and originally appeared in <a href=\"https:\/\/spectrum.ieee.org\/ai-mistakes-schneier\">IEEE Spectrum<\/a>.<\/em><\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Bruce Schneier<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/www.schneier.com\/blog\/archives\/2025\/01\/ai-mistakes-are-very-different-from-human-mistakes.html\">Go to bruce schneier<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>AI Mistakes Are Very Different from Human Mistakes Humans make mistakes all the time. All of us do, every day, in tasks both new and routine. Some of our mistakes are minor and some are catastrophic. Mistakes can break trust with our friends, lose the confidence of our bosses, and sometimes be the difference between [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[167,57,268,1],"tags":[87],"class_list":["post-1458","post","type-post","status-publish","format-standard","hentry","category-ai","category-bruce-schneier","category-llm","category-uncategorized","tag-bruce-schneier"],"_links":{"self":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/1458"}],"collection":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/comments?post=1458"}],"version-history":[{"count":0,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/1458\/revisions"}],"wp:attachment":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/media?parent=1458"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/categories?post=1458"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/tags?post=1458"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}