{"id":185,"date":"2023-11-07T13:25:24","date_gmt":"2023-11-07T13:25:24","guid":{"rendered":"https:\/\/ozer.gt\/log\/?p=185"},"modified":"2024-01-27T07:53:22","modified_gmt":"2024-01-27T07:53:22","slug":"llms-are-most-useful-for-experts-on-a-topic","status":"publish","type":"post","link":"https:\/\/ozer.gt\/log\/2023\/11\/07\/llms-are-most-useful-for-experts-on-a-topic\/","title":{"rendered":"LLMs are most useful for experts on a topic&#8230;"},"content":{"rendered":"<p>&#8230;because experts are more likely to know what they don&#8217;t know. When users don&#8217;t know what they don&#8217;t know, so-called &#8220;hallucinations&#8221; are less likely to be detected and this seems to be a growing problem, likely exacerbated by the Dunning-Kruger effect. Well, this is my take on them.<\/p>\n<p>In the study cited in the article, several LLM models are asked to summarize news articles to measure how often they &#8220;hallucinated&#8221; or made up facts.<\/p>\n<p>The LLM models showed different rates of &#8220;hallucination&#8221;, with OpenAI having the lowest (about 3%), followed by Meta (about 5%), Anthropic&#8217;s Claude 2 system (over 8%), and Google&#8217;s Palm chat with the highest (27%).<\/p>\n<p><a href=\"https:\/\/www.nytimes.com\/2023\/11\/06\/technology\/chatbots-hallucination-rates.html?unlocked_article_code=1.8kw.0-P_.JiW8-EQYqPXx&amp;smid=url-share\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8230;because experts are more likely to know what they don&#8217;t know. When users don&#8217;t know what they don&#8217;t know, so-called &#8220;hallucinations&#8221; are less likely to be detected and this seems to be a growing problem, likely exacerbated by the Dunning-Kruger effect. Well, this is my take on them. In the study cited in the article, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"cybocfi_hide_featured_image":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-185","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/posts\/185","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/comments?post=185"}],"version-history":[{"count":2,"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/posts\/185\/revisions"}],"predecessor-version":[{"id":187,"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/posts\/185\/revisions\/187"}],"wp:attachment":[{"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/media?parent=185"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/categories?post=185"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ozer.gt\/log\/wp-json\/wp\/v2\/tags?post=185"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}