{"id":957,"date":"2015-03-09T15:55:17","date_gmt":"2015-03-09T20:55:17","guid":{"rendered":"http:\/\/blog.richmond.edu\/physicsbunn\/?p=957"},"modified":"2015-03-09T15:55:17","modified_gmt":"2015-03-09T20:55:17","slug":"p-values-arent-wrong-theyre-just-uninteresting","status":"publish","type":"post","link":"https:\/\/blog.richmond.edu\/physicsbunn\/2015\/03\/09\/p-values-arent-wrong-theyre-just-uninteresting\/","title":{"rendered":"p-values aren&#8217;t wrong; they&#8217;re just uninteresting"},"content":{"rendered":"<p>The journal <em>Basic and Applied Social Psychology<\/em> has come out with a <a href=\"http:\/\/www.tandfonline.com\/doi\/pdf\/10.1080\/01973533.2015.1012991\">ban<\/a> on <em>p<\/em>-values. To be precise, they&#8217;ve banned the &#8220;null hypothesis significance testing procedure&#8221; from articles published in the journal. This ban means that authors in the journal can&#8217;t claim that an effect they see in their data is &#8220;statistically significant&#8221; in the usual way that we&#8217;re all accustomed to reading.<\/p>\n<p>Like all right-thinking people, I believe that the only coherent way to think about statistical questions is the Bayesian way, and that there are serious problems with the alternative &#8220;frequentist&#8221; approach. Moreover, the sort of (frequentist) significance testing banned by this journal can\u00a0indeed <a href=\"http:\/\/blog.richmond.edu\/physicsbunn\/2014\/02\/18\/nature-on-p-values\/\">lead to serious problems<\/a>. It&#8217;s a large part of the reason that a strong argument can be made that (at least in some scientific disciplines)\u00a0<a href=\"http:\/\/journals.plos.org\/plosmedicine\/article?id=10.1371\/journal.pmed.0020124\">most published research findings are false<\/a>.<\/p>\n<p>All of that suggests that I ought to applaud this decision, but at the risk of seeming disloyal to my fellow Bayesians, I don&#8217;t. In fact, the journal editors&#8217; decision to impose this ban makes me trust the quality of the journal less than I otherwise would, not more.<\/p>\n<p>I was pleased to see that my old friend Allen Downey, always a voice of sanity on matters of this sort, <a href=\"http:\/\/allendowney.blogspot.com\/2015\/03\/statistical-inference-is-only-mostly.html\">is of the same opinion<\/a>. I won&#8217;t rehash everything he says in his post, but I heartily endorse it.<\/p>\n<p>The main thing to realize is that the techniques in question aren&#8217;t actually wrong. On the contrary, they correctly answer the questions they&#8217;re supposed to answer.<\/p>\n<p>If your data allow you to reject the null hypothesis with a significance (&#8220;<em>p<\/em>-value&#8221;) of 5%, that means that,\u00a0<em>if the null hypothesis were true, there&#8217;d be only a 5% chance of getting data that look like the data you actually got.\u00a0<\/em><\/p>\n<p>Some people &#8212; or so I&#8217;ve been told &#8212; labor under the misconception that the <em>p<\/em>-value tells you the probability that the null hypothesis is true, but it doesn&#8217;t. I&#8217;m going to rehash the old story here; skip ahead if you&#8217;ve heard it before.<\/p>\n<p>Suppose that a pregnancy test yields correct results 95% of the time. Pat takes the test, which comes out positive. That means that the &#8220;null hypothesis&#8221; that Pat is not pregnant can be ruled out with a significance (<em>p<\/em>-value) of 5%. But it does not mean that there&#8217;s a 95% chance that Pat is pregnant. The probability that Pat is pregnant depends both on the result of the test and on any additional information you have about Pat &#8212; that is, the\u00a0<em>prior probability<\/em> that Pat is pregnant. For example, if Pat is anatomically male, then the probability of pregnancy is zero, regardless of the test result.<\/p>\n<p>Needless to say, Pat doesn&#8217;t care about the\u00a0<em>p-<\/em>value; Pat cares about whether Pat is pregnant. The\u00a0<em>p<\/em>-value is not wrong; it&#8217;s just uninteresting. As I&#8217;ve said <a href=\"http:\/\/blog.richmond.edu\/physicsbunn\/2014\/02\/18\/nature-on-p-values\/\">before<\/a>, this can be summarized with a convenient Venn diagram:<\/p>\n<p><a href=\"http:\/\/blog.richmond.edu\/physicsbunn\/files\/2015\/03\/Slide11.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-958\" src=\"http:\/\/blog.richmond.edu\/physicsbunn\/files\/2015\/03\/Slide11.gif\" alt=\"Slide11\" width=\"491\" height=\"290\" \/><\/a><\/p>\n<p>So if\u00a0<em>p<\/em>-values are uninteresting, why shouldn&#8217;t the journal ban them?<\/p>\n<p>The main reason is because they can be an ingredient in drawing useful conclusions.\u00a0You can combine the\u00a0<em>p<\/em>-value with your prior knowledge to get an answer to the question you&#8217;re actually interested in (&#8220;How likely is the hypothesis to be true?&#8221;).\u00a0Just because the\u00a0<em>p<\/em>-value doesn&#8217;t directly answer Pat&#8217;s question about whether she&#8217;s pregnant, that doesn&#8217;t mean that it&#8217;s not valid and useful information about the reliability of the pregnancy test, which she can (and should) use in drawing conclusions.<\/p>\n<p>As far as I can tell, the argument in favor of banning <em>p<\/em>-values is that people sometimes misinterpret them, but that&#8217;s a weak argument for a ban. It&#8217;s worth distinguishing two possibilities here:<\/p>\n<ol>\n<li>The article describes the statistical procedures and results accurately, but statistically illiterate readers misunderstand them. In this situation, I&#8217;m perfectly happy to go all libertarian and\u00a0<em>caveat emptor.\u00a0<\/em>Why should an intelligent reader be denied the right to hear about a\u00a0<em>p-<\/em>value just because someone else might be misled due to his own ignorance?<\/li>\n<li>The article describes the statistical procedures and results in a misleading way. Obviously, the journal should not allow this. But a ban shouldn&#8217;t be necessary to enforce this. The whole point of a peer-reviewed journal is that experts are evaluating the article to make sure it doesn&#8217;t contain incorrect or misleading statements. If the editors feel the need for a ban, then they are in effect admitting that they and referees cannot effectively evaluate the validity of an article&#8217;s statistical claims.<\/li>\n<\/ol>\n<p>The editorial describing the reasons for the ban states, incorrectly, that the banned technique is &#8220;invalid.&#8221; Moreover, the perceived need for a ban seems to me to arise from the editors&#8217; lack of confidence in their own ability to weed out good statistics from bad. That&#8217;s why I say that, although I have no great love for <em>p<\/em>-values, this ban reduces my willingness to trust any results published in this journal.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The journal Basic and Applied Social Psychology has come out with a ban on p-values. To be precise, they&#8217;ve banned the &#8220;null hypothesis significance testing procedure&#8221; from articles published in the journal. This ban means that authors in the journal can&#8217;t claim that an effect they see in their data is &#8220;statistically significant&#8221; in the &hellip; <a href=\"https:\/\/blog.richmond.edu\/physicsbunn\/2015\/03\/09\/p-values-arent-wrong-theyre-just-uninteresting\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">p-values aren&#8217;t wrong; they&#8217;re just uninteresting<\/span><\/a><\/p>\n","protected":false},"author":12,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-957","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/posts\/957","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/comments?post=957"}],"version-history":[{"count":0,"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/posts\/957\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/media?parent=957"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/categories?post=957"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.richmond.edu\/physicsbunn\/wp-json\/wp\/v2\/tags?post=957"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}