{"id":2860,"date":"2018-12-05T07:00:17","date_gmt":"2018-12-05T12:00:17","guid":{"rendered":"http:\/\/datacolada.org\/?p=2860"},"modified":"2020-11-18T23:15:28","modified_gmt":"2020-11-19T04:15:28","slug":"74-in-press-at-psychological-science-a-new-nudge-supported-by-implausible-data","status":"publish","type":"post","link":"https:\/\/datacolada.org\/74","title":{"rendered":"[74] In Press at Psychological Science: A New 'Nudge' Supported by Implausible Data"},"content":{"rendered":"<p>Today <em>Psychological Science<\/em> issued a Corrigendum (.<a href=\"https:\/\/www.doi.org\/10.1177\/0956797618761374\">htm<\/a>) and an expression of concern (<a href=\"http:\/\/doi.org\/10.1177\/0956797618816068\">htm<\/a>) for a paper originally posted online in May 2018 (.<a href=\"https:\/\/journals.sagepub.com\/doi\/10.1177\/0956797618761374\">htm<\/a>). This post will spell out the data irregularities we uncovered that eventually led to the two postings from the journal today. We are not convinced that those postings are sufficient.<\/p>\n<p>It is important to say at the outset we have not identified who is responsible for the problems. In the correction, for example, the authors themselves make clear that they \"do not have an explanation\" for some peculiarities, in part because many other people handled the data between collection and reporting. This post is therefore not about who caused the problems [<a href=\"#footnote_0_2860\" id=\"identifier_0_2860\" class=\"footnote-link footnote-identifier-link\" title=\"It is also worth noting that&nbsp; this post is possible because the authors elected to post their data.\">1<\/a>].<\/p>\n<p><a href=\"https:\/\/datacolada.org\/appendix\/74\/74%20-%20DataColada%20-%20Decoy%20Sanitizer%20-%202018%2008%2023.R\">R Code<\/a> to reproduce all calculations and figures.<\/p>\n<p><strong>Background<\/strong><br \/>\nThe history of the correction starts back in May, in a Shanghai journal club discussion Leif participated in while on sabbatical in China. Puzzled by a few oddities, four members of the group \u2013 Frank Yu (.<a href=\"https:\/\/web.archive.org\/web\/20180824150115\/http:\/\/www.ceibs.edu\/yu-frank\">htm<\/a>), Leif, and two other anonymous researchers \u2013 went on to consider the original data posted by the authors (.<a href=\"https:\/\/osf.io\/k8tv2\/\">htm<\/a>) and identified several patterns that were objectively, instead of merely intuitively, problematic.<\/p>\n<p>Most notably, the posted data has two classic markers of data implausibility:<\/p>\n<p style=\"margin: 0in; margin-bottom: .0001pt; text-align: justify; text-justify: inter-ideograph;\">(i) anomalous distribution of last digits, and<\/p>\n<p style=\"margin: 0in; margin-bottom: .0001pt; text-align: justify; text-justify: inter-ideograph;\">(ii) means are excessively similar.<\/p>\n<p>Leif and his team first went to Uri for his independent assessment of the data, who concurred that the problems looked significant and added new analyses. Then, back in June, they contacted Steve Lindsay (.<a href=\"https:\/\/web.archive.org\/web\/20180711025850\/https:\/\/www.uvic.ca\/socialsciences\/psychology\/people\/faculty-directory\/lindsaysteve.php\">html<\/a>), the editor of <em>Psychological Science.<\/em>\u00a0 In consultation with the editor, the authors then wrote a correction. We deemed this correction to be insufficient and we drafted a blog post. We shared it with the authors and the editor. They asked us to wait while they considered our arguments further. We promised we would, and we did.\u00a0Eventually they wrote an expression of concern, to be published alongside the Corrigendum, and they shared it with us. Today, six months after we first contacted the editor, we publish this post, in part because these responses (1) seem insufficient given the gravity of the irregularities, and (2) do not convey the irregularities clearly enough for readers to understand their gravity.<\/p>\n<p><strong>The basic design in the original paper.<\/strong><br \/>\nLi (<a href=\"https:\/\/web.archive.org\/web\/20181120152002\/https:\/\/clas.ucdenver.edu\/hbsc\/meng-li\">htm<\/a>), Sun (<a href=\"https:\/\/web.archive.org\/web\/20181120151910\/http:\/\/sourcedb.psych.cas.cn\/en\/epsychexpert\/201203\/t20120327_3518891.html\">htm<\/a>), &amp; Chen, report three field experiments showing that the Decoy Effect \u2013 a classic finding from decision research [<a href=\"#footnote_1_2860\" id=\"identifier_1_2860\" class=\"footnote-link footnote-identifier-link\" title=\"Basic background for the intrigued: The original demonstration is Huber, Payne, and Puto (1982 .htm). Heath &amp; Chaterjee (1995 .htm) provide a&nbsp;good review of several studies\">2<\/a>] \u2013 can be used as a nudge to increase the use of hand\u2011sanitizer by food factory workers.<\/p>\n<p>In the experiments, the authors manipulate the set of sanitizer dispensers available, and measure the amount of sanitizer used, by weighing the dispensers at the end of each day. There is one observation per worker-day.<\/p>\n<p>For example, in Experiment 1, some workers only had a spray dispenser, while others had two dispensers, both the spray dispenser and a squeeze-bottle:<\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/Hand-sanitizer.png\"><img decoding=\"async\" class=\"wp-image-2866 size-full aligncenter\" style=\"border: 1px solid #000000;\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/Hand-sanitizer.png\" alt=\"\" width=\"200\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/Hand-sanitizer.png 853w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/Hand-sanitizer-300x201.png 300w\" sizes=\"(max-width: 853px) 100vw, 853px\" \/><\/a><br \/>\nThe authors postulated that the squeeze-bottle sanitizer was objectively inferior to the spray, and that it would serve as a decoy. Thus, the authors predicted that workers would be more likely to use the spray dispenser when it was next to the squeeze bottle than when it was the only dispenser available [<a href=\"#footnote_2_2860\" id=\"identifier_2_2860\" class=\"footnote-link footnote-identifier-link\" title=\"Study 2 used a soaking basin as a decoy instead\">3<\/a>].<\/p>\n<p><strong>Original results: Huge effects.<br \/>\n<\/strong>Across three studies, the presence of a decoy dispenser increased the use of the spray dispenser by more than 1\u00a0standard deviation on average (<em>d<\/em> = 1.06).\u00a0That's a large effect. Notably, in Study 2, only <span style=\"text-decoration: underline;\">one<\/span> participant in the control condition increased sanitizer use more than the participant who increased <span style=\"text-decoration: underline;\">the least<\/span> in the treatment. Almost non-overlapping distributions.<\/p>\n<p><strong>Problem 1. Inconsistency in scale precision.<\/strong><br \/>\nThe original article indicated that the experimenters used <em>\"an electronic scale accurate to 5 grams\"<\/em> (p.4). Such a scale could measure 15 grams, or 20 grams, but not 17 grams. Contradicting this description, the posted data has many observations (8.4% of them) that were not multiples of 5.<\/p>\n<p>The correction states that scales accurate to 1, 2 and 3 grams may have been used sometimes, instead of scales precise to 5 grams (We do not believe scales precise to 3 grams exist). \u00a0[<a href=\"#footnote_3_2860\" id=\"identifier_3_2860\" class=\"footnote-link footnote-identifier-link\" title=\"Footnote 1 in the correction reads:\n\">4<\/a>].<\/p>\n<p><strong>Problem 2. Last digit in Experiment 1<br \/>\n<\/strong>But there is another odd thing about the data purportedly obtained with the more precise scales. The problem involves the frequency of the last digit in the number of grams (by last digit we mean, for example, the 8, in 201<u>8<\/u>).<\/p>\n<p>In particular, the problem with those observations draws on the generalization of something called \u201cBenford\u2019s Law\u201d, \u00a0which tells us the last digit should be distributed (nearly) uniformly: there should be just about as many workers using sanitizer amounts that end in 3 grams (e.g, 23 or 43 grams), as in 4 (e.g., 24 or 44 grams), etc. But as we see below, the data looks nothing like the uniform distribution. (If you are not familiar with Benford's law, read this footnote: [<a href=\"#footnote_4_2860\" id=\"identifier_4_2860\" class=\"footnote-link footnote-identifier-link\" title=\"About 80 years ago, Benford (.htm) noticed that with collections of numbers, the leading digit (the one furthest to the left) had a predictable pattern of occurrence: 1&rsquo;s were more common than 2&rsquo;s, which were more common than 3&rsquo;s etc. A mathematical formula generalizing Benford&#039;s law applies to digits further to the right in a different way: as one moves right, to the 2nd, 3rd, 4th digit, etc., those numbers should be distributed closer and closer to uniformly (i.e., 1 is just as common as 2, 3, 4, etc.). Because those predictions are derived mathematically, and observed empirically, violations of Benford&#039;s law are a signal that something is wrong. Benford&#039;s law has, for first and last digits, been used to detect fraud in accounting, elections, and science. See Wikipedia.\">5<\/a>]).<\/p>\n<p><strong><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F1-Histogram-last-digit-for-Study-1-1.png\"><img decoding=\"async\" class=\"alignnone wp-image-2891 size-full\" style=\"border: 1px solid #000000;\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F1-Histogram-last-digit-for-Study-1-1.png\" alt=\"\" width=\"400\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F1-Histogram-last-digit-for-Study-1-1.png 800w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F1-Histogram-last-digit-for-Study-1-1-300x150.png 300w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/a><br \/>\n<\/strong><strong>Fig 1<\/strong>.<em> Histogram for last digit in Study 1 <\/em>[<a href=\"#footnote_5_2860\" id=\"identifier_5_2860\" class=\"footnote-link footnote-identifier-link\" title=\"In this appendix (.pdf) we document that the uniform is indeed what you&#039;d expect for these data, even though values on this variable have just 2 digits.\">6<\/a>].<\/p>\n<p>About this problem, the expression of concern reads:<\/p>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-4990\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/expression-of-concern-about-3-and-7.png\" alt=\"\" width=\"350\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/expression-of-concern-about-3-and-7.png 701w, https:\/\/datacolada.org\/wp-content\/uploads\/expression-of-concern-about-3-and-7-300x151.png 300w\" sizes=\"(max-width: 701px) 100vw, 701px\" \/><\/p>\n<p>This speculated behavior, one scale precise to 5 grams used in the morning, another precise to 1 or 2 grams in the afternoon, or vice versa, cannot explain the posted data. A uniform distribution of last digits is anyway expected, not the bizarre prevalence of 3s and 7s that we see (<a href=\"https:\/\/datacolada.org\/appendix\/74\/74%20-%20Different%20scales%20lead%20to%20uniform%20anyway.R\">R Code<\/a>).<\/p>\n<p><strong>Problem 3. Last digit in Experiment 3.<br \/>\n<\/strong>Let's look at the last digit again. In this study sanitizer use was measured for 80 participants over 40 days, and with a scale sensitive to the 100<sup>th<\/sup> of a gram. The expectation of the last digit being uniformly distributed here is more obvious.<\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F2-Histogram-last-digit-for-Study-3-1.png\"><img decoding=\"async\" class=\"alignnone wp-image-2892 size-full\" style=\"border: 1px solid #000000;\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F2-Histogram-last-digit-for-Study-3-1.png\" alt=\"\" width=\"400\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F2-Histogram-last-digit-for-Study-3-1.png 800w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F2-Histogram-last-digit-for-Study-3-1-300x150.png 300w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/a><br \/>\n<strong>Fig 2.<\/strong> <em>Last digit for Study 3<\/em><\/p>\n<p>To appreciate how implausible <strong>Fig 2<\/strong> is, consider that it implies, for example, that workers would be 3 times as likely to use 45.5<u>6<\/u> grams of sanitizer, as they would be to use 45.5<u>3<\/u> grams\u00a0[<a href=\"#footnote_6_2860\" id=\"identifier_6_2860\" class=\"footnote-link footnote-identifier-link\" title=\"As an extra precaution, we analyzed other datasets with grams as the dependent variable. We found studies on (i) soup consumption, (ii) brood carcass, (iii) American bullfrog size, and (iv) decomposing bags. Last digit was uniform across the board (See details: .pdf). \">7<\/a>].<\/p>\n<p>About this problem, the expression of concern reads:<br \/>\n<img decoding=\"async\" class=\"alignnone size-full wp-image-4991\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/last-digit.png\" alt=\"\" width=\"350\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/last-digit.png 671w, https:\/\/datacolada.org\/wp-content\/uploads\/last-digit-300x110.png 300w\" sizes=\"(max-width: 671px) 100vw, 671px\" \/><br \/>\n<strong>Problem 4. Implausibly similar means in Experiment 2<\/strong><br \/>\nIn Experiment 2, sanitizer use was measured daily for 40 participants for 40 days (20 days of baseline, 20 of treatment), all with a scale sensitive to 100<sup>th<\/sup> of a gram.<\/p>\n<p>Recall that the manipulation was done at the room level. This figure, which was in the original article, shows the daily average use of sanitizer across the two rooms.<\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/similar-means-posted-data.png\"><img decoding=\"async\" class=\"alignnone wp-image-2875 size-full\" style=\"border: 1px solid #000000;\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/similar-means-posted-data.png\" alt=\"\" width=\"400\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/similar-means-posted-data.png 719w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/similar-means-posted-data-300x202.png 300w\" sizes=\"(max-width: 719px) 100vw, 719px\" \/><\/a><br \/>\nTreatment started on day 21. In days 1-20 the two rooms had extraordinarily similar means. Average sanitizer usage differed, on average, by just .19 grams across rooms. Moreover, across days, average sanitizer use was correlated at <em>r<\/em> = .94 across rooms.<\/p>\n<p>To quantify how surprisingly similar the conditions were in the \"before treatment\" period, we conducted the following resampling test: we shuffle all 40 participants into two new groups of 20 (keeping all observations per worker fixed). We then compute daily means for each of the two groups ('rooms'). We did this one million times and asked \"How often do simulation results look as extreme as the paper's?\" The answer is \"almost never\":<\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F3-Excessive-similarity-in-Study-2.png\"><img decoding=\"async\" class=\"alignnone wp-image-2876 size-full\" style=\"border: 1px solid #000000;\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F3-Excessive-similarity-in-Study-2.png\" alt=\"\" width=\"600\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F3-Excessive-similarity-in-Study-2.png 1200w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F3-Excessive-similarity-in-Study-2-300x150.png 300w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/F3-Excessive-similarity-in-Study-2-1024x512.png 1024w\" sizes=\"(max-width: 1200px) 100vw, 1200px\" \/><\/a><br \/>\nSo, for example, the figure on the right shows that the correlation between means is on average about r=.7 rather than the r=.94 that's reported in the paper. Only 96 times in a million, would we expect it to be .94 or higher.<\/p>\n<p>We don't think readers of the expression of concern would come away with sufficient information to appreciate the impossibility we shared with the authors and editor; all it says about it is:<\/p>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-4993\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/unlikely.png\" alt=\"\" width=\"350\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/unlikely.png 644w, https:\/\/datacolada.org\/wp-content\/uploads\/unlikely-300x144.png 300w\" sizes=\"(max-width: 644px) 100vw, 644px\" \/><\/p>\n<p><strong>Problem 5. Last digit\u2026 in Experiment 2<\/strong><\/p>\n<p>While less visually striking than for Experiments 1 and 3, the last digit is not distributed uniform in this experiment either, with N=1600, a scale precise to 1\/100<sup>th<\/sup> of gram, rejects the uniform null with: \u03c7<sup>2<\/sup>(9)\u00a0=\u00a043.45, <em>p<\/em>&lt;.0001; see histogram .<a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/S3-Histogram-last-digit-for-Study-2.png\" target=\"_blank\" rel=\"noopener noreferrer\">png<\/a> [<a href=\"#footnote_7_2860\" id=\"identifier_7_2860\" class=\"footnote-link footnote-identifier-link\" title=\"This is perhaps a good place to tell you of an additional anecdotal problem: when preparing the first draft of this post, back in June, we noticed this odd row in Experiment 2. The 5 gram scale makes a surprising re-appearance on day 4, takes a break on day 9, but returns on day 10\n\">8<\/a>].<\/p>\n<p><strong>Summary.<br \/>\n<\/strong>We appreciate that the authors acknowledge some of the problems we brought to their attention and further that they cannot assuage concerns because the data collection and management necessarily occurred at such a remove. On the other hand, as readers we are at a loss. Three experiments show unambiguous signs that there are problems with all of the reported data. How can we read the paper and interpret differences across conditions as meaningful while discounting the problems as otherwise meaningless? We think that it might be warranted to take the opposite view and see meaning in the long list of problems and therefore seeing the differences across conditions as meaningless.<\/p>\n<p>We should maintain a very high burden of proof to conclude that any individual tampered with data.<\/p>\n<p>But the burden of proof for <em>dataset<\/em> concerns should be considerably lower. We do not need to know the source of contamination in order to lose trust in the data.<\/p>\n<p>Even after the correction, and the clarifications of the Expression of Concern, we still believe that these data do not deserve the trust of <em>Psychological Science<\/em> readers.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-376\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2014\/02\/Wide-logo-300x145.jpg\" alt=\"Wide logo\" width=\"78\" height=\"38\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2014\/02\/Wide-logo-300x145.jpg 300w, https:\/\/datacolada.org\/wp-content\/uploads\/2014\/02\/Wide-logo.jpg 320w\" sizes=\"auto, (max-width: 78px) 100vw, 78px\" \/><\/p>\n<hr \/>\n<p><span style=\"color: #0000ff;\"><strong>Author feedback<\/strong><br \/>\nOur policy (.<a href=\"https:\/\/web.archive.org\/web\/20181120152002\/https:\/\/clas.ucdenver.edu\/hbsc\/meng-li\">htm<\/a>) is to share drafts of blog posts that discuss someone else\u2019s work with them to solicit feedback. As mentioned above we contacted the authors and editor of Psych Science. They provided feedback on wording and asked that we wait while they revised the correction, which we did (for over 6 months).<\/span><\/p>\n<p>J<span style=\"color: #0000ff;\">ust before posting they gave us another round of suggestions and then Meng Li\u00a0(<a style=\"color: #0000ff;\" href=\"https:\/\/web.archive.org\/web\/20181120152002\/https:\/\/clas.ucdenver.edu\/hbsc\/meng-li\">htm<\/a>) wrote a separate piece (.<a style=\"color: #0000ff;\" href=\"https:\/\/web.archive.org\/web\/20181206065030\/https:\/\/openmethods.wordpress.com\/2018\/12\/05\/response-to-datacolada\/\">htm<\/a>).<\/span><\/p>\n<p><span style=\"color: #0000ff;\">When all is said and done, the original authors have not yet provided benign mechanisms that could have generated the data they reported (neither the last digit pattern, nor the excessive similarity of means).<\/span><\/p>\n<div class=\"jetpack_subscription_widget\"><h2 class=\"widgettitle\">Subscribe to Blog via Email<\/h2>\n\t\t\t<div class=\"wp-block-jetpack-subscriptions__container\">\n\t\t\t<form action=\"#\" method=\"post\" accept-charset=\"utf-8\" id=\"subscribe-blog-1\"\n\t\t\t\tdata-blog=\"58049591\"\n\t\t\t\tdata-post_access_level=\"everybody\" >\n\t\t\t\t\t\t\t\t\t<div id=\"subscribe-text\"><p>Enter your email address to subscribe to this blog and receive notifications of new posts by email.<\/p>\n<\/div>\n\t\t\t\t\t\t\t\t\t\t<p id=\"subscribe-email\">\n\t\t\t\t\t\t<label id=\"jetpack-subscribe-label\"\n\t\t\t\t\t\t\tclass=\"screen-reader-text\"\n\t\t\t\t\t\t\tfor=\"subscribe-field-1\">\n\t\t\t\t\t\t\tEmail Address\t\t\t\t\t\t<\/label>\n\t\t\t\t\t\t<input type=\"email\" name=\"email\" autocomplete=\"email\" required=\"required\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tvalue=\"\"\n\t\t\t\t\t\t\tid=\"subscribe-field-1\"\n\t\t\t\t\t\t\tplaceholder=\"Email Address\"\n\t\t\t\t\t\t\/>\n\t\t\t\t\t<\/p>\n\n\t\t\t\t\t<p id=\"subscribe-submit\"\n\t\t\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"action\" value=\"subscribe\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"source\" value=\"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/2860\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"sub-type\" value=\"widget\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"redirect_fragment\" value=\"subscribe-blog-1\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" id=\"_wpnonce\" name=\"_wpnonce\" value=\"b4cdbc0b54\" \/><input type=\"hidden\" name=\"_wp_http_referer\" value=\"\/wp-json\/wp\/v2\/posts\/2860\" \/>\t\t\t\t\t\t<button type=\"submit\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tclass=\"wp-block-button__link\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tstyle=\"margin: 0; margin-left: 0px;\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\tname=\"jetpack_subscriptions_widget\"\n\t\t\t\t\t\t>\n\t\t\t\t\t\t\tSubscribe\t\t\t\t\t\t<\/button>\n\t\t\t\t\t<\/p>\n\t\t\t\t\t\t\t<\/form>\n\t\t\t\t\t\t<\/div>\n\t\t\t\n<\/div>\n<strong>Footnotes.<\/strong><\/p>\n<ol class=\"footnotes\">\n<li id=\"footnote_0_2860\" class=\"footnote\">It is also worth noting that\u00a0 this post is possible because the authors elected to post their data. [<a href=\"#identifier_0_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_1_2860\" class=\"footnote\">Basic background for the intrigued: The original demonstration is Huber, Payne, and Puto (1982 .<a href=\"https:\/\/academic.oup.com\/jcr\/article-abstract\/9\/1\/90\/1839380\">htm<\/a>). Heath &amp; Chaterjee (1995 .<a href=\"https:\/\/academic.oup.com\/jcr\/article-abstract\/22\/3\/268\/1791719\">htm<\/a>) provide a\u00a0good review of several studies [<a href=\"#identifier_1_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_2_2860\" class=\"footnote\">Study 2 used a soaking basin as a decoy instead [<a href=\"#identifier_2_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_3_2860\" class=\"footnote\">Footnote 1 in the correction reads:<br \/>\n<a href=\"https:\/\/web.archive.org\/web\/20181206065120\/https:\/\/datacolada.org\/wp-content\/uploads\/2019\/08\/Footnote-1-in-correction-2018-11-20.png\"><img decoding=\"async\" class=\"alignnone wp-image-2986 size-full\" style=\"border: 1px solid #000000;\" src=\"https:\/\/web.archive.org\/web\/20181206065120\/https:\/\/datacolada.org\/wp-content\/uploads\/2019\/08\/Footnote-1-in-correction-2018-11-20.png\" alt=\"\" width=\"250\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2019\/08\/Footnote-1-in-correction-2018-11-20.png 727w, https:\/\/datacolada.org\/wp-content\/uploads\/2019\/08\/Footnote-1-in-correction-2018-11-20-300x280.png 300w\" sizes=\"(max-width: 727px) 100vw, 727px\" \/><\/a> [<a href=\"#identifier_3_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_4_2860\" class=\"footnote\">About 80 years ago, Benford (.<a href=\"https:\/\/www.jstor.org\/stable\/984802\">htm<\/a>) noticed that with collections of numbers, the leading digit (the one furthest to the left) had a predictable pattern of occurrence: 1\u2019s were more common than 2\u2019s, which were more common than 3\u2019s etc. A mathematical formula generalizing Benford's law applies to digits further to the right in a different way: as one moves right, to the 2<sup>nd<\/sup>, 3<sup>rd<\/sup>, 4<sup>th<\/sup> digit, etc., those numbers should be distributed closer and closer to uniformly (i.e., 1 is just as common as 2, 3, 4, etc.). Because those predictions are derived mathematically, and observed empirically, violations of Benford's law are a signal that something is wrong. Benford's law has, for first and last digits, been used to detect fraud in accounting, elections, and science. See <a href=\"https:\/\/en.wikipedia.org\/wiki\/Benford%27s_law\">Wikipedia<\/a>. [<a href=\"#identifier_4_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_5_2860\" class=\"footnote\">In this appendix (.<a href=\"https:\/\/datacolada.org\/appendix\/74\/Appendix_colada74.pdf\">pdf<\/a>) we document that the uniform is indeed what you'd expect for these data, even though values on this variable have just 2 digits. [<a href=\"#identifier_5_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_6_2860\" class=\"footnote\">As an extra precaution, we analyzed other datasets with grams as the dependent variable. We found studies on (i) soup consumption, (ii) brood carcass, (iii) American bullfrog size, and (iv) decomposing bags. Last digit was uniform across the board (See details: .<a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/Appendix-2-2018-08-24.pdf\">pdf<\/a>).  [<a href=\"#identifier_6_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_7_2860\" class=\"footnote\">This is perhaps a good place to tell you of an additional anecdotal problem: when preparing the first draft of this post, back in June, we noticed this odd row in Experiment 2. The 5 gram scale makes a surprising re-appearance on day 4, takes a break on day 9, but returns on day 10<br \/>\n<a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/funny-row.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2877 size-full\" style=\"border: 1px solid #000000;\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/funny-row.png\" alt=\"\" width=\"1621\" height=\"263\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/funny-row.png 1621w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/funny-row-300x49.png 300w, https:\/\/datacolada.org\/wp-content\/uploads\/2018\/08\/funny-row-1024x166.png 1024w\" sizes=\"auto, (max-width: 1621px) 100vw, 1621px\" \/><\/a> [<a href=\"#identifier_7_2860\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Today Psychological Science issued a Corrigendum (.htm) and an expression of concern (htm) for a paper originally posted online in May 2018 (.htm). This post will spell out the data irregularities we uncovered that eventually led to the two postings from the journal today. We are not convinced that those postings are sufficient. It is&#8230;<\/p>\n","protected":false},"author":16,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_wp_rev_ctl_limit":""},"categories":[4,54],"tags":[],"class_list":["post-2860","post","type-post","status-publish","format-standard","hentry","category-paper","category-fake-data"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/2860","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/comments?post=2860"}],"version-history":[{"count":6,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/2860\/revisions"}],"predecessor-version":[{"id":5895,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/2860\/revisions\/5895"}],"wp:attachment":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/media?parent=2860"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/categories?post=2860"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/tags?post=2860"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}