{"id":994,"date":"2015-06-29T07:00:13","date_gmt":"2015-06-29T11:00:13","guid":{"rendered":"http:\/\/datacolada.org\/?p=994"},"modified":"2020-02-11T22:49:09","modified_gmt":"2020-02-12T03:49:09","slug":"40-reducing-fraud-in-science","status":"publish","type":"post","link":"https:\/\/datacolada.org\/40","title":{"rendered":"[40] Reducing Fraud in Science"},"content":{"rendered":"<p><span style=\"font-size: 12pt;\">Fraud in science is often attributed to incentives: we reward sexy-results\u2192fraud happens. The solution, the argument goes, is to reward other things. \u00a0In this post I counter-argue, proposing three alternative solutions.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Problems with the <em>Change the Incentives<\/em> solution.<br \/>\n<\/strong>First, even if rewarding sexy-results caused fraud, it does not follow we should stop rewarding sexy-results. We should pit costs vs benefits. Asking questions with the most upside is beneficial.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Second, if we started rewarding unsexy stuff, a likely consequence is fabricateurs continuing to fake, now just unsexy stuff. \u00a0Fabricateurs want the lifestyle of successful scientists. [<a href=\"#footnote_0_994\" id=\"identifier_0_994\" class=\"footnote-link footnote-identifier-link\" title=\"I use the word fabricateur to refer to scientists who fabricate data. Fraudster is insufficiently specific (e.g., selling 10 bagels calling them a dozen is fraud too), and fabricator has positive meanings (e.g., people who make things). Fabricateur has a nice ring to it. \">1<\/a>] Changing incentives involves making our lifestyle less appealing. (Finally, a benefit to committee meetings).\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Third, the evidence for \u201cliking sexy\u2192fraud\u201d is just not there. Like real research, most fake research is <em>not<\/em> sexy. Life-long fabricateur Diederik Stapel mostly published dry experiments with \u201cfindings\u201d in line with the rest of the literature. That we attend to and remember the <em>sexy<\/em> fake studies is diagnostic of what we pay attention to, not what causes fraud. \u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">The evidence that incentives causes fraud comes primarily from self-reports, with fabricateurs saying \"the incentives made me do it\" (see e.g., Tijdink et al\u00a0<a href=\"https:\/\/www.researchgate.net\/profile\/Joeri_Tijdink3\/publication\/273326059_Publication_Pressure_and_Scientific_Misconduct_in_Medical_Scientists\/links\/552061ad0cf29dcabb0b4478.pdf\">.pdf<\/a>; or Stapel interviews).\u00a0\u00a0To me, the guilty saying \u201cit\u2019s not\u00a0<em><u>my<\/u><\/em>\u00a0fault\u201d seems like weak evidence. What else could they say?<\/span><br \/>\n<span style=\"font-size: 12pt;\"> <em>\u201cI realized I was\u00a0not cut-out for this; it was either faking some science or getting a job with less status\u201d<\/em><\/span><br \/>\n<span style=\"font-size: 12pt;\"> \u201c<em>I am kind of a psychopath, I had fun tricking everyone\u201d<br \/>\n\u201cA voice in my head told me to do it\u201d<\/em><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Similarly weak, to me, is the observation that fraud is more prevalent in top journals; we find fraud where we look for it. Fabricateurs faking articles that don\u2019t get read don\u2019t get caught\u2026.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">It's good for universities\u00a0to ignore <em>quantity<\/em> of papers when hiring and promoting, good for journals to publish interesting questions with inconclusive answers. But that won't help with fraud.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Solution 1. Retract without asking \u201care the data fake?\u201d<br \/>\n<\/strong>We have a high bar for retracting articles, and a higher bar for accusing people of fraud.\u00a0<\/span><span style=\"font-size: 12pt;\">The latter makes sense. The former does not.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Retracting is not such a big deal, it just says \"we no longer have confidence in the evidence.\u201d\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">So many things can go wrong when collecting, analyzing and reporting data that this should be a relatively routine occurrence even in the absence of fraud. An accidental killing\u00a0may not land the killer in prison, but the victim goes 6 ft under regardless. I'd propose a \u00a0retraction doctrine like:<\/span><\/p>\n<blockquote><p><span style=\"font-size: 12pt; color: #3366ff;\">If something is discovered that would lead reasonable experts to believe the results did not originate in a study performed as described in a published paper, or to conclude the study was conducted with excessive sloppiness, the journal should retract the paper. \u00a0\u00a0<\/span><\/p><\/blockquote>\n<p style=\"padding-left: 30px;\"><em><span style=\"font-size: 12pt;\"><u>Example 1.<\/u> Analyses indicate published results\u00a0are implausible for a study conducted as described (e.g., excessive linearity, implausibly similar means, or a covariate is impossibly imbalanced across conditions).\u00a0Retract.<\/span><\/em><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"font-size: 12pt;\"><em><u>Example 2.<\/u> Authors of a paper published in a journal that requires data sharing upon request, when asked for it, indicate to have \u201clost the data\u201d. \u00a0Retract. [<a href=\"#footnote_1_994\" id=\"identifier_1_994\" class=\"footnote-link footnote-identifier-link\" title=\"Every author publishing in an American Psychological Association journal agrees to share data upon request\">2<\/a>]<\/em><\/span><\/p>\n<p style=\"padding-left: 30px;\"><span style=\"font-size: 12pt;\"><em><u>Example 3.<\/u> Comparing original materials with posted data reveals important inconsistencies (e.g., scales ranges are 1-11 in the data but 1-7 in the original). Retract.<\/em><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">When journals reject original submissions it is not their job to figure out why the authors run an uninteresting study or executed it poorly. They just reject it.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">When journals lose confidence in the data behind a published article it is not their job to figure out why the authors published data whose confidence was eventually lost. They should just retract it.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">Employers, funders, and co-authors can worry about why an author published untrustworthy data.\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Solution 2. Show receipts<br \/>\n<\/strong><span style=\"line-height: 1.5;\">Penn, my employer, reimburses me for expenses incurred at conferences.<a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/2015\/06\/receipt.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-999 alignright\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2015\/06\/receipt-225x300.jpg\" alt=\"receipt\" width=\"125\" height=\"167\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2015\/06\/receipt-225x300.jpg 225w, https:\/\/datacolada.org\/wp-content\/uploads\/2015\/06\/receipt.jpg 360w\" sizes=\"auto, (max-width: 125px) 100vw, 125px\" \/><\/a><\/span><\/span><\/p>\n<p><span style=\"font-size: 12pt;\">However, I don\u2019t get to just say \u201chey, I bought some tacos in that Kansas City conference, please deposit $6.16 onto my checking account.\u201d I need receipts.\u00a0 They trust me, but there is a paper trail in case of need.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">When I submit the work I presented in Kansas City to a journal, in contrast, I do just say \u201chey, I collected the data this or that way.\u201d No receipts.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\">The recent <em>Science<\/em> retraction, with\u00a0canvassers &amp; gay marriage, is a great example for the value of receipts. The statistical evidence \u00a0suggested\u00a0something was off, but the receipts-like paper trail helped a lot\u00a0<\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><em>Author<\/em>: \u201cso and so run the survey with such and such company\u201d<br \/>\n<\/span><span style=\"font-size: 12pt;\"><em>Sleuths<\/em>: \"hello such and such company, can we talk with so and so about this survey you guys run?\"<br \/>\n<\/span><span style=\"font-size: 12pt;\"><em>Such and such company: <\/em>\u201cwe don\u2019t know any so and so, and we don\u2019t have the capability to run the survey.\u201d<\/span><\/p>\n<blockquote><p><span style=\"font-size: 12pt; color: #3366ff;\">Authors should provide as much documentation about how they run their science as they do about what they eat at conferences: where exactly was the study run, at what time and day, which research assistant run it (with contact information), how exactly were participants paid, etc.<\/span><\/p><\/blockquote>\n<p><span style=\"font-size: 12pt;\">We will trust everything researchers say. Until the need to verify arises.<\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Solution 3. Post data, materials and code<br \/>\n<\/strong>Had the raw data not been available, the recent <em>Science<\/em> retraction would probably not have happened. Stapel would probably not have gotten caught. The cases against Sanna and Smeesters would not have moved forward. \u00a0To borrow from a recent paper with Joe and Leif:<\/span><\/p>\n<blockquote><p><span style=\"font-size: 12pt; color: #3366ff;\">Journals that do not increase data and materials posting requirements for publications are causally, if not morally, responsible for the continued contamination of the scientific record with fraud and sloppiness. \u00a0<\/span><\/p><\/blockquote>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-376\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/2014\/02\/Wide-logo-300x145.jpg\" alt=\"Wide logo\" width=\"78\" height=\"38\" srcset=\"https:\/\/datacolada.org\/wp-content\/uploads\/2014\/02\/Wide-logo-300x145.jpg 300w, https:\/\/datacolada.org\/wp-content\/uploads\/2014\/02\/Wide-logo.jpg 320w\" sizes=\"auto, (max-width: 78px) 100vw, 78px\" \/><\/p>\n<hr \/>\n<p><span style=\"color: #993366;\"><strong>Feedback from Ivan Oransky, co-founder of Retraction Watch<br \/>\n<\/strong>Ivan co-wrote an editorial in the <em>New \u00a0York Times<\/em> on changing the incentives to reduce fraud (<a href=\"https:\/\/www.nytimes.com\/2015\/05\/23\/opinion\/whats-behind-big-science-frauds.html\">.html<\/a>). I reached out to him to get feedback. He directed me to some papers on the evidence of incentives and fraud. I was unaware of, but also unpersuaded by, that evidence. This prompted to\u00a0add the last paragraph in the incentives section (where I am skeptical of that evidence). \u00a0<\/span><span style=\"color: #993366;\">Despite our different takes on the role of rewarding sexy-findings on fraud, Ivan\u00a0is on board with the three non-incentive solutions proposed here. \u00a0I thank Ivan for the prompt response and useful feedback. (and for Retraction Watch!)<\/span><\/p>\n<hr \/>\n<div class=\"jetpack_subscription_widget\"><h2 class=\"widgettitle\">Subscribe to Blog via Email<\/h2>\n\t\t\t<div class=\"wp-block-jetpack-subscriptions__container\">\n\t\t\t<form action=\"#\" method=\"post\" accept-charset=\"utf-8\" id=\"subscribe-blog-1\"\n\t\t\t\tdata-blog=\"58049591\"\n\t\t\t\tdata-post_access_level=\"everybody\" >\n\t\t\t\t\t\t\t\t\t<div id=\"subscribe-text\"><p>Enter your email address to subscribe to this blog and receive notifications of new posts by email.<\/p>\n<\/div>\n\t\t\t\t\t\t\t\t\t\t<p id=\"subscribe-email\">\n\t\t\t\t\t\t<label id=\"jetpack-subscribe-label\"\n\t\t\t\t\t\t\tclass=\"screen-reader-text\"\n\t\t\t\t\t\t\tfor=\"subscribe-field-1\">\n\t\t\t\t\t\t\tEmail Address\t\t\t\t\t\t<\/label>\n\t\t\t\t\t\t<input type=\"email\" name=\"email\" autocomplete=\"email\" required=\"required\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tvalue=\"\"\n\t\t\t\t\t\t\tid=\"subscribe-field-1\"\n\t\t\t\t\t\t\tplaceholder=\"Email Address\"\n\t\t\t\t\t\t\/>\n\t\t\t\t\t<\/p>\n\n\t\t\t\t\t<p id=\"subscribe-submit\"\n\t\t\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"action\" value=\"subscribe\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"source\" value=\"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/994\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"sub-type\" value=\"widget\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" name=\"redirect_fragment\" value=\"subscribe-blog-1\"\/>\n\t\t\t\t\t\t<input type=\"hidden\" id=\"_wpnonce\" name=\"_wpnonce\" value=\"4e6205ebb5\" \/><input type=\"hidden\" name=\"_wp_http_referer\" value=\"\/wp-json\/wp\/v2\/posts\/994\" \/>\t\t\t\t\t\t<button type=\"submit\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tclass=\"wp-block-button__link\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\tstyle=\"margin: 0; margin-left: 0px;\"\n\t\t\t\t\t\t\t\t\t\t\t\t\t\tname=\"jetpack_subscriptions_widget\"\n\t\t\t\t\t\t>\n\t\t\t\t\t\t\tSubscribe\t\t\t\t\t\t<\/button>\n\t\t\t\t\t<\/p>\n\t\t\t\t\t\t\t<\/form>\n\t\t\t\t\t\t<\/div>\n\t\t\t\n<\/div>\n<p><span style=\"font-size: 10pt;\"><strong>Footnotes.<\/strong><\/span><\/p>\n<ol class=\"footnotes\">\n<li id=\"footnote_0_994\" class=\"footnote\">I use the word fabricateur to refer to scientists who fabricate data. Fraudster is insufficiently specific (e.g., selling 10 bagels calling them a dozen is fraud too), and fabricator has positive meanings (e.g., people who make things). Fabricateur has a nice ring to it.  [<a href=\"#identifier_0_994\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<li id=\"footnote_1_994\" class=\"footnote\">Every author publishing in an American Psychological Association journal agrees to share data upon request [<a href=\"#identifier_1_994\" class=\"footnote-link footnote-back-link\">&#8617;<\/a>]<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Fraud in science is often attributed to incentives: we reward sexy-results\u2192fraud happens. The solution, the argument goes, is to reward other things. \u00a0In this post I counter-argue, proposing three alternative solutions. Problems with the Change the Incentives solution. First, even if rewarding sexy-results caused fraud, it does not follow we should stop rewarding sexy-results. We&#8230;<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_wp_rev_ctl_limit":""},"categories":[69,54],"tags":[],"class_list":["post-994","post","type-post","status-publish","format-standard","hentry","category-tips","category-fake-data"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/994","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/comments?post=994"}],"version-history":[{"count":3,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/994\/revisions"}],"predecessor-version":[{"id":4790,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/posts\/994\/revisions\/4790"}],"wp:attachment":[{"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/media?parent=994"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/categories?post=994"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/datacolada.org\/wp-json\/wp\/v2\/tags?post=994"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}