Skip to content

Why Most Published Research Findings Are False

Why this mattered

Ioannidis’s essay mattered because it changed the default interpretation of a “statistically significant” published result. The paper did not argue merely that some studies were biased or underpowered; it gave a simple probabilistic framework in which the truth of a claim depends on prior plausibility, statistical power, bias, and the number of relationships being tested. In that framing, a published positive result could be less a discovery than the expected output of a research system with small studies, flexible analyses, selective reporting, and strong incentives to find significance. The paradigm shift was to treat unreliability as a predictable property of whole research ecosystems, not just as misconduct, incompetence, or isolated bad luck. See the original PLOS Medicine essay.

After this paper, it became newly possible to ask quantitative, field-level questions about credibility: how many tested hypotheses were likely true before testing, how much power studies had, how much selective reporting would be needed to explain the literature, and whether replication failure should be surprising. That helped make “meta-research” a central empirical program rather than a set of methodological cautions. The essay also gave later reform movements a compact causal model: preregistration, trial registries, larger samples, replication, data sharing, registered reports, and correction for publication bias were not just ideals of transparency, but interventions aimed at specific failure modes in the publication process.

Its influence is visible in the replication-crisis era that followed. Large coordinated projects, such as the 2015 Open Science Collaboration study in Science, turned Ioannidis’s theoretical warning into field-scale measurement by testing whether prominent findings could be reproduced under planned protocols. The paper did not prove that every field is mostly false, and its exact conclusions depend on assumptions about priors, power, and bias. Its lasting importance is that it made those assumptions explicit, forcing scientific communities to evaluate published literatures as noisy, incentive-shaped evidence systems rather than as simple accumulations of significant results.

Abstract

There is increasing concern that most current published research findings are false. The probability that a research claim is true may depend on study power and bias, the number of other studies on the same question, and, importantly, the ratio of true to no relationships among the relationships probed in each scientific field. In this framework, a research finding is less likely to be true when the studies conducted in a field are smaller; when effect sizes are smaller; when there is a greater number and lesser preselection of tested relationships; where there is greater flexibility in designs, definitions, outcomes, and analytical modes; when there is greater financial and other interest and prejudice; and when more teams are involved in a scientific field in chase of statistical significance. Simulations show that for most study designs and settings, it is more likely for a research claim to be false than true. Moreover, for many current scientific fields, claimed research findings may often be simply accurate measures of the prevailing bias. In this essay, I discuss the implications of these problems for the conduct and interpretation of research.

Sources