Editorial · CASRAI · Research integrity and misconduct

Paper mills and tortured phrases: the integrity crisis in 2026

Cabanac’s Problematic Paper Screener, retraction wave, citation cartels, COPE flowcharts, and the United2Act initiative: where the publishing-integrity fight stands.

ByCASRAI Editorial Board

Published 12 Dec 2025· Last updated 10 Jul 2026· 7 minute read

The scholarly-publishing integrity ecosystem ended 2025 with the highest retraction rate ever recorded and the clearest evidence yet that industrial-scale fraud is structurally embedded in the literature. The numbers are sobering: Retraction Watch‘s database crossed 60,000 entries in 2025; Hindawi/Wiley alone retracted over 11,000 papers across 2023-2024 following paper-mill detection; the Problematic Paper Screener now flags new manuscripts at a rate that strains journals’ capacity to investigate. This post maps the current threats, the detection tooling that has matured, and the United2Act coordination work that is beginning to produce a coherent industry response.

Paper mills: the supply side

A paper mill is a commercial operation that fabricates manuscripts and sells authorship slots on them. The mills emerged at significant scale around 2010-2012, driven by promotion-and-tenure incentives in jurisdictions where publication count is a hard quantitative requirement (early-career clinical researchers in some countries face explicit per-promotion-step publication quotas). The mills industrialised what individual fabrication had done for decades.

The 2022-2024 Hindawi crisis (Wiley’s acquired open-access portfolio was infiltrated at scale, leading to 11,000+ retractions and the closure of several journals) made the systemic nature visible. The Hindawi pattern was: mill-generated manuscripts submitted to special-issue calls in low-rigour journals, peer-reviewed by mill-affiliated reviewers in coordinated networks, published, and used for career advancement. The breakdown was multifactorial: high-volume special-issue calls without sufficient editorial oversight; reviewer networks that the journal could not detect were coordinated; a financial incentive structure that rewarded throughput.

The 2024-2025 response was substantial. Wiley shut down the Hindawi brand, retracted at scale, and rebuilt its peer-review controls. Other publishers running similar special-issue programmes audited and tightened. The COPE-led United2Act initiative (United2Act for paper mills, launched 2023) produced industry-wide commitments to detection cooperation, transparent retraction practices, and improved reviewer verification.

Tortured phrases: a detection lever

The tortured phrases concept, coined by Guillaume Cabanac and Cyril Labbé in 2021, was a methodological breakthrough. A tortured phrase is a clumsy paraphrasing of a standard technical term, typically introduced by attempting to evade plagiarism detection by automatic word substitution. “Counterfeit consciousness” for “artificial intelligence,” “haphazard backwoods” for “random forests,” “fake neural organization” for “artificial neural network.” Once recognised, tortured phrases are a reliable signal of mill involvement, because no human author working in their field would write “haphazard backwoods” when they meant random forests.

Cabanac and Labbé’s Problematic Paper Screener (PPS) operationalises tortured-phrase detection at scale. The PPS continuously scans the published literature against a curated dictionary of tortured phrases, flagging papers that contain them. By 2026 the PPS has flagged over 14,000 papers; many have been retracted, more are under investigation, and a substantial subset will likely remain in the literature without action because the journals are unresponsive or defunct.

The PPS is open infrastructure (the dictionary is public, the methodology is published, the flagged papers are listed). It has been criticised for false positives (some flagged papers turn out to have innocent explanations, e.g., automated translation from a non-English original) but the precision is high enough that an editor receiving a PPS flag should treat it as a serious signal warranting investigation.

Image manipulation

The other major detection front is image manipulation, particularly in life-science papers where Western blots, microscopy images, and gel electrophoresis are routinely fabricated by duplication, splicing, or AI generation. Elisabeth Bik’s catalogue of image-duplication cases has been the canonical reference for over a decade. The 2022-2024 development was the deployment of automated image-similarity tools (Imagetwin, Proofig) by major publishers; by 2025 most large publishers run automated image screening on every submission.

The 2025 escalation is AI-generated images. A diffusion-model-generated Western blot is more difficult to detect than a duplicated one because there is no source to find. The detection community has begun work on AI-generated-image detection but the arms race is genuinely real, with no settled tool. The current best practice is to require raw data deposition (the original blot scan, the unprocessed microscopy stack) alongside the published image, with image-manipulation tools running on both. Several Cell Press and EMBO journals now require this for all life-science submissions.

Citation cartels

Citation cartels are coordinated networks of authors who systematically cite each other to inflate their citation counts and journal impact factors. The classic cartel pattern is journal-level: a journal’s editorial board reciprocally cites other journals’ editorial boards, all benefiting from the inflated cross-citation. The author-level pattern is similar: a network of researchers in adjacent specialties cites each other across many papers.

Detection is statistical: cartels show citation patterns that are sharply non-random in the citation graph. The 2023-2024 work by Albers Mohrman and others operationalised the detection at the journal-citation-network level; Clarivate has begun excluding cartel-implicated journals from the JCR. The author-level cartels are harder to act against, but the existence of the signal is becoming part of the institutional-integrity toolkit.

The retraction infrastructure

Retraction has historically been slow, opaque, and inconsistently practiced. The 2022 NISO recommended practice on retraction (NISO RP-45-2022) and the 2024 Crossref retraction-metadata revisions have begun to change this. A retracted paper now carries structured machine-readable metadata about the retraction reason, the implicated parties, and the relationship to other papers; downstream services (PubMed, Google Scholar, Scopus, citation databases) consume the metadata and surface retraction notices alongside the paper.

The remaining gap is the unretracted-but-suspect paper. A paper flagged by the Problematic Paper Screener but never investigated by the journal sits in the literature unmarked. The 2024 COPE-led discussion of expressions of concern as an interim status (the paper is under investigation but not yet retracted) is one direction. A more radical proposal, now being piloted by several preprint servers and one or two journals, is to surface the PPS flag directly on the article landing page even before the journal acts, with a clear distinction between “flagged by automated screener” and “retracted by publisher.”

The United2Act response

United2Act, launched in 2023 with COPE and STM coordinating, brought publishers, researchers, integrity offices, and regulators together to address paper mills. The 2024 United2Act communique committed signatories to: cooperate on detection (sharing reviewer-misconduct signals across publishers); standardise retraction practices; improve reviewer verification; coordinate with institutions on consequences for authors of fabricated papers.

The 2025 work has been operational: the COPE/STM joint paper-mill database (publishers can submit suspect manuscript signatures and the database flags coincidences); reviewer-verification protocols (ORCID iD plus institutional email plus referee history); coordination with national integrity offices in jurisdictions where paper-mill commissioning is concentrated.

The honest assessment is that United2Act has bought the industry better coordination but has not solved the structural incentive problem. As long as researchers face quantitative publication requirements for promotion, the demand for fabricated authorship slots will exist. The longer-term fix is on the responsible-assessment side (see our responsible-assessment domain); the integrity-side work is harm reduction.

COPE flowcharts: the per-case operational layer

The COPE flowcharts, maintained and updated by the Committee on Publication Ethics, are the operational toolkit for editors handling suspected misconduct. The flowcharts cover (among many) plagiarism in a submitted manuscript, plagiarism in a published article, redundant publication, fabricated data, undisclosed conflict of interest, undisclosed AI use, image manipulation, authorship disputes, paper-mill suspicion, and citation manipulation.

An editor confronted with a suspect submission in 2026 should pull the relevant COPE flowchart, follow the documented procedure, and document the decision trail. The flowcharts are not a substitute for editorial judgement, but they are an audit-defensible baseline. The 2024-2025 COPE updates added flowcharts specifically for AI-assisted fabrication, paper-mill suspicion based on tortured-phrase detection, and image-manipulation findings from automated tools.

What to do at the institutional level

For an institutional research-integrity office in 2026, the practical priorities are: (1) monitor your own institution’s authors against the PPS and the Retraction Watch database; (2) integrate retraction-metadata feeds into your CRIS so you can detect when your authors’ papers are retracted elsewhere; (3) participate in United2Act or its national-level analogues; (4) commit publicly to following COPE flowcharts and document decisions; (5) work with your promotion-and-tenure committees to remove the pure-count incentives that fuel the demand side. The research-integrity domain at CASRAI maintains the institutional-integrity playbooks.

References

Cabanac, Labbé, Magazinov, Tortured phrases: A dubious writing style emerging in science (2021 preprint and follow-up papers). Bik et al., The Prevalence of Inappropriate Image Duplication in Biomedical Research Publications (mBio, 2016). Else and Van Noorden, The fight against fake-paper factories that churn out sham science (Nature, 2021). COPE, Paper mills – research, action plans, and resources (2023, updated 2024). United2Act, Joint Communique on Paper Mills (2023).

Related editorial in this domain

More on Research integrity and misconduct

30 Jul 2026

BMJ Study: AI Flags 9.9% of Cancer Papers

A BMJ study used a BERT classifier to screen 2.6 million cancer papers, flagging 9.9% as suspected paper-mill output — a scale manual review cannot match.

30 Jul 2026

Clarivate Puts Surgery Journal Group on Hold

Clarivate paused Web of Science indexing for six IJS Publishing Group journals after mandatory reporting-guideline citations inflated their impact factor.

30 Jul 2026

Why Contribution-Blind Authorship Norms Backfire

A January 2026 Royal Society Open Science paper models why contribution-insensitive authorship norms (alphabetical, senior-last) evolve via a ‘Red King’ dynamic — and shows they measurably discourage collaboration compared to contribution-sensitive norms.

Paper mills and tortured phrases: the integrity crisis in 2026

Paper mills: the supply side

Tortured phrases: a detection lever

Image manipulation

Citation cartels

The retraction infrastructure

The United2Act response

COPE flowcharts: the per-case operational layer

What to do at the institutional level

Related dictionary entries

References

More on Research integrity and misconduct

BMJ Study: AI Flags 9.9% of Cancer Papers

Clarivate Puts Surgery Journal Group on Hold

Why Contribution-Blind Authorship Norms Backfire