Blog

The Quiet Corruption: How AI Strengthens Your Claims Beyond What Your Data Supports

November 26th, 2025, by Guy Harris

Of all the problems AI introduces into scientific writing, the most dangerous may be one researchers are little likely to notice.

A study published in Royal Society Open Science compared AI-generated summaries of research papers against human-written summaries of the same papers. The finding: AI-generated summaries were 4.85 times more likely to overgeneralize the conclusions of the original research.

Nearly five times more likely. This is not a grammar problem. It is not a formatting problem. It is a scientific integrity problem, and it operates below the threshold of casual detection.

Here is how it works. A researcher writes in their results: "Treatment A was associated with a reduction in symptom severity in the study population (p = 0.03)." This is a careful, appropriately hedged statement. It says what the data supports and implies nothing more.

The researcher then asks ChatGPT to "improve" the Discussion section. The AI produces: "Treatment A significantly reduces symptom severity, demonstrating its clinical efficacy."

The sentence reads well. It sounds authoritative. It is also a materially different claim. "Was associated with" has become "reduces." "In the study population" has disappeared, implying generalizability. "Demonstrating its clinical efficacy" is a conclusion the study design probably cannot support.

This is what the Royal Society study calls overgeneralization. Tt happens because large language models are trained to be helpful and definitive. Hedging, qualification, and careful limitation of claims, the hallmarks of rigorous scientific writing, are systematically stripped away in favor of confident, assertive prose.

For non-native English speakers, the trap is particularly insidious. The AI-polished version genuinely sounds more professional than the author's original, and something a native English speaker would write. The author, whose instinct may correctly tell them that "associated with" is the right phrase, defers to the AI's apparent linguistic authority.

What happens next is predictable. The reviewer reads the Discussion and responds: "The authors' claims exceed what the data can support." Or: "The interpretation is overly broad given the study design." Or simply: "Major revision required."

The author is confused. They thought they were improving their English. They were actually corrupting their science.

The solution is not to stop using AI, but to understand what AI does to your claims and to review the output accordingly. Every time an AI rephrases a scientific statement, check three things:

  1. Has the direction of causality changed? "Associated with" is not the same as "causes." "Correlated" is not the same as "predicts."

  2. Has the scope of the claim expanded? A finding in 47 patients at one hospital is not a finding about "patients" in general.

  3. Has hedging language been removed? "May," "might," "appears to," "suggests" — these words exist for a reason. If they have disappeared, put them back.

AI is a powerful tool for improving the readability of scientific prose. It is also, unless carefully supervised, a quiet engine of scientific overclaiming. The difference between a paper that gets published and a paper that gets desk-rejected for overstatement can be a single missing qualifier.