Signals
Back to feed
5/10 Safety & Policy 16 May 2026, 19:01 UTC

ArXiv institutes one-year ban for authors submitting entirely AI-generated scientific papers

As engineers relying on ArXiv for cutting-edge ML research, maintaining a high signal-to-noise ratio is critical. This policy provides a necessary friction point against automated paper-generation pipelines that spam the repository with unverified claims. It forces researchers to maintain human-in-the-loop accountability for their published methodologies.

What Happened

ArXiv, the premier preprint repository for computer science and physics research, has announced a strict new policy regarding the use of Large Language Models (LLMs). Authors caught submitting papers where the text, data, or research is entirely generated by AI without significant human contribution and oversight will face a one-year ban from the platform.

Technical Details

The crackdown specifically targets the "careless use" of generative AI. This is not a ban on using LLMs for copyediting, translation, or code assistance—tools that are rapidly becoming standard in modern research workflows. Instead, it targets autonomous paper-generation pipelines where an LLM is prompted to draft an entire manuscript. These zero-effort submissions often result in hallucinated citations, flawed mathematical proofs, and generic conclusions. ArXiv relies heavily on automated moderation and volunteer moderators; enforcement will likely depend on a mix of heuristic analysis, community reporting, and identifying obvious generation artifacts (e.g., "As an AI language model...") rather than relying solely on notoriously unreliable AI-text classifiers.

Why It Matters

For the engineering and AI research community, ArXiv is the primary distribution mechanism for new breakthroughs. The recent explosion of LLM capabilities has dramatically lowered the barrier to generating plausible-sounding but scientifically vacuous papers. If the repository is flooded with automated junk, the discovery of genuinely impactful research is compromised. This one-year ban introduces a severe professional penalty, shifting the risk-reward calculus for bad actors looking to artificially inflate their publication metrics.

What to Watch Next

Monitor how ArXiv handles the false-positive rate of enforcement. Since reliable AI text detection remains an unsolved technical challenge, enforcement will likely rely on blatant structural errors or community flags. If legitimate non-native English speakers using AI for heavy translation get caught in the crossfire, expect significant pushback. Additionally, watch to see if other major preprint servers like bioRxiv or medRxiv adopt similar punitive measures.

arxiv ai-policy research-integrity llm-generation