Pull to refresh
Logo
Daily Brief
Following
Why Ranks Sign Up
The AI science rush

The AI science rush

New Capabilities

Large language models earn recognition for frontier research—and expose cracks in scientific publishing

January 22nd, 2026: NeurIPS Papers Contaminated by AI Hallucinations

Overview

Science magazine named large language models doing frontier science a runner-up breakthrough of 2025. Within weeks, the prediction became reality: OpenAI's GPT-5.2 solved previously unsolved Erdős mathematics problems in 15 minutes, achieving 40% accuracy on expert-level mathematics that stumped earlier systems.

DeepMind announced its first automated laboratory in the UK for 2026, pairing Gemini with robotics to synthesize hundreds of materials daily. Google partnered with the U.S. Department of Energy on Genesis, a national AI-for-science platform mobilizing 17 national laboratories.

But the recognition marked when AI overwhelmed academic publishing. At ICLR 2026, researchers discovered that 21% of peer reviews—15,899 submissions—were fully AI-generated, with over half showing AI involvement. A Springer Nature journal retracted 129 papers early 2026 after being inundated with AI-generated manuscripts, and NeurIPS papers contained hallucinated citations; the resulting flood threatens scientific validation.

Key Indicators

40.3%
GPT-5.2 expert math accuracy
OpenAI's GPT-5.2 Thinking solved 40.3% of FrontierMath problems, previously unsolvable by AI; helped solve 11 open Erdős problems
21%
ICLR 2026 reviews fully AI-generated
Analysis found 15,899 of 75,800 peer reviews at major AI conference were entirely written by AI, triggering integrity crisis
129
Papers retracted (single journal, 2025)
Springer Nature's Neurosurgical Review retracted 129 AI-generated papers in 2025 alone, symptomatic of systemic collapse
$3B
Isomorphic Labs partnerships
DeepMind spin-out secured $3 billion from Eli Lilly and Novartis; first clinical trials delayed to late 2026
50%+
Research output increase for LLM users
Scientists flagged as using LLMs posted 33-50% more papers on arXiv, bioRxiv, and SSRN

Voices

Curated perspectives — historical figures and your fellow readers.

Ever wondered what historical figures would say about today's headlines?

Sign up to generate historical perspectives on this story.

Play

Exploring all sides of a story is often best achieved with Play.

Log in to play. Track your picks, climb the leaderboards. Log in Sign Up
Predict 5 ways this could play out. Contrarian picks score more — points lock when the scenario resolves. Log in to play
Timeline Five events from this story — drag them oldest to newest. Log in to play
Connections Sixteen names from the news. Find the four hidden groups of four. Log in to play

People Involved

Organizations Involved

Google DeepMind
Google DeepMind
AI research division
Opening first automated laboratory in UK 2026; partnering with U.S. DOE on Genesis

Alphabet's AI research lab pioneering large language models for scientific discovery across mathematics, biology, and materials science.

Periodic Labs
Periodic Labs
AI Science Startup
Building automated scientific discovery platform

AI startup automating scientific discovery, initially targeting superconductor invention.

Lila Sciences
Lila Sciences
AI Science Startup
Building scientific superintelligence with automated labs

Startup pairing specialized AI models with automated laboratories to accelerate biological research.

Sakana AI
Sakana AI
AI Research Startup
AI Scientist system faces critical evaluation showing quality concerns

Tokyo-based startup that created The AI Scientist system for fully automated research.

Science Magazine
Science Magazine
Scientific Journal
Recognized LLMs as 2025 Breakthrough runner-up

Peer-reviewed academic journal published by the American Association for the Advancement of Science.

Isomorphic Labs
Isomorphic Labs
AI Drug Discovery Company
First clinical trials expected late 2026

DeepMind spin-out using AI for drug discovery, backed by $3 billion in pharma partnerships.

Pangram Labs
Pangram Labs
AI Detection Company
Exposed ICLR 2026 peer review crisis

Developer of AI detection tools for academic integrity and plagiarism detection.

U.S. Department of Energy
U.S. Department of Energy
Federal Agency
Partnering with DeepMind on Genesis AI-for-science platform

U.S. federal agency overseeing national laboratories and energy research infrastructure.

Timeline

November 2020 January 2026

20 events Latest: January 22nd, 2026 · 4 months ago Showing 8 of 20
Tap a bar to jump to that date
  1. NeurIPS Papers Contaminated by AI Hallucinations

    Latest Publication Crisis

    Analysis reveals NeurIPS submissions contain hallucinated citations and sources invented by generative AI models as submissions increased 220% since 2020.

  2. Isomorphic Labs Delays Clinical Trial Timeline

    Business Development

    CEO Demis Hassabis announces first AI-designed drug trials expected by end of 2026, delayed from 2025 target despite $3B pharma partnerships.

  3. GPT-5.2 Solves Erdős Mathematics Problems

    Research Breakthrough

    OpenAI's GPT-5.2 achieves 40.3% on expert-level FrontierMath; helps solve 11 previously open Erdős problems in 15 minutes of processing.

  4. OpenAI Releases GPT-5.2 for Science and Math

    Technology Release

    GPT-5.2 Pro and Thinking achieve 93.2% on graduate-level GPQA Diamond benchmark; set new state-of-the-art for expert mathematics at 29.2% on FrontierMath Tier 4.

  5. Study Shows 50% Output Boost, Quality Concerns

    Research Publication

    Cornell study finds LLM users publish 33-50% more papers, but AI-polished work less likely to be accepted.

  6. Science Names LLMs Breakthrough Runner-Up

    Recognition

    Science magazine recognizes large language models doing frontier science as 2025 Breakthrough of the Year runner-up.

  7. GPT-5 Achieves 79x Lab Efficiency Gain

    Research Breakthrough

    OpenAI announces GPT-5 optimized gene-editing protocol in real wet lab, introducing novel enzyme mechanism.

  8. DeepMind Announces UK Automated Laboratory

    Infrastructure

    Google DeepMind partners with UK government to open first automated materials science lab in 2026, using Gemini and robotics to synthesize hundreds of materials daily.

  9. ICLR 2026 AI-Generated Review Scandal

    Publication Crisis

    Pangram Labs analysis finds 21% of 75,800 peer reviews for major AI conference were fully AI-generated; over 50% showed AI involvement.

  10. Research Integrity Conference Hit by AI Abstracts

    Publication Crisis

    2026 World Conference on Research Integrity discovers substantial submitted abstracts showing generative AI use, detected by Copyleaks plagiarism software.

  11. Lila Sciences Hits $1.3B Valuation

    Funding

    AI lab raises $115M extension from Nvidia, General Catalyst, bringing total raised to $550M for automated laboratories.

  12. Periodic Labs Raises $300M Seed

    Funding

    Former OpenAI and DeepMind researchers raise one of largest seed rounds ever from a16z, Nvidia, Bezos.

  13. AI Co-Scientist Validates Drug Candidates

    Research Validation

    Stanford publishes findings: Google's AI co-scientist identified liver fibrosis drugs in days versus years of human research.

  14. Gemini Wins Gold at Math Olympiad

    Competition Victory

    DeepMind's Gemini Deep Think scores 35/42 points, achieving gold medal standard at International Mathematical Olympiad.

  15. First AI-Generated Peer-Reviewed Paper

    Publication

    Sakana's AI Scientist v2 produces first entirely AI-generated paper accepted at ICLR workshop after peer review.

  16. Springer Journal Retracts 129 AI Papers

    Publication Crisis

    Neurosurgical Review retracts 129 papers after investigation finds articles with strong LLM-generation indicators submitted without proper disclosure.

  17. First AI-Enabled Nobel Prize

    Recognition

    Hassabis and Jumper win Nobel Prize in Chemistry for AlphaFold—first Nobel recognizing AI-enabled scientific discovery.

  18. Sakana Releases The AI Scientist

    Technology Release

    Sakana AI unveils open-source system automating entire research lifecycle from hypothesis to paper writing.

  19. AlphaFold Database Launched

    Platform Release

    DeepMind releases open-source AlphaFold and database with 360,000 protein structures; now exceeds 200 million predictions.

  20. AlphaFold2 Solves Protein Folding

    Scientific Breakthrough

    DeepMind's AlphaFold2 predicts protein structures with unprecedented accuracy, solving a 50-year grand challenge in biology.

Historical Context

3 moments from history that rhyme with this story — and how they unfolded.

2020-2024

AlphaFold and the 2024 Nobel Prize

DeepMind's AlphaFold solved the 50-year protein folding problem in 2020, predicting structures for 200 million proteins. The breakthrough earned Demis Hassabis and John Jumper the 2024 Nobel Prize in Chemistry—the first Nobel recognizing AI-enabled scientific discovery. Over 500,000 researchers adopted the open-source tool, generating thousands of papers on antibiotic resistance, drug discovery, and crop resilience.

Then

Proved AI could solve grand scientific challenges previously requiring decades of human effort; validated massive investment in AI for science.

Now

Established template for LLMs tackling fundamental research problems; demonstrated that open-sourcing breakthrough AI accelerates global science rather than limiting it.

Why this matters now

AlphaFold's Nobel Prize validated AI's scientific capability one year before Science magazine recognized LLMs as 2025 breakthrough runners-up, creating momentum for the current AI science investment rush.

2011-2019

IBM Watson's Failed Drug Discovery Bet

After Watson's 2011 Jeopardy triumph, IBM launched Watson for Drug Discovery, promising to revolutionize medicine by analyzing massive biomedical datasets. Major partnerships with Pfizer (2016) and MD Anderson followed. By 2019, IBM quietly discontinued the product after high-profile failures, including MD Anderson scrapping their installation. Watson made only small advances in drug discovery despite years of investment and publicity.

Then

Multi-million dollar partnerships dissolved; IBM shifted focus away from drug discovery toward clinical applications with more modest claims.

Now

Created investor skepticism about AI drug discovery claims; established cautionary tale about over-promising AI capabilities before validation.

Why this matters now

Watson's trajectory warns that today's $1.6 billion in AI science startup funding could face similar reality checks when computational predictions must translate to real-world lab results.

2020-2025

The Paper Mill Crisis and Peer Review Collapse

Paper retractions passed 10,000 in 2023, quadrupling over 20 years with the majority due to misconduct. In 2023, Hindawi retracted over 8,000 papers from organized paper mills selling fraudulent authorship. By 2025, AI tools like ChatGPT enabled industrial-scale generation of plagiarism-resistant junk science. Experts estimate one in 50 papers now show paper mill patterns, overwhelming traditional peer review gatekeeping.

Then

Major publishers implemented AI detection tools; peer review timelines lengthened; journal credibility declined in affected fields.

Now

Forced fundamental questioning of peer review's viability; created opening for AI-assisted review systems despite concerns about AI reviewing AI-generated content.

Why this matters now

The existing publishing crisis means LLMs' 50% productivity boost arrives precisely when the system can least handle increased volume, potentially triggering complete restructuring of scientific validation.

Sources

(27)