United States Federal Agency
Appears in 2 stories
Funder of the SCORE program
For decades, social science findings shaped everything from classroom teaching methods to criminal sentencing guidelines—yet no one had systematically checked whether those findings held up. A seven-year project involving 865 researchers, nearly 3,900 papers, and 62 journals across 11 disciplines found that about 55% of published claims successfully replicate and 54% of studies are precisely computationally reproducible.
Updated May 30
Concluded two-year AI Cyber Challenge; open-sourced finalist systems
For decades, finding security flaws in software has required expensive human experts or pattern-matching tools that miss complex bugs. In five months, all three frontier artificial intelligence labs (OpenAI, Anthropic, and Google) released autonomous agents that read code like a human researcher, discover vulnerabilities traditional scanners miss, and generate patches. On March 6, 2026, OpenAI launched Codex Security in research preview, an agent that scanned 1.2 million code commits in its first month of beta testing and discovered 14 previously unknown vulnerabilities serious enough to receive formal identifiers in OpenSSH, Chromium, and PHP.
No stories match your search
Try a different keyword
How would you like to describe your experience with the app today?