AI coding benchmark MirrorCode published its full results June 26, showing Claude Opus 4.7 autonomously rebuilt a 60,000-line interpreter and scored 56% overall — completing tasks that take human ...
The academy says no national benchmark existed for AI courses until now — 5,000 colleges and 500 EdTech platforms have been ...
LLVM powers the core development tools, operating systems, and most applications at Apple Computer, where it long ago ...
Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2 ...
Chinese artificial intelligence developer Zhipu AI crossed the HK$1 trillion ($127 billion) market valuation mark on Monday, becoming China’s first large language model company ...
Japanese AI startup Sakana has launched Fugu, a new AI model family that the company says outperforms Anthropic's Claude ...
Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2, 2026, a system that compiles any natural-language task spec into a 23MB ...
As AI gets dramatically better at finding software's flaws, Jack Li is working on the harder half of the problem — getting AI ...
Microsoft (MSFT) stock is down 22% in 2026, but Azure's 39% growth and $37B AI revenue run rate have Wall Street predicting ...
A wave of recent product updates suggests the competition among AI coding tools is moving beyond autocomplete and chat toward long-running agents that can understand projects, invoke tools, and carry ...
OpenAI Group PBC today introduced GPT-5.6, a new series of large language models that it says can outperform Claude Mythos 5 ...
Anthropic PBC today debuted Claude Sonnet 5, a midrange large language model that outperforms its predecessor in several ...