A Google study finds that the standard three to five human raters per test example often aren't enough for reliable AI ...
Alibaba's Qwen team has developed a new training algorithm for reasoning models that assigns different weights to individual tokens based on how much each step influences the subsequent chain of ...
The New York Times cut ties with freelance writer Alex Preston after it turned out an AI tool he'd used had copied from an existing book review. Preston was writing a review of Jean-Baptiste Andrea's ...
AI safety research firm Lyptus Research has published a new study on the offensive cybersecurity capabilities of AI models. The study is based on the METR time-horizon method and involved testing with ...
Several leadership changes are underway at OpenAI. Fidji Simo, CEO of the newly created "AGI Deployment" division, is taking sick leave for several weeks to deal with an autoimmune disease affecting ...
Anthropic has looked into complaints from users who were hitting their Claude Code usage limits much faster than expected. According to Anthropic's Lydia Hallie, tighter limits during peak hours and ...
Microsoft has introduced MAI-Transcribe-1, a speech-to-text model supporting 25 languages that achieves the lowest word error rate of any model tested on the FLEURS ...
The leaked blog posts have allegedly surfaced online; the information matches what Fortune shared in a follow-up article. There are two versions of the same blog post that only differ in the model's ...
Anthropic and OpenAI are both growing fast, but they report revenue very differently, The Information reports. OpenAI's annualized revenue is around $25 billion; Anthropic's is $19 billion. Both ...
Mistral AI has released Mistral Small 4, combining fast text responses, logical reasoning, and image processing in one model. It has 119 billion parameters, but only 6 billion are active per query - ...
Nvidia fleshed out the Vera Rubin platform at GTC 2026: The POD comprises 40 racks with 1,152 Rubin GPUs and 60 exaflops of compute. The central NVL72 rack is expected to deliver 4x training ...