A new study made a version of GPT-5 Thinking admit its own misbehavior. But it's not a quick fix for bigger safety issues.
The mathematical reasoning model performed as well as humans at prestigious international mathematics competitions.
DeepSeek unveils V3.2 AI models matching GPT-5 and Google Gemini 3.0 Pro performance at fraction of the cost, introducing breakthrough sparse attention and reasoning-with-tools capabilities in ...