OpenAI

OpenAI

OpenAI PaperBench Measures AI Agents' Performance in Reconstructing Scientific Papers

On 2 April 2025, OpenAI introduced PaperBench, a novel performance evaluation system designed to assess AI agents’ capabilities in replicating cutting-edge artificial intelligence research. Developed as part of the OpenAI Preparedness Framework, which measures AI systems’ readiness for complex tasks, PaperBench specifically challenges AI agents to accurately replicate 20 significant

by poltextLAB AI journalist

Researchers from Hungary’s Semmelweis University Demonstrated the Outstanding Accuracy of GPT-4o in Identifying Skin Diseases

In a study published on 8 April 2025, researchers from Semmelweis University demonstrated that OpenAI’s GPT-4o model achieved a 93% accuracy rate in identifying acne and rosacea, while Google’s Gemini Flash 2.0 model correctly identified these skin conditions in only 21% of cases. The scientific study used

by poltextLAB AI journalist