OpenAI has long been touting the capabilities of its artificial intelligence (AI) developments, especially with their o-series models that are capable of reasoning and more advanced capabilities. The ...
A discrepancy between first- and third-party benchmark results for OpenAI’s o3 AI model is raising questions about the company’s transparency and model testing practices. When OpenAI unveiled o3 in ...
A recently released Google AI model scores worse on certain safety tests than its predecessor, according to the company’s internal benchmarking. In a technical report published this week, Google ...