OpenAI AI Industry Benchmarks

News

3hon MSN

OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

A discrepancy between first- and third-party benchmark results for OpenAI's o3 AI model is raising questions about the ...

Cryptopolitan on MSN30m

OpenAI’s o3 model falls short of its own benchmark claims

OpenAI’s newest LLM, o3, is facing scrutiny after independent tests found it solved a far fewer number of tough math problems ...

10d

OpenAI is pushing for industry-specific AI benchmarks - why that matters

Benchmark performance results typically accompany the launch of every new AI model to showcase how well the models can ...

2don MSN

Figuring out which AI model is right for you is harder than you think

AI models are numerous and confusing to navigate, but the benchmarks used to measure their performance are also challenging.

OpenAI and start-ups race to generate code and transform software industry

Artificial intelligence is poised to outperform humans in writing code as leading groups, including OpenAI, Anthropic and ...

11don MSN

OpenAI launches program to design new ‘domain-specific’ AI benchmarks

Through the Pioneers Program, OpenAI hopes to create benchmarks for specific domains like legal, finance, insurance, healthcare, and accounting. The lab says that, in the coming months, it’ll work ...

eWeek4d

OpenAI’s New GPT-4.1: Do the Pros Outnumber the Cons?

OpenAI launches GPT-4.1 with improved coding, long-context support, and updated data. Available via API only, it outperforms ...

10don MSN

OpenAI Wants to Partner With Startups on AI—Here’s How to Apply

OpenAI has announced the OpenAI Pioneers Program, a new initiative that will have the company working with startups to devise ...

OpenAI launches o3 and o4-mini, AI models that ‘think with images’ and use tools autonomously

OpenAI launches groundbreaking o3 and o4-mini AI models that can manipulate and reason with images, representing a major ...

1don MSN

OpenAI's o3 and o4-mini hallucinate way higher than previous models

By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1.

OpenAI slashes prices for GPT-4.1, igniting AI price war among tech giants

OpenAI slashes GPT-4.1 API prices by up to 75% while offering superior coding performance and million-token context windows, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results