What does "PhD-level" AI mean? OpenAI's rumored $20,000 agent plan explained.

from Ars Technica 4 months ago

The latest Frontier Math benchmark reveals that EpochAI's o3 model solved 25.2% of problems, far surpassing other models. Despite impressive performance, concerns about confabulationsâproducing plausible but incorrect informationâpersist, particularly for critical applications. High costs, influenced by OpenAI's financial pressures and significant investments from backers like SoftBank, indicate that the market may expect substantial value from these advanced AI systems. However, the justification for such premium pricing is questioned, especially when compared to existing, more affordable AI options.

Despite their benchmark performances, these simulated reasoning models still struggle with confabulationsâinstances where they generate plausible-sounding but factually incorrect information. This remains a critical concern for research applications where accuracy and reliability are paramount.

Ideally, potential applications for a true PhD-level AI model would include analyzing medical research data, supporting climate modeling, and handling routine aspects of research work.

The high price points reported by The Information suggest that OpenAI believes these systems could provide substantial value to businesses, particularly with substantial investments from major backers like SoftBank.

Whether the performance difference between these tiers will match their thousandfold price difference is an open question, raising doubts about the overall value proposition for enterprises.

Read at Ars Technica

#ai-development #openai #benchmarks #pricing-strategy #confabulation-issues

Collection

[

...

]

What does "PhD-level" AI mean? OpenAI's rumored $20,000 agent plan explained.What does "PhD-level" AI mean? OpenAI's rumored $20,000 agent plan explained. Briefly

What does "PhD-level" AI mean? OpenAI's rumored $20,000 agent plan explained.
What does "PhD-level" AI mean? OpenAI's rumored $20,000 agent plan explained.
Briefly