#model-scaling

[ follow ]
fromBusiness Insider
1 week ago

Databricks CEO says AGI is already here - and Silicon Valley just keeps moving the goalposts

Everybody would say yes, but we kept moving the goalposts," Ghodsi said in the discussion, which was published Tuesday.
Artificial intelligence
Artificial intelligence
fromWIRED
1 month ago

The AI Industry's Scaling Obsession Is Headed for a Cliff

Very large, compute-heavy AI models will likely yield diminishing performance returns over the next decade, while efficiency improvements will make smaller models increasingly capable.
fromArs Technica
1 month ago

Anthropic says its new AI model "maintained focus" for 30 hours on multistep tasks

On Monday, Anthropic released Claude Sonnet 4.5, a new AI language model the company calls its "most capable model to date," with improved coding and computer use capabilities. The company also revealed Claude Code 2.0, a command-line AI agent for developers, and the Claude Agent SDK, which is a tool developers can use to build their own AI coding agents.
Artificial intelligence
Artificial intelligence
fromHackernoon
55 years ago

Multi-Token Prediction: Architecture for Memory-Efficient LLM Training | HackerNoon

Multi-token prediction enhances language modeling efficacy by allowing simultaneous forecasting of multiple tokens.
Improved model performance scales with increased size.
[ Load more ]