#llm-efficiency

[ follow ]
Data science
fromMedium
6 days ago

The Shift to Efficient AI: Why Smarter, Smaller Models Are Winning in Production

Operational constraints like cost, latency, reliability, and infrastructure now determine AI deployment choices, shifting differentiation from model scale to efficiency.
fromYcombinator
1 month ago

Show HN: LLMs consume 5.4x less mobile energy than ad-supported web search | Hacker News

On mobile devices, a standard LLM session uses on average 5.4 times less energy than a classic ad-supported web search session, highlighting the efficiency of LLMs in this context.
fromTechzine Global
4 months ago

DeepSeek breakthrough gives LLMs the highways it has long needed

As LLMs cannot grow infinitely large but do improve with size, researchers must find ways to make the technology effective at smaller scales. One well-known method is Mixture-of-Experts, where an LLM activates only a portion of itself to generate a response (text, photo, video) based on a prompt. This makes a larger model effectively smaller and faster during operation. mHC promises to be even more fundamental. It offers the chance to increase model complexity without the pain points of the past.
Artificial intelligence
Science
fromTechzine Global
6 months ago

Once again, DeepSeek suggests AI can be done much more efficiently

Feeding LLMs images of words (pixels) enables far more efficient processing, reducing model size, data footprint, and compute compared with raw word sequences.
[ Load more ]