#llm-efficiency

[ follow ]
Artificial intelligence
fromTechzine Global
9 hours ago

DeepSeek breakthrough gives LLMs the highways it has long needed

mHC (Manifold-Constrained Hyper-Connections) enables stable, more efficient information flow in LLMs, increasing model complexity and performance without simply scaling up model size.
Science
fromTechzine Global
1 month ago

Once again, DeepSeek suggests AI can be done much more efficiently

Feeding LLMs images of words (pixels) enables far more efficient processing, reducing model size, data footprint, and compute compared with raw word sequences.
[ Load more ]