Artificial intelligence
fromFuturism
8 hours agoOpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad
The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.
gamers are probably going to feel left out since Nvidia seems to have decided renting cloud rigs to them is better than selling consumer hardware, small companies looking for AI chip compromises will be excited, and agentic AI is gonna be so hot that our Mann on the ground this week in San Jose isn't gonna need a jacket.
Intel is making a new push into GPUs, this time with a focus on data center workloads, as the chipmaker looks to reestablish itself in a market increasingly shaped by AI-driven demand and dominated by Nvidia. CEO Lip-Bu Tan said that after hiring a senior GPU architect, the company is working directly with customers to define requirements, signaling a more demand-driven approach as enterprises and cloud providers weigh their options for accelerated computing, according to a Reuters report.
The new capabilities center on two integrated components: the Dynamo Planner Profiler and the SLO-based Dynamo Planner. These tools work together to solve the "rate matching" challenge in disaggregated serving. The teams use this term when they split inference workloads. They separate prefill operations, which process the input context, from decode operations that generate output tokens. These tasks run on different GPU pools. Without the right tools, teams spend a lot of time determining the optimal GPU allocation for these phases.
The company, which is based in San Francisco and has an office in Pune, India, is targeting up to $35 million this year as it builds a royalty-driven on-device AI business. That growth has buoyed the company, which now has post-money valuation of between $270 million and $300 million, up from around $100 million in its 2022 Series B, Kheterpal said.