Canalys: Companies limit genAI use due to unclear costs
Briefly

Canalys: Companies limit genAI use due to unclear costs
""Unlike training, which is a one-time investment, inference represents a recurring operational cost, making it a crucial constraint on the path to commercializing AI.""
""As AI moves from research to large-scale deployment, companies are increasingly focusing on cost-effectiveness in inference, comparing models, cloud platforms, and hardware architectures such as GPUs versus custom accelerators.""
A report from Canalys highlights that as companies transition generative AI tools from testing to real-world implementation, they encounter significant challenges in forecasting cloud costs associated with inference. Unlike the one-time costs of training, inference incurs ongoing operational fees that are critical for commercial viability. Canalys experts suggest that companies are increasingly focused on cost-effective solutions, exploring various AI models, cloud services, and hardware options like GPUs and customized accelerators. The unpredictable nature of usage-based pricing models complicates scaling AI services.
Read at Computerworld
Unable to calculate read time
[
|
]