Reading for Inference

Nvidia’s rivals are focusing on building AI inference chips. Here’s what to know

But the same qualities that make those graphics processor chips, or GPUs, so effective at creating powerful AI systems from scratch make them less efficient at putting AI products to work. That’s ...

SiliconANGLE

Report: Intel could acquire inference chip startup SambaNova for $1.6B

Intel Corp. is reportedly in talks to acquire SambaNova Systems Inc., a venture-backed developer of artificial intelligence chips. Bloomberg today cited sources as saying that the discussions are in ...

Forbes

AI Inference Is King; Do You Know Which Chip is Best?

Everyone is not just talking about AI inference processing; they are doing it. Analyst firm Gartner released a new report this week forecasting that global generative AI spending will hit $644 billion ...

Seeking Alpha

AMD: Inference Is The Future Of AI

AMD is strategically positioned to dominate the rapidly growing AI inference market, which could be 10x larger than training by 2030. The MI300X's memory advantage and ROCm's ecosystem progress make ...

The Next Platform

The Battle Begins For AI Inference Compute In The Datacenter

The major cloud builders and their hyperscaler brethren – in many cases, one company acts like both a cloud and a hyperscaler – have made their technology choices when it comes to deploying AI ...

VentureBeat

What's a NIM? Nvidia Inference Microservices is new approach to gen AI model deployment that could change the industry

Nvidia is aiming to dramatically accelerate and optimize the deployment of generative AI large language models (LLMs) with a new approach to delivering models for rapid inference. At Nvidia GTC today, ...

Seeking Alpha

AMD's Quiet Edge In AI Inference

AMD’s Q2 2025 data center revenue reached $3.24 billion, up 14% year-over-year, with EPYC CPUs powering 1,200+ cloud instances. The $800 million MI308 impairment weighed on results, but MI355 promises ...

Forbes

Nvidia Dynamo — Next-Gen AI Inference Server For Enterprises

At the GTC 2025 conference, Nvidia introduced Dynamo, a new open-source AI inference server designed to serve the latest generation of large AI models at scale. Dynamo is the successor to Nvidia’s ...

InfoWorld

Evolving Kubernetes for generative AI inference

Kubernetes has become the leading platform for deploying cloud-native applications and microservices, backed by an extensive community and comprehensive feature set for managing distributed systems.

ZDNet

Cloud-native computing is poised to explode, thanks to AI inference work

The CNCF is bullish about cloud-native computing working hand in glove with AI. AI inference is the technology that will make hundreds of billions for cloud-native companies. New kinds of AI-first ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results