VAST Data, the AI Operating System company, today announced a new inference architecture that enables the NVIDIA Inference ...
At the GTC 2025 conference, Nvidia introduced Dynamo, a new open-source AI inference server designed to serve the latest generation of large AI models at scale. Dynamo is the successor to Nvidia’s ...
Six new chips, one system. NVIDIA’s Vera Rubin launch extends beyond a single product into a full AI infrastructure platform ...
With the AI infrastructure push reaching staggering proportions, there’s more pressure than ever to squeeze as much inference as possible out of the GPUs they have. And for researchers with expertise ...
MOUNTAIN VIEW, Calif.--(BUSINESS WIRE)--Enfabrica Corporation, an industry leader in high-performance networking silicon for artificial intelligence (AI) and accelerated computing, today announced the ...
Alibaba Cloud unveiled a host of AI-centric offerings at its annual flagship conference, including its latest network architecture for training and inference. Showcased at Apsara 2025, HPN8.0 is ...
Google LLC today announced it’s bringing its custom Ironwood chips online for cloud customers, unleashing tensor processing units that can scale up to 9,216 chips in a single pod to become the company ...
RENO, Nev.--(BUSINESS WIRE)--Positron AI, the premier company for American-made semiconductors and inference hardware, today announced the close of a $51.6 million oversubscribed Series A funding ...
AWS, Cisco, CoreWeave, Nutanix and more make the inference case as hyperscalers, neoclouds, open clouds, and storage go ...
IBM has teamed up with Groq to offer enterprise customers a reliable, cost-effective way to speed AI inferencing applications. Further, IBM and Groq plan to integrate and enhance Red Hat’s open-source ...
Semiconductor startup Tsavorite Scalable Intelligence Inc. is looking to reinvent the system-on-chip computing architecture for artificial intelligence workloads with its new Omni Processing Unit.