Ai Inference Database Server Architecture

VAST Data Redesigns Inference Architecture for Agentic AI with NVIDIA

VAST Data, the AI Operating System company, today announced a new inference architecture that enables the NVIDIA Inference ...

Forbes

Nvidia Dynamo — Next-Gen AI Inference Server For Enterprises

At the GTC 2025 conference, Nvidia introduced Dynamo, a new open-source AI inference server designed to serve the latest generation of large AI models at scale. Dynamo is the successor to Nvidia’s ...

Data Center Frontier

NVIDIA’s Rubin Redefines the AI Factory

Six new chips, one system. NVIDIA’s Vera Rubin launch extends beyond a single product into a full AI infrastructure platform ...

TechCrunch

Tensormesh raises $4.5M to squeeze more inference out of AI server loads

With the AI infrastructure push reaching staggering proportions, there’s more pressure than ever to squeeze as much inference as possible out of the GPUs they have. And for researchers with expertise ...

Business Wire

Enfabrica Unveils Industry’s First Ethernet-Based AI Memory Fabric System for Efficient Superscaling of LLM Inference

MOUNTAIN VIEW, Calif.--(BUSINESS WIRE)--Enfabrica Corporation, an industry leader in high-performance networking silicon for artificial intelligence (AI) and accelerated computing, today announced the ...

SDxCentral

Alibaba Cloud unveils 800G AI-centric network architecture

Alibaba Cloud unveiled a host of AI-centric offerings at its annual flagship conference, including its latest network architecture for training and inference. Showcased at Apsara 2025, HPN8.0 is ...

SiliconANGLE

Google unleashes Ironwood TPUs, new Axion instances as AI inference demand surges

Google LLC today announced it’s bringing its custom Ironwood chips online for cloud customers, unleashing tensor processing units that can scale up to 9,216 chips in a single pod to become the company ...

Business Wire

Positron AI Secures $51.6 Million in Oversubscribed Series A to Accelerate Inference-Optimized Hardware

RENO, Nev.--(BUSINESS WIRE)--Positron AI, the premier company for American-made semiconductors and inference hardware, today announced the close of a $51.6 million oversubscribed Series A funding ...

SDxCentral

AI inferencing will define 2026, and the market's wide open

AWS, Cisco, CoreWeave, Nutanix and more make the inference case as hyperscalers, neoclouds, open clouds, and storage go ...

Network World

IBM signs up Groq for speedy AI inferencing option

IBM has teamed up with Groq to offer enterprise customers a reliable, cost-effective way to speed AI inferencing applications. Further, IBM and Groq plan to integrate and enhance Red Hat’s open-source ...

SiliconANGLE

Tsavorite takes on Nvidia with composable AI chiplets based on Arm’s Neoverse architecture

Semiconductor startup Tsavorite Scalable Intelligence Inc. is looking to reinvent the system-on-chip computing architecture for artificial intelligence workloads with its new Omni Processing Unit.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results