by Karl Freund | Sep 9, 2025 | In the News
In an industry-first, Nvidia has announced a new GPU, the Rubin CPX, to offload the compute-intensive “context processing” off another GPU. Yep, now, for some AI, you will need two GPUs to achieve maximize performance and profit. I would be surprised if the...
by Karl Freund | Sep 9, 2025 | In the News
The Nvidia juggernaught faces increased competition, but keeps innovating in silicon, software and systems design to keep its No. 1 position in the AI market. The industry standard MLCommons organization has just released the latest benchmarks for inference...
by Karl Freund | Jul 11, 2025 | In the News
While GPU performance has been the focus in data centers over the last few years, the performance of fabrics has become a key enabler or bottleneck in achieving the throughput and latency required to create and deliver artificial intelligence at scale. Nvidia’s...
by Karl Freund | Jun 12, 2025 | In the News
AMD held their now-annual Advancing AI event today in Silicon Valley, with new GPUs, new networking, new software, and even a rack-scale architecture for 2026/27 to better compete with the Nvidia NVL72 that is taking the AI world by storm. Let’s dive in! Net-Net...
by Karl Freund | Jun 4, 2025 | In the News
As you AI pros know, the 125-member MLCommons organization alternates training and inference benchmarks every three months. This time around, its all about training, which remains the largest AI hardware market, although not by much as inference drives more growth as...