by Karl Freund | Sep 9, 2025 | In the News
In an industry-first, Nvidia has announced a new GPU, the Rubin CPX, to offload the compute-intensive “context processing” off another GPU. Yep, now, for some AI, you will need two GPUs to achieve maximize performance and profit. I would be surprised if the...
by Karl Freund | Sep 5, 2025 | In the News
The latest TPU and the upcoming Ironwood supercomputer were just the start Google is taking the next step in its quest to become a serious challenger to Nvidia GPUs. As I recently noted in my post on CSP silicon costs and failures, Google is the exception. Google...
by Karl Freund | Jul 10, 2025 | In the News
The AI world continues to evolve rapidly, especially since the introduction of DeepSeek and its followers. Many have concluded that enterprises don’t really need the large, expensive AI models touted by OpenAI, Meta, and Google, and are focusing instead on...
by Karl Freund | Jun 17, 2025 | In the News
Nvidia recently announced two new cloud initiatives. First, the company announced DGX Cloud Lepton, designed to connect artificial intelligence developers with Nvidia’s wide network of cloud providers. Second, Nvidia announced a new cloud service, the Industrial AI...
by Karl Freund | Apr 2, 2025 | In the News
Everyone is not just talking about AI inference processing; they are doing it. Analyst firm Gartner released a new report this week forecasting that global generative AI spending will hit $644 billion in 2025, growing 76.4% year-over-year. Meanwhile, MarketsandMarkets...