The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

NVIDIA GTC: “DPU” Smart NIC And More

NVIDIA Co-founder and CEO Jensen Huang rarely disappoints his audience nor his investors. This week he once again delivered the goods at the GPU Technology Conference. Announcing a broad range of hardware and software innovations, Jensen made it clear that he intends...

Blaize AI Emerges From Stealth

Throughout 2020, a wave of AI hardware startups will launch their companies and products. Cerebras started this wave with its wafer-scale engine last September. This week, Intel announced its AI chips from Nervana, Groq (founded by the inventors of Google TPU)...

Qualcomm Launches Cloud AI Chip

Last year, Qualcomm teased its Cloud AI100, promising strong performance and power efficiency to enable Artificial Intelligence in cloud edge computing, autonomous vehicles and 5G infrastructure. Today, the company announced it is now sampling the platform, with...

IBM Launches Granite 3.0 AI Models; Smaller, Faster, And 97% Cheaper

The newly optimized LLMs underpin AI transformation client engagements with IBM Consulting Advantage and Watsonx. The industry has been abuzz about the affordability of LLM-based generative AI. If we don’t improve the efficiency, users will struggle to achieve an...

Why Nvidia Is Entering The $30B Market For Custom Chips

Nvidia’s largest customers (e.g., Google, Amazon, Microsoft, Meta, and OpenAI) are developing AI chips that compete with Nvidia, presenting a potential long-term threat to the AI leader. Jensen’s response? “We can help you do that.” Warning: this blog contains...

AI Inference Is King; Do You Know Which Chip is Best?

Everyone is not just talking about AI inference processing; they are doing it. Analyst firm Gartner released a new report this week forecasting that global generative AI spending will hit $644 billion in 2025, growing 76.4% year-over-year. Meanwhile, MarketsandMarkets...

Does NVIDIA Selene Form A Wider Moat Than CUDA?

The annual International Supercomputer Conference (ISC), held virtually this year, kicked off today. Not surprisingly, NVIDIA has already made a few announcements of note. Especially of interest to me was the announcement of Selene, NVIDIA’s in-house 1+ Exaflop AI...

NVIDIA Keeps The Performance Crown For AI Inference For The 6th Time In A Row

In The Data Center And On The Edge, the bottom line is that the H100 (Hopper-based) GPU is up to four times faster than the NVIDIA A100 on the newly released ​MLPerf V2.1 benchmark suite. The A100 retains leadership in many benchmarks versus other available products...

NVIDIA Needed A CPU, But Did It Need To Buy Arm To Get One?

I often opine that NVIDIA needs a data center-class CPU to compete with Intel and AMD, both of whom have used tightly-coupled CPU/GPU technology to win the first three U.S. exascale supercomputer deals. Connecting massive GPUs to fast CPUs over a painfully slow PCIe...

IBM And Rapidus Will Collaborate To Build 2nm Chips In Japan.

IBM, who invented the tech everyone will use to create 2nm silicon, is partnering with Rapidus to bring it to market The world now realizes that major economies need to have reliable and secure access to semiconductors, the heart of the digital economy, not to mention...