The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

New Fabrics Enable Efficient AI Acceleration

While GPU performance has been the focus in data centers over the last few years, the performance of fabrics has become a key enabler or bottleneck in achieving the throughput and latency required to create and deliver artificial intelligence at scale. Nvidia’s...

The NewReality: Fast Inference Processing For 90% Less?

Most of the investment buzz in AI hardware concentrates on the amazing accelerator chips that crunch the math required for neural networks, like Nvidia’s GPUs. But what about the rest of the story? CPUs and NICs that pre- and post-process the query add significant...

NVIDIA Performance Trounces All Competitors Who Have The Guts To Submit To MLPerf Inference 3.0

But power matters, too. Qualcomm and SiMa.ai win in Edge Data Center and Embedded Edge respectively, while Neuchips wins in data center recommendations for power. Big News! There is a potential solution on the horizon to vastly broaden the field of benchmark...

Why Intel Is Investing In Neuromorphic Computing

Intel certainly has a lot of irons in the AI fireplace, including Xeon CPUs, Movidius computer vision chips, MobileEye chips for autonomous driving and Deep Neural Network training and inference processing technology from the newly acquired Habana Labs. With all of...

How To Run Large AI Models On An Edge Device

It can be done, but it requires the edge device vendor to work to optimize the model. A hybrid approach can also extend the applicability of LLMs by combining Cloud and Edge processing. When most people think of Artificial Intelligence (AI), they imagine a berserk...

NVIDIA GTC: “DPU” Smart NIC And More

NVIDIA Co-founder and CEO Jensen Huang rarely disappoints his audience nor his investors. This week he once again delivered the goods at the GPU Technology Conference. Announcing a broad range of hardware and software innovations, Jensen made it clear that he intends...

Will Nvidia GTC Mark The Peak Of AI?

In these explosive times and stock market surges, let’s look at where we might find the nuggets of new growth. Nvidia will focus on additional growth opportunities in Inference processing, Software, Edge, and Automotive; Nvidia is nowhere near Peak AI. Yes, GTC is a...

IBM Research Innovates “Serverless” Approach To Quantum

Quantum computing promises near-miraculous performance, but it comes with a lot of caveats. Most importantly, the problem being solved must be amenable to expression into the quantum “circuits” that can run on todays hardware. But what if only parts of the problem fit...

AMD Launches New GPU And EPYC CPU Right Across NVIDIA’s Bow

The Instinct MI200 is nearly five times faster than the NVIDIA A100 for HPC, but is theoretically only 20% faster for AI. One year ago I complained that the newly announced AMD MI100 GPU was great for HPC, but inadequate for most AI workloads. Now AMD has announced...

This Cadence AI Super Agent Is World’s First To Automate Chip Design

The semiconductor industry has been applying AI to accelerate chip design for over three years, achieving 10X productivity gains and focusing primarily on tasks that now seem relatively easy, such as floor-plan optimization using massive recurrent neural networks. But...