The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

d-Matrix Emerges From Stealth With Strong AI Performance And Efficiency

Startup launches “Corsair” AI platform with Digital In-Memory Computing, using on-chip SRAM memory that can produce 30,000 tokens/second at 2 ms/token latency for Llama3 70B in a single rack. Using Generative AI, called inference processing, is a memory-intensive...

NVIDIA Completely Re-Imagines The Data Center For AI

It is all about tighter integration with memory, CPUs, and accelerators for trillion-parameter AI models. For 12 years, NVIDIA has used its Spring GPU Technology Conference (GTC) to amaze its customers and investors with new GPUs for graphics and application...

A Closer Look At Graphcore ML Performance

Greater scalability and new software increases performance by 50-fold over the last twelve months. Graphcore, the UK-based AI Unicorn, submitted a raft of new benchmarks to MLCommons in December, which we covered here. Performance improved significantly with the...

Intel Gaudi2 Looked To Be A Credible Alternative To Nvidia. Until…

In the latest inference processing MLPerf benchmark contest, Gaudi 2 came surprisingly close to Nvidia H100. But Nvidia promised faster software soon, which is a constantly changing picture. In the latest round of AI benchmarks, all eyes were on the new Large Language...

Q2 Competitive Update: The AI Cambrian Explosion Rolls On

A lot of new AI Silicon and Software has been announced since March and it can be tough to keep track of it all. To help users, vendors, and investors keep track, our June Competitive Landscape Report is now available, and for a limited time, the ~70-page report is...

New Cerebras Wafer-Scale Cluster Eliminates Months Of Painstaking Work To Build Massive Intelligence

The architecture eliminates the need to decompose large models for distributed computing to train: Push-button AI? The hottest trend in AI is the emergence of massive models such as Open AI’s GPT-3. These models are surprising even its developers with capabilities...

Esperanto Sees A Bright Future For RISC-V In AI And HPC

The company is shipping its first-gen chip globally, with over 1000 cores at only 25 watts of power. Can it break into Generative AI? Suddenly, AI has become the hottest investment and cocktail party topic de jour. But the estimates for power consumption are pretty...

Cerebras Update: The Wafer Scale Engine 3 Is A Door Opener

Cerebras held an AI Day, and in spite of the concurrently running GTC, there wasn’t an empty seat in the house. As we have noted, Cerebras Systems is one of the very few startups that is actually getting some serious traction in training AI, at least from a handful of...

Qualcomm AI Research Innovates 3D Perception Techniques

The trick of AI at the edge is reducing complexity while maintaining accuracy. And that takes a lot of primary research. When we explored the eight “AI Firsts” from Qualcomm AI Research earlier this year, it was clear to us the company’s full-stack approach to AI...

NVIDIA Needed A CPU, But Did It Need To Buy Arm To Get One?

I often opine that NVIDIA needs a data center-class CPU to compete with Intel and AMD, both of whom have used tightly-coupled CPU/GPU technology to win the first three U.S. exascale supercomputer deals. Connecting massive GPUs to fast CPUs over a painfully slow PCIe...