The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

The Cambrian AI Landscape: GROQ

Ex-Google TPU engineers have been there and done that! Startup Groq is now sampling its AI platform to select customers and claims to have built the most efficient DNN processor in the industry. However, we need more transparency to substantiate this claim, in my...

AMD Launches 3rd Generation EPYC

AMD’s new EPYC server chip promises better performance than Intel.

Intel Focusses On AI At Big Event, And For Good Reasons

Intel has a multi-prong AI strategy. The company announced the AI PC and Intel Xeon Gen5 with excellent AI performance and TCO. Intel announced new products for desktops and servers, and the focus for both was AI, where Intel has a commanding lead over the...

d-Matrix Emerges From Stealth With Strong AI Performance And Efficiency

Startup launches “Corsair” AI platform with Digital In-Memory Computing, using on-chip SRAM memory that can produce 30,000 tokens/second at 2 ms/token latency for Llama3 70B in a single rack. Using Generative AI, called inference processing, is a memory-intensive...

Enhanced Memory Grace Hopper Superchip Could Shift Demand To NVIDIA CPU And Away From X86

The company’s new high bandwidth memory version is only available with the CPU-GPU Superchip. In addition, a new dual Grace-Hopper MGX Board offers 282GB of fast memory for large model inferencing. The AI landscape continues to change rapidly, and fast memory (HBM)...

ChatGPT: Massive Disruption

Our relationship with computers will never look the same. Here are the winners and losers. The web, the media, and my email are entirely full of ChatGPT missives and questions. I’ve spent hours with scores of investors, all wanting to understand the impact on...

Cerebras Gets Into The Inference Market With A Bang

Cerebras’ Wafer-Scale Engine has only been used for AI training, but new software enables leadership inference processing performance and costs. Should Nvidia be afraid? As Cerebras prepares to go public, it has expanded its target markets and competitive stance by...

NVIDIA And Intel Publish First GPT3 Benchmarks; AMD, AWS, And Google Are MIA

Let’s look into the results and explore why so many competitors chose not to play ball. Those who follow me know the drill: MLCommons publishes benchmarking results every 3 months, alternating between inferencing and training. Then I explain the results and whine...

IBM Research and NeuReality Announce Partnership For AI

NeuReality is the first Licensee of IBM’s reduced-precision core for AI IBM (NYSE: IBM) and NeuReality, an Israeli AI systems and semiconductor company, have signed an agreement to develop the next generation of high-performance AI inference platforms. IBM and...

Chip Design Moves To The Cloud With Synopsys And Microsoft Azure

CEO's Aart de Geus and Satya Nadella have teamed up to create the industry's first on-demand EDA Software as a Service offering on Microsoft Azure. SYNOPSYS AND MICROSOFT Synopsys and MicrosoftThe two companies have collaborated to enable engineers to design chips...