The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

New Fabrics Enable Efficient AI Acceleration

While GPU performance has been the focus in data centers over the last few years, the performance of fabrics has become a key enabler or bottleneck in achieving the throughput and latency required to create and deliver artificial intelligence at scale. Nvidia’s...

Micron Looks To Be First To Market With HBM3 Update For Generative AI And HPC

According to the company, the new Gen-2 of HBM increases memory capacity by 50%, with another bump in the works for 2024. As you may have heard, in addition to NVIDIA GPUs, generative AI eats memory for lunch. And dinner. In fact, running ChatGPT takes 8 or 16 GPUs...

d-Matrix Emerges From Stealth With Strong AI Performance And Efficiency

Startup launches “Corsair” AI platform with Digital In-Memory Computing, using on-chip SRAM memory that can produce 30,000 tokens/second at 2 ms/token latency for Llama3 70B in a single rack. Using Generative AI, called inference processing, is a memory-intensive...

Why Intel Is Investing In Neuromorphic Computing

Intel certainly has a lot of irons in the AI fireplace, including Xeon CPUs, Movidius computer vision chips, MobileEye chips for autonomous driving and Deep Neural Network training and inference processing technology from the newly acquired Habana Labs. With all of...

Enhanced Memory Grace Hopper Superchip Could Shift Demand To NVIDIA CPU And Away From X86

The company’s new high bandwidth memory version is only available with the CPU-GPU Superchip. In addition, a new dual Grace-Hopper MGX Board offers 282GB of fast memory for large model inferencing. The AI landscape continues to change rapidly, and fast memory (HBM)...

AI Is Reshaping Chip Design. But Where Will It End?

Using AI reduces design costs, improves yields and performance, and shortens time-to-market for better chips. Synopsys, Cadence Design Systems, and many hyperscal chip designers are now adopting generative AI capabilities to facilitate chip design. Think of this as...

Intel Lays Out Strategy For AI: It’s Habana

Last month, Intel announced that it would acquire Israeli AI chip startup Habana Labs for $2B. At the time, I opined that this probably spelled the end for chips from the 2016 Nervana acquisition. Intel planned to bring out both the inference and the training versions...

Using A Digital Twin To Manage A Sustainable Flexible Data Center

Cadence’s acquisition of Future Facilities in 2022 opened the door to large data centers, and provides the company with the ability to manage power and cooling just when data centers need help the most as AI surges. Suppose you are running a data center, racks full of...

Blaize AI: Now In Production And Trials

Last November I covered Blaize and its silicon and software strategy, and noted that the company’s fairly large team has been focused on early customer engagements to gain insights and accelerate adoption. Now the company, backed by industrial heavyweights such as...

Xilinx Readies Versal AI Edge For 2022 Availability

Platform includes updated AI Engine with 4- and 8-bit integer math, along with new memory architecture. Xilinx has just launched the first edge model of the flexible Versal ACAP (Adaptive Compute Acceleration Platform) family, the third Versal to be announced in...