The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

IBM Doubles Down On Its AI Cloud

IBM Research has doubled the capacity of its Vela AI Supercomputer, part of the IBM Cloud, to handle the strong growth in watsonx models and has aggressive plans to continue to expand and enhance AI inferencing with its own accelerator, the IBM AIU. A year ago, IBM...

read more

Is The AMD GPU Better Than We Thought For AI?

MosaicML, just acquired by DataBricks for $1.3B, published some interesting benchmarks for training LLMs on the AMD MI250 GPU, and said it is ~80% as fast as an NVIDIA A100. Did the world just change? To be brutally honest, everyone wants to see a fight, between AMD...

AMD Launches MI325 AI Accelerator To Challenge Nvidia. Stock Fell 4.5%

Chasing Nvidia is hard; Team Green simultaneously countered with new software that can triple AI performance. Consequently, AMD’s performance claims are already out of date. And AMD compared it against the older Hopper, not Blackwell which will ship long before the...

Amazon EC2 Inf1 Instances Now Support Amazon SageMaker

image: AWS Last year at AWS reInvent, out of the 100s of announcements, I chose the top 5 for overall, long-term impact. One of those was Amazon’s EC2 Inf1 Instances that used its new Inferentia machine learning inference chip. I chose Inf1 Instances as a top 5 for a...

How To Run Large AI Models On An Edge Device

It can be done, but it requires the edge device vendor to work to optimize the model. A hybrid approach can also extend the applicability of LLMs by combining Cloud and Edge processing. When most people think of Artificial Intelligence (AI), they imagine a berserk...

Skipping Nvidia Left Amazon, Apple And Tesla Behind In AI

Everyone thinks they are a comic. And everyone in big cap high tech thinks they can design better and/or cheaper AI chip alternatives to the industry-leader, Nvidia. Turns out, it’s simply not that easy. Apple and AWS have recently run aground in AI growth, and Tesla...

Cerebras, Groq And SambaNova Line Up To Compete With Nvidia

While Nvidia gets most of the press and market volume, there are three startups that have designed custom silicon and rack-scale infrastructure to compete with them head-on: Cerebras, Groq and Samba Nova. While Groq and Samba Nova seem to be getting some traction,...

Cerebras Update: The Wafer Scale Engine 3 Is A Door Opener

Cerebras held an AI Day, and in spite of the concurrently running GTC, there wasn’t an empty seat in the house. As we have noted, Cerebras Systems is one of the very few startups that is actually getting some serious traction in training AI, at least from a handful of...

Why Can’t NVIDIA Be Bested In MLPerf?

MLPerf, an industry consortium of over 70 companies and institutions, has released the second round of AI Inference processing results. These benchmarks now represent production applications from all major areas of AI deployment today. But only a few technologies...

Using A Digital Twin To Manage A Sustainable Flexible Data Center

Cadence’s acquisition of Future Facilities in 2022 opened the door to large data centers, and provides the company with the ability to manage power and cooling just when data centers need help the most as AI surges. Suppose you are running a data center, racks full of...

Could Graphcore’s Second Chip Challenge NVIDIA?

Graphcore, a UK-based startup, launched its first Intelligence Processing Unit (IPU) for AI acceleration in 2018. Today it introduced its second-generation product for AI, a massively parallel chip with 59.4 billion transistors that delivers some 250 Trillion...