The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

OpenAI’s Deep Research Demands More Hardware, Not Less

Discussions about Deep Seek’s impact on Nvidia is everywhere. Yesterday, I heard an investor on CNBC’s "Fast Money" program pontificate that Deep Seek and its disruptive technology mean that “Nobody needs an Nvidia H100 anymore,” much less a Blackwell. I struggle to...

Buckle Up! Here’s The Hot News From Nvidia GTC 2024

There is so much to share, so here is our take from Jensen’s Keynote address... After a five year Covid hiatus, Jensen Huang took the stage for an in-person Keynote at the SAP Arena to an adoring crowd of techies and investors. Like in the olden days, the man in the...

MLPerf Training 4.0: It’s All About Scale

While there isn’t a lot of new hardware (none!), Nvidia and Intel show off their muscles and ability to run new models at scale. Ok, here we go again. MLCommons has released new AI benchmarks, this time for training. And again. Nvidia runs all AI models better than...

NVIDIA Keeps The Performance Crown For AI Inference For The 6th Time In A Row

In The Data Center And On The Edge, the bottom line is that the H100 (Hopper-based) GPU is up to four times faster than the NVIDIA A100 on the newly released ​MLPerf V2.1 benchmark suite. The A100 retains leadership in many benchmarks versus other available products...

Intel Lays Out Strategy For AI: It’s Habana

Last month, Intel announced that it would acquire Israeli AI chip startup Habana Labs for $2B. At the time, I opined that this probably spelled the end for chips from the 2016 Nervana acquisition. Intel planned to bring out both the inference and the training versions...

Cadence Design And Nvidia Team To Create AI Data Center Digital Twin

The data center business is in a bit of a panic. Demand is skyrocketing. Rack power requirements have increased from ~12KW per rack to over 125 KW in just the last year. Now they are preparing for a Gigawatt rack in the next two to three years (Nvidia Rubin Ultra)....

NVIDIA And Intel Publish First GPT3 Benchmarks; AMD, AWS, And Google Are MIA

Let’s look into the results and explore why so many competitors chose not to play ball. Those who follow me know the drill: MLCommons publishes benchmarking results every 3 months, alternating between inferencing and training. Then I explain the results and whine...

Cerebras Publishes 7 Trained Generative AI Models To Open Source

The AI company is the first to use Non-GPU tech to train GPT-based Large Language Models and make available to the AI community. The early days of an open AI community, sharing work and building on each other’s success, is over. As there is now much more money at...

SiFive Is Leading The Way For Innovation On RISC-V

The company appears well positioned to challenge CPU incumbents with high performance RISC-V CPUs and Vector Extensions to the open ISA architecture. The RISC-V CPU Instruction Set Architecture (ISA) is emerging as a serious challenger to current CPUs based on...

AI Startup MosaicML Comes Out Of Stealth To Aid AI Developers

Naveen Rao, ex-Nervana and Intel, leads the new company focused on improving the efficiency of AI Training Training a deep neural network takes a lot of computational horsepower. Billions of trillions of multiplications and additions calculate “weights” which are...