The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

My 2026 AI Predictions Have A Few Surprises

OK, I haven’t done this in a while; no excuse other than laziness. But here are ten concrete, defensible predictions for AI in 2026, with a bias toward things that materially matter for infra, enterprises, and policy. 1. Agentic AI moves from demos to staffed “digital...

read more

AI Training: “I’m Not Dead Yet!”

With so much focus on inference processing, it is easy to overlook the AI training market, which continues to drive gigawatts of AI computing capacity. The latest benchmarks show that the training of AI models, an immense investment in power and compute, continues to...

read more

Cerebras AI Lands A Whale As It Prepares To Go Public

Cerebras, famous for being the only AI company with a full wafer-scale chip, has landed OpenAI, its first major US-based hyperscaler. Prior to this deal, Cerebras has been successful securing investments and system commitments from a relatively small number of...

NVIDIA DGX Cloud Gives CSPs And Their Customers Exactly What They Want: Fast AI, Fast

NVIDIA’s offering integrates the company’s best GPU hardware and software into AI Supercomputers of virtually any size in the cloud, enabling enterprises to build and deploy AI without infrastructure hassles. While some may confuse this as NVIDIA competing with cloud...

AMD Claims MI300X Is The World’s Fastest AI Hardware

The hardware looks quite capable, but the software optimization story has a long way to go to get close to Nvidia. But given the current demand/supply imbalance, I suspect AMD can sell all they can make. AMD launched the MI300 in San Jose to an anxious audience of...

Big AI Inference Has Become A Big Deal And A Bigger Business

Thanks to innovations like DeepSeek, training AI has become cheaper. However, inference is becoming more demanding as we ask AI to think harder before answering our questions. Nvidia, Groq, and Cerebras Systems (clients of Cambrian-AI Research) have all released...

HPE Adds Support For Qualcomm Cloud AI 100 Inference Accelerator

HPE’s endorsement for the Qualcomm Technology Cloud AI 100 is a huge step for most efficient and high-performance AI inference engines in market today. When I was working at AMD to get the first generation EPYC server SoC added to HPE servers, I learned that the...

Cerebras And Mayo Clinic Announce Foundation Model For Healthcare

JP Morgan’s annual Healthcare Conference is almost becoming an AI event as doctors and AI scientists collaborate to turn petabytes of clinical data into generative AI models and actionable insights. The progress made since Chat GPT’s arrival demonstrates that we are...

IBM Launches Granite 3.0 AI Models; Smaller, Faster, And 97% Cheaper

The newly optimized LLMs underpin AI transformation client engagements with IBM Consulting Advantage and Watsonx. The industry has been abuzz about the affordability of LLM-based generative AI. If we don’t improve the efficiency, users will struggle to achieve an...

Are AI Venture Investors Crazy, Or Are Groq And Sambanova Worth It?

Groq and Sambanova AI unicorns take in additional ~#1B in funding; customers must like what they see. UK AI leader Graphcore has raised some $700M to date. Intel purchased Habana Labs for $2B. Alibaba is spinning out their AI chip development business. Now two silicon...

Intel Lays Out Strategy For AI: It’s Habana

Last month, Intel announced that it would acquire Israeli AI chip startup Habana Labs for $2B. At the time, I opined that this probably spelled the end for chips from the 2016 Nervana acquisition. Intel planned to bring out both the inference and the training versions...

Intel Habana Launches 2nd Gen AI Chips

Habana launches new chips for training and inference, claiming 2X performance advantage over last generation NVIDIA A100 Habana launched their 1st chip at the inaugural Kisaco AI Hardware Summit in 2018 with excellent performance and efficiency. Unfortunately for...