The Latest News in AI

We publish news articles on Forbes, which are copied here for your convenience.  

Can Groq Really Take On Nvidia?

The Silicon Valley startup has just raised an astonishing $640M in additional funding, and is valued at $2.8B. Have they captured AI Lightning in a bottle? Groq has announced a $640M Series D round at a valuation of $2.8B, led by BlackRock Private Equity Partners....

read more

Micron Looks To Be First To Market With HBM3 Update For Generative AI And HPC

According to the company, the new Gen-2 of HBM increases memory capacity by 50%, with another bump in the works for 2024. As you may have heard, in addition to NVIDIA GPUs, generative AI eats memory for lunch. And dinner. In fact, running ChatGPT takes 8 or 16 GPUs...

IBM Launches Granite 3.0 AI Models; Smaller, Faster, And 97% Cheaper

The newly optimized LLMs underpin AI transformation client engagements with IBM Consulting Advantage and Watsonx. The industry has been abuzz about the affordability of LLM-based generative AI. If we don’t improve the efficiency, users will struggle to achieve an...

Cerebras Publishes 7 Trained Generative AI Models To Open Source

The AI company is the first to use Non-GPU tech to train GPT-based Large Language Models and make available to the AI community. The early days of an open AI community, sharing work and building on each other’s success, is over. As there is now much more money at...

NVIDIA Earth-2: Leveraging The Omniverse To Help Understand Climate Change

Cambrian-AI Analyst Alberto Romero contributed to this article. One of the greatest challenges humanity faces today is climate change. Although changes in climate occur naturally, during the last 200 years human activities have directly influenced the otherwise normal...

NVIDIA L40S: A Datacenter GPU For Omniverse And Graphics That Can Also Accelerate AI Training & Inference

I’m getting a lot of inquiries from investors about the potential for this new GPU and for good reasons; it is fast! NVIDIA announced a new passively-cooled GPU at SIGGRAPH, the PCIe-based L40S, and most of us analysts just considered this to be an upgrade to the...

Tenstorrent Launches AI Chip With Conditional Execution

What the heck is that you ask?  So, did I; but it looks promising! I continue to be amazed at the innovations that are coming to market to accelerate deep learning workloads.  GraphCore, Habana Labs, Cerebras, Blaize, Groq, Perceive and others are now being joined by...

Synopsys Moves To RISC-V To Help SoC Developers

When the number two provider of CPU designs jumps on the RISC-V train, it is a significant milestone. The open-source RISC-V design is on a roll, displacing Arm in many SoC development plans. ARC and Arm are both companies that design and license microprocessor (CPU)...

NVIDIA GTC ’20 Could Be Massive

As we approach NVIDIA’s annual GPU Technology Conference, everyone is anxious to see what CEO Jensen Huang has up his trademark black leather sleeves. I have no idea, but as usual, I have an opinion. An intro to GTC GTC is NVIDIA’s partners’ big annual opportunity to...

Cerebras Now The Fastest LLM Inference Processor; Its Not Even Close

The company tackled inferencing the Llama-3.1 405B foundation model and just crushed it. And for the crowds at SC24 this week in Atlanta, the company also announced it is 700 times faster than Frontier, the worlds fastest supercomputer, on a molecular dynamics...

NVIDIA DGX Cloud Gives CSPs And Their Customers Exactly What They Want: Fast AI, Fast

NVIDIA’s offering integrates the company’s best GPU hardware and software into AI Supercomputers of virtually any size in the cloud, enabling enterprises to build and deploy AI without infrastructure hassles. While some may confuse this as NVIDIA competing with cloud...