by Karl Freund | Sep 25, 2023 | In the News
The AIHW and Edge AI Summit had a treasure trove of insightful presentations from luminaries such as Andrew Ng, Lip-Bu Tan, Marc Tremblay, and many others. I hope to get around to writing about what I learned, but first, I want to share the innovations from a startup...
by Karl Freund | Sep 11, 2023 | In the News
In the latest inference processing MLPerf benchmark contest, Gaudi 2 came surprisingly close to Nvidia H100. But Nvidia promised faster software soon, which is a constantly changing picture. In the latest round of AI benchmarks, all eyes were on the new Large Language...
by Karl Freund | Sep 8, 2023 | In the News
TensorRT-LLM adds a slew of new performance-enhancing features to all NVIDIA GPUs. Just ahead of the next round of MLPerf benchmarks, NVIDIA has announced a new TensorRT software for Large Language Models (LLMs) that can dramatically improve performance and efficiency...
by Karl Freund | Aug 30, 2023 | In the News
I’m getting a lot of inquiries from investors about the potential for this new GPU and for good reasons; it is fast! NVIDIA announced a new passively-cooled GPU at SIGGRAPH, the PCIe-based L40S, and most of us analysts just considered this to be an upgrade to the...
by Karl Freund | Aug 8, 2023 | In the News
The company’s new high bandwidth memory version is only available with the CPU-GPU Superchip. In addition, a new dual Grace-Hopper MGX Board offers 282GB of fast memory for large model inferencing. The AI landscape continues to change rapidly, and fast memory (HBM)...