Tag Archives: Accelerators

The New d-Matrix JetStream 400G Ethernet Card for Data Center Scale AI Inference

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/the-new-d-matrix-jetstream-400g-ethernet-card-for-data-center-scale-ai-inference/

The new d-Matrix Jetstream 400G card is designed to help the company scale out its Corsair AI inference platfrom using lower-cost switching

The post The New d-Matrix JetStream 400G Ethernet Card for Data Center Scale AI Inference appeared first on ServeTheHome.

Google’s Ironwood TPU Swings for Reasoning Model Leadership at Hot Chips 2025

Post Syndicated from Ryan Smith original https://www.servethehome.com/googles-ironwood-tpu-swings-for-reasoning-model-leadership-at-hot-chips-2025/

Closing out the machine learning sessions at Hot Chips 2025 is Google, who is at the show to talk about their latest tensor processing unit (TPU), codenamed Ironwood. Revealed by the company a few months ago, Ironwood is the first Google TPU that is explicitly designed for large-scale AI inference (rather than AI training). Paired […]

The post Google’s Ironwood TPU Swings for Reasoning Model Leadership at Hot Chips 2025 appeared first on ServeTheHome.

AMD Dives Deep on CDNA 4 Architecture and MI350 Accelerator at Hot Chips 2025

Post Syndicated from Ryan Smith original https://www.servethehome.com/amd-dives-deep-on-cdna-4-architecture-and-mi350-accelerator-at-hot-chips-2025/

The second big machine learning accelerator talk of the afternoon belongs to AMD. The company’s chip architects are at this year’s show to tell the audience all about the CDNA 4 architecture, which is powering AMD’s new MI350 family of accelerators. Like it’s MI300 predecessor, AMD is using 3D die stacking to build up a […]

The post AMD Dives Deep on CDNA 4 Architecture and MI350 Accelerator at Hot Chips 2025 appeared first on ServeTheHome.

Huawei Presents UB-Mesh Interconnect for Large AI SuperNodes at Hot Chips 2025

Post Syndicated from Ryan Smith original https://www.servethehome.com/huawei-presents-ub-mesh-interconnect-for-large-ai-supernodes-at-hot-chips-2025/

The third and final machine learning presentation before the afternoon break comes from Huawei. Unlike many of the other ML vendors who are here to pitch products, Huawei’s presentation is more focused on fundamental technology. In this case, how to use efficiently use meshes to interconnect the chips within large AI systems. Eyeing so-called SuperNodes […]

The post Huawei Presents UB-Mesh Interconnect for Large AI SuperNodes at Hot Chips 2025 appeared first on ServeTheHome.

d-Matrix Presents Corsair, An In-Memory Computing Architecture For Inference, at Hot Chips 2025

Post Syndicated from Ryan Smith original https://www.servethehome.com/d-matrix-presents-corsair-an-in-memory-computing-architecture-for-inference-at-hot-chips-2025/

The second machine learning presentation of the afternoon comes from d-Matrix. The company specializes in hardware for AI inference, and as of late has been tackling the matter of how to improve inference performance by using in-memory computing. Along those lines, the company is presenting their Corsair in-memory computing chiplet architecture at Hot Chips. Not […]

The post d-Matrix Presents Corsair, An In-Memory Computing Architecture For Inference, at Hot Chips 2025 appeared first on ServeTheHome.

Rebellions REBEL-Quad UCIe and 144TB HBM3E Accelerator at Hot Chips 2025

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/rebellions-rebel-quad-ucie-and-144tb-hbm3e-accelerator-at-hot-chips-2025/

At Hot Chips 2025, we saw a live demo of the Rebellions REBEL-Quad, an AI accelerator with four ASICs, 144GB of HBM3E, and more using UCIe

The post Rebellions REBEL-Quad UCIe and 144TB HBM3E Accelerator at Hot Chips 2025 appeared first on ServeTheHome.

New Faster AMD Alveo V80 Accelerator with HBM2e and Fast Networking

Post Syndicated from Cliff Robinson original https://www.servethehome.com/new-faster-amd-alveo-v80-accelerator-with-hbm2e-and-fast-networking/

The new, faster, AMD Alveo V80 packs 32GB of HBM2e memory, four QSFP56 200G ports, and many other features with the SoC on a pre-built card

The post New Faster AMD Alveo V80 Accelerator with HBM2e and Fast Networking appeared first on ServeTheHome.

Not Just for Oreos and Tailers AMD Helios Next-Gen AI Racks Go Double-Wide

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/not-just-for-oreos-and-tailers-amd-helios-next-gen-ai-racks-go-double-wide/

The AMD Helios with the MI450 will scale to 31TB of memory and 1.4PB/s of memory bandwidth per rack. It is so big, it needs bigger racks

The post Not Just for Oreos and Tailers AMD Helios Next-Gen AI Racks Go Double-Wide appeared first on ServeTheHome.

Qualcomm Discrete NPU Spotted at in Dell Pro Max Plus laptop at DTW

Post Syndicated from Will Taillac original https://www.servethehome.com/qualcomm-discrete-qualcomm-npu-spotted-at-in-dell-pro-max-plus-laptop-at-dtw/

At Dell Tech World 2025, the company showed off a new laptop class that had Qualcomm NPUs instead of a dGPU for heavier AI workloads

The post Qualcomm Discrete NPU Spotted at in Dell Pro Max Plus laptop at DTW appeared first on ServeTheHome.

Intel Arc Pro B50 and B60 For Lower Cost Pro GPUs and 18A Pather Lake Shown at Computex 2025

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/intel-arc-pro-b50-and-b60-for-lower-cost-pro-gpus-and-18a-pather-lake-shown-at-computex-2025/

The Intel Arc Pro B50 is a 70W SFF GPU at $299 with 16GB of memory and the B60 is a step up with support for AI inference and virtualization

The post Intel Arc Pro B50 and B60 For Lower Cost Pro GPUs and 18A Pather Lake Shown at Computex 2025 appeared first on ServeTheHome.