Tag Archives: Accelerators

A Quick Introduction to the NVIDIA GH200 aka Grace Hopper

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/a-quick-introduction-to-the-nvidia-gh200-aka-grace-hopper-arm/

The NVIDIA GH200 or “Grace Hopper” is far from a single product. We have a quick guide so when someone says “GH200” you know what to look for

The post A Quick Introduction to the NVIDIA GH200 aka Grace Hopper appeared first on ServeTheHome.

Meta AI Acceleration in the Next-Gen Meta MTIA for Recommendation Inference

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/meta-ai-acceleration-in-the-next-gen-meta-mtia-for-recommendation-inference-risc-v/

The next-gen Meta MTIA is a custom RISC-V accelerator for the company’s recommendation model AI inference workloads deployed this year

The post Meta AI Acceleration in the Next-Gen Meta MTIA for Recommendation Inference appeared first on ServeTheHome.

Broadcom AI Compute ASIC with Optical Attach Detailed at Hot Chips 2024

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/broadcom-ai-compute-asic-with-optical-attach-detailed-at-hot-chips-2024/

In one of the coolest presentations at Hot Chips 2024 so far, Broadcom showed co-packaged silicon photonics for switches and AI ASICs

The post Broadcom AI Compute ASIC with Optical Attach Detailed at Hot Chips 2024 appeared first on ServeTheHome.

Get Excited Marvell Structera CXL Memory with Arm Neoverse V2

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/everyone-reading-sth-get-excited-marvell-structera-cxl-memory-with-arm-neoverse-v2/

Marvell Structera CXL Memory Expansion modules accept DDR4 or DDR5. The line also has a 16-core Arm Neoverse V2 accelerated memory expander

The post Get Excited Marvell Structera CXL Memory with Arm Neoverse V2 appeared first on ServeTheHome.

Unigen Biscotti Dual Hailo-8 AI Module Spotted in AIC Booth at Computex 2024

Post Syndicated from Cliff Robinson original https://www.servethehome.com/unigen-biscotti-dual-hailo-8-ai-module-spotted-in-aic-booth-at-computex-2024/

At Computex 2024, we saw the Unigen Biscotti, a low-power dual Hailo-8 AI inference accelerator E1.S card in the AIC booth

The post Unigen Biscotti Dual Hailo-8 AI Module Spotted in AIC Booth at Computex 2024 appeared first on ServeTheHome.

AMD Instinct MI350 288GB GPU Offering 35x AI Inference Performance Next Year

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/amd-instinct-mi350-288gb-gpu-offering-35x-ai-inference-performance-next-year/

AMD Instinct MI325X with 288GB of HBM3E memory is for 2024, while the MI350X with CDNA 4 offers 35x AI Inference performance in 2025

The post AMD Instinct MI350 288GB GPU Offering 35x AI Inference Performance Next Year appeared first on ServeTheHome.

Intel Ponte Vecchio Spaceship GPU No Longer Hunting New Clusters

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/intel-ponte-vecchio-spaceship-gpu-no-longer-hunting-new-clusters/

Intel’s spaceship GPU, Ponte Vecchio, was perhaps too far ahead of its time and the GPU is moving into a support phase ahead of Falcon Shores

The post Intel Ponte Vecchio Spaceship GPU No Longer Hunting New Clusters appeared first on ServeTheHome.

This is Intel Gaudi 3 the New 128GB HBM2e AI Chip in the Wild

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/this-is-intel-gaudi-3-the-new-128gb-hbm2e-ai-chip-in-the-wild-intel-vision-2024/

This is Intel Gaudi 3 from Intel Vision 2024. The new modules are designed for scale-out AI inference and training with 24x 200GbE links

The post This is Intel Gaudi 3 the New 128GB HBM2e AI Chip in the Wild appeared first on ServeTheHome.

Cerebras WSE-3 AI Chip Launched 56x Larger than NVIDIA H100

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/cerebras-wse-3-ai-chip-launched-56x-larger-than-nvidia-h100-vertiv-supermicro-hpe-qualcomm/

The Cerebras WSE-3 is a giant AI training engineering marvel with 44GB of on-chip memory, 900,000 cores, and 125PF of AI compute

The post Cerebras WSE-3 AI Chip Launched 56x Larger than NVIDIA H100 appeared first on ServeTheHome.

AMD Infinity Fabric AFL Scale Up Competitor to NVIDIA NVLink Coming to Broadcom Switches in PCIe Gen7

Post Syndicated from Patrick Kennedy original https://www.servethehome.com/amd-infinity-fabric-afl-scale-up-competitor-to-nvidia-nvlink-coming-to-broadcom-switches-in-pcie-gen7/

AMD’s AFL Infinity Fabric scale-up competitor to NVIDIA NVLink is coming to Broadcom switches in the PCIe Gen7 era

The post AMD Infinity Fabric AFL Scale Up Competitor to NVIDIA NVLink Coming to Broadcom Switches in PCIe Gen7 appeared first on ServeTheHome.