MangoBoost showcases record NVMe/TCP storage speeds for AI data center training workloads

MangoBoost has reported record-setting results in the MLPerf Storage v2.0 benchmarks, demonstrating its Mango StorageBoost solution for data center AI training workloads. The announcement highlights performance, efficiency, and scalability improvements achieved for DPU-accelerated Non-Volatile Memory Express over TCP (NVMe/TCP) storage systems, a technology MangoBoost claims delivers near-local solid-state drive (SSD) speeds across distributed compute environments.

The company’s test configuration used Mango StorageBoost NVMe/TCP Initiator (NTI) on host servers and Target (NTT) on storage servers, both connected via a 400 gigabit Ethernet fabric. In MLPerf Storage v2.0’s Fabric-attached Block Storage category, MangoBoost claims it achieved line-rate throughput, providing near-local SSD performance for workloads such as 3D-UNet on NVIDIA A100 and H100 graphics processing units (GPUs).

Technical results reported by MangoBoost include six-point-two times GPU scalability for 3D-UNet on A100 GPUs compared to alternatives, and from one-point-two-five to seven-point-five times scalability on H100 GPUs. Throughput gains of one-point-five-seven times per 400 gigabit bandwidth on A100 and up to two-point-zero-five times on H100 were recorded. According to MangoBoost, its solution outperformed local Solidigm D7-PS1030 SSD drives and delivered better performance-to-cost against NVIDIA’s BlueField-3 data processing unit (DPU), claiming significantly reduced total cost of ownership (TCO).

The Mango StorageBoost architecture is based on three components:

NVMe/TCP Initiator (NTI), which offloads the NVMe/TCP stack to hardware for full-duplex, line-rate performance without CPU consumption.
NVMe/TCP Target (NTT), which accelerates TCP/IP and NVMe-over-Fabrics processing, enabling storage disaggregation over standard Ethernet with zero CPU participation.
GPU Storage Boost (GSB), enabling direct direct memory access (DMA) transfers from GPU memory to local or remote storage while bypassing the CPU, which MangoBoost says improves input/output efficiency.

Mango StorageBoost integrates with standard server platforms and GPUs, does not require changes to existing hardware or software stacks, and is available for data center deployment.

Source: MangoBoost

MangoBoost showcases record NVMe/TCP storage speeds for AI data center training workloads

Get Data Center Engineering News In Your Inbox:

Popular Posts:

Share Your Data Center Engineering News

Get Data Center Engineering News In Your Inbox:

Latest Data Center Engineering News

LandGate launches instant BESS due-diligence reports for faster site selection near substations and data centers

DigitalOcean adds AMD Instinct MI350X GPU Droplets for production AI inference

Speedata APU brings accelerated Apache Spark analytics to Nebul’s European sovereign cloud data centers

FlexGen updates HybridOS energy management system for battery storage supporting data center load growth

Generac to acquire Enercon to expand data center backup power switchgear and generator enclosures

Aterio reports construction restart signals at Microsoft’s Catawba County data center campuses

Molex launches Impress co-packaged copper connectors for 224 Gbps PAM-4 near-ASIC links in data centers

Qunnect and Cisco demo metro-scale quantum entanglement swapping over 17.6 km of commercial fiber via a data center hub