MangoBoost showcases record NVMe/TCP storage speeds for AI data center training workloads

MangoBoost has reported record-setting results in the MLPerf Storage v2.0 benchmarks, demonstrating its Mango StorageBoost solution for data center AI training workloads. The announcement highlights performance, efficiency, and scalability improvements achieved for DPU-accelerated Non-Volatile Memory Express over TCP (NVMe/TCP) storage systems, a technology MangoBoost claims delivers near-local solid-state drive (SSD) speeds across distributed compute environments.

The company’s test configuration used Mango StorageBoost NVMe/TCP Initiator (NTI) on host servers and Target (NTT) on storage servers, both connected via a 400 gigabit Ethernet fabric. In MLPerf Storage v2.0’s Fabric-attached Block Storage category, MangoBoost claims it achieved line-rate throughput, providing near-local SSD performance for workloads such as 3D-UNet on NVIDIA A100 and H100 graphics processing units (GPUs).

Technical results reported by MangoBoost include six-point-two times GPU scalability for 3D-UNet on A100 GPUs compared to alternatives, and from one-point-two-five to seven-point-five times scalability on H100 GPUs. Throughput gains of one-point-five-seven times per 400 gigabit bandwidth on A100 and up to two-point-zero-five times on H100 were recorded. According to MangoBoost, its solution outperformed local Solidigm D7-PS1030 SSD drives and delivered better performance-to-cost against NVIDIA’s BlueField-3 data processing unit (DPU), claiming significantly reduced total cost of ownership (TCO).

The Mango StorageBoost architecture is based on three components:

  • NVMe/TCP Initiator (NTI), which offloads the NVMe/TCP stack to hardware for full-duplex, line-rate performance without CPU consumption.
  • NVMe/TCP Target (NTT), which accelerates TCP/IP and NVMe-over-Fabrics processing, enabling storage disaggregation over standard Ethernet with zero CPU participation.
  • GPU Storage Boost (GSB), enabling direct direct memory access (DMA) transfers from GPU memory to local or remote storage while bypassing the CPU, which MangoBoost says improves input/output efficiency.

Mango StorageBoost integrates with standard server platforms and GPUs, does not require changes to existing hardware or software stacks, and is available for data center deployment.

Source: MangoBoost

Get Data Center Engineering News In Your Inbox:

Popular Posts:

Elvis-Leka,-New-Product-Development-Engineer-—-Parker,-Sporlan-Division
From air to two-phase liquid: how rack cooling options compare on density and risk
Why your data center needs 135% cooling capacity- Munters
Why your data center needs 135% cooling capacity
Near-Packaged-Optics--Rethinking-the-AI-Data-Center-Interconnect
Near-packaged optics: rethinking the AI data center interconnect
OlympusMAX-Product-Image
SPX launches Marley OlympusMAX fluid cooler with dry/adiabatic modules
bedra copper alloys target AI data center cooling plates and busbars
bedra's latest copper alloys target AI data center cooling plates and busbars

Share Your Data Center Engineering News

Do you have a new product announcement, webinar, whitepaper, or article topic? 

Get Data Center Engineering News In Your Inbox: