MangoBoost showcases record NVMe/TCP storage speeds for AI data center training workloads

MangoBoost has reported record-setting results in the MLPerf Storage v2.0 benchmarks, demonstrating its Mango StorageBoost solution for data center AI training workloads. The announcement highlights performance, efficiency, and scalability improvements achieved for DPU-accelerated Non-Volatile Memory Express over TCP (NVMe/TCP) storage systems, a technology MangoBoost claims delivers near-local solid-state drive (SSD) speeds across distributed compute environments.

The company’s test configuration used Mango StorageBoost NVMe/TCP Initiator (NTI) on host servers and Target (NTT) on storage servers, both connected via a 400 gigabit Ethernet fabric. In MLPerf Storage v2.0’s Fabric-attached Block Storage category, MangoBoost claims it achieved line-rate throughput, providing near-local SSD performance for workloads such as 3D-UNet on NVIDIA A100 and H100 graphics processing units (GPUs).

Technical results reported by MangoBoost include six-point-two times GPU scalability for 3D-UNet on A100 GPUs compared to alternatives, and from one-point-two-five to seven-point-five times scalability on H100 GPUs. Throughput gains of one-point-five-seven times per 400 gigabit bandwidth on A100 and up to two-point-zero-five times on H100 were recorded. According to MangoBoost, its solution outperformed local Solidigm D7-PS1030 SSD drives and delivered better performance-to-cost against NVIDIA’s BlueField-3 data processing unit (DPU), claiming significantly reduced total cost of ownership (TCO).

The Mango StorageBoost architecture is based on three components:

  • NVMe/TCP Initiator (NTI), which offloads the NVMe/TCP stack to hardware for full-duplex, line-rate performance without CPU consumption.
  • NVMe/TCP Target (NTT), which accelerates TCP/IP and NVMe-over-Fabrics processing, enabling storage disaggregation over standard Ethernet with zero CPU participation.
  • GPU Storage Boost (GSB), enabling direct direct memory access (DMA) transfers from GPU memory to local or remote storage while bypassing the CPU, which MangoBoost says improves input/output efficiency.

Mango StorageBoost integrates with standard server platforms and GPUs, does not require changes to existing hardware or software stacks, and is available for data center deployment.

Source: MangoBoost

Get Data Center Engineering News In Your Inbox:

Popular Posts:

Elvis-Leka,-New-Product-Development-Engineer-—-Parker,-Sporlan-Division
From air to two-phase liquid: how rack cooling options compare on density and risk
IT-Pod-2-1200x675
Flex launches 800 VDC power rack for NVIDIA Vera Rubin AI platforms
FalconXpr
AI data center networks: Xscape launches 8-wavelength ELSFP laser module
Near-Packaged-Optics--Rethinking-the-AI-Data-Center-Interconnect
Near-packaged optics: rethinking the AI data center interconnect
Screenshot
Five AI data centers to reach 1 GW power capacity in 2026, new analysis shows

Share Your Data Center Engineering News

Do you have a new product announcement, webinar, whitepaper, or article topic? 

Get Data Center Engineering News In Your Inbox: