MangoBoost showcases record NVMe/TCP storage speeds for AI data center training workloads

MangoBoost has reported record-setting results in the MLPerf Storage v2.0 benchmarks, demonstrating its Mango StorageBoost solution for data center AI training workloads. The announcement highlights performance, efficiency, and scalability improvements achieved for DPU-accelerated Non-Volatile Memory Express over TCP (NVMe/TCP) storage systems, a technology MangoBoost claims delivers near-local solid-state drive (SSD) speeds across distributed compute environments.

The company’s test configuration used Mango StorageBoost NVMe/TCP Initiator (NTI) on host servers and Target (NTT) on storage servers, both connected via a 400 gigabit Ethernet fabric. In MLPerf Storage v2.0’s Fabric-attached Block Storage category, MangoBoost claims it achieved line-rate throughput, providing near-local SSD performance for workloads such as 3D-UNet on NVIDIA A100 and H100 graphics processing units (GPUs).

Technical results reported by MangoBoost include six-point-two times GPU scalability for 3D-UNet on A100 GPUs compared to alternatives, and from one-point-two-five to seven-point-five times scalability on H100 GPUs. Throughput gains of one-point-five-seven times per 400 gigabit bandwidth on A100 and up to two-point-zero-five times on H100 were recorded. According to MangoBoost, its solution outperformed local Solidigm D7-PS1030 SSD drives and delivered better performance-to-cost against NVIDIA’s BlueField-3 data processing unit (DPU), claiming significantly reduced total cost of ownership (TCO).

The Mango StorageBoost architecture is based on three components:

  • NVMe/TCP Initiator (NTI), which offloads the NVMe/TCP stack to hardware for full-duplex, line-rate performance without CPU consumption.
  • NVMe/TCP Target (NTT), which accelerates TCP/IP and NVMe-over-Fabrics processing, enabling storage disaggregation over standard Ethernet with zero CPU participation.
  • GPU Storage Boost (GSB), enabling direct direct memory access (DMA) transfers from GPU memory to local or remote storage while bypassing the CPU, which MangoBoost says improves input/output efficiency.

Mango StorageBoost integrates with standard server platforms and GPUs, does not require changes to existing hardware or software stacks, and is available for data center deployment.

Source: MangoBoost

Get Data Center Engineering News In Your Inbox:

Popular Posts:

695fcac850f073b041e711a2_karman-p-3200 copy
Karman launches 10 MW Heat Processing Unit for giga-scale AI data center cooling
Screenshot
Five AI data centers to reach 1 GW power capacity in 2026, new analysis shows
68e79d30a17eea847251fae6_img-home-product-liquidjet-main
Frore Systems updates LiquidJet direct-to-chip coldplate for 1,950 W NVIDIA Rubin data center GPUs
1765906506220
Tritium launches 800 VDC bidirectional inverter for data centers and renewable energy sites
Screenshot
HC Capital Partners and Herrmann Family Companies plan 1,500-plus-acre Energy Ranch power-linked data center campus in South Texas

Share Your Data Center Engineering News

Do you have a new product announcement, webinar, whitepaper, or article topic? 

Get Data Center Engineering News In Your Inbox: