欢迎访问文传商讯!

全部新闻

KAYTUS Launches All-QLC Flash Storage at AI EXPO 2026 for 10,000-GPU Clusters

发布时间:2026-05-08 16:16


KAYTUS’s next-generation all-QLC flash solution delivers fully linear performance scaling for massive GPU clusters, while reducing TCO by 70%, enabling ultra-large-scale computing for the era of agentic AI.

SINGAPORE--(BUSINESS WIRE)--At AI EXPO KOREA 2026, KAYTUS officially launched its All-QLC Flash Storage Solution, engineered to deliver high performance, massive scalability, and cost efficiency for 10,000-GPU clusters. The solution addresses data-delivery bottlenecks in ultra-large-scale AI training, helping maximize GPU resource utilization.

Based on the KR2280 and KR1180 server platforms, the solution is deeply integrated with industry-leading AI-native parallel file systems to eliminate data silos inherent in traditional tiered storage. Purpose-built for read-intensive AI workloads, it overcomes the horizontal scaling limitations of massive clusters. Verified test-data shows that, at exabyte-scale deployment, the solution delivers 10 TB/s aggregate bandwidth and 100 million IOPS. In addition, it reduces five-year TCO by 70% compared with traditional TLC-based solutions, accelerating model innovation for AI cloud providers and intelligent computing centers.

Limitations in Traditional AI Storage Architectures.

The explosive growth of AI is fundamentally transforming enterprise computing and storage requirements. Large-scale AI model training features highly read-intensive workloads that require tens of thousands of GPUs to concurrently access exabyte-scale datasets with sub-millisecond latency. Traditional storage architectures now face three major challenges:

  • Separated Data Silos: Traditional ETL processes require data to be moved from object storage to parallel file systems before training, resulting in time-consuming physical data migration. IDC research indicates that data teams spend 81% of their time on data preparation, slowing business iteration.
  • Workload and Media Mismatch: More than 90% of AI training involves high-frequency concurrent reads. In contrast, traditional TLC flash solutions provide excessive write endurance that is unnecessary for these read-intensive workloads, driving up procurement, space, and power costs for exabyte-scale clusters and resulting in inefficient resource utilization.
  • Scalability Bottlenecks: Traditional file systems were not designed to handle the I/O burst workloads generated by 10,000-GPU clusters. As clusters scale, metadata lock contention and communication overhead introduce latency spikes and degraded overall performance.

KAYTUS Solution: All-QLC Flash Storage for Delivering High Performance, Scalability, and Cost Efficiency.

The next-generation KAYTUS All- QLC Flash Storage Server Solution is purpose-built to unlock the full potential of read-intensive AI training workloads. By tightly integrating flagship compute nodes with industry-leading AI-native parallel file systems, the solution harnesses advanced hardware–software co-design to deliver breakthrough performance, seamless scalability, and superior cost efficiency for ultra-large-scale AI computing environments.

Architectural Innovation: Overcoming AI Training Efficiency Bottlenecks.

The KAYTUS solution establishes a unified namespace with native multi-protocol access across file, object, and block storage. By leveraging high-capacity QLC flash pools and NVMe-oF fully shared interconnects, it redefines the unified data plane for AI storage, effectively eliminating the data silos inherent in traditional tiered architectures. Data can now flow on demand to GPU nodes without cross-system migration, enabling sub-millisecond access, and significantly improving AI training data retrieval efficiency.

  • Hardware Optimization: Engineered for read-intensive workloads, the solution features a PCIe 5.0 direct-connect architecture that doubles single-node I/O bandwidth compared to the previous generation. Combined with NUMA-balanced optimization, it effectively eliminates internal throughput bottlenecks.
  • Software Synergy: The solution integrates NFS over RDMA and native GPU Direct Storage technology, enabling direct data paths from QLC flash to GPU memory. By leveraging a disaggregated architecture that decouples protocol processing from storage states, it eliminates east-west traffic and achieves fully linear scaling of bandwidth and throughput, from petabyte to exabyte scale.

10,000-GPU Cluster Benchmarks: Exceptional Performance, Scalability, and Cost Efficiency

In benchmark testing in an exabyte-scale storage environment for a 10,000-GPU data center, the solution—powered by KR2280 and KR1180 nodes and optimized with industry-leading AI-native parallel file systems—demonstrated its capability to scale seamlessly to support computing clusters of up to 10,000 GPUs.

  • Extreme Performance at Scale: The system delivers 10 TB/s sustained aggregate read bandwidth and 100 million random-read IOPS, enabling concurrent access for tens of thousands of GPUs. Performance scales linearly as additional nodes are added, while GPU utilization remains consistently above 95%, with no storage-side lock contention or queuing, effectively eliminating GPU data starvation.
  • Superior Cost Efficiency: Compared with traditional TLC all-flash solutions, the solution reduces five-year TCO by 70%, cuts power and cooling costs by more than 75%, helping enterprises avoid overpaying for unnecessary extra write endurance.

Metric (1 EB Capacity)

TLC SSD Solution

QLC SSD Solution

Difference

CAPEX

1.0

0.39

65% ↓

Power Cost

1.0

0.29

75% ↓

5-Year TCO

1.0

0.36

70% ↓

(Note: Based on 15.36T TLC vs 61.44T QLC drive units)

 

KAYTUS All-Flash Portfolio: From High Density to Massive Capacity.

KAYTUS offers a comprehensive QLC product portfolio supporting single-drive capacities of up to 122.88 TB.

  • KR1180 (1U10) — High-Density Excellence: Delivers 1 PB of capacity and 140 GB/s bandwidth in a 1U chassis, featuring optimized air cooling and 18% latency improvement under GPU workloads.
  • KR2280 (2U24) — Versatile Flagship: Supports 24 QLC drives and seven PCIe 5.0 slots. It is compatible with both Intel and AMD platforms and offers liquid-cooling options for high-efficiency data centers.
  • KR4266 (4U60) — Big Data Massive Storage: Offers industry-leading physical density with up to 7 PB per unit, delivering 260 GB/s sequential read bandwidth and 20 million IOPS.

About KAYTUS

KAYTUS is a leading provider of AI infrastructure and liquid cooling solutions, delivering a diverse range of innovative, open, and eco-friendly products for cloud, AI, edge computing, and other emerging applications. With a customer-centric approach, KAYTUS is agile and responsive to user needs through its adaptable business model. Discover more at KAYTUS.com and follow us on LinkedIn and X

 

Contacts

Media Contacts
media@kaytus.com

 

金联创 xinhua08 cacs takungpao China.com 和讯 财讯 C114.net 看商界 畅享网 中国能源网 证券之星 金融界 中金在线 天和网 中国金融网 中汽传媒 国际财经日报 中国环保网 今日亚洲新闻网 百歌新闻专线 亚洲商机 新华网能源频道 IT资讯网 中国智能卡论坛 广西物流网 品牌世家 汽车点评网 中国电子标签网 360教育在线 21世纪保险网 中国能源投资网 中国电子商业联盟 中国汽车咨询中心网 煤炭供应链 美国证券网 百奥知 CTI论坛 中国测控网 北极星电力网 能源财经网 福建之窗 智库在线 eeworld 电脑商情在线 中国电池网 赢商网 湖南信息港 赢在中国网 比邻星环保网 中国制造业门户 中国涂料在线 渝网 - 了解重庆第一站,重庆城市生活门户网站 中国云计算第一门户网站—中云网 投资界 i美股 天和财富网 太阳能信息网 爱中国能源网 世纪新能源网 中国新能源网 PVMate.com 环球外汇 橡胶网-hc360慧聪网 百年企业在线 IT168 米内 汽车工业网 第一车市汽车网 股市资讯 中国西部网 中原汽配网 科技在线 煤炭网 51招生网 教育人生 驴皮网 物流北京 51电池搜索网 大众医学 岭南医药网 5联网 股城网 BIT CNELC XXTLW 外汇通 供应链中国网 中国粘合剂网 中国储能网 家具迷 家居装修网 中華检测网 中国食品招商网 华东化工网 新疆第一汽车网 中国汽车用品交易网 大娱网 中国汽配网 山东化工网 960化工网 妈咪爱 塑胶五金网 慧聪电子网 迈点酒店网 火爆网 emcsino eetrend 绿色节能网 赤浪绿色新能源网 中国商业网 生物无忧 全球医疗器械网 贷商网 手机在线 汽车轻量化在线 中国汽车材料网 科易网 中国电子顶级开发网 中国POS机网 乐康家居 必修 国易网