KAYTUS Launches All-QLC Flash Storage at AI EXPO 2026 for 10,000-GPU Clusters
KAYTUS’s next-generation all-QLC flash solution delivers fully linear performance scaling for massive GPU clusters, while reducing TCO by 70%, enabling ultra-large-scale computing for the era of agentic AI.
SINGAPORE--(BUSINESS WIRE)--At AI EXPO KOREA 2026, KAYTUS officially launched its All-QLC Flash Storage Solution, engineered to deliver high performance, massive scalability, and cost efficiency for 10,000-GPU clusters. The solution addresses data-delivery bottlenecks in ultra-large-scale AI training, helping maximize GPU resource utilization.
Based on the KR2280 and KR1180 server platforms, the solution is deeply integrated with industry-leading AI-native parallel file systems to eliminate data silos inherent in traditional tiered storage. Purpose-built for read-intensive AI workloads, it overcomes the horizontal scaling limitations of massive clusters. Verified test-data shows that, at exabyte-scale deployment, the solution delivers 10 TB/s aggregate bandwidth and 100 million IOPS. In addition, it reduces five-year TCO by 70% compared with traditional TLC-based solutions, accelerating model innovation for AI cloud providers and intelligent computing centers.
Limitations in Traditional AI Storage Architectures.
The explosive growth of AI is fundamentally transforming enterprise computing and storage requirements. Large-scale AI model training features highly read-intensive workloads that require tens of thousands of GPUs to concurrently access exabyte-scale datasets with sub-millisecond latency. Traditional storage architectures now face three major challenges:
-
-
Separated Data Silos: Traditional ETL processes require data to be moved from object storage to parallel file systems before training, resulting in time-consuming physical data migration. IDC research indicates that data teams spend 81% of their time on data preparation, slowing business iteration.
-
-
Workload and Media Mismatch: More than 90% of AI training involves high-frequency concurrent reads. In contrast, traditional TLC flash solutions provide excessive write endurance that is unnecessary for these read-intensive workloads, driving up procurement, space, and power costs for exabyte-scale clusters and resulting in inefficient resource utilization.
-
-
Scalability Bottlenecks: Traditional file systems were not designed to handle the I/O burst workloads generated by 10,000-GPU clusters. As clusters scale, metadata lock contention and communication overhead introduce latency spikes and degraded overall performance.
-
KAYTUS Solution: All-QLC Flash Storage for Delivering High Performance, Scalability, and Cost Efficiency.
The next-generation KAYTUS All- QLC Flash Storage Server Solution is purpose-built to unlock the full potential of read-intensive AI training workloads. By tightly integrating flagship compute nodes with industry-leading AI-native parallel file systems, the solution harnesses advanced hardware–software co-design to deliver breakthrough performance, seamless scalability, and superior cost efficiency for ultra-large-scale AI computing environments.
Architectural Innovation: Overcoming AI Training Efficiency Bottlenecks.
The KAYTUS solution establishes a unified namespace with native multi-protocol access across file, object, and block storage. By leveraging high-capacity QLC flash pools and NVMe-oF fully shared interconnects, it redefines the unified data plane for AI storage, effectively eliminating the data silos inherent in traditional tiered architectures. Data can now flow on demand to GPU nodes without cross-system migration, enabling sub-millisecond access, and significantly improving AI training data retrieval efficiency.
-
-
Hardware Optimization: Engineered for read-intensive workloads, the solution features a PCIe 5.0 direct-connect architecture that doubles single-node I/O bandwidth compared to the previous generation. Combined with NUMA-balanced optimization, it effectively eliminates internal throughput bottlenecks.
-
-
Software Synergy: The solution integrates NFS over RDMA and native GPU Direct Storage technology, enabling direct data paths from QLC flash to GPU memory. By leveraging a disaggregated architecture that decouples protocol processing from storage states, it eliminates east-west traffic and achieves fully linear scaling of bandwidth and throughput, from petabyte to exabyte scale.
-
10,000-GPU Cluster Benchmarks: Exceptional Performance, Scalability, and Cost Efficiency
In benchmark testing in an exabyte-scale storage environment for a 10,000-GPU data center, the solution—powered by KR2280 and KR1180 nodes and optimized with industry-leading AI-native parallel file systems—demonstrated its capability to scale seamlessly to support computing clusters of up to 10,000 GPUs.
-
-
Extreme Performance at Scale: The system delivers 10 TB/s sustained aggregate read bandwidth and 100 million random-read IOPS, enabling concurrent access for tens of thousands of GPUs. Performance scales linearly as additional nodes are added, while GPU utilization remains consistently above 95%, with no storage-side lock contention or queuing, effectively eliminating GPU data starvation.
-
-
Superior Cost Efficiency: Compared with traditional TLC all-flash solutions, the solution reduces five-year TCO by 70%, cuts power and cooling costs by more than 75%, helping enterprises avoid overpaying for unnecessary extra write endurance.
-
|
Metric (1 EB Capacity) |
TLC SSD Solution |
QLC SSD Solution |
Difference |
|
CAPEX |
1.0 |
0.39 |
65% ↓ |
|
Power Cost |
1.0 |
0.29 |
75% ↓ |
|
5-Year TCO |
1.0 |
0.36 |
70% ↓ |
|
(Note: Based on 15.36T TLC vs 61.44T QLC drive units) |
|||
KAYTUS All-Flash Portfolio: From High Density to Massive Capacity.
KAYTUS offers a comprehensive QLC product portfolio supporting single-drive capacities of up to 122.88 TB.
-
-
KR1180 (1U10) — High-Density Excellence: Delivers 1 PB of capacity and 140 GB/s bandwidth in a 1U chassis, featuring optimized air cooling and 18% latency improvement under GPU workloads.
-
-
KR2280 (2U24) — Versatile Flagship: Supports 24 QLC drives and seven PCIe 5.0 slots. It is compatible with both Intel and AMD platforms and offers liquid-cooling options for high-efficiency data centers.
-
-
KR4266 (4U60) — Big Data Massive Storage: Offers industry-leading physical density with up to 7 PB per unit, delivering 260 GB/s sequential read bandwidth and 20 million IOPS.
-
About KAYTUS
KAYTUS is a leading provider of AI infrastructure and liquid cooling solutions, delivering a diverse range of innovative, open, and eco-friendly products for cloud, AI, edge computing, and other emerging applications. With a customer-centric approach, KAYTUS is agile and responsive to user needs through its adaptable business model. Discover more at KAYTUS.com and follow us on LinkedIn and X
- 佳因特科技:创新驱动绿色能源未来 重塑全球充电基础设施
- 韦昌铠携经云手:以民族医药之光,照亮健康中国路
- Thredd签署里程碑协议,助力Visa Cloud Connect服务全球
- Bitget 开放AI代理辅助交易功能,促进加密货币更广泛采用
- 星能冠军助学项目——民生银行河南封丘行
- 朱虹:做好重大选择 点亮人生梦想
- Esaote launches the new MyLabTM E85 GTS ultrasound system in Vienna
- 20亩丝瓜年入40万;西红柿全城热销,他们的秘诀竟然是一袋肥料!
- 星展香港及Yedpay合作推出「DBS MAX商户服务方案」一站式销售管理平台
- 容祖儿成为香港启德开幕首位表演歌手 高燃开场《越唱越强》见证荣耀时刻
- DeepSeek赋能企业研发:DevOps+AI新时代再升级
- 临商银行北京路支行推进数字化转型 提升服务效能
- 智原打造基于联电28纳米SST eFlash平台的终端AI IP解决方案
- 家政保洁服务:打造舒适宜居环境的首选
- SPIE公布17届年度棱镜奖的光子学行业最佳新产品
- 走进彝乡为彝客
- 北京狼牙国际救援中心云南省应急产业园筹备会召开
- FOODEX JAPAN盛大开幕 雪川农业集团展现中国薯世界味
- 水晶凝雅韵 手作悦心颜 中荷人寿安徽省分公司举办水晶手作蜜丝会活动
- 读张迎老师所著《我的莲花青年》:探寻成长之路,绽放莲花般的青春
- 企业群像 | 《控制工程网》采访:中科时代引领通用自动化变革,解锁产业跃迁机遇
- 移为通信亮相全美车队峰会:AIoT+多模定位技术打通北美物流“端到端”资产可视化
- CRH开发风电场为罗马尼亚水泥厂供电
- Xsolla SDK现已面向全球游戏开发者开放
- 智云健康独家商业化运营,国内首个碳酸司维拉姆干混悬剂上市
- 广西:“硬核”新品闪耀盛会 聚势同兴有“服”共享
- 围美辣妈“三角减肥法”,为全球肥胖防治提供中国方案
- 八脉六合承古韵,仁心仁术惠苍生—— 记赵氏八脉六合疏通法传承人赵连强
- Snail Games 旗下 Wandering Wizard 携手跨界合作,提升《雪山之上》和《反抗引擎》全球知名度
- 《法治中国》频道与河南安达爆破达成战略合作 共启法治护航企业发展新篇章





