Table of Contents
This white paper demonstrates through rigorous, verifiable analysis that deploying a 1 exabyte (EB) usable storage solution utilizing Solidigm™ D5-P5336 high-density 122TB Quad-Level Cell (QLC) SSDs, powered by VAST Data's AI Operating System, offers a significantly lower 10-year Total Cost of Ownership (TCO) compared to a traditional approach using 30TB Hard Disk Drives (HDDs) with CEPH software. Solidigm is primarily a hardware company, so this analysis focuses exclusively on the impact of storage-centric hardware and operational expenditures (OpEx).
The Solidigm + VAST solution delivers an estimated 10-year hardware and OpEx TCO of approximately $35.19 million USD, representing a substantial 58.9% saving compared to the HDD/CEPH solution's calculated TCO of $85.62 million USD.
Cost Category | HDD/CEPH ($M) | SSD/VAST ($M) | Difference ($M) | Difference (%) |
Total 10-Year Hardware Cost | $40.77 | $29.06 | -$11.71 | -28.7% |
Total 10-Year Power Cost | $13.65 | $3.13 | -$10.52 | -77.1% |
Total 10-Year Space Cost | $31.20 | $3.00 | -$28.20 | -90.4% |
Total 10-Year TCO | $85.62 | $35.19 | -$50.43 | -58.9% |
Table 1. Estimated 10-year TCO summary for hardware & OpEx only1 (Note: The sums are rounded)
While this analysis focuses on a 1EB deployment, the fundamental economic advantages demonstrated hold true for capacities ranging from as low as 20 Petabytes (PB) and can be extrapolated to Zettabyte (ZB) scale.
This substantial hardware and OpEx cost advantage stems from a confluence of factors inherent to the Solidigm + VAST architecture, fundamentally reshaping the economics of large-scale storage. The synergy of these elements creates a compounding benefit structure. Dramatically higher storage density, enabled by the Solidigm 122TB QLC SSD, drastically reduces the physical footprint. Fewer drives, fewer chassis, and significantly fewer data center racks are required. This initial reduction in physical hardware directly translates to lower acquisition costs for the drives themselves, and subsequently, lower power and space operational expenditures.
VAST Data's advanced software contributes compounding efficiencies. Similarity data reduction, storing only the differences in similar blocks of data, achieves significant capacity savings beyond what is physically deployed. Additionally innovative, flash-optimized erasure coding minimizes data protection overhead to industry-leading lows. Combined with VAST Data’s extended 10-year operational infrastructure lifespan and supported by VAST Data’s comprehensive warranty that includes QLC endurance, these advances eliminate the need for costly and disruptive triennial or quadrennial HDD hardware refreshes. This extended lifespan is a critical lever in the TCO calculation, as it negates a major capital expenditure cycle inherent in traditional HDD-based systems.
These factors can culminate in substantial savings in capital expenditure over the decade, with significant reductions in operational expenses related to power consumption and data center space utilization. These economies challenge the traditional paradigm where all-flash storage was reserved for high-performance, low-capacity tiers; this analysis indicates that for large-scale, long-term deployments, all-flash can be the more cost-effective solution for capacity-centric workloads as well. This analysis provides evidence-based support for this economic proposition.
Solidigm and VAST Data have collaborated to engineer a storage future that reduces the traditional trade-offs between performance, capacity, and cost. By leveraging Solidigm's leadership in high-density flash memory and VAST Data's revolutionary software architecture, this effort delivers a solution designed for the demands of the exabyte era. This analysis quantifies the profound economic benefits of this collaboration for organizations managing data at unprecedented scale, demonstrating a clear path towards a more efficient, sustainable, and cost-effective storage infrastructure.
Organizations managing data at the exabyte scale require storage solutions that are efficient, reliable, performant, and cost-effective over the long term. This paper analyzes two distinct architectural approaches designed to meet these demands:
The positioning of these solutions as "traditional" versus "modern" or "next-generation" frames an important narrative. The claims made for the modern solution, such as advanced data reduction and erasure coding, are substantiated by vendor-provided information and technical documentation, lending credibility to its advanced nature. Conversely, the industry-acknowledged limitations of legacy systems, when supported by industry observations or analyses, strengthen the case for exploring newer architectures.
The primary objective of this white paper is to provide a transparent and verifiable 10-year TCO comparison between these two solutions when deployed to deliver 1EB of usable storage capacity. This TCO excludes software platform costs, focusing solely on:
While the calculations focus on the 1EB mark, the relative economic benefits scale, making this analysis relevant for deployments starting from approximately 20PB and extending towards zettabyte (ZB) levels. By examining costs related to hardware acquisition, mandatory refreshes, power consumption, and data center space, this analysis aims to equip technical leaders with the quantitative data needed to make informed, strategic decisions regarding their long-term, large-scale storage infrastructure investments.
For transparent comparison, this TCO analysis is built upon a framework of defined assumptions, each supported by publicly available data, vendor specifications, industry reports, or direct vendor attestation where applicable. The goal is to provide a realistic, albeit specific, scenario reflecting the potential economics of deploying storage at the exabyte scale. The validity of the entire TCO model rests upon the credibility of these foundational assumptions. Therefore, each is scrutinized for its supporting evidence.
The analysis targets the delivery of 1EB of usable storage capacity for both the HDD/CEPH and SSD/VAST solutions. (1EB = 1000 Petabytes (PB) = 1,000,000 Terabytes (TB)).
Pricing reflects anticipated costs for procurement at the exabyte level, which understandably entails significant volume discounts compared to standard enterprise or retail pricing. All prices are in USD.
The assumed operational lifespan and replacement frequency differ significantly between the two solutions, driven by technology characteristics, warranty coverage, and operational best practices for maintaining performance and reliability at scale. This difference is a cornerstone of the TCO divergence and relies on contrasting the inherent properties of mechanical drives versus solid-state media, alongside vendor commitments.
To account for the expected decline in storage costs over time, each subsequent HDD refresh (at year 4 and year 8) is assumed to benefit from a 25% price reduction per TB compared to the previous purchase price. For example, the year 4 price per TB is 75% of the year 0 price, and the year 8 price per TB is 75% of the year 4 price.
Supporting Evidence: Historically, the cost per terabyte for HDDs has consistently decreased as technology advances and areal densities increase, although the rate of decline can fluctuate. Backblaze observed an average annual decrease of over 9% across various drive sizes between 2017 and 2022.5 A 25% price reduction over a 4-year period implies an average annual decrease of approximately 7%, which is considered plausible for large-volume procurement contracts anticipating continued technological progress in the HDD market.
The significant difference in assumed operational lifespan—requiring two full replacements for the HDD fleet versus none for the SSDs over 10 years—is a fundamental factor driving the TCO divergence. This difference is grounded in the physical nature of the technologies, observed failure rate patterns for HDDs in active use, and the specific software capabilities and warranty commitments provided by the VAST Data solution for managing QLC SSD endurance.
The efficiency with which raw storage capacity is converted into usable capacity varies significantly due to filesystem requirements, data protection needs, and data reduction technologies. These early-stage assumptions about efficiency have a profound multiplier effect, as they determine the total raw physical capacity required, which in turn influences hardware count, power consumption, and space utilization.
Filesystem metadata overhead: An overhead of 10% is assumed for filesystem metadata for both the HDD/CEPH and SSD/VAST solutions.
Supporting Evidence: Filesystem metadata (tracking file locations, attributes, permissions, etc.) consumes a portion of the raw storage capacity. The actual percentage varies based on the filesystem used, the average size of the files being stored, and other configuration factors, potentially ranging from less than 1% for very large files to over 20% for systems dominated by extremely small files. Industry guidelines and practical experience often use 10% as a reasonable rule-of-thumb for planning purposes in large, mixed-use environments. For instance, AWS FSx for ONTAP suggests 3% to 7% for typical file sizes but recommends planning with a 10% overhead (1 GiB SSD per 10 GiB data) for metadata associated with capacity pool storage. VMware vSAN ESA also allocates approximately 10% for global metadata.
Data protection overhead: The storage capacity beyond the size of the user data required to protect against drive failures differs based on the chosen protection scheme.
Supporting Evidence: This overhead level reflects a common balance between resilience (protecting against multiple drive failures) and capacity efficiency in large-scale HDD-based systems using erasure coding. While specific erasure coding schemes (like k+m) determine the exact overhead, 12.5% (equivalent to an 8+1 scheme's overhead, or slightly better than a typical 14+2 scheme's 14.3%) represents a reasonable and achievable efficiency target for a well-configured CEPH cluster at exabyte scale, prioritizing usable capacity while maintaining robust data protection against common failure scenarios.
Supporting Evidence: VAST Data employs novel, patented locally decodable erasure codes specifically optimized for flash media. These codes allow for much wider stripes (e.g., 146 data chunks + 4 parity chunks, or 146+4) compared to traditional Reed-Solomon codes used in many HDD-based systems. Wider stripes significantly reduce the relative overhead. A 146+4 scheme results in an overhead of m/k = 4/146 ≈ 2.7%.
Data reduction efficiency: Data reduction techniques like deduplication and compression can further increase storage efficiency by reducing the amount of physical capacity needed to store logical data. The stark contrast in assumed data reduction capabilities—0% for HDD/CEPH versus 2.5:1 for Solidigm SSD/VAST—is pivotal and requires robust justification based on the performance characteristics of the underlying media.
The combination of VAST's efficient erasure coding (low overhead) and its effective data reduction technology results in a notable compounding effect on overall storage efficiency. The end result is the Solidigm + VAST solution requires significantly less raw physical capacity to deliver the target 1EB of usable storage compared to the HDD/CEPH approach.
Based on the overheads and data reduction assumptions, the total raw physical capacity required for each solution is estimated as follows:
Parameter | HDD/CEPH | Solidigm SSD + VAST |
Usable Capacity Target (EB) | 1.000 | 1.000 |
Filesystem Overhead Factor | 1 / (1 - 0.10) | 1 / (1 - 0.10) |
Capacity After FS Overhead (EB) | 1.111 | 1.111 |
Data Protection Overhead Factor | 1 + 0.125 | 1 + 0.027 |
Capacity After Protection (EB) | 1.250 | 1.141 |
Data Reduction Ratio | 1:1 | 2.5:1 |
Calculated Raw Capacity Required (EB) | 1.250 | 0.456 |
Table 2. Effective vs. raw drive capacity calculation
This calculation illustrates the efficiency difference: the Solidigm + VAST solution requires approximately 2.7 times less raw physical storage (1.250EB / 0.456EB) than the HDD/CEPH solution to deliver the same 1EB of usable capacity. This fundamental difference in required physical hardware drives downstream reduction on component counts, capital costs, power consumption, and space utilization.
Figure 1. Calculated raw capacity for SSD vs HDD exabyte storage
The calculated raw capacity requirements translate into the following component counts for the initial solution deployment:
Component | HDD/CEPH Solution | Solidigm SSD/VAST Solution |
Seagate 30TB HDD | 41,667 | 0 |
Solidigm D5-P5336 122TB SSD | 0 | 3,738 |
4U 90-Bay HDD Chassis (Est.) | 463 | 0 |
1U VAST Ceres SSD Chassis (Est.) | 0 | 170 |
Table 3. Hardware component summary
Table 3 highlights the reduction in the number of physical components required by the Solidigm + VAST solution—approximately 11 times fewer drives and 2.7 times fewer chassis. These figures, derived directly from the raw capacity calculations and vendor-specified component capacities, make the physical scale difference concrete and easily understandable.
Standard assumptions for a modern, reasonably efficient data center environment are used.
Based on the hardware scale and infrastructure assumptions, the physical footprint and power requirements are estimated as follows:
Parameter | HDD/CEPH | Solidigm SSD/VAST | Unit |
Total Raw Capacity Required | 1.250 | 0.456 | EB |
Total Drive Count | 41,667 | 3,738 | Drives |
Total Chassis Count | 463 | 170 | Chassis |
Chassis U Height | 4 | 1 | U |
Total U Space | 1,852 | 170 | U |
Total IT Power (Average/Active) | 733.86 | 170 | kW |
Racks Needed | 52 | 5 | Racks |
Total Facility Power (incl. 1.4 PUE) | 1038.56 | 238 | kW |
Avg. IT kW per Rack (Populated) | 14.27 | 34.0 | kW/Rack |
Avg. Weight per Rack (Populated) | 1325 | 1153 | kg/Rack |
Table 4. Infrastructure footprint comparison estimate
The infrastructure footprint analysis quantifies the dramatic physical consolidation enabled by the Solidigm + VAST solution. Driven by the higher density of the Solidigm 122TB SSDs and the efficiency gains from VAST's software, the all-flash approach requires over 10 times fewer racks (5 racks vs. 52 racks). The increased chassis overhead for HDDs widens the gap in total facility power consumption (238 kW for SSD/VAST vs. 1038.56 kW for HDD/CEPH). These differences in physical and electrical footprints, derived from component specifications and industry-standard data center metrics, are primary drivers of the OpEx variations in the TCO.
Figure 2. Total racks needed for SSD vs HDD exabyte storage
The following cost categories are intentionally excluded from the TCO calculation:
This section calculates and compares the 10-year TCO for both storage solutions, aggregating the costs of hardware acquisition and refreshes, power consumption, and data center space based on the framework established in the previous section. The analysis demonstrates how the initial higher capital expenditure for the Solidigm SSD/VAST solution is overcome by significant long-term savings in refresh costs and operational expenditures.
Cost Component | HDD/CEPH Cost ($M) | Solidigm SSD/VAST Cost ($M) |
Year 0 (Initial Drives) | $16.63 | $27.36 |
Year 0 (Initial Chassis - Est.) | $2.32 | $1.70 |
Year 0 Total | $18.95 | $29.06 |
Year 4 (HDD Refresh - Drives Only) | $12.47 | $0.00 |
Year 8 (HDD Refresh - Drives Only) | $9.35 | $0.00 |
Total 10-Year Hardware Cost | $40.77 | $29.06 |
Table 5. 10-year estimated hardware investment comparison
The initial hardware investment for the Solidigm + VAST solution is estimated to be approximately 53% higher than the HDD/CEPH solution. However, due to two full drive refreshes for the HDD solution (adding ~$21.8M), its total 10-year hardware cost is approximately 40% higher than the all-flash solution. This illustrates the critical impact of lifecycle assumptions: the initial lower expense for HDDs becomes more expensive over the decade due to repeated replacement cycles, a cost that can be avoided by the longer-lifespan SSD solution backed by VAST Data’s 10-year warranty.
Figure 3. Total 10-year hardware cost for SSD vs HDD exabyte storage
Parameter | HDD/CEPH | Solidigm SSD/VAST | Unit |
Total IT Power (Avg/Active) | 733.86 | 170 | kW |
PUE Factor | 1.4 | 1.4 | - |
Total Facility Power (kW) | 1038.56 | 238 | kW |
Annual Energy Consumption | 9,098,194 | 2,085,120 | kWh |
Cost per kWh | $0.15 | $0.15 | USD/kWh |
Annual Power Cost | $1,364,729 | $312,768 | USD |
Total 10-Year Power Cost ($M) | $13.65 | $3.13 | USD M |
Table 6. 10-year estimated power cost comparison
The Solidigm + VAST solution's lower component count and higher energy efficiency result in a 77% reduction in total facility power consumption compared to the HDD/CEPH system. Over 10 years, this translates into over $10.5 million in estimated direct power cost savings. This operational saving is a direct consequence of the superior density and efficiency of SSD technology and the VAST architecture.
Figure 4. Total 10-year power cost for SSD vs HDD exabyte storage
Parameter | HDD/CEPH | Solidigm SSD/VAST | Unit |
Rack Count | 52 | 5 | Racks |
Cost per Rack per Month | $5,000 | $5,000 | USD |
Annual Space Cost | $3,120,000 | $300,000 | USD |
Total 10-Year Space Cost ($M) | $31.20 | $3.00 | USD M |
Table 7. 10-year space cost comparison
The SSD/VAST solution's extreme density (5 racks vs. 52 racks) yields an estimated $28.2 million saving in space costs over 10 years. This highlights how the physical footprint, driven by storage density, becomes a major economic differentiator in long-term, large-scale deployments, especially when using premium data center rack space.
Cost Category | HDD/CEPH ($M) | Solidigm/VAST ($M) | Difference ($M) | Difference (%) |
Total 10-Year Hardware Cost | $40.77 | $29.06 | -$11.71 | -28.7% |
Total 10-Year Power Cost | $13.65 | $3.13 | -$10.52 | -77.1% |
Total 10-Year Space Cost | $31.20 | $3.00 | -$28.20 | -90.4% |
Total 10-Year TCO | $85.62 | $35.19 | -$50.43 | -58.9% |
Table 8. Comprehensive estimated 10-year TCO savings summary hardware & OpEx only (Note: The sums are rounded)
The comprehensive 10-year TCO analysis, reflecting increased HDD chassis power overhead, further amplifies the economic advantage of the Solidigm + VAST all-flash solution. The higher power cost for the HDD/CEPH solution, combined with its large space footprint, increases its TCO. While HDDs may offer a lower initial cost per terabyte, the cumulative expenses associated with their lower efficiency, shorter refresh cycles, and higher power and space demands make them a more expensive proposition over a 10-year horizon at exabyte scale.
While the 10-year TCO analysis provides a compelling quantitative comparison, qualitative factors also differentiate the two solutions and impact their overall value proposition and strategic fit for modern data centers. These aspects, though not directly factored into the TCO calculation above, are critical for a holistic evaluation, often translating into strategic advantages that extend beyond direct cost savings.
The fundamental difference in media (mechanical spinning disk vs. flash) results in orders-of-magnitude performance differences. HDDs offer millisecond-level latency and IOPS typically in the low hundreds per drive. Solidigm SSDs, particularly NVMe-based QLC drives in an optimized system like VAST Data's, deliver microsecond-level latency and vastly higher IOPS and bandwidth.4
The Solidigm + VAST solution involves significantly fewer physical components (drives, chassis, racks) compared to the HDD/CEPH setup. SSDs generally exhibit lower annualized failure rates (AFR) than HDDs, especially as HDDs age past their initial years of service.3 The elimination of hardware refreshes for the SSD/VAST solution over the 10-year period contrasts sharply with the two full, disruptive, and labor-intensive refreshes required for the HDD fleet.
Environmental considerations are important for organizations and their customers.
The efficiencies of the all-flash solution can free up critical data center resources.
Factor | HDD/CEPH Assessment | Solidigm SSD/VAST Assessment |
Performance | ||
Latency | Millisecond-level latency. | Microsecond-level latency. |
IOPS | Low IOPS (100-200 range per drive). | Very high IOPS (100k+ per system). |
Bandwidth/Throughput | Moderate sequential throughput. | Very high sequential throughput. |
Reliability & Maintenance | ||
Component Count & Failure Points | Very high drive/chassis count. | Significantly lower drive/chassis count. |
Drive Failure Rates (AFR) | Higher AFR (1-3%+ typical, increases with age). | Generally lower AFR (~0.5-1%). |
Hardware Refreshes | Requires two disruptive, large-scale refreshes. | No hardware refreshes required (10-year warranty). |
Maintenance Effort / FTE Req. | High due to component count, failure rates, refreshes. | Significantly lower due to fewer components, lower AFR, and no refreshes. |
10-Year Environmental Profile | ||
Embodied Carbon (Lifecycle) | Lower initial per-drive CO2e, but 3x manufacturing cycles due to refreshes. | Higher initial per-drive CO2e, but 1x manufacturing cycle; lower raw capacity needed. |
Operational Power | Very high-power consumption due to drive mechanics, large component count, and high chassis overhead. | Significantly lower power consumption (~77% less) due to flash efficiency and density. |
E-Waste | Generates substantial e-waste from two full hardware refreshes. | Avoids refresh-related e-waste. |
Material Use | Requires more raw materials due to lower density. | Requires fewer raw materials due to extreme density. |
Strategic Impact | ||
Resource Reallocation | Consumes significantly more space ($28M+ higher cost) and power (~800 kW more), limiting expansion. | Frees up substantial power (~800 kW) and especially space (47 racks, $28M+ cost saving) for reallocation. |
AI/GPU Acceleration | Performance limitations can bottleneck GPUs. | High performance effectively feeds GPUs, accelerating AI/ML. |
Table 9. Qualitative factor comparison
The analysis presented in this whitepaper provides a data-driven comparison of the 10-year Total Cost of Ownership (TCO), focusing on hardware and operational expenditures, for deploying 1 exabyte of usable storage using two distinct technological approaches.
Based on the assumptions and calculations within this framework, the findings favor the modern all-flash solution. Solidigm D5-P5336 122TB QLC SSDs combined with the VAST Data AI Operating System demonstrate a lower 10-year TCO, calculated at $35.19 million USD, compared to $85.62 million USD for the traditional high-capacity HDD solution managed by CEPH. This represents an estimated savings of $50.43 million USD, or approximately 58.9%, over the decade.
This compelling economic advantage is driven by a confluence of factors inherent to the Solidigm + VAST architecture:
Beyond the TCO reduction, the Solidigm + VAST solution offers critical qualitative and strategic advantages including superior performance for AI/ML workloads, lower maintenance burdens, increased reliability, and a stronger long-term sustainability profile. The power and space savings also offer valuable opportunities for resource reallocation within the data center, potentially deferring or avoiding costly facility expansions.
Crucially, this analysis suggests that the long-held assumption of needing a separate, lower-cost HDD tier for "colder" data may no longer be economically justified when considering a 10-year horizon, particularly in new data center builds or for large-scale capacity expansions. The cumulative cost of managing less efficient, shorter-lived HDD infrastructure at scale can outweigh its initial acquisition cost benefits. The paradigm is shifting, suggesting that a single, high-density, efficient all-flash tier can be the most economical approach for a wider range of data, not just performance-critical applications.
For organizations planning and investing in storage infrastructure at the exabyte scale, the transition to an all-flash architecture powered by Solidigm QLC SSDs and VAST Data software represents a fundamentally more economical, efficient, and strategically advantageous path forward. The evidence indicates that the Solidigm + VAST solution delivers not only substantial cost savings, but also the performance, reliability, and scalability required to meet the challenges to unlock the opportunities of the data-driven future.
The estimates provided to you in this paper are for information purposes only. Some results have been estimated or simulated using internal Solidigm or other analysis or architecture simulation or modeling. Tests document performance of components on a particular test, in specific systems.
Features and benefits of Solidigm technologies depend on system configuration and may require enabled hardware, software, or service activation. Differences in hardware, software, or configuration will affect actual performance. Your results may vary.
Solidigm does not control or audit third-party data. You should consult other sources of information to evaluate accuracy and/or performance as you consider your purchase. Performance results are based on testing as of dates shown in the configurations and may not reflect all publicly available updates. See configuration disclosure for details.
Nothing herein is intended to create any express or implied warranty, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, or any warranty arising from course of performance, course of dealing, or usage in trade.