Data Infrastructure

Cold Storage Economics and the Compliance Drivers Reshaping Archival

Cold storage server room

Cold storage is not a glamorous category. It does not generate the conference buzz that real-time analytics or ML infrastructure does. Nobody is writing breathless threads about archival solutions. And yet, over the past three years, we have watched the cold storage and archival market quietly become one of the more interesting places to invest in data infrastructure — not because the technology changed dramatically, but because the economics and compliance pressure around data retention have.

The core tension is simple. Data volumes are growing at roughly 23% annually across the enterprise customers we work with. Retention mandates — how long a company is legally required to keep certain categories of data — are getting longer, not shorter. Storage costs have declined, but not at a rate that keeps pace with volume growth combined with longer retention requirements. The math is getting worse for enterprise IT teams every year, and the people responsible for it are looking for better options.

What Has Changed on the Compliance Side

Regulatory frameworks around data retention have not historically moved fast, but the past five years have seen meaningful change in a few areas that directly affect enterprise archival strategy.

Financial services firms are dealing with SEC and FINRA requirements that have expanded both the categories of communications that must be retained and the duration of retention. The enforcement actions in this space have been significant — over $1.8 billion in fines across major financial institutions for communications record-keeping failures since 2021. That number gets every financial services CTO's attention.

Healthcare organizations face HIPAA retention requirements that vary by record type and state law, with some states now mandating retention periods extending to 10 years or beyond for certain patient record categories. The complexity of managing retention policies across a large health system — where record types, patient ages, and state jurisdictions all intersect — is genuinely hard to do without purpose-built tooling.

The broader corporate compliance picture has also shifted. Litigation hold requirements mean companies often need to preserve data well beyond standard retention schedules when legal proceedings are reasonably anticipated. Managing litigation hold overlays on top of standard retention policies is a real operational challenge for the legal and IT functions at large enterprises.

Why the Existing Solutions Are Inadequate

Most enterprises today handle archival with one of three approaches, none of which are actually well-suited to the current environment:

Tape libraries. The legacy choice. Still in use at many large enterprises because the per-byte economics are hard to beat at scale. The problems are retrieval speed (hours to days for anything that requires physical tape handling), operational complexity, and the expertise required to maintain aging tape infrastructure. Tape is not going away for bulk archival of truly cold data, but it cannot serve as a complete solution for data that might need to be retrieved for litigation hold or regulatory inquiry on short notice.

Object storage tiers. Most cloud providers offer tiered storage with distinct pricing for infrequently accessed data. The economics are better than hot storage but are frequently poorly understood — particularly around retrieval costs, which can be significant when large data volumes need to be accessed for compliance purposes. We have seen enterprises receive unexpectedly large bills when they needed to pull significant data volumes for a regulatory inquiry. That surprise cost element undermines the planning confidence enterprises need for long-term archival budget modeling.

SAN/NAS overflow. Many enterprises simply push data to on-premise storage arrays that have been fully depreciated. This works until it does not — hardware failures, capacity limits, and the absence of any policy enforcement layer mean compliance managers often cannot actually verify what data is retained where and for how long.

What Coldforge Is Building

Our investment in Coldforge at Series B reflects a bet on a specific approach to solving this problem. Rather than competing on raw per-byte storage cost alone, Coldforge built a compliance-aware archival layer that manages retention policy enforcement, litigation hold overlays, and retrieval SLAs as first-class product capabilities. The storage layer underneath is configurable — customers can use whatever hardware or cloud storage tier fits their cost model — but the policy and compliance management layer above it is where the differentiation sits.

For a financial services customer, this means the compliance team can define retention schedules by record category, the legal team can place litigation holds with documented chain of custody, and the IT team can query exactly what is retained where without running a manual audit. Those three functions have historically operated separately, often with conflicting or poorly documented policies. Coldforge integrates them.

The retention numbers from Coldforge's early customers are strong — above 130% NRR — which in a category where switching costs are inherently high is not surprising, but the magnitude reflects genuine product satisfaction rather than just lock-in. We do not think they are the only company solving this problem well, but they have the clearest product vision for the regulated-industry use case of any company we evaluated in this space.

The Broader Market Dynamics

Cold storage and archival is not a market that gets disrupted by a single innovation. The unit economics are constrained by the fundamental cost of storing bits across time. What does shift is the complexity and risk profile of managing compliance obligations around retained data — and that is where the software layer creates sustained value.

Enterprises will spend more on archival management software over the next decade, not less. The volume of data subject to retention requirements is growing. The regulatory complexity around that retention is increasing. The litigation risk associated with inadequate retention practices is well-documented and large. Those three forces do not reverse.

For investors, the question is which layer of the stack captures the value. We think it is the policy and compliance management layer, not the raw storage layer. That is the layer where the problem is hardest, the switching costs are highest, and the buyer has the clearest sense of what failure looks like.

Continue Reading