The AI Drive Storage Shortage Threatens Internet Archive and Digital History

Enterprise drive shortages force Internet Archive to seek donations as AI companies monopolize storage market

Annemarije de Boer Avatar
Annemarije de Boer Avatar

By

Image: Beatrice Murch via Wikimedia Commons

Key Takeaways

Key Takeaways

  • AI companies purchased 2025-2026 HDD production, causing 30-50% price surges
  • Internet Archive’s 210 petabytes face existential threat from unaffordable storage costs
  • Storage shortage expected to persist until 2027-2029, threatening digital preservation projects

That favorite deleted YouTube video you love? It might be gone forever, thanks to hard drives that Internet Archive can’t afford anymore. AI data centers have created an unprecedented storage shortage, with enterprise HDDs backordered for two years and prices surging 30-50% since late 2024. This isn’t just another supply chain hiccup—it’s actively threatening digital history preservation while tech giants monopolize storage.

AI’s Storage Appetite Devours Everything

Seagate, Western Digital, and Toshiba sold their 2025-2026 production to AI companies before most people realized what was happening. “No meaningful open production remains for discretionary buyers except hyperscalers,” according to Sid Nag from Tekonyx. Those 28-30TB drives that cost $549 last year? They’re $599 now—if you can find them.

Meanwhile, hyperscalers are stockpiling storage like digital doomsday preppers, preparing for training runs that demand petabytes of capacity. The shift has manufacturers prioritizing enterprise clients with deep pockets over the archivists and researchers who’ve quietly preserved digital culture for decades.

Internet Archive Fights for Digital Survival

Brewster Kahle’s Internet Archive, which maintains 210 petabytes of human knowledge, now faces an existential crisis. The organization that preserves everything from GeoCities pages to government documents can’t secure the high-capacity drives it needs for daily operations. They’re seeking donations and manufacturer aid while watching storage costs potentially double.

When the organization dedicated to preventing digital amnesia struggles to buy memory, something’s fundamentally broken. Your access to deleted websites, archived software, and digitized books hangs in the balance of corporate procurement decisions.

Data Hoarders Declare Defeat

The r/DataHoardder subreddit—once filled with enthusiasts backing up terabytes of content—now reads like a support group. Users report “calling it quits” on new drives, hunting eBay for older hardware, and extending the life of aging equipment.

Academic projects suffer too: University of North Texas professor Mark Phillips faces doubled RAM and SSD costs for his End of Term Archive. These aren’t enterprise buyers—they’re the digital librarians preserving culture that corporations ignore.

No Relief Until 2029

“Memory price hikes are likely to become the new normal,” warns Seagate’s Ban-Seng Teh. The shortage could persist into 2027-2029, forcing archivists and researchers to develop workarounds or simply accept that some digital history will disappear.

Meanwhile, AI companies continue their data-hungry expansion, viewing storage costs as just another line item in their billions of capex spending. The irony cuts deep: AI systems trained on human knowledge are making it harder to preserve that same knowledge for future humans.

Share this

At Gadget Review, our guides, reviews, and news are driven by thorough human expertise and use our Trust Rating system and the True Score. AI assists in refining our editorial process, ensuring that every article is engaging, clear and succinct. See how we write our content here →