banner
datahoarding.org
@datahoarding.org
DataHoarding.org is an index of resources and archives related to data hoarding, web archival and self hosting.
Pinned
DataHoarding.org is an index of resources and archives related to data hoarding, web archival and self hosting.

If you know additional resources or archival sites, use the #DataHoarding tag.
Data Hoarding
An index of resources and archives related to data hoarding, web archival and self hosting.
DataHoarding.org
SciX, created by the team behind the NASA Astrophysics Data System (ADS), covers and unifies the fields of Earth science, planetary science, astrophysics, heliophysics, and the NASA-funded biological and physical sciences.

Added to #DataHoarding index. datahoarding.org/archives.htm...
November 23, 2025 at 10:16 PM
The mission of the National Centre for Scientific Research (CNRS) is to leverage all fields of sciences to tackle current global challenges.

Added to #DataHoarding index. datahoarding.org/archives.htm...
November 20, 2025 at 3:29 AM
From 1991 to 2021, the FAS Project on Government Secrecy worked to challenge excessive government secrecy and to promote public oversight in national security affairs.

Now added to the #DataHoarding index. datahoarding.org/archives.htm...
November 18, 2025 at 6:24 PM
The Discography of American Historical Recordings (DAHR) is a database of master recordings made by American record companies during the 78rpm era.

Now added to the #DataHoarding index. datahoarding.org/archives.htm...
November 18, 2025 at 6:24 PM
The VGHF Library is operated by the Video Game History Foundation, a non-profit organization dedicated to the history of video games. This is the home of video game development materials, magazines, artwork, ephemera, and more.

Added to the #DataHoarding index. datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
November 11, 2025 at 12:13 AM
Bitsavers is an historical archive containing over 179,000 files including over 8.5 million text pages. It focuses on software, general computing, telecommunications, electronics components, magazines and test equipment.

Added to #DataHoarding index: datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
November 6, 2025 at 4:29 PM
MalwareBazaar is a platform from abuse.ch and Spamhaus, dedicated to sharing malware samples with the infosec community, antivirus vendors, and threat intelligence providers.

Added to the #DataHoarding index. datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 17, 2025 at 1:29 PM
Trump's Truth is a public archive of Donald Trump's communications on TRUTH Social. Meanwhile, the twitter archive used to check Twitter every 60 seconds and record every Trump tweet into a database.

Added to the #DataHoarding index. datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 17, 2025 at 1:26 PM
Within its legal mandate, the German Meteorological Service (DWD) offers weather and climate data free of charge on its Open Data server. Data includes satellites and radar maps, radiation data, local and historical forecasts.

Indexed to #DataHoading archives: datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 13, 2025 at 6:20 PM
GEO.ca is the definitive source for Canada’s open geospatial information. Open data. Applications. Maps. And more. Discover it all on along with the tools you need to visualize, analyze and share the insights you create.

Indexed to #DataHoarding archives: datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 13, 2025 at 6:19 PM
Replacement Docs

The replacementdocs site provides high quality scanned images of game instruction manuals in their full, original format with all original artwork and other graphical elements intact.

Indexed: datahoarding.org/archives.htm...
October 12, 2025 at 2:13 PM
Mailbag

The Mailbag project is a draft specification and mailbagit open source tool for preserving email archives using multiple formats, such as MBOX, PDF, and WARC.

datahoarding.org/resources.ht...
Data Hoarding - Resources
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 11, 2025 at 11:58 PM
The UNIX Files

This is an abandonware archive dedicated to preserving software for propriatary UNIX operating systems such as IRIX, Solaris, AIX and HP-UX. Software featured on it has been discontinued by their publishers and is no longer commercially available.

datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 11, 2025 at 10:41 PM
IKEA Museum

The digital museum showcases the story of IKEA and is open to anyone who is curious about IKEA and life at home, anywhere and anytime. It includes the full catalogs from 1950 to 2021. Added to #DataHoarding index.

datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 11, 2025 at 10:40 PM
Epstein Files Archive: An automatically processed, OCR'd, searchable archive of publicly released documents related to the Jeffrey Epstein case. Added to #DataHoarding index.

datahoarding.org/archives.htm...
Data Hoarding - Archives
An index of resources and archives related to data hoarding, web archival and digital preservation.
datahoarding.org
October 6, 2025 at 4:58 PM
HAL is a multidisciplinary open archive for sharing research results in open access. It contains over 1.5M scientific papers and 4M references. Added to #DataHoarding index. hal.science
Home - Archive ouverte HAL
Your publications are easy to find, well referenced by search engines and interconnected with other services (ORCID, preprint servers)
hal.science
October 4, 2025 at 3:58 PM
LibriVox volunteers record chapters of books in the public domain, and then we release the audio files back onto the net for free. Added to #DataHoarding index. librivox.org
LibriVox | free public domain audiobooks
librivox.org
October 1, 2025 at 1:14 PM
Since its foundation in 1917 IWM has been building its collections in order to illustrate and record all aspects of conflict in the twentieth and twenty-first centuries. IWM's collection has over 1 million items. Added to #DataHoarding index. www.iwm.org.uk/collections
IWM Collections
Explore over 1 million items from IWM's collections that tell the story of war and conflict.
www.iwm.org.uk
September 30, 2025 at 5:27 PM
AODL provides free universal access to cultural heritage materials from and about African countries and communities. Added to #DataHoarding index. aodl.org
Home | AODL
aodl.org
September 25, 2025 at 4:48 PM
The VintageMachinery web site is devoted to information on the history, restoration and use of vintage machinery. The site contains information concerning vintage machinery including publications, historical information and technical data. Added to #DataHoarding index. vintagemachinery.org
VintageMachinery.org | Welcome
vintagemachinery.org
August 23, 2025 at 7:01 PM
WorldCourts is one of the largest databases of international case law in the world. Established in 1999, it provides a single point of access to over 50,000 decisions of 52 international and internationalized judicial and quasi-judicial bodies. Added to #DataHoarding index. www.worldcourts.com
WorldCourts: International Case Law Database (Judgments, Advisory Opinions, Views & Decisions)
www.worldcourts.com
August 12, 2025 at 8:09 PM
Digital Archive Ontario collects digitized items held by Toronto Public Library, including over 100,000 historical photos, maps, postcards & more from Ontario. Added to #DataHoarding index. digitalarchiveontario.ca
Digital Archive Ontario
eMuseum is a powerful web publishing toolkit that integrates seamlessly with TMS to bring dynamic collection content and images to your website, intranet, and kiosks.
digitalarchiveontario.ca
July 19, 2025 at 2:02 PM
Global Energy Monitor develops and analyzes data on energy infrastructure, resources, and uses. They provide open access to information that is essential to building a sustainable energy future. Added to #DataHoarding index. globalenergymonitor.org
Home
Global Energy Monitor studies the evolving international energy landscape, creating databases, reports, and interactive tools that enhance understanding. Our work transforms complexity into clarity, enhancing the quality of public discourse on energy and the environment.
globalenergymonitor.org
July 19, 2025 at 2:35 AM
Home to over a million extraordinary artifacts and archaeological finds, the Penn Museum has been uncovering our shared humanity across continents and millennia since 1887. Added to #DataHoarding index. www.penn.museum/collections/
Digital Collections - Penn Museum
Penn Museum Collections. Search over 389,000+ object records, representing over 1.3 million objects with 279,000+ images. Read 1700+ articles. Watch over 1,100 lectures, archival, and produced films.
www.penn.museum
July 18, 2025 at 6:45 PM
Google Arts & Culture is a non-commercial initiative. They work with museums, cultural institutions and artists around the world to preserve and bring the world's art and culture online so it's accessible to anyone, anywhere. Added to #DataHoarding index. artsandculture.google.com/partner
Collections — Google Arts & Culture
Explore all Collections on Google Arts & Culture.
artsandculture.google.com
July 18, 2025 at 5:04 AM