Enterprise HTML Dumps (Legacy)

The partial mirrors of Wikimedia Enterprise HTML dumps were an experimental service that, as of 24 March 2025, are no longer replicated.

For recent dumps of article change updates (Snapshot API) or the ability to query individual articles (On-demand API), please visit Wikimedia Enterprise to sign up for a free account. Alternatively, use your developer account to access APIs within Wikimedia Cloud Services.

The historical dumps will remain available here for a specific set of namespaces and wikis for public download. To view the payload schema visit Wikimedia Enterprise API Data Dictionary.

Wikiseed is currently in the process of mirroring these dumps. Only 4 full dumps are available, older dumps have been removed from Wikimedia servers and are no longer retrievable. The available dumps are 1 February 2025, 20 February 2025, 1 March 2025, and 20 March 2025.

The dumps are available as .json files packaged in .tar.gz tarballs. Each full dump is approximately 908-920 GB in size, compressed. Uncompressed file sizes have not been calculated. Files inside the dump are organized by language then project. Individual files can be downloaded separate from the rest of the dump.

The dumps will be available as a folder sorted by dump date, which will be mirrored via the Internet Archive and torrent. Links and more information will be provided on this page as this portion of the Wikiseed project progresses.