pa_uswikimedia dump progress on 20200501
This is the Wikimedia dump service.
Please read the copyrights information.
See Meta:Data dumps
for documentation on the provided data formats.
The 7zip decoder on Windows is known to have
problems with some bz2-format
files for larger wikis; we recommend the use of bzip2 for Windows for these cases.
Please report problems with these dumps on Phabricator and add the
Dumps-generation tag.
See all databases list.
Last dumped on 2020-04-20
For a machine-readable version of the information on this page,
see the json status file.
Dump complete
Verify downloaded files against the (md5), (sha1) checksums
to check for corrupted files.
- 2020-05-03 04:13:38 done Articles, templates, media/file descriptions, and primary meta-pages, in multiple bz2 streams, 100 pages per stream
- 2020-05-04 21:58:12 done All pages with complete edit history (.7z)
- 2020-05-04 21:57:57 done All pages with complete page edit history (.bz2)
b'2020-05-04 21:57:17: pa_uswikimedia (ID 20618) 280 pages (34.0|70.3/sec all|curr), 1495 revs (181.3|124.3/sec all|curr), 100.0%|100.0% prefetched (all|curr), ETA 2020-05-04 21:57:18 [max 1715]'
- 2020-05-04 21:56:56 done Log events to all pages and users.
- 2020-05-04 08:04:06 done All pages, current versions only.
- 2020-05-02 16:07:44 done Articles, templates, media/file descriptions, and primary meta-pages.
- 2020-05-01 09:27:09 done First-pass for page XML data dumps
- 2020-05-04 21:56:11 done Extracted page abstracts for Yahoo
b'2020-05-04 21:56:06: pa_uswikimedia (ID 16360) 21 pages (21.2|21.2/sec all|curr), 20 revs (20.2|20.2/sec all|curr), ETA 2020-05-04 21:56:24 [max 381]'
- 2020-05-04 21:55:48 done List of all page titles
- 2020-05-04 21:55:35 done List of page titles in main namespace
- 2020-05-04 21:55:22 done Namespaces, namespace aliases, magic words.
- 2020-05-01 20:14:01 done List of pages' geographical coordinates
- 2020-05-01 20:18:51 done Wiki template inclusion link records.
- 2020-05-01 20:14:59 done Wiki category membership link records.
- 2020-05-01 20:15:27 done Wiki page-to-page link records.
- 2020-05-01 20:16:42 done Base per-page data (id, title, old restrictions, etc).
- 2020-05-01 20:14:16 done This contains the SiteMatrix information from meta.wikimedia.org provided as a table.
- 2020-05-01 20:17:39 done Annotation (tag) names and ids.
- 2020-05-01 20:17:25 done Language proficiency information per user.
- 2020-05-01 20:16:11 done Metadata on current versions of uploaded media/files.
- 2020-05-01 20:15:58 done Redirect list
- 2020-05-01 20:14:44 done Wiki media/files usage records.
- 2020-05-01 20:17:54 done Interwiki link tracking records
- 2020-05-01 20:16:27 done A few statistics such as the page count.
- 2020-05-01 20:18:38 done Wiki external URL link records.
- 2020-05-01 20:13:47 done Nonexistent pages that have been protected.
- 2020-05-01 20:15:13 done User group assignments.
- 2020-05-01 20:14:28 done Newer per-page restrictions table.
- 2020-05-01 20:16:57 done Name/value pairs for pages.
- 2020-05-01 20:15:43 done Category information.
- 2020-05-01 20:18:09 done List of annotations (tags) for revisions and log entries
- 2020-05-01 20:17:11 done Wiki interlanguage link records.
- 2020-05-01 20:18:24 done Past user group assignments.