It looks like you're offline.
Open Library logo
additional options menu
Last edited by raybb
January 22, 2020 | History

Open Library Data Dumps

Open Library provides dumps of all the data in various formats. Currently these dumps are generated every month.

OL Dump

This contains dump of latest editions of all the records in Open Library. This is a tab separated file with the following columns:

This full dump (~7.8G compressed) can be downloaded from:

https://openlibrary.org/data/ol_dump_latest.txt.gz

For convenience, this dump is split into multiple files based on type.

OL Complete Dump

This contains dump of all revisions of all the records in Open Library (~18.3G compressed). Format is same as the OL dump.

This dump can be downloaded from:

https://openlibrary.org/data/ol_cdump_latest.txt.gz

Format of JSON records

A JSON schema for the various types is located at https://github.com/internetarchive/openlibrary-client/tree/master/olclient/schemata

Author Record

JSON serialization of a type/author

Edition

JSON serialization of a type/edition

Work

JSON serialization of a type/work

OL Covers Dump

:TODO:

History

March 22, 2024 Edited by raybb Edited without comment.
October 8, 2023 Edited by raybb update dump sizes
February 3, 2023 Edited by Tom Morris Update sizes for dumps of main entities
November 17, 2021 Edited by raybb update file sizes
December 14, 2011 Created by Anand Chitipothu Documented Open Library Data Dumps