Open Library provides dumps of all the data in various formats. Currently these dumps are generated every month.
OL Dump
This contains dump of latest editions of all the records in Open Library. This is a tab separated file with the following columns:
-
type
- type of record (/type/edition, /type/work etc.)
-
key
- unique key of the record. (/books/OL1M etc.)
-
revision
- revision number of the record
-
last_modified
- last modified timestamp
-
JSON
- the complete record in JSON format
Dumps:
-
editions dump (~ 5.7G)
-
works dump (~ 1.6G)
-
authors dump (~ 0.3G)
-
all types dump (~ 7.8G): includes editions, works, authors, redirects, etc.
- complete dump (~ 18.3G): includes all revisions of all the records in Open Library
For past dumps, see: https://archive.org/details/ol_exports?sort=-publicdate
Format of JSON records
A JSON schema for the various types is located at https://github.com/internetarchive/openlibrary-client/tree/master/olclient/schemata
-
Author Records: JSON serialization of a type/author
-
Edition Records: JSON serialization of a type/edition
- Work Records: JSON serialization of a type/work
OL Covers Dump
:TODO:
History
- Created December 14, 2011
- 28 revisions
May 22, 2024 | Edited by raybb | Edited without comment. |
May 22, 2024 | Edited by raybb | covers url sorted by date added |
May 22, 2024 | Edited by raybb | Edited without comment. |
May 22, 2024 | Edited by raybb | placeolder for redirects and other |
December 14, 2011 | Created by Anand Chitipothu | Documented Open Library Data Dumps |