A full list of fields is provided on the advanced search page: it is recommended that the search term is built on the advanced search page, and after hitting the 'Search' button, the completed query can be copied across as an argument for this script. Various item metadata fields can be searched, enabling flexible download options - such as downloading all items associated with a collection, and/or uploaded by a particular creator. Downloading items returned from a search term Here is an example of a details page for an Internet Archive item - in this example, the item identifier to use with this script is '.1155023' (as listed in the URL, and by the 'Identifier' string on the item's details page). Downloading individual Internet Archive item(s)ĭownloading items individually requires finding the item's unique identifier. an item can be a book, a song, an album, a dataset, a movie, an image or set of images, etc. An item can be considered as a group of files. An item is a logical “thing” that we represent on one web page on. An item is defined within Internet Archive documentation as:Ī is made up of “items”. You can download individual Internet Archive item(s), and/or all items returned from an search. This script has been tested with macOS 11.6 (using Python >= 3.7 installed using Homebrew), Ubuntu 20.04, and Windows 10 20H2. Python 3.7 or later is required, with the Internet Archive Python Library installed ( Internet Archive Python Library installation instructions). Wayback Machine ( ) pages are not supported by this script. This Python script uses multithreading and multiprocessing in conjunction with the Internet Archive Python Library to provide bulk downloads of files associated with Internet Archive ( ) items and collections, with optional interrupted download resumption and file hash verification.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |