Updating search results...

Search Resources

1 Result

View
Selected filters:
The Programming Historian 2: Applied Archival Downloading with Wget
Unrestricted Use
CC BY
Rating
0.0 stars

Now that you have learned how Wget can be used to mirror or download specific files from websites like ActiveHistory.ca via the command line, it’s time to expand your web-scraping skills through a few more lessons that focus on other uses for Wget’s recursive retrieval function. The following tutorial provides three examples of how Wget can be used to download large collections of documents from archival websites with assistance from the Python programing language. It will teach you how to parse and generate a list of URLs using a simple Python script, and will also introduce you to a few of Wget’s other useful features. Similar functions to the ones demonstrated in this lesson can be achieved using curl, an open-source software capable of performing automated downloads from the command line. For this lesson, however, we will focus on Wget and building your Python skills.

Subject:
Applied Science
Computer Science
Material Type:
Diagram/Illustration
Provider:
Center for History and New Media
Author:
Kellen Kurschinski
Date Added:
06/16/2015