iliasSpider OUTDATED - USE MisterXY89/iliasScraper
THIS IS OUTDATED AND DOES NOT WORK ANYMORE. THEREFORE PLEASE GO TO: MisterXY89/iliasScraper
iliasSpider
is a web scraper which downloads your materials from an ilias course (@Uni Constance) written in python.
I am using scrapy (docs) to login and download the files.
Scrapy is an application framework for crawling web sites and extracting structured data which can be used for a wide range of useful applications, like data mining, information processing or historical archival.
For those who are not familiar with the scrapy folder structure:
The spider can be found here: ilias_spider/spiders/ilias.py
.
Install (if not satisfied): pip https://pip.pypa.io/en/stable/
-
FOR LINUX:
- Python 2.x or 3.x https://www.python.org/downloads/
- the following python reqs. via pip:
$ pip install -r requirements.txt
-
FOR WINDOWS:
- Python 2.7 (since Python 3 is not supported on Windows with Scrapy)
- follow the instructions to set up scrapy & restart
$ pip install keyring
You can create for every ilias folder a download program via create.py
. Simply run it with pyhton create.py
.
You will be prompted a few questions to config your spider. Everything you need to know will be explained there.
Since all files are downloaded with a .pdf extension there might occur an error downloading files of another file format. In most cases, it is sufficient to just change the .pdf extension to the correct one. If any errors occur, try to restart the program and check your config. If it still occurs open a new issue.