Skip to content

uFair-Tech/uCourse-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

uCourse-crawler

🎒 Scrape the courses info from the University of Nottingham's website. (Different campuses and academic years supported.)

Requirements

  • Nodejs
  • MongoDB (optional)

Usage

git clone https://github.com/Songkeys/uCourse-crawler.git
cd uCourse-crawler
npm i
npm start

Demo

demo

Output Methods

There are two output methods provided:

  1. MongoDB (Recommended)
  2. Local JSON file

Output (MongoDB)

For mongoDB, you will need to input a mongo connection string URI. The output will be stored in a table called course_[campus]_[year]. E.g. course_china_2020.

The output example:

output-mongodb

Output (JSON file)

For local JSON file, the output will be in a JSON format stored in /dist/[tablename].json.

The output example:

output-json

Size & Time

The estimated output size will be 2~3 MB per campus per year.

The estimated crawling time will be 30~50 mins per campus per year (depending on your network).

Todo

Resources

About

🎒 Scrape the courses info from the University of Nottingham's website.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published