This repository aims to get all of available matakuliah and dosen information from all of prodi and facutly in ITB for Dosen Rank Project. Built using python
I do not own the data. All of the data here are owned by ITB
- argparse
- bs4
- requests
- Clone this repository
- create virtual environment from python
virtualenv venv - install all requirements
pip install -r requirements.txt - provide your
nimandcookiefrom ITB sso service- To get your cookie, you need to logged in in SIX
- Right click >
inspect>application - Grab the
khongguancookie from there
- To get all cleaned data, first is run the
scrape.pyto get all of available data in SIX ITB - The data then get cleaned by using
clean.py - Cleaned data then get converted to
jsonorsqlusingoutput.py
.
├── README.md
├── clean.py
├── data
│ ├── cleaned.json
│ ├── dosen_id_name.json
│ ├── dosen_matkul_map.json
│ ├── fakultas.json
│ ├── fakultas_shorthand.json
│ ├── id_prodi_map.json
│ ├── matkul_dosen_map.json
│ ├── matkul_id_name.json
│ ├── prodi.json
│ └── saved.json
├── output.py
├── requirements.txt
├── scrape.py
└── sql
├── dosen.sql
├── fakultas.sql
├── matkul.sql
├── matkul_dosen.sql
└── prodi.sql