This project corresponds to an open source tool to webscraping CvLAC - Colciencias Page with self-assessment and accreditation purpuses.
The entire project is associated to the Unidad de Apoyo a Procesos de Autoevaluación y Acreditación of the Universidad Nacional de Colombia.
Previows version used to extract some products in UAPA format is available Here.
- Requirements:
- Windows 8 or higher / Ubuntu 14 or higher.
- Python 3.6.
- Internet Access.
- Any Text Editor (I used Atom 3).
- Install:
- Copy this repository to your machine.
- install bs4
pip install bs4
- install openpyxl
pip install openpyxl
- Usage:
- Locate the list of CvLAC that you want to use and place it in the file \Input\Base.xslx in this way:
- Document
- Names
- Last Names
- URL
- Save the file and open bash or a command prompt (terminal if you are on Linux OS)
- Run:
python main.py
- That is all, now you can get all your data stored in the \Resultados folder.