Skip to content

Commit a4e21ba

Browse files
authored
Update README.md
1 parent 77827fa commit a4e21ba

1 file changed

Lines changed: 10 additions & 13 deletions

File tree

README.md

Lines changed: 10 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
FoppaInit
1+
FoppaInit v1.0.1
22
-------------------------------------------------------------------------
33
*Initialization of the FOPPA database*
44

@@ -8,20 +8,19 @@ FoppaInit is free software: you can redistribute it and/or modify it under the t
88

99
* **Lab site:** http://lia.univ-avignon.fr
1010
* **GitHub repo:** https://github.com/CompNet/FoppaInit
11-
* **Data:** https://doi.org/10.5281/zenodo.7433155
11+
* **Data:** https://doi.org/10.5281/zenodo.7443842
1212
* **Contact:** Lucas Potin <lucas.potin@univ-avignon.fr>
1313

1414
-------------------------------------------------------------------------
1515

1616
# Description
17-
These scripts create the FOPPA database from raw TED files. This database relies mainly on the award notices of public contracts related to French clients and suppliers from 2010 to 2020 in the Tenders Electronic Daily. It also proposes an enrichment of these data, thanks to the siretization of agents (i.e. the retrieval of their unique IDs, which is missing for most of them) as well as the cleaning and extraction of award criteria, and other processing.
17+
These scripts create the FOPPA database v.1.1.1 from raw TED files. This database relies mainly on the award notices of public contracts related to French clients and suppliers from 2010 to 2020 in the Tenders Electronic Daily. It also proposes an enrichment of these data, thanks to the siretization of agents (i.e. the retrieval of their unique IDs, which is missing for most of them) as well as the cleaning and extraction of award criteria, and other processing.
1818

19-
The process conducted to build the FOPPA is quite long, though (around 1 week, depeding on the hardware), so the produced database is alternatively directly available on [Zenodo](https://doi.org/10.5281/zenodo.7433155). The detail of this processing are described in a technical report [P'22].
19+
The process conducted to build the FOPPA is quite long, though (around 1 week, depeding on the hardware), so the produced database is alternatively directly available on [Zenodo](https://doi.org/10.5281/zenodo.7443842). The detail of this processing are described in a technical report [P'22].
2020

21-
This work was conducted in the framework of [DeCoMaP](https://anr.fr/Projet-ANR-19-CE38-0004) ANR project (Detection of corruption in public procurement markets). If you use this source code or the produced database, please cite bibliographical reference [P'22].
21+
This work was conducted in the framework of the [DeCoMaP](https://anr.fr/Projet-ANR-19-CE38-0004) ANR project (*Detection of corruption in public procurement markets* -- `ANR-19-CE38-0004`). If you use this source code or the produced database, please cite bibliographical reference [P'22].
2222

2323
# Organization
24-
2524
This repository is composed of the following elements:
2625
* `requirements.txt` : List of Python packages used in foppaInit.py.
2726
* `foppaInit.py` : Python script in order to create the FOPPA Database.
@@ -35,16 +34,14 @@ The script requires the TED data and leverages some additional sources:
3534

3635
# Installation
3736
You first need to install `python` and the required packages:
38-
39-
1. Install the [`python` language](https://www.python.org)
37+
1. Install the Python language: https://www.python.org
4038
2. Download this project from GitHub and unzip.
41-
3. Install CUDA (https://developer.nvidia.com/cuda-downloads)
39+
3. Install CUDA: https://developer.nvidia.com/cuda-downloads
4240
4. Execute `pip install -r requirements.txt` to install some of the required packages (see also the *Dependencies* Section).
4341

4442
Package `blazingsql` requires a specific installation:
45-
4643
1. Go to Webpage https://rapids.ai/start.html
47-
2. Configure your setup.
44+
2. Configure your setup in this Webpage.
4845
3. Execute the generated command.
4946

5047
# Use
@@ -53,7 +50,6 @@ In order to build the FOPPA database:
5350
2. Run `foppaInit.py`.
5451

5552
The script is going to perform several tasks:
56-
5753
1. Download all the necessary data (see Section *Organization*).
5854
2. Apply the processing described in [P'22].
5955
3. Export the resulting database under different forms (SQL dump, CSV sheets).
@@ -68,7 +64,8 @@ Tested with Python version 3.8.0, with the following packages:
6864
* [`dedupe`](https://pypi.org/project/dedupe/): version 2.0.19.
6965

7066
# Data
71-
The produced database is directly available publicly online on [Zenodo](https://doi.org/10.5281/zenodo.7433155), under two different forms:
67+
The produced database is directly available publicly online on [Zenodo](https://doi.org/10.5281/zenodo.7443842), under three different forms:
68+
* SQLite file: https://www.sqlite.org/index.html
7269
* SQL dump.
7370
* CSV files (one by table).
7471

0 commit comments

Comments
 (0)