You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These scripts create the FOPPA database from raw TED files. This database relies mainly on the award notices of public contracts related to French clients and suppliers from 2010 to 2020 in the Tenders Electronic Daily. It also proposes an enrichment of these data, thanks to the siretization of agents (i.e. the retrieval of their unique IDs, which is missing for most of them) as well as the cleaning and extraction of award criteria, and other processing.
17
+
These scripts create the FOPPA database v.1.1.1 from raw TED files. This database relies mainly on the award notices of public contracts related to French clients and suppliers from 2010 to 2020 in the Tenders Electronic Daily. It also proposes an enrichment of these data, thanks to the siretization of agents (i.e. the retrieval of their unique IDs, which is missing for most of them) as well as the cleaning and extraction of award criteria, and other processing.
18
18
19
-
The process conducted to build the FOPPA is quite long, though (around 1 week, depeding on the hardware), so the produced database is alternatively directly available on [Zenodo](https://doi.org/10.5281/zenodo.7433155). The detail of this processing are described in a technical report [P'22].
19
+
The process conducted to build the FOPPA is quite long, though (around 1 week, depeding on the hardware), so the produced database is alternatively directly available on [Zenodo](https://doi.org/10.5281/zenodo.7443842). The detail of this processing are described in a technical report [P'22].
20
20
21
-
This work was conducted in the framework of [DeCoMaP](https://anr.fr/Projet-ANR-19-CE38-0004) ANR project (Detection of corruption in public procurement markets). If you use this source code or the produced database, please cite bibliographical reference [P'22].
21
+
This work was conducted in the framework of the [DeCoMaP](https://anr.fr/Projet-ANR-19-CE38-0004) ANR project (*Detection of corruption in public procurement markets* -- `ANR-19-CE38-0004`). If you use this source code or the produced database, please cite bibliographical reference [P'22].
22
22
23
23
# Organization
24
-
25
24
This repository is composed of the following elements:
26
25
*`requirements.txt` : List of Python packages used in foppaInit.py.
27
26
*`foppaInit.py` : Python script in order to create the FOPPA Database.
@@ -35,16 +34,14 @@ The script requires the TED data and leverages some additional sources:
35
34
36
35
# Installation
37
36
You first need to install `python` and the required packages:
38
-
39
-
1. Install the [`python` language](https://www.python.org)
37
+
1. Install the Python language: https://www.python.org
40
38
2. Download this project from GitHub and unzip.
41
-
3. Install CUDA (https://developer.nvidia.com/cuda-downloads)
0 commit comments