@@ -2,6 +2,7 @@ First Steps
22===========
33The first step before running diffusion algorithms on your network using DiffuPy is to learn about the graph and data
44formats are supported. Next, you can find samples of input datasets and networks to run diffusion methods over.
5+
56Input Data
67----------
78
@@ -10,9 +11,8 @@ You can submit your dataset in any of the following formats:
1011- CSV (.csv)
1112- TSV (.tsv)
1213
13- Please ensure that the dataset has a column 'Node' containing node IDs. If you only provide the node IDs, you can
14- also include a column in your dataset 'NodeType' indicating the entity type for each node. You can also optionally add
15- the following columns to your dataset:
14+ Please ensure that the dataset minimally has a column 'Node' containing node IDs. You can also optionally add the
15+ following columns to your dataset:
1616
1717- LogFC [*]_
1818- p-value
@@ -28,20 +28,19 @@ DiffuPath accepts several input formats which can be codified in different ways.
2828`diffusion scores <https://github.com/multipaths/DiffuPy/blob/master/docs/source/diffusion.rst >`_ summary for more
2929details.
3030
31- 1. You can provide a dataset with a column 'Node' containing node IDs along with a column 'NodeType' indicating the
32- entity type.
33-
34- +------------+--------------+
35- | Node | NodeType |
36- +============+==============+
37- | A | Gene |
38- +------------+--------------+
39- | B | Gene |
40- +------------+--------------+
41- | C | Metabolite |
42- +------------+--------------+
43- | D | Gene |
44- +------------+--------------+
31+ 1. You can provide a dataset with a column 'Node' containing node IDs.
32+
33+ +------------+
34+ | Node |
35+ +============+
36+ | A |
37+ +------------+
38+ | B |
39+ +------------+
40+ | C |
41+ +------------+
42+ | D |
43+ +------------+
4544
46452. You can also choose to provide a dataset with a column 'Node' containing node IDs as well as a column 'logFC' with
4746their abs(LogFC).
0 commit comments