You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Provdb now writes a map of shard index to instance and provider indices for use by the viz
Updated config scripts for benchmark_suite func_multimodal and c_from_python
run_services changes:
Added viz initialization parameters for multi-instance provDB
Committer log output now piped to file rather than through tee to reduce spam
Viz initialization now reports error if provDB is disabled
Copy file name to clipboardExpand all lines: benchmark_suite/c_from_python/chimbuko_config.sh
+4-1Lines changed: 4 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -26,15 +26,18 @@ chimbuko_services="infer" #The location of the Chimbuko service script. If set t
26
26
use_provdb=1 #enable or disable the provDB. If disabled the provenance data will be written as JSON ASCII into the ${provdb_writedir} set below
27
27
provdb_extra_args=""#any extra command line arguments to pass
28
28
provdb_nshards=4 #number of database shards
29
+
provdb_ninstances=1 #number of database server instances. Shards are distributed over instances
29
30
provdb_engine="ofi+tcp;ofi_rxm"#the OFI libfabric provider used for the Mochi stack
30
31
provdb_port=5000 #the port of the provenance database
31
-
provdb_nthreads=4 #number of worker threads; should be >= the number of shards
32
32
provdb_writedir=chimbuko/provdb #the directory in which the provenance database is written. Chimbuko creates chimbuko/provdb which can be used as a default
33
33
provdb_commit_freq=10000 #frequency ms at which the provenance database is committed to disk. If set to 0 it will commit only at the end
34
34
35
35
#With "verbs" provider (used for infiniband, iWarp, etc) we need to also specify the domain, which can be found by running fi_info (on a compute node)
36
36
provdb_domain=mlx5_0 #only needed for verbs provider <------------ ***SET ME (if using verbs)***
37
37
38
+
export FI_UNIVERSE_SIZE=1600 # Defines the expected number of provenance DB clients per instance <------------- *** SET ME (should be larger than the number of clients/instance)
39
+
export FI_MR_CACHE_MAX_COUNT=0 # disable MR cache in libfabric; still problematic as of libfabric 1.10.1
40
+
export FI_OFI_RXM_USE_SRX=1 # use shared recv context in RXM; should improve scalability
Copy file name to clipboardExpand all lines: benchmark_suite/func_multimodal/chimbuko_config.sh
+9-1Lines changed: 9 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -26,15 +26,18 @@ chimbuko_services="infer" #The location of the Chimbuko service script. If set t
26
26
use_provdb=1 #enable or disable the provDB. If disabled the provenance data will be written as JSON ASCII into the ${provdb_writedir} set below
27
27
provdb_extra_args=""#any extra command line arguments to pass
28
28
provdb_nshards=4 #number of database shards
29
+
provdb_ninstances=1 #number of database server instances. Shards are distributed over instances
29
30
provdb_engine="ofi+tcp;ofi_rxm"#the OFI libfabric provider used for the Mochi stack
30
31
provdb_port=5000 #the port of the provenance database
31
-
provdb_nthreads=4 #number of worker threads; should be >= the number of shards
32
32
provdb_writedir=chimbuko/provdb #the directory in which the provenance database is written. Chimbuko creates chimbuko/provdb which can be used as a default
33
33
provdb_commit_freq=10000 #frequency ms at which the provenance database is committed to disk. If set to 0 it will commit only at the end
34
34
35
35
#With "verbs" provider (used for infiniband, iWarp, etc) we need to also specify the domain, which can be found by running fi_info (on a compute node)
36
36
provdb_domain=mlx5_0 #only needed for verbs provider <------------ ***SET ME (if using verbs)***
37
37
38
+
export FI_UNIVERSE_SIZE=1600 # Defines the expected number of provenance DB clients per instance <------------- *** SET ME (should be larger than the number of clients/instance)
39
+
export FI_MR_CACHE_MAX_COUNT=0 # disable MR cache in libfabric; still problematic as of libfabric 1.10.1
40
+
export FI_OFI_RXM_USE_SRX=1 # use shared recv context in RXM; should improve scalability
38
41
39
42
####################################
40
43
#Options for the parameter server
@@ -98,6 +101,11 @@ TAU_ADIOS2_FILE_PREFIX=tau-metrics #the prefix of tau adios2 files; full filena
0 commit comments