Skip to content

Commit 6bf3dad

Browse files
committed
updated readme, fixed slu-cuda input
1 parent b7c5034 commit 6bf3dad

2 files changed

Lines changed: 5 additions & 3 deletions

File tree

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -98,10 +98,10 @@ Once this is done, we can start gathering CUDA kernel profiling data with the fo
9898
```
9999
cd $GPU_FLOPBENCH_ROOT/cuda-profiling
100100
101-
LD_LIBRARY_PATH=/usr/lib/llvm-18/lib:$LD_LIBRARY_PATH DATAPATH=$PWD/../src/prna-cuda/data_tables python3 ./gatherData.py --outfile=profiling-data.csv 2>&1 | tee -a runlog.txt
101+
LD_LIBRARY_PATH=/usr/lib/llvm-18/lib:$LD_LIBRARY_PATH DATAPATH=$PWD/../src/prna-cuda/data_tables SLU_PATH=$PWD/../src/slu-cuda/src python3 ./gatherData.py --outfile=profiling-data.csv 2>&1 | tee -a runlog.txt
102102
```
103-
^ This process will take about 10 hours, so please have someone around to babysit in case any unexpected issues arise.
104-
We tested this on our own Docker container and had no issues.
103+
^ This process will take about 10-15 hours, so please have someone around to babysit in case any unexpected issues arise.
104+
We tested this on our own Docker container and had no issues, aside from timeouts for long-running codes or out-of-memory exceptions.
105105

106106

107107

cuda-profiling/gatherData.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -500,6 +500,8 @@ def modify_exe_args_for_some_targets(targets:list):
500500
target['exeArgs'] = 'graph.csv 10000 output'
501501
elif (basename == 'atomicCost-cuda'):
502502
target['exeArgs'] = '16 10'
503+
elif (basename == 'slu-cuda'):
504+
target['exeArgs'] = '-i $SLU_PATH/nicslu/demo/ASIC_100k.mtx'
503505

504506

505507
return targets

0 commit comments

Comments
 (0)