Skip to content

Commit 3a8857d

Browse files
committed
added some manual calls to unzip necessary files
1 parent 5bf590c commit 3a8857d

3 files changed

Lines changed: 29 additions & 15 deletions

File tree

CMakeLists.txt

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -401,11 +401,16 @@ IF(BUILD_CUDA)
401401
UNZIPFILE("${HECBENCH_SRC_DIR}/slu-cuda/src/nicslu.tar")
402402
UNZIPFILE("${HECBENCH_SRC_DIR}/axhelm-cuda/BlasLapack.tar.gz")
403403
UNZIPFILE("${HECBENCH_SRC_DIR}/testSNAP-omp/refdata.tar.gz")
404+
UNZIPFILE("${HECBENCH_SRC_DIR}/lsqt-cuda/examples.tar.gz")
405+
UNZIPFILE("${HECBENCH_SRC_DIR}/meanshift-cuda/dataset.tar.gz")
406+
UNZIPFILE("${HECBENCH_SRC_DIR}/ced-cuda/frames.tar.gz")
404407
ENDIF()
405408

406409
IF(BUILD_OMP)
407410
UNZIPFILE("${HECBENCH_SRC_DIR}/axhelm-omp/BlasLapack.tar.gz")
408411
UNZIPFILE("${HECBENCH_SRC_DIR}/testSNAP-omp/refdata.tar.gz")
412+
UNZIPFILE("${HECBENCH_SRC_DIR}/lsqt-cuda/examples.tar.gz")
413+
UNZIPFILE("${HECBENCH_SRC_DIR}/meanshift-cuda/dataset.tar.gz")
409414
ENDIF()
410415

411416
set(TARGETS "")

Dockerfile

Lines changed: 17 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
FROM nvidia/cuda:12.6.0-devel-ubuntu24.04
22

3+
ARG HOSTOS=linux
4+
35
# Set non-interactive frontend for apt-get to avoid prompts
46
ENV DEBIAN_FRONTEND=noninteractive
57

@@ -10,6 +12,7 @@ SHELL ["/bin/bash", "-c"]
1012
RUN apt-get update && \
1113
apt-get upgrade -y && \
1214
apt-get install -y wget make git gfortran libomp-18-dev libboost-all-dev clang-18 clang-tools-18 unzip && \
15+
apt-get install -y imagemagick && \
1316
wget https://github.com/Kitware/CMake/releases/download/v3.28.0/cmake-3.28.0-linux-x86_64.sh && \
1417
chmod +x cmake-3.28.0-linux-x86_64.sh && \
1518
./cmake-3.28.0-linux-x86_64.sh --skip-license --prefix=/usr/local && \
@@ -57,15 +60,20 @@ COPY . .
5760

5861
# if we are on a windows host, we need to remove the CRLF characters from all the files
5962
#RUN find . -type f -exec sed -i 's/\r$//' {} +
60-
RUN find . -type f \
61-
-not -name "*.gz" \
62-
-not -name "*.zip" \
63-
-not -name "*.rar" \
64-
-not -name "*.7z" \
65-
-not -name "*.tar" \
66-
-not -name "*.bz2" \
67-
-not -name "*.tar.*" \
68-
-exec sed -i 's/\r$//' {} +
63+
RUN if [ "$HOSTOS" = "windows" ]; then \
64+
echo "Removing CRLF characters from files..."; \
65+
find . -type f \
66+
-not -name "*.gz" \
67+
-not -name "*.zip" \
68+
-not -name "*.rar" \
69+
-not -name "*.7z" \
70+
-not -name "*.tar" \
71+
-not -name "*.bz2" \
72+
-not -name "*.tar.*" \
73+
-exec sed -i 's/\r$//' {} +; \
74+
else \
75+
echo "HOSTOS is not windows; skipping CRLF removal."; \
76+
fi
6977

7078
# write out to the bashrc to source conda and activate the environment on container startup
7179
RUN echo 'conda activate gpu-flopbench' >> ~/.bashrc

README.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -18,9 +18,9 @@ Target codes thus-far:
1818
This work was presented at the [HPDC 2025](https://hpdc.sci.utah.edu/2025/) [AI4Sys Workshop](https://ai4sys.github.io/).
1919
Upon having our paper accepted, we pre-published on arXiv in case people wanted to cite us.
2020

21-
[paper link here](https://dl.acm.org/doi/10.1145/3731545.3743645)
21+
📃📃 [paper link here](https://dl.acm.org/doi/10.1145/3731545.3743645) 🔗🔗
2222

23-
BibTeX reference below.
23+
📜 BibTeX reference below.
2424
```
2525
@inproceedings{10.1145/3731545.3743645,
2626
author = {Bolet, Gregory and Georgakoudis, Giorgis and Menon, Harshitha and Parasyris, Konstantinos and Hasabnis, Niranjan and Estes, Hayden and Cameron, Kirk and Oren, Gal},
@@ -54,12 +54,12 @@ For ease-of-reproducibility, we supply a `Dockerfile` with the necessary steps t
5454
The following is a list of steps to help you get set up and into the main bash shell of the container.
5555

5656
‼️‼️
57-
We note that the base container image will take up about 40 GB of storage space; once we start building codes and gathering profiling data, the disk usage will jump up to about 50 GB.
57+
We note that the base container image will take up about 15 GB of storage space, which then jumps to 40 GB when we build the container; once we start building codes and gathering profiling data, the disk usage will jump up to about 50 GB.
5858
Please ensure your system has enough storage space before continuing.
5959
‼️‼️
6060

6161
```
62-
git clone git@github.com:gregbolet/gpu-FLOPBench.git ./gpu-flopbench
62+
git clone git@github.com:gregbolet/gpu-flopbench.git ./gpu-flopbench
6363
cd ./gpu-flopbench
6464
docker build --progress=plain -t 'gpu-flopbench' .
6565
docker run -ti --gpus all --name gpu-flopbench-container --runtime=nvidia -e NVIDIA_DRIVER_CAPABILITIES=compute,utility -e NVIDIA_VISIBLE_DEVICES=all gpu-flopbench
@@ -104,7 +104,7 @@ In future work we would like to make this process of building the codes agnostic
104104

105105
Start by simply cloning our repo.
106106
```
107-
git clone git@github.com:gregbolet/gpu-FLOPBench.git ./gpu-flopbench
107+
git clone git@github.com:gregbolet/gpu-flopbench.git ./gpu-flopbench
108108
cd ./gpu-flopbench
109109
```
110110

@@ -116,6 +116,7 @@ By default, we have everything building with `clang++` and `clang`, this should
116116
source ./runBuild.sh
117117
```
118118
NOTE: If you're running this from a Docker container generated from our Dockerfile, it should work out-of-the-box.
119+
119120
We originally had the CUDA codes building with `nvcc`, but to be able to also build SYCL and OMP codes, we switched to just LLVM. You may still be able to build the codes with `nvcc`, but it may take some modifications to the build pipeline.
120121
We have future plans to sample SYCL and OMP codes, but for now, this work focuses on CUDA codes.
121122

@@ -202,7 +203,7 @@ python3 simpleScrapeKernels.py
202203
The scraped output will be a file called `simple-scraped-kernels-with-sass.json` in JSON format.
203204
We particularly do this simple form of scraping because we're struggling to have a proper AST traversal script that can properly extract CUDA kernels from source.
204205
This is a future step we're working on. For now, this file contains all the source files from each executable that was built in the `build` directory.
205-
Because this is an update version, we include SASS code in the scrape, but these are not used in the final results of this paper.
206+
Because this is an updated version, we include SASS code in the scrape, but these are not used in the final results of this paper.
206207

207208
## Pruning the Scraped Kernels
208209

0 commit comments

Comments
 (0)