Skip to content

Commit 2cfa24a

Browse files
authored
Merge pull request #152 from AI-Hypercomputer/ajkv/docker-update
Use custom docker image to run code and remove jax-tpu-emb wheel
2 parents 5d1c73e + 614c1ed commit 2cfa24a

2 files changed

Lines changed: 9 additions & 5 deletions

File tree

Binary file not shown.

training.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -44,24 +44,28 @@ python dlrm_experiment_test.py
4444

4545
## Option 2: Docker (Recommended for Production)
4646

47-
If you prefer not to manage a virtual environment or want to deploy this as a container, you can build a Docker image.
47+
If you prefer not to manage a virtual environment or want to deploy this as a container, you can use a docker image. We provide two options: (1) Building your own docker image with the Dockerfile provided in this repo; (2) Use our latest docker image from Dockerhub to run the code.
4848

4949
### 1. Build the Image
5050

51-
Run this command from the root of the repository. It reads the `Dockerfile`, installs all dependencies, and creates a ready-to-run image.
51+
Run this command from the root of the repository. It reads the `Dockerfile`, installs all dependencies, and creates a ready-to-run image. You will need to have the jax-tpu-embedding wheel for building your own docker image. Steps to get the wheel can be found here: https://github.com/jax-ml/jax-tpu-embedding.
5252

5353
```bash
5454
docker build -t recml-training .
5555
```
5656

57-
### 2. Run the Image
57+
### 2. Use Our Image From Dockerhub
5858

59-
This will run the docker image and execute the command specified, which is currently set to run DLRM.
59+
The image name is: `docker.io/recsyscmcs/recml-tpu:v1.0.0`. This image contains all the latest dependencies and sets up the env for RecML to run the algorithms successfully on V6 and V7 TPUs.
60+
61+
### Run DLRM Using Docker Image
62+
63+
This will run the docker image and execute the command specified, which is currently set to run DLRM. The below command uses our latest image, but feel free to change the image to your own.
6064

6165
```bash
6266
docker run --rm --privileged \
6367
--net=host \
6468
--ipc=host \
6569
--name recml-experiment \
66-
recml-training
70+
docker.io/recsyscmcs/recml-tpu:v1.0.0
6771
```

0 commit comments

Comments
 (0)