Skip to content

Commit a74b875

Browse files
committed
docs: apply mdformat to documentation files
1 parent ba5e740 commit a74b875

2 files changed

Lines changed: 14 additions & 14 deletions

File tree

docs/guides/checkpointing_solutions/convert_checkpoint.md

Lines changed: 13 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -7,19 +7,19 @@ This guide provides instructions to use [checkpoint conversion scripts](https://
77
The following models are supported:
88

99
| Model Family | Sizes | HF $\to$ Orbax (scan) | HF $\to$ Orbax (unscan) | Orbax (scan) $\to$ HF | Orbax (unscan) $\to$ HF |
10-
| :---------------------- | :--------------------- | :--------------------: | :----------------------: | :--------------------: | :----------------------: |
11-
| **Gemma2** | 2B, 9B, 27B | | | | |
12-
| **Gemma3** (Multimodal) | 4B, 12B, 27B | | | | |
13-
| **Llama3.1** | 8B, 70B, 450B | | | | |
14-
| **Qwen2.5** | 1.5B, 7B, 14B | | | | |
15-
| **Qwen3** | 0.6B, 4B, 8B, 14B, 32B | | | | |
16-
| **Qwen3 MoE** | 30B, 235B, 480B | | | | |
17-
| **Mixtral** | 8x7B, 8x22B | | | | |
18-
| **GPT-OSS** | 20B, 120B | | | | |
19-
| **DeepSeek2** | 16B | | | | |
20-
| **DeepSeek3** | 671B | | | | |
21-
| **DeepSeek3.2** | 671B | | | - | - |
22-
| **Qwen3 Next** | 80B | | | | |
10+
| :---------------------- | :--------------------- | :-------------------: | :---------------------: | :-------------------: | :---------------------: |
11+
| **Gemma2** | 2B, 9B, 27B |||||
12+
| **Gemma3** (Multimodal) | 4B, 12B, 27B |||||
13+
| **Llama3.1** | 8B, 70B, 450B |||||
14+
| **Qwen2.5** | 1.5B, 7B, 14B |||||
15+
| **Qwen3** | 0.6B, 4B, 8B, 14B, 32B |||||
16+
| **Qwen3 MoE** | 30B, 235B, 480B |||||
17+
| **Mixtral** | 8x7B, 8x22B |||||
18+
| **GPT-OSS** | 20B, 120B |||||
19+
| **DeepSeek2** | 16B |||||
20+
| **DeepSeek3** | 671B |||||
21+
| **DeepSeek3.2** | 671B ||| - | - |
22+
| **Qwen3 Next** | 80B |||||
2323

2424
## Prerequisites
2525

docs/tutorials/posttraining/multimodal.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,7 +155,7 @@ python -m maxtext.trainers.post_train.sft.train_sft_deprecated \
155155

156156
$\text{Text Tokens} \approx 1.3 \times \text{Number of Words in Prompt}$.
157157

158-
- For Gemma3, each image is resized to 896*896 and contributes 256 tokens:
158+
- For Gemma3, each image is resized to 896\*896 and contributes 256 tokens:
159159

160160
$\text{Total Tokens} \approx \text{Text Tokens} + \text{Number of Images} * 256$.
161161

0 commit comments

Comments
 (0)