@@ -7,19 +7,19 @@ This guide provides instructions to use [checkpoint conversion scripts](https://
77The following models are supported:
88
99| Model Family | Sizes | HF $\to$ Orbax (scan) | HF $\to$ Orbax (unscan) | Orbax (scan) $\to$ HF | Orbax (unscan) $\to$ HF |
10- | :---------------------- | :--------------------- | :-------------------- : | :---------------------- : | :-------------------- : | :- ---------------------: |
11- | ** Gemma2** | 2B, 9B, 27B | √ | √ | √ | √ |
12- | ** Gemma3** (Multimodal) | 4B, 12B, 27B | √ | √ | √ | √ |
13- | ** Llama3.1** | 8B, 70B, 450B | √ | √ | √ | √ |
14- | ** Qwen2.5** | 1.5B, 7B, 14B | √ | √ | √ | √ |
15- | ** Qwen3** | 0.6B, 4B, 8B, 14B, 32B | √ | √ | √ | √ |
16- | ** Qwen3 MoE** | 30B, 235B, 480B | √ | √ | √ | √ |
17- | ** Mixtral** | 8x7B, 8x22B | √ | √ | √ | √ |
18- | ** GPT-OSS** | 20B, 120B | √ | √ | √ | √ |
19- | ** DeepSeek2** | 16B | √ | √ | √ | √ |
20- | ** DeepSeek3** | 671B | √ | √ | √ | √ |
21- | ** DeepSeek3.2** | 671B | √ | √ | - | - |
22- | ** Qwen3 Next** | 80B | √ | √ | √ | √ |
10+ | :---------------------- | :--------------------- | :-------------------: | :---------------------: | :-------------------: | :---------------------: |
11+ | ** Gemma2** | 2B, 9B, 27B | √ | √ | √ | √ |
12+ | ** Gemma3** (Multimodal) | 4B, 12B, 27B | √ | √ | √ | √ |
13+ | ** Llama3.1** | 8B, 70B, 450B | √ | √ | √ | √ |
14+ | ** Qwen2.5** | 1.5B, 7B, 14B | √ | √ | √ | √ |
15+ | ** Qwen3** | 0.6B, 4B, 8B, 14B, 32B | √ | √ | √ | √ |
16+ | ** Qwen3 MoE** | 30B, 235B, 480B | √ | √ | √ | √ |
17+ | ** Mixtral** | 8x7B, 8x22B | √ | √ | √ | √ |
18+ | ** GPT-OSS** | 20B, 120B | √ | √ | √ | √ |
19+ | ** DeepSeek2** | 16B | √ | √ | √ | √ |
20+ | ** DeepSeek3** | 671B | √ | √ | √ | √ |
21+ | ** DeepSeek3.2** | 671B | √ | √ | - | - |
22+ | ** Qwen3 Next** | 80B | √ | √ | √ | √ |
2323
2424## Prerequisites
2525
0 commit comments