Skip to content

Commit e735af1

Browse files
Merge pull request #3560 from AI-Hypercomputer:fix_markdown
PiperOrigin-RevId: 893813217
2 parents 655c103 + 2381295 commit e735af1

10 files changed

Lines changed: 54 additions & 54 deletions

docs/guides.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -18,45 +18,45 @@
1818

1919
Explore our how-to guides for optimizing, debugging, and managing your MaxText workloads.
2020

21-
::::\{grid} 1 2 2 2
21+
::::{grid} 1 2 2 2
2222
:gutter: 2
2323

24-
:::\{grid-item-card} ⚡ Optimization
24+
:::{grid-item-card} ⚡ Optimization
2525
:link: guides/optimization
2626
:link-type: doc
2727

2828
Techniques for maximizing performance, including sharding strategies, Pallas kernels, and benchmarking.
2929
:::
3030

31-
:::\{grid-item-card} 💾 Data Pipelines
31+
:::{grid-item-card} 💾 Data Pipelines
3232
:link: guides/data_input_pipeline
3333
:link-type: doc
3434

3535
Configure input pipelines using **Grain** (recommended for determinism), **HuggingFace**, or **TFDS**.
3636
:::
3737

38-
:::\{grid-item-card} 🔄 Checkpointing
38+
:::{grid-item-card} 🔄 Checkpointing
3939
:link: guides/checkpointing_solutions
4040
:link-type: doc
4141

4242
Manage GCS checkpoints, handle preemption with emergency checkpointing, and configure multi-tier storage.
4343
:::
4444

45-
:::\{grid-item-card} 🔍 Monitoring & Debugging
45+
:::{grid-item-card} 🔍 Monitoring & Debugging
4646
:link: guides/monitoring_and_debugging
4747
:link-type: doc
4848

4949
Tools for observability: goodput monitoring, hung job debugging, and Vertex AI TensorBoard integration.
5050
:::
5151

52-
:::\{grid-item-card} 🐍 Python Notebooks
52+
:::{grid-item-card} 🐍 Python Notebooks
5353
:link: guides/run_python_notebook
5454
:link-type: doc
5555

5656
Interactive development guides for running MaxText on Google Colab or local JupyterLab environments.
5757
:::
5858

59-
:::\{grid-item-card} 🌱 Model Bringup
59+
:::{grid-item-card} 🌱 Model Bringup
6060
:link: guides/model_bringup
6161
:link-type: doc
6262

docs/guides/checkpointing_solutions.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,31 +2,31 @@
22

33
# Checkpointing
44

5-
::::\{grid} 1 2 2 2
5+
::::{grid} 1 2 2 2
66
:gutter: 2
77

8-
:::\{grid-item-card} 💾 GCS Checkpointing
8+
:::{grid-item-card} 💾 GCS Checkpointing
99
:link: checkpointing_solutions/gcs_checkpointing
1010
:link-type: doc
1111

1212
Standard checkpointing to Google Cloud Storage.
1313
:::
1414

15-
:::\{grid-item-card} 🚑 Emergency Checkpointing
15+
:::{grid-item-card} 🚑 Emergency Checkpointing
1616
:link: checkpointing_solutions/emergency_checkpointing
1717
:link-type: doc
1818

1919
Handle preemption and recover training progress.
2020
:::
2121

22-
:::\{grid-item-card} 🗄️ Multi-tier checkpointing
22+
:::{grid-item-card} 🗄️ Multi-tier checkpointing
2323
:link: checkpointing_solutions/multi_tier_checkpointing
2424
:link-type: doc
2525

2626
Optimize storage costs and performance with multi-tier usage.
2727
:::
2828

29-
:::\{grid-item-card} 🔁 Checkpoint conversion utilities
29+
:::{grid-item-card} 🔁 Checkpoint conversion utilities
3030
:link: checkpointing_solutions/convert_checkpoint
3131
:link-type: doc
3232

docs/guides/monitoring_and_debugging.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -16,52 +16,52 @@
1616

1717
# Monitoring and debugging
1818

19-
::::\{grid} 1 2 2 2
19+
::::{grid} 1 2 2 2
2020
:gutter: 2
2121

22-
:::\{grid-item-card} 🕵️ Features & Diagnostics
22+
:::{grid-item-card} 🕵️ Features & Diagnostics
2323
:link: monitoring_and_debugging/features_and_diagnostics
2424
:link-type: doc
2525

2626
Diagnostic tools and features for monitoring MaxText.
2727
:::
2828

29-
:::\{grid-item-card} ☁️ GCP Observability
29+
:::{grid-item-card} ☁️ GCP Observability
3030
:link: monitoring_and_debugging/gcp_workload_observability
3131
:link-type: doc
3232

3333
Observability for workloads running on Google Cloud Platform.
3434
:::
3535

36-
:::\{grid-item-card} 🚫 Hang Playbook
36+
:::{grid-item-card} 🚫 Hang Playbook
3737
:link: monitoring_and_debugging/megascale_hang_playbook
3838
:link-type: doc
3939

4040
Troubleshooting guide for training hangs at megascale.
4141
:::
4242

43-
:::\{grid-item-card} 📈 Goodput
43+
:::{grid-item-card} 📈 Goodput
4444
:link: monitoring_and_debugging/monitor_goodput
4545
:link-type: doc
4646

4747
Monitoring efficient training time (Goodput).
4848
:::
4949

50-
:::\{grid-item-card} 📊 Logs & Metrics
50+
:::{grid-item-card} 📊 Logs & Metrics
5151
:link: monitoring_and_debugging/understand_logs_and_metrics
5252
:link-type: doc
5353

5454
Understanding MaxText logs and performance metrics.
5555
:::
5656

57-
:::\{grid-item-card} 📉 TensorBoard
57+
:::{grid-item-card} 📉 TensorBoard
5858
:link: monitoring_and_debugging/use_vertex_ai_tensorboard
5959
:link-type: doc
6060

6161
Using Vertex AI TensorBoard for visualization.
6262
:::
6363

64-
:::\{grid-item-card} ⏱️ XProf
64+
:::{grid-item-card} ⏱️ XProf
6565
:link: monitoring_and_debugging/xprof_user_guide
6666
:link-type: doc
6767

docs/guides/optimization.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -18,31 +18,31 @@
1818

1919
Explore techniques for maximizing performance, including model customization, sharding strategies, Pallas kernels, and benchmarking.
2020

21-
::::\{grid} 1 2 2 2
21+
::::{grid} 1 2 2 2
2222
:gutter: 2
2323

24-
:::\{grid-item-card} 🛠️ Customizing Model Configs
24+
:::{grid-item-card} 🛠️ Customizing Model Configs
2525
:link: optimization/custom_model
2626
:link-type: doc
2727

2828
Optimize and customize your LLM model configurations for higher performance (MFU) on TPUs.
2929
:::
3030

31-
:::\{grid-item-card} 🥞 Sharding Strategies
31+
:::{grid-item-card} 🥞 Sharding Strategies
3232
:link: optimization/sharding
3333
:link-type: doc
3434

3535
Choose efficient sharding strategies (FSDP, TP, EP, PP) using Roofline Analysis and understand arithmetic intensity.
3636
:::
3737

38-
:::\{grid-item-card} ⚡ Pallas Kernels
38+
:::{grid-item-card} ⚡ Pallas Kernels
3939
:link: optimization/pallas_kernels_performance
4040
:link-type: doc
4141

4242
Optimize with Pallas kernels for fine-grained control.
4343
:::
4444

45-
:::\{grid-item-card} 📈 Benchmarking & Tuning
45+
:::{grid-item-card} 📈 Benchmarking & Tuning
4646
:link: optimization/benchmark_and_performance
4747
:link-type: doc
4848

docs/reference.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -18,31 +18,31 @@
1818

1919
Deep dive into MaxText architecture, models, and core concepts.
2020

21-
::::\{grid} 1 2 2 2
21+
::::{grid} 1 2 2 2
2222
:gutter: 2
2323

24-
:::\{grid-item-card} 📊 Performance Metrics
24+
:::{grid-item-card} 📊 Performance Metrics
2525
:link: reference/performance_metrics
2626
:link-type: doc
2727

2828
Understanding Model Flops Utilization (MFU), calculation methods, and why it matters for performance optimization.
2929
:::
3030

31-
:::\{grid-item-card} 🤖 Models
31+
:::{grid-item-card} 🤖 Models
3232
:link: reference/models
3333
:link-type: doc
3434

3535
Supported models and architectures, including Llama, Qwen, and Mixtral. Details on tiering and new additions.
3636
:::
3737

38-
:::\{grid-item-card} 🏗️ Architecture
38+
:::{grid-item-card} 🏗️ Architecture
3939
:link: reference/architecture
4040
:link-type: doc
4141

4242
High-level overview of MaxText design, JAX/XLA choices, and how components interact.
4343
:::
4444

45-
:::\{grid-item-card} 💡 Core Concepts
45+
:::{grid-item-card} 💡 Core Concepts
4646
:link: reference/core_concepts
4747
:link-type: doc
4848

docs/reference/architecture.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,16 @@
11
# Architecture
22

3-
::::\{grid} 1 2 2 2
3+
::::{grid} 1 2 2 2
44
:gutter: 2
55

6-
:::\{grid-item-card} 🗺️ Overview
6+
:::{grid-item-card} 🗺️ Overview
77
:link: architecture/architecture_overview
88
:link-type: doc
99

1010
High-level overview of MaxText design and components.
1111
:::
1212

13-
:::\{grid-item-card} 📚 JAX/AI Libraries
13+
:::{grid-item-card} 📚 JAX/AI Libraries
1414
:link: architecture/jax_ai_libraries_chosen
1515
:link-type: doc
1616

docs/reference/core_concepts.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -16,45 +16,45 @@
1616

1717
# Core concepts
1818

19-
::::\{grid} 1 2 2 2
19+
::::{grid} 1 2 2 2
2020
:gutter: 2
2121

22-
:::\{grid-item-card} 💾 Checkpoints
22+
:::{grid-item-card} 💾 Checkpoints
2323
:link: core_concepts/checkpoints
2424
:link-type: doc
2525

2626
Understanding checkpoint formats and strategies.
2727
:::
2828

29-
:::\{grid-item-card} ⚖️ Alternatives
29+
:::{grid-item-card} ⚖️ Alternatives
3030
:link: core_concepts/alternatives
3131
:link-type: doc
3232

3333
Comparison with other frameworks like Megatron-LM.
3434
:::
3535

36-
:::\{grid-item-card} 📉 Quantization
36+
:::{grid-item-card} 📉 Quantization
3737
:link: core_concepts/quantization
3838
:link-type: doc
3939

4040
Techniques for reducing model size and improving performance.
4141
:::
4242

43-
:::\{grid-item-card} 🧱 Tiling
43+
:::{grid-item-card} 🧱 Tiling
4444
:link: core_concepts/tiling
4545
:link-type: doc
4646

4747
Understanding tiling strategies for partitioning logic.
4848
:::
4949

50-
:::\{grid-item-card} ⚡ JAX/XLA/Pallas
50+
:::{grid-item-card} ⚡ JAX/XLA/Pallas
5151
:link: core_concepts/jax_xla_and_pallas
5252
:link-type: doc
5353

5454
How MaxText leverages JAX, XLA, and Pallas for efficiency.
5555
:::
5656

57-
:::\{grid-item-card} 🧠 MoE Configuration
57+
:::{grid-item-card} 🧠 MoE Configuration
5858
:link: core_concepts/moe_configuration
5959
:link-type: doc
6060

docs/reference/models.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,16 @@
11
# Models
22

3-
::::\{grid} 1 2 2 2
3+
::::{grid} 1 2 2 2
44
:gutter: 2
55

6-
:::\{grid-item-card} 🥇 Tiering
6+
:::{grid-item-card} 🥇 Tiering
77
:link: models/tiering
88
:link-type: doc
99

1010
Optimized model tiers (Gold, Silver) for various TPU generations.
1111
:::
1212

13-
:::\{grid-item-card} 🏗️ Supported Models
13+
:::{grid-item-card} 🏗️ Supported Models
1414
:link: models/supported_models_and_architectures
1515
:link-type: doc
1616

docs/run_maxtext.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -2,38 +2,38 @@
22

33
Choose your environment and orchestration method to run MaxText.
44

5-
::::\{grid} 1 2 2 2
5+
::::{grid} 1 2 2 2
66
:gutter: 2
77

8-
:::\{grid-item-card} 💻 Localhost / Single VM
8+
:::{grid-item-card} 💻 Localhost / Single VM
99
:link: run_maxtext/run_maxtext_localhost
1010
:link-type: doc
1111

1212
Get started quickly on a single machine. Clone the repo, install dependencies, and run your first training job on a single TPU or GPU VM.
1313
:::
1414

15-
:::\{grid-item-card} 🎮 Single-host GPU
15+
:::{grid-item-card} 🎮 Single-host GPU
1616
:link: run_maxtext/run_maxtext_single_host_gpu
1717
:link-type: doc
1818

1919
Run MaxText on single-host NVIDIA GPUs (e.g., A3 High/Mega). Includes Docker setup, NVIDIA Container Toolkit installation, and 1B/7B model training examples.
2020
:::
2121

22-
:::\{grid-item-card} 🏗️ At scale with XPK (GKE)
22+
:::{grid-item-card} 🏗️ At scale with XPK (GKE)
2323
:link: run_maxtext/run_maxtext_via_xpk
2424
:link-type: doc
2525

2626
Deploy to Google Kubernetes Engine (GKE) using XPK. Orchestrate large-scale training jobs on TPU or GPU clusters with simple CLI commands.
2727
:::
2828

29-
:::\{grid-item-card} 🌐 Multi-host via Pathways
29+
:::{grid-item-card} 🌐 Multi-host via Pathways
3030
:link: run_maxtext/run_maxtext_via_pathways
3131
:link-type: doc
3232

3333
Run large-scale JAX jobs on TPUs using Pathways. Supports batch and headless (interactive) workloads on GKE.
3434
:::
3535

36-
:::\{grid-item-card} 🔌 Decoupled Mode
36+
:::{grid-item-card} 🔌 Decoupled Mode
3737
:link: run_maxtext/decoupled_mode
3838
:link-type: doc
3939

docs/tutorials.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -18,31 +18,31 @@
1818

1919
Explore our tutorials to learn how to use MaxText, from your first run to advanced post-training techniques.
2020

21-
::::\{grid} 1 2 2 2
21+
::::{grid} 1 2 2 2
2222
:gutter: 2
2323

24-
:::\{grid-item-card} 🚀 Getting Started
24+
:::{grid-item-card} 🚀 Getting Started
2525
:link: tutorials/first_run
2626
:link-type: doc
2727

2828
Installation, prerequisites, verification, and your first training run.
2929
:::
3030

31-
:::\{grid-item-card} 📚 Pre-training
31+
:::{grid-item-card} 📚 Pre-training
3232
:link: tutorials/pretraining
3333
:link-type: doc
3434

3535
Step-by-step guides for pre-training with real datasets like C4 using HuggingFace, Grain, or TFDS.
3636
:::
3737

38-
:::\{grid-item-card} 🧩 Post-training
38+
:::{grid-item-card} 🧩 Post-training
3939
:link: tutorials/post_training_index
4040
:link-type: doc
4141

4242
Techniques for SFT, RL, and other post-training workflows on TPU.
4343
:::
4444

45-
:::\{grid-item-card} 📊 Inference
45+
:::{grid-item-card} 📊 Inference
4646
:link: tutorials/inference
4747
:link-type: doc
4848

0 commit comments

Comments
 (0)