Skip to content

Commit c9fde40

Browse files
committed
docs: expand model grid to 26 cards, 40 architectures (24 families)
1 parent 8c4df5b commit c9fde40

2 files changed

Lines changed: 10 additions & 2 deletions

File tree

content/_index.html

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -487,7 +487,7 @@ <h3 style="font-size:1rem;font-weight:600;margin-bottom:16px">Performance journe
487487
<div class="wrap">
488488
<div class="section-head">
489489
<h2>Supported models</h2>
490-
<p>28 architectures across 16 model families. Load any GGUF model from HuggingFace.</p>
490+
<p>40 architectures across 24 model families. Load any GGUF model from HuggingFace.</p>
491491
</div>
492492
<div class="model-grid">
493493
<div class="model-card"><div class="name">Gemma 3/3n</div><div class="status prod">Transformer</div></div>
@@ -508,6 +508,14 @@ <h2>Supported models</h2>
508508
<div class="model-card"><div class="name">LLaVA/Qwen-VL</div><div class="status prod">Vision-language</div></div>
509509
<div class="model-card"><div class="name">BERT</div><div class="status prod">Encoder</div></div>
510510
<div class="model-card"><div class="name">Granite TS</div><div class="status prod">Time series</div></div>
511+
<div class="model-card"><div class="name">GLM-4/ChatGLM</div><div class="status prod">Transformer + MoE</div></div>
512+
<div class="model-card"><div class="name">Kimi K2</div><div class="status prod">Linear attention MoE</div></div>
513+
<div class="model-card"><div class="name">LFM2</div><div class="status prod">Hybrid MoE</div></div>
514+
<div class="model-card"><div class="name">OLMo 2</div><div class="status prod">Transformer</div></div>
515+
<div class="model-card"><div class="name">EXAONE</div><div class="status prod">Transformer</div></div>
516+
<div class="model-card"><div class="name">StarCoder 2</div><div class="status prod">Code generation</div></div>
517+
<div class="model-card"><div class="name">InternLM 2</div><div class="status prod">Transformer</div></div>
518+
<div class="model-card"><div class="name">DBRX</div><div class="status prod">Fine-grained MoE</div></div>
511519
</div>
512520
<div style="text-align:center;margin-top:32px">
513521
<p style="color:var(--fg3);font-size:.875rem">Uses GGUF as the sole model format. Compatible with llama.cpp, Ollama, LM Studio, and GPT4All model files.</p>

content/docs/reference/migration-v1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -236,7 +236,7 @@ for usage of deprecated symbols.
236236
These are additive and do not require migration, but are worth knowing about:
237237

238238
- **Architecture registry** -- `inference.RegisterArchitecture` / `inference.ListArchitectures` for pluggable model support.
239-
- **28 architectures (16 model families)** -- Llama 3/4, Gemma 3/3n, Mistral, Qwen 2, Phi 3/4, DeepSeek V3, GPT-2, Nemotron-H, MiniMax M2, Falcon, Command R, Mixtral, RWKV, Jamba, Mamba 3, Whisper, and more.
239+
- **28 architectures (16 model families)** -- Llama 3/4, Gemma 3/3n, Mistral, Qwen 2, Phi 3/4, DeepSeek V3, GPT-2, Nemotron-H, MiniMax M2, GLM-4, Kimi K2, LFM2, OLMo 2, EXAONE, StarCoder 2, InternLM 2, DBRX, Falcon, Command R, Mixtral, RWKV, Jamba, Mamba 3, Whisper, and more.
240240
- **Speculative decoding** -- `inference.Model.SpeculativeGenerate` and `generate.WithSpeculativeDraft`.
241241
- **Paged KV cache** -- `generate.WithPagedKV` for memory-efficient serving.
242242
- **Prefix caching** -- `generate.WithPrefixCache` for shared system prompt reuse.

0 commit comments

Comments
 (0)