Skip to content

Commit 1e0f8ea

Browse files
committed
docs: update to 28 architectures (16 families), add GPT-2/Nemotron-H/MiniMax-M2
1 parent 8c62b60 commit 1e0f8ea

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

content/_index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -487,7 +487,7 @@ <h3 style="font-size:1rem;font-weight:600;margin-bottom:16px">Performance journe
487487
<div class="wrap">
488488
<div class="section-head">
489489
<h2>Supported models</h2>
490-
<p>24 architectures across 13 model families. Load any GGUF model from HuggingFace.</p>
490+
<p>28 architectures across 16 model families. Load any GGUF model from HuggingFace.</p>
491491
</div>
492492
<div class="model-grid">
493493
<div class="model-card"><div class="name">Gemma 3</div><div class="status prod">Production</div></div>

content/docs/reference/migration-v1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -236,7 +236,7 @@ for usage of deprecated symbols.
236236
These are additive and do not require migration, but are worth knowing about:
237237

238238
- **Architecture registry** -- `inference.RegisterArchitecture` / `inference.ListArchitectures` for pluggable model support.
239-
- **24 architectures (13 model families)** -- Llama 3, Gemma 3, Mistral, Qwen 2, Phi 3/4, DeepSeek V3, Falcon, Command R, Mixtral, RWKV, Jamba, Mamba 3, and more.
239+
- **28 architectures (16 model families)** -- Llama 3/4, Gemma 3/3n, Mistral, Qwen 2, Phi 3/4, DeepSeek V3, GPT-2, Nemotron-H, MiniMax M2, Falcon, Command R, Mixtral, RWKV, Jamba, Mamba 3, Whisper, and more.
240240
- **Speculative decoding** -- `inference.Model.SpeculativeGenerate` and `generate.WithSpeculativeDraft`.
241241
- **Paged KV cache** -- `generate.WithPagedKV` for memory-efficient serving.
242242
- **Prefix caching** -- `generate.WithPrefixCache` for shared system prompt reuse.

0 commit comments

Comments
 (0)