You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enables a three-tier KV cache with automatic promotion and demotion across hot (GPU/CPU memory), warm (compressed CPU memory), and cold (disk) storage. Layers are tracked by access count; infrequently-used layers are demoted to lower tiers while hot layers remain in uncompressed memory. An async prefetch goroutine moves cold layers back to hot before they are needed.
225
+
226
+
When `cfg.ColdDir` is empty, a temporary directory is created and deleted by `Close()`. When `cfg.ColdDir` is non-empty, the directory is left intact after `Close()` so cold-tier files can be reused across generation calls.
227
+
228
+
```go
229
+
gen := generate.NewGenerator[float32](graph, tok, engine, cfg,
0 commit comments