Skip to content

v0.2.4

Latest

Choose a tag to compare

@zhuzilin zhuzilin released this 29 Mar 13:02
· 23 commits to main since this release

v0.2.4 is here! Thanks to everyone who contributed to this release.

Major Updates

In addition to a broad set of bug fixes and stability improvements, v0.2.4 brings several major updates:

  • Profiling and observability improvements
    Added a rollout trace timeline viewer and W&B reporting for dynamic ITL / TTFT percentile metrics.
  • Router stack unified on sgl-router
    Consolidated the router stack onto sgl-router and removed slime-router.
  • Expanded multimodal and model support
    Improved support for GLM-4.6V / GLM4V, Multimodal OPD, and Qwen3.5-related workflows.

Other Notable Changes

  • Fixed CUDA IPC cache leaks during weight updates
  • Fixed SP/CP gradient inflation in FLA layers

What's Changed

New Contributors

Full Changelog: v0.2.3...v0.2.4