Skip to content
Change the repository type filter

All

    Repositories list

    • OmniScript

      Public
      OmniScript: Towards Audio-Visual Script Generation for Long-Form Cinematic Video
      0110Updated Apr 22, 2026Apr 22, 2026
    • TimeLens

      Public
      [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
      Python
      Other
      1012880Updated Apr 21, 2026Apr 21, 2026
    • DSR_Suite

      Public
      Jupyter Notebook
      Apache License 2.0
      77020Updated Apr 21, 2026Apr 21, 2026
    • MotionCrafter

      Public
      [CVPR 2026 Highlight🔥] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
      Python
      Other
      615510Updated Apr 20, 2026Apr 20, 2026
    • CubeComposer

      Public
      [CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
      Python
      Other
      1011020Updated Mar 24, 2026Mar 24, 2026
    • GenCompositor

      Public
      [ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer
      Python
      Other
      815430Updated Mar 16, 2026Mar 16, 2026
    • Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels
      Python
      Other
      2021410Updated Mar 11, 2026Mar 11, 2026
    • VerseCrafter

      Public
      VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
      Python
      Other
      2635680Updated Feb 26, 2026Feb 26, 2026
    • ColorFlow

      Public
      The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow:基于检索增强的图像序列上色
      Python
      Other
      41460140Updated Dec 10, 2025Dec 10, 2025
    • SEED-Voken

      Public
      SEED-Voken: A Series of Powerful Visual Tokenizers
      Python
      Apache License 2.0
      441k21Updated Nov 25, 2025Nov 25, 2025
    • ARC-Chapter

      Public
      Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
      Apache License 2.0
      24140Updated Nov 19, 2025Nov 19, 2025
    • BlobCtrl

      Public
      [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing
      Python
      Other
      32410Updated Nov 14, 2025Nov 14, 2025
    • RollingForcing

      Public
      [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
      Python
      Other
      18374131Updated Oct 31, 2025Oct 31, 2025
    • MindOmni

      Public
      [NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
      Python
      Other
      313920Updated Oct 15, 2025Oct 15, 2025
    • vllm

      Public
      vllm for ARC-Hunyuan-Video-7B
      Python
      Apache License 2.0
      0305Updated Oct 6, 2025Oct 6, 2025
    • GeometryCrafter

      Public
      [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
      Python
      Other
      1944150Updated Oct 2, 2025Oct 2, 2025
    • Moto

      Public
      [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
      Python
      Other
      817260Updated Oct 1, 2025Oct 1, 2025
    • ARC-Hunyuan-Video-7B

      Public
      Structured Video Comprehension of Real-World Shorts
      Python
      Other
      7237150Updated Sep 21, 2025Sep 21, 2025
    • AudioStory

      Public
      AudioStory: Generating Long-Form Narrative Audio with Large Language Models
      Jupyter Notebook
      2230131Updated Sep 21, 2025Sep 21, 2025
    • IC-Custom

      Public
      [ICLR'26] IC-Custom: Diverse Image Customization via In-Context Learning
      Python
      Other
      416010Updated Sep 15, 2025Sep 15, 2025
    • BrushEdit

      Public
      [under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
      Python
      Other
      30589110Updated Sep 3, 2025Sep 3, 2025
    • ToonComposer

      Public
      [ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing
      Python
      Other
      5455990Updated Aug 20, 2025Aug 20, 2025
    • TokLIP

      Public
      TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
      Python
      Other
      623780Updated Aug 18, 2025Aug 18, 2025
    • FreeSplatter

      Public
      [ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
      JavaScript
      Other
      17237102Updated Aug 4, 2025Aug 4, 2025
    • HTML
      0100Updated Aug 1, 2025Aug 1, 2025
    • Video-Holmes

      Public
      Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
      Python
      Apache License 2.0
      29120Updated Jul 13, 2025Jul 13, 2025
    • SEED-Bench-R1

      Public
      Python
      Apache License 2.0
      29920Updated Jun 23, 2025Jun 23, 2025
    • GRPO-CARE

      Public
      [ACL2026 Findings] GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning
      Python
      Apache License 2.0
      28150Updated Jun 23, 2025Jun 23, 2025
    • AnimeGamer

      Public
      [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
      Python
      Other
      2934651Updated Apr 9, 2025Apr 9, 2025
    • VideoPainter

      Public
      [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
      Python
      Other
      45600160Updated Apr 8, 2025Apr 8, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.