Skip to content
Change the repository type filter

All

    Repositories list

    • litellm

      Public
      Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, A…
      Python
      Other
      7.4k000Updated Apr 7, 2026Apr 7, 2026
    • seldon-core

      Public archive
      An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
      HTML
      Apache License 2.0
      862000Updated Sep 16, 2025Sep 16, 2025
    • lorax

      Public
      Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
      Python
      Apache License 2.0
      3123.8k15030Updated May 21, 2025May 21, 2025
    • Python
      22020Updated Sep 5, 2024Sep 5, 2024
    • Jupyter Notebook
      1700Updated Mar 3, 2024Mar 3, 2024
    • Best practices for distilling large language models.
      Jupyter Notebook
      5662110Updated Feb 1, 2024Feb 1, 2024
    • huggingface_hub

      Public
      The official Python client for the Huggingface Hub.
      Python
      Apache License 2.0
      998000Updated Dec 18, 2023Dec 18, 2023
    • volcano

      Public archive
      A Cloud Native Batch System (Project under CNCF)
      Go
      Apache License 2.0
      1.3k001Updated Dec 4, 2023Dec 4, 2023
    • punica

      Public
      Serving multiple LoRA finetuned LLM as one
      Cuda
      62200Updated Nov 24, 2023Nov 24, 2023
    • volcano-apis

      Public archive
      The API (CRD) of Volcano
      Go
      Apache License 2.0
      102000Updated Nov 8, 2023Nov 8, 2023
    • LlamaIndex (GPT Index) is a data framework for your LLM applications
      Python
      MIT License
      7.3k000Updated Aug 1, 2023Aug 1, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      22k000Updated Jul 20, 2023Jul 20, 2023
    • Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container images on each node.
      Go
      Eclipse Public License 2.0
      41000Updated Apr 20, 2023Apr 20, 2023
    • PyBump

      Public
      Bump version in Helm Chart.yaml and setup.py files
      Python
      Apache License 2.0
      9000Updated Dec 22, 2022Dec 22, 2022
    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.8k000Updated Oct 22, 2022Oct 22, 2022
    • dask-sql

      Public
      Distributed SQL Engine in Python using Dask
      Python
      MIT License
      70100Updated Apr 5, 2022Apr 5, 2022
    • Python
      BSD 3-Clause "New" or "Revised" License
      14100Updated Feb 23, 2022Feb 23, 2022
    • neuropod

      Public
      A uniform interface to run deep learning models from multiple frameworks
      C++
      Apache License 2.0
      73300Updated Feb 23, 2022Feb 23, 2022
    • GitHub action for identifying the last successful commit for a given workflow and branch.
      JavaScript
      52000Updated Jan 5, 2021Jan 5, 2021
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.