Skip to content
Change the repository type filter

All

    Repositories list

    • uwazi

      Public
      Uwazi is a web-based, open-source solution for building and sharing document collections
      TypeScript
      MIT License
      9830241115Updated Apr 22, 2026Apr 22, 2026
    • TypeScript
      0400Updated Apr 20, 2026Apr 20, 2026
    • A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmen…
      Python
      Apache License 2.0
      1271.1k47Updated Apr 13, 2026Apr 13, 2026
    • ML-Benchmarks

      Public
      Repository to store all the ML benchmarks
      0000Updated Apr 10, 2026Apr 10, 2026
    • Python API to interact with Uwazi
      Python
      0240Updated Mar 13, 2026Mar 13, 2026
    • NER-in-docker
      Python
      0607Updated Mar 3, 2026Mar 3, 2026
    • pdf-document-layout-analysis-async
      Python
      0105Updated Feb 27, 2026Feb 27, 2026
    • ml-cloud-connector
      Python
      Apache License 2.0
      0000Updated Feb 23, 2026Feb 23, 2026
    • docker-translation-service

      Public
      docker-translation-service
      Python
      Apache License 2.0
      0006Updated Feb 18, 2026Feb 18, 2026
    • text selection handling and highlighting
      TypeScript
      Apache License 2.0
      0160Updated Jan 31, 2026Jan 31, 2026
    • HTML
      MIT License
      3260Updated Jan 28, 2026Jan 28, 2026
    • pdf-features
      Python
      0300Updated Jan 20, 2026Jan 20, 2026
    • pdf_information_extraction
      Python
      1508Updated Jan 9, 2026Jan 9, 2026
    • Trainable Entity Extractor
      Python
      Apache License 2.0
      0507Updated Jan 9, 2026Jan 9, 2026
    • preserve

      Public
      Preserve is a tool for capturing and saving online digital content. Integrated with Uwazi, Preserve captures content from websites, social media and communicati…
      TypeScript
      MIT License
      161211Updated Nov 18, 2025Nov 18, 2025
    • NER-in-uwazi
      Python
      MIT License
      0000Updated Oct 20, 2025Oct 20, 2025
    • queue-processor
      Python
      Apache License 2.0
      0000Updated Oct 2, 2025Oct 2, 2025
    • Python
      0000Updated Aug 29, 2025Aug 29, 2025
    • TypeScript
      Apache License 2.0
      1300Updated Mar 18, 2025Mar 18, 2025
    • rison

      Public
      JavaScript
      Apache License 2.0
      5001Updated Mar 11, 2025Mar 11, 2025
    • This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and cla…
      Makefile
      Apache License 2.0
      43820Updated Feb 3, 2025Feb 3, 2025
    • This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leve…
      Makefile
      Apache License 2.0
      42020Updated Feb 3, 2025Feb 3, 2025
    • An http service to OCR PDFs based on a redis queue.
      Python
      MIT License
      0130Updated Dec 13, 2024Dec 13, 2024
    • An http service to convert documents to PDF based on a redis queue.
      Python
      MIT License
      0037Updated Sep 19, 2024Sep 19, 2024
    • Python
      3316Updated Jul 4, 2024Jul 4, 2024
    • Python
      MIT License
      64914Updated Jul 4, 2024Jul 4, 2024
    • Python
      21600Updated Apr 26, 2024Apr 26, 2024
    • Python
      MIT License
      45104Updated May 25, 2023May 25, 2023
    • twitter crawler
      Python
      0101Updated Apr 3, 2023Apr 3, 2023
    • Python
      5313Updated Dec 27, 2022Dec 27, 2022
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.