Python Machine Learning

Open-source Python projects categorized as Machine Learning

Top 23 Python Machine Learning Projects

Machine Learning
  1. transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Project mention: Mastering AI Language Models: From NLP Foundations to 2025 Innovations | dev.to | 2026-03-07

    Ready to build your own language AI? Explore Hugging Face's Transformers library and test your skills with our interactive coding challenges at AIAcademy.tech!

  2. InfluxDB

    InfluxDB – Database Purpose-Built for High-Resolution Data. Turn time series data into real-time intelligence. Manage high-volume, high-velocity data without sacrificing performance.

    InfluxDB logo
  3. Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Project mention: Avoid UUIDv4 Primary Keys | news.ycombinator.com | 2025-12-15

    > A running number also carries data. Before you know it, someone's relying on the ordering or counting on there not being gaps - or counting the gaps to figure out something they shouldn't.

    For example, if https://github.com/pytorch/pytorch/issues/111111 can be seen but https://github.com/pytorch/pytorch/issues/111110 can't, someone might infer the existence of a hidden issue relating to a critical security problem.

    Whereas if the URL was instead https://github.com/pytorch/pytorch/issues/761500e0-0070-4c0d... that risk would be avoided.

  4. nn

    🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

  5. scikit-learn

    scikit-learn: machine learning in Python

    Project mention: Building a Personalized Meal Recommendation System | dev.to | 2026-02-26

    In practice, you’ll want to use libraries (like scikit-learn or TensorFlow.js for more advanced modeling), but the principle remains: find what similar users enjoy, and use that as a basis for recommendations.

  6. Keras

    Deep Learning for humans

    Project mention: PyTorch vs TensorFlow 2025: Which one wins after 72 hours? | dev.to | 2025-08-29

    Keras 3 multi-backend

  7. OpenBB

    Financial data platform for analysts, quants and AI agents.

    Project mention: The Death of Software Development | news.ycombinator.com | 2026-01-17

    How would you rate OpenBB [0]? It’s touted as a Bloomberg Terminal alternative and it has most certainly been included in training for all SOTA models.

    0. https://github.com/OpenBB-finance/OpenBB

  8. yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

    Project mention: Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi | dev.to | 2025-11-23

    Ultralytics YOLO Documentation

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. Face Recognition

    The world's simplest facial recognition api for Python and the command line

    Project mention: Show HN: Real-time privacy protection for smart glasses | news.ycombinator.com | 2025-08-11

    Did you look at egoblur? its a lot more effective at face detection than https://github.com/ageitgey/face_recognition granted, you'd have to do your own face matching to do exception.

  11. faceswap

    Deepfakes Software For All

  12. ultralytics

    Ultralytics YOLO 🚀

    Project mention: Why DETRs are replacing YOLOs for real-time object detection | news.ycombinator.com | 2025-11-22

    > The YOLO series is developed and maintained by Ultralytics. All YOLO code and weights are released under the AGPL-3.0 license.The YOLO series is developed and maintained by Ultralytics. All YOLO code and weights are released under the AGPL-3.0 license.

    The original author of YOLO and the Darknet framework [1] issued the code under pretty much every license you wish to use [2]. My preferred fork by AlexeyAB is under an equally permissive license [3].

    Ultralytics then created their own model under the AGPL-3.0 license [4], which probably would never stand up in a court as they have the model from the likes of YOLOv3 in their source [5].

    This entire article is flawed anyway, because they don't state which YOLOv11 model they are using or compare the accuracy. They appear to have just taken the pre-trained models and assumed it's apples-to-apples. They could have at least compared YOLO11n/s/m/l/x,

    [1] https://pjreddie.com/darknet/yolo/

    [2] https://github.com/pjreddie/darknet

    [3] https://github.com/AlexeyAB/darknet

    [4] https://github.com/ultralytics/ultralytics

    [5] https://github.com/ultralytics/ultralytics/tree/main/ultraly...

  13. Airflow

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    Project mention: Top Open-Source Data Engineering Tools- Unravelling the Best in 2026 | dev.to | 2025-12-10

    Airflow

  14. streamlit

    Streamlit — A faster way to build and share data apps.

    Project mention: Top 10 Open-Source User Interfaces for LLMs | dev.to | 2026-02-20

    GitHub Link: https://github.com/streamlit/streamlit

  15. gradio

    Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

    Project mention: AI bot crabby-rathbun is still going | news.ycombinator.com | 2026-02-13
  16. DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Project mention: AWS SageMaker HyperPod: Distributed Training for Foundation Models at Scale | dev.to | 2026-01-16

    DeepSpeed Optimization Library - An open-source library compatible with HyperPod that offers advanced pipeline and system optimizations for LLM training.

  17. Ray

    Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    Project mention: GSoC 2026 Predictions: 30 NEW AI/ML/Security Organizations You Should Start Contributing to NOW! | dev.to | 2026-02-06

    Main: https://github.com/ray-project/ray ⭐ 34k+

  18. MindsDB

    Query Engine for AI Analytics: Build self-reasoning agents across all your live data

    Project mention: MindsDB Supercharges Google's MCP Toolbox with Unstructured Data Support | dev.to | 2025-12-29

    We’re happy to announce that we’ve integrated MindsDB with Google's open-source project, MCP (Model Context Protocol) Toolbox. This will make your AI applications very, very smart. This enhancement expands the Toolbox's reach, especially for organizations grappling with lots of siloed data.

  19. qlib

    Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

    Project mention: Choosing the Right AI Model for Stock Prediction | dev.to | 2025-10-04

    After researching different AI models in Qlib (a quantitative finance platform), here's what I learned:

  20. Open-Assistant

    OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

  21. paperless-ngx

    A community-supported supercharged document management system: scan, index and archive all your documents

    Project mention: How to Record and Retrieve Anything You've Ever Had to Look Up Twice | news.ycombinator.com | 2026-03-02

    https://github.com/paperless-ngx/paperless-ngx might be a nice rabbit hole for you, drop the files in there and it'll be OCR'ed and searchable. There's also some AI projects you can give access to paperless to achieve your use case.

  22. gym

    A toolkit for developing and comparing reinforcement learning algorithms.

  23. supervision

    We write your reusable computer vision tools. 💜

    Project mention: Show HN: Plug-and-play Python utils for any computer-vision pipeline | news.ycombinator.com | 2025-07-21
  24. spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Project mention: Parsing Nutrition Labels with AI: From Image to Structured Data | dev.to | 2026-03-02

    For more advanced food label AI, combine pattern matching with Named Entity Recognition (NER). Libraries like spaCy (Python) or compromise (JavaScript) can identify amounts, units, and nutrient names even in noisy text.

  25. dspy

    DSPy: The framework for programming—not prompting—language models

    Project mention: Gas Town's Agent Patterns, Design Bottlenecks, and Vibecoding at Scale | news.ycombinator.com | 2026-01-23

    Has anyone contrasted gas town to Stanford's DSPY (https://dspy.ai/)? They seem related, but I have trouble understanding exactly what Gas Town is and so can't myself do a comparison?

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Machine Learning discussion

Log in or Post with

Python Machine Learning related posts

  • Nvidia Triton Inference Server

    1 project | news.ycombinator.com | 9 Mar 2026
  • DVC: Data Version Control

    1 project | news.ycombinator.com | 1 Mar 2026
  • We Built the First AI-Native Quantum Software Framework: Say Hello to Agentic TensorCircuit-NG

    1 project | dev.to | 27 Feb 2026
  • Building a Personalized Meal Recommendation System

    3 projects | dev.to | 26 Feb 2026
  • How Taalas "prints" LLM onto a chip?

    2 projects | news.ycombinator.com | 22 Feb 2026
  • Building a Menu Scanner with OCR and AI

    1 project | dev.to | 15 Feb 2026
  • Built a Hospital Lenght Of Stay Predictor Web app

    3 projects | dev.to | 15 Feb 2026
  • A note from our sponsor - SaaSHub
    www.saashub.com | 11 Mar 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Machine Learning projects in Python? This list will help you:

# Project Stars
1 transformers 157,492
2 Pytorch 98,171
3 nn 65,861
4 scikit-learn 65,330
5 Keras 63,912
6 OpenBB 62,624
7 yolov5 56,936
8 Face Recognition 56,115
9 faceswap 55,032
10 ultralytics 54,189
11 Airflow 44,511
12 streamlit 43,803
13 gradio 41,921
14 DeepSpeed 41,769
15 Ray 41,621
16 MindsDB 38,659
17 qlib 38,364
18 Open-Assistant 37,457
19 paperless-ngx 37,133
20 gym 36,944
21 supervision 36,644
22 spaCy 33,284
23 dspy 32,580

Sponsored
InfluxDB – Database Purpose-Built for High-Resolution Data
Turn time series data into real-time intelligence. Manage high-volume, high-velocity data without sacrificing performance.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?