SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Machine Learning Projects
-
transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Project mention: Mastering AI Language Models: From NLP Foundations to 2025 Innovations | dev.to | 2026-03-07Ready to build your own language AI? Explore Hugging Face's Transformers library and test your skills with our interactive coding challenges at AIAcademy.tech!
-
InfluxDB
InfluxDB – Database Purpose-Built for High-Resolution Data. Turn time series data into real-time intelligence. Manage high-volume, high-velocity data without sacrificing performance.
-
> A running number also carries data. Before you know it, someone's relying on the ordering or counting on there not being gaps - or counting the gaps to figure out something they shouldn't.
For example, if https://github.com/pytorch/pytorch/issues/111111 can be seen but https://github.com/pytorch/pytorch/issues/111110 can't, someone might infer the existence of a hidden issue relating to a critical security problem.
Whereas if the URL was instead https://github.com/pytorch/pytorch/issues/761500e0-0070-4c0d... that risk would be avoided.
-
nn
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
-
In practice, you’ll want to use libraries (like scikit-learn or TensorFlow.js for more advanced modeling), but the principle remains: find what similar users enjoy, and use that as a basis for recommendations.
-
Keras 3 multi-backend
-
How would you rate OpenBB [0]? It’s touted as a Bloomberg Terminal alternative and it has most certainly been included in training for all SOTA models.
0. https://github.com/OpenBB-finance/OpenBB
-
Project mention: Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi | dev.to | 2025-11-23
Ultralytics YOLO Documentation
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Show HN: Real-time privacy protection for smart glasses | news.ycombinator.com | 2025-08-11
Did you look at egoblur? its a lot more effective at face detection than https://github.com/ageitgey/face_recognition granted, you'd have to do your own face matching to do exception.
-
-
Project mention: Why DETRs are replacing YOLOs for real-time object detection | news.ycombinator.com | 2025-11-22
> The YOLO series is developed and maintained by Ultralytics. All YOLO code and weights are released under the AGPL-3.0 license.The YOLO series is developed and maintained by Ultralytics. All YOLO code and weights are released under the AGPL-3.0 license.
The original author of YOLO and the Darknet framework [1] issued the code under pretty much every license you wish to use [2]. My preferred fork by AlexeyAB is under an equally permissive license [3].
Ultralytics then created their own model under the AGPL-3.0 license [4], which probably would never stand up in a court as they have the model from the likes of YOLOv3 in their source [5].
This entire article is flawed anyway, because they don't state which YOLOv11 model they are using or compare the accuracy. They appear to have just taken the pre-trained models and assumed it's apples-to-apples. They could have at least compared YOLO11n/s/m/l/x,
[1] https://pjreddie.com/darknet/yolo/
[2] https://github.com/pjreddie/darknet
[3] https://github.com/AlexeyAB/darknet
[4] https://github.com/ultralytics/ultralytics
[5] https://github.com/ultralytics/ultralytics/tree/main/ultraly...
-
Project mention: Top Open-Source Data Engineering Tools- Unravelling the Best in 2026 | dev.to | 2025-12-10
Airflow
-
GitHub Link: https://github.com/streamlit/streamlit
-
-
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Project mention: AWS SageMaker HyperPod: Distributed Training for Foundation Models at Scale | dev.to | 2026-01-16DeepSpeed Optimization Library - An open-source library compatible with HyperPod that offers advanced pipeline and system optimizations for LLM training.
-
Ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Project mention: GSoC 2026 Predictions: 30 NEW AI/ML/Security Organizations You Should Start Contributing to NOW! | dev.to | 2026-02-06Main: https://github.com/ray-project/ray ⭐ 34k+
-
Project mention: MindsDB Supercharges Google's MCP Toolbox with Unstructured Data Support | dev.to | 2025-12-29
We’re happy to announce that we’ve integrated MindsDB with Google's open-source project, MCP (Model Context Protocol) Toolbox. This will make your AI applications very, very smart. This enhancement expands the Toolbox's reach, especially for organizations grappling with lots of siloed data.
-
qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
After researching different AI models in Qlib (a quantitative finance platform), here's what I learned:
-
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
-
paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
Project mention: How to Record and Retrieve Anything You've Ever Had to Look Up Twice | news.ycombinator.com | 2026-03-02https://github.com/paperless-ngx/paperless-ngx might be a nice rabbit hole for you, drop the files in there and it'll be OCR'ed and searchable. There's also some AI projects you can give access to paperless to achieve your use case.
-
-
Project mention: Show HN: Plug-and-play Python utils for any computer-vision pipeline | news.ycombinator.com | 2025-07-21
-
Project mention: Parsing Nutrition Labels with AI: From Image to Structured Data | dev.to | 2026-03-02
For more advanced food label AI, combine pattern matching with Named Entity Recognition (NER). Libraries like spaCy (Python) or compromise (JavaScript) can identify amounts, units, and nutrient names even in noisy text.
-
Project mention: Gas Town's Agent Patterns, Design Bottlenecks, and Vibecoding at Scale | news.ycombinator.com | 2026-01-23
Has anyone contrasted gas town to Stanford's DSPY (https://dspy.ai/)? They seem related, but I have trouble understanding exactly what Gas Town is and so can't myself do a comparison?
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Machine Learning discussion
Python Machine Learning related posts
-
Nvidia Triton Inference Server
-
DVC: Data Version Control
-
We Built the First AI-Native Quantum Software Framework: Say Hello to Agentic TensorCircuit-NG
-
Building a Personalized Meal Recommendation System
-
How Taalas "prints" LLM onto a chip?
-
Building a Menu Scanner with OCR and AI
-
Built a Hospital Lenght Of Stay Predictor Web app
-
A note from our sponsor - SaaSHub
www.saashub.com | 11 Mar 2026
Index
What are some of the best open-source Machine Learning projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | transformers | 157,492 |
| 2 | Pytorch | 98,171 |
| 3 | nn | 65,861 |
| 4 | scikit-learn | 65,330 |
| 5 | Keras | 63,912 |
| 6 | OpenBB | 62,624 |
| 7 | yolov5 | 56,936 |
| 8 | Face Recognition | 56,115 |
| 9 | faceswap | 55,032 |
| 10 | ultralytics | 54,189 |
| 11 | Airflow | 44,511 |
| 12 | streamlit | 43,803 |
| 13 | gradio | 41,921 |
| 14 | DeepSpeed | 41,769 |
| 15 | Ray | 41,621 |
| 16 | MindsDB | 38,659 |
| 17 | qlib | 38,364 |
| 18 | Open-Assistant | 37,457 |
| 19 | paperless-ngx | 37,133 |
| 20 | gym | 36,944 |
| 21 | supervision | 36,644 |
| 22 | spaCy | 33,284 |
| 23 | dspy | 32,580 |