Awesome Python Data Science
Probably the best curated list of data science software in Python
Contents
- Contents
- Machine Learning
- Deep Learning
- Automated Machine Learning
- Natural Language Processing
- Computer Audition
- Computer Vision
- Time Series
- Reinforcement Learning
- Graph Machine Learning
- Learning-to-Rank & Recommender Systems
- Probabilistic Graphical Models
- Probabilistic Methods
- Model Explanation
- Optimization
- Genetic Programming
- Feature Engineering
- Visualization
- Data Manipulation
- Deployment
- Statistics
- Distributed Computing
- Experimentation
- Data Validation
- Evaluation
- Computations
- Web Scraping
- Spatial Analysis
- Quantum Computing
- Conversion
- Contributing
- License
Machine Learning
General Purpose Machine Learning
- SciPy - Fundamental algorithms for scientific computing in Python
- scikit-learn - Machine learning in Python.

- PyCaret - An open-source, low-code machine learning library in Python.

- Shogun - Machine learning toolbox.
- xLearn - High Performance, Easy-to-use, and Scalable Machine Learning Package.
- cuML - RAPIDS Machine Learning Library.

- modAL - Modular active learning framework for Python3.

- Sparkit-learn - PySpark + scikit-learn = Sparkit-learn.

- mlpack - A scalable C++ machine learning library (Python bindings).
- dlib - Toolkit for making real-world machine learning and data analysis applications in C++ (Python bindings).
- MLxtend - Extension and helper modules for Python's data analysis and machine learning libraries.

- hyperlearn - 50%+ Faster, 50%+ less RAM usage, GPU support re-written Sklearn, Statsmodels.

- Reproducible Experiment Platform (REP) - Machine Learning toolbox for Humans.

- scikit-multilearn - Multi-label classification for python.

- seqlearn - Sequence classification toolkit for Python.

- pystruct - Simple structured learning framework for Python.

