Skip to content

Recommended GitHub Repositories for Data Science

GitHub repositories for data science. Organized into clear categories so you can easily navigate based on your needs whether you're a beginner, intermediate, or looking for projects/MLOps.

1. Awesome Curated Lists (Your "Master Index")

These are massive resource collections covering tools, courses, books, and more.

2. Learning Roadmaps & Structured Curricula

Perfect for self-paced learning with clear paths.

3. Core Libraries (Must-Know Foundations)

Daily tools every data scientist uses.

4. Machine Learning & Deep Learning Frameworks

For building models.

5. Hands-On Projects & Portfolio Builders

Build real projects to strengthen your GitHub profile.

6. Bonus: MLOps, Production & Specialized

Quick Start Advice

  • Beginner: Start with awesome-datascience + Data-Science-For-Beginners + pandas + 100-Days-Of-ML-Code.
  • Intermediate/Advanced: Dive into PyTorch or Transformers + build 5–10 projects from the project repos.
  • Pro Tip: Star these repos, fork interesting ones, and contribute small improvements — it’s great for learning and your resume.

Made with <3 by Muaz Hazali