MLOps & LLM Engineering Resources

Papers, repos, courses, and communities for ML platform teams, LLMOps practitioners, and engineers running models in production.

Papers

MLflow
Open-source platform for managing the ML lifecycle including experiment tracking, model registry, and deployment.

#experiment-tracking
Weights & Biases
ML experiment tracking, model versioning, and collaboration platform.

#experiment-tracking
Evidently AI
Open-source ML monitoring library for data drift, model performance, and data quality.

#monitoring
Arize AI
ML observability platform for detecting issues in production models.

#monitoring
Fiddler AI
Enterprise ML monitoring with explainability and bias detection.

#monitoring
WhyLabs
AI observability platform powered by open-source whylogs.

#monitoring
Ray
Distributed compute framework for scaling ML workloads.

#distributed-compute
vLLM
High-throughput and memory-efficient LLM inference engine.

#inference
LangSmith
Observability, testing, and evaluation platform for LLM applications.

#llmops
Prefect
Workflow orchestration platform for data and ML pipelines.

#orchestration

Eugene Yan's Newsletter
Practical ML engineering, system design, and applied research insights.

#applied-ml
MLOps.community Newsletter
Weekly roundup of MLOps news, tools, and discussions.

#mlops
The Batch (Andrew Ng)
Weekly digest of AI research and industry news.

#ml-news
Chip Huyen's Blog
Deep dives into ML systems design, production ML, and AI engineering.

#ml-systems
Lilian Weng's Blog
Technical deep-dives on ML research from OpenAI.

#research

Stanford CS329S: Machine Learning Systems Design
Full course on building and deploying scalable ML systems.

#course
MLOps Zoomcamp (DataTalks.Club)
Free practical MLOps course covering monitoring, CI/CD, and deployment.

#course
LLM Bootcamp (Full Stack Deep Learning)
End-to-end LLM application development and deployment.

#llmops

MLOps.community Slack
6k+ practitioners discussing production ML, tooling, and case studies.

#slack
Weights & Biases Discord
Active community around experiment tracking and ML best practices.

#discord
Hugging Face Forums
Transformers, datasets, model hub, and deployment discussions.

#forum
r/MachineLearning
Large community covering ML research, news, and engineering topics.

#reddit