ml-systems

Star

Here are 18 public repositories matching this topic...

mosaicml / composer

Star

Supercharge Your Model Training

machine-learning deep-learning neural-network pytorch neural-networks ml-training ml-systems ml-efficiency

Updated Nov 12, 2025
Python

mryab / efficient-dl-systems

Star

Efficient Deep Learning Systems course materials (HSE, YSDA)

machine-learning deep-learning cuda pytorch performance-optimization distributed-training ml-infrastructure mlops inference-optimization efficient-deep-learning ml-systems

Updated Jan 25, 2026
Jupyter Notebook

gagan-iitb / ComputerSysDesign

Star

Designing IT and ML Applications using Systems Thinking Approach at IIT Bhilai (CS559)

design fault-tolerance scalability systems rate-limiter system-design ml-ops system-design-interview ml-systems

Updated May 5, 2024

anastasiamkh / engineering-machine-learning-systems

Star

Structured notes on designing scalable and fault-tolerant ML systems, to refresh your knowledge and help you prepare for a system design interview. Covers system design, MLOps, and case studies.

Updated Jan 18, 2025

711nishtha / financial-fraud-detection-app

Star

Experimental web application demonstrating how an offline-trained financial fraud detection model can be exposed through a web interface. Built with Flask and a pre-trained XGBoost model to showcase ML inference flow, feature engineering, and result communication — not a production fraud prevention system.

flask web-application fraud-detection machine-learning-demo applied-ml ml-systems model-inference

Updated Jan 24, 2026
HTML

karun2328 / inference_pipeline

Star

Benchmarking and optimizing transformer inference across PyTorch, ONNXRuntime, and TensorRT with latency/throughput analysis on GPU and CPU.

gpu inference pytorch quantization tensorrt onnxruntime ml-systems

Updated Jan 16, 2026
Python

fractal360 / risk-gate-api

Star

Deterministic decision gate for AI/ML systems. Risk-Gate enforces strict, schema-driven admissibility boundaries between AI/LLM intent and real system actions. It provides a fixed, human-owned decision structure with deterministic allow/block outcomes, explicit audit logging, and environment-specific policy via configuration — no ML, no heuristics,

terraform aws-ecs system-architecture policy-enforcement fastapi applied-ml ml-systems ai-governance llm-safety deterministic-systems auditability decision-gate

Updated Jan 14, 2026
Python

yksanjo / cs249r_book

Star

Introduction to Machine Learning Systems - Educational materials for ML systems architecture, deployment, and production considerations.

open-source education machine-learning book ml-systems

Updated Nov 16, 2025
JavaScript

nikita74939 / Eksperimen_SML_Nikita

Star

An automated preprocessing pipeline for Telco Customer Churn data, including cleaning, feature engineering, and CI with GitHub Actions.

machine-learning data-preprocessing data-pipeline github-actions ml-systems telco-churn

Updated Dec 20, 2025
Jupyter Notebook

dahlp94 / feed-ranking-engine

Star

End-to-end personalized feed ranking system demonstrating retrieval → ranking pipelines, offline evaluation, realistic simulation, and business-aligned diagnostics inspired by large-scale social platforms.

machine-learning recommendation-system learning-to-rank ml-systems feed-ranking

Updated Dec 31, 2025
Python

alfonsocruzvelasco / learning-notes

Star

Public engineering notes (ML systems, CV, MIT courses). Notes-only; sources linked.

computer-science mit algorithms data-structures learning-notes ml-systems

Updated Jan 18, 2026

Krishna13k / Fraud-Anomaly-System

Star

End-to-end fraud anomaly detection system using FastAPI, Isolation Forest, Streamlit, Docker, and a CI/CD pipeline.

docker machine-learning backend pipeline ci-cd fraud-detection anomaly-detection fastapi streamlit ml-systems

Updated Jan 29, 2026
Python

Thiyaga1586 / PneumoAI

Star

Production-style ML inference system for Pneumonia detection from chest X-rays, featuring custom CNN architectures, versioned model serving, preprocessing parity, observability, drift detection, and rollback using FastAPI and Docker.

docker machine-learning computer-vision deep-learning pytorch medical-imaging model-serving chest-xrays mlops pneumonia-detection fastapi ml-systems

Updated Jan 8, 2026
Python

Arnav-Ajay / rag-failure-modes

Star

Failure-first analysis of retrieval-augmented and agentic systems, focused on isolating and attributing failures across retrieval, planning, execution, memory, and policy layers.

evaluation-framework rag failure-analysis agent-systems ai-observability ml-systems retrieval-augmented-generation system-debug agent-architecture llm-systems

Updated Jan 28, 2026
Python

rukmini-17 / mini-autodiff

Star

A lightweight, reverse-mode Automatic Differentiation (AD) engine built from scratch using Python and NumPy. Supports dynamic computational graphs and complex linear algebra operations.

python machine-learning calculus numpy automatic-differentiation mathematics autograd backpropagation scratch-implementation differentiable-programming ml-systems computational-graph reverse-mode-ad

Updated Dec 24, 2025
Jupyter Notebook

fortyfive-labs / ml-dash

Star

Scalable Training Telemetry and Metrics Visualization

visualization machine-learning robot-learning ml-systems rlhf physical-ai

Updated Jan 27, 2026
Python

mansoor-mamnoon / openlake

Star

analytics data-engineering databricks delta-lake lakehouse ml-systems

Updated Jan 19, 2026
TypeScript

Improve this page

Add a description, image, and links to the ml-systems topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ml-systems topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ml-systems

Here are 18 public repositories matching this topic...

mosaicml / composer

mryab / efficient-dl-systems

gagan-iitb / ComputerSysDesign

anastasiamkh / engineering-machine-learning-systems

4F71 / 4F71

711nishtha / financial-fraud-detection-app

karun2328 / inference_pipeline

fractal360 / risk-gate-api

yksanjo / cs249r_book

nikita74939 / Eksperimen_SML_Nikita

dahlp94 / feed-ranking-engine

alfonsocruzvelasco / learning-notes

Krishna13k / Fraud-Anomaly-System

Thiyaga1586 / PneumoAI

Arnav-Ajay / rag-failure-modes

rukmini-17 / mini-autodiff

fortyfive-labs / ml-dash

mansoor-mamnoon / openlake

Improve this page

Add this topic to your repo