Theme · 5 episode(s)

Systems for ML

← all concepts

Definition

Systems for ML is the engineering discipline of building the substrate that machine learning runs on: distributed training, inference serving, schedulers, storage, networking, hardware abstractions. It’s where most of the practical compute-efficiency wins of the past few years have come from.

Episodes covering this

Worth reading next

Papers we haven't done a deep dive on yet, but would recommend on this topic.