Designing Machine Learning Systems - Chip Huyen

Designing Machine Learning Systems

Chip Huyen

出版社

O’Reilly

出版时间

2022-06-28

ISBN

9781098107963

评分

★★★★★
书籍介绍

Machine learning systems are both complex and unique. They are complex because they consist of many different components and involve many different stakeholders. They are unique because they are data-dependent, and data varies wildly from one use case to the next.

This book takes a holistic approach to designing machine learning systems that are reliable, scalable, maintainable, and adaptive to changing environments and business requirements. It considers each design decision — e.g. how to create training data, what features to include, how to deploy, what to monitor, how often to retrain your model — in the context of how it can help the system as a whole achieve its objectives. The iterative framework laid out in this book is illustrated using actual case studies and backed by ample references.

Examples of the scenarios that this book will be able to help you tackle.

You have been given a business problem and a lot of raw data. You want to engineer this data and choose the right metrics to solve this problem.

Your initial models perform well in offline experiments and you want to deploy them.

You have little feedback on how your models are performing after your models are deployed, and you want to figure out a way to quickly detect, debug, and address any issue your models might run into in production.

The process of developing, evaluating, deploying, and updating models for your team has been mostly manual, slow, and error-prone. You want to automate and improve this process.

Each machine learning use case in your organization has been deployed using its own workflow, and you want to lay down the foundation (e.g. model store, feature store, monitoring tools) that can be shared and reused across use cases.

You're worried that there might be biases in your machine learning systems and you want to make your systems responsible!

Read less

Chip Huyen (https://huyenchip.com) is a co-founder of Claypot AI, a platform for real-time machine learning. Through her work at NVIDIA, Netflix, and Snorkel AI, she has helped some of the world's largest organizations develop and deploy machine learning systems. She teaches CS 329S: Machine Learning Systems Design at Stanford, whose lecture notes this book is based on.

LinkedI...

(展开全部)

用户评论
作为一个工程师而并非ML从业者,我觉得本书写得非常好,深入浅出,既有high level big picture ,又有具体算法实例,读后受益良多。本书堪比那本在后端工程师中人手一本的Designing Data-Intensive Applications
本书覆盖了machine learning system design的各个层面,论述非常全面,而且作者讲述的非常通俗易懂,并且作者总结了她在设计机器学习系统中的各个经验教训,个人感觉比较实用。作为MLE,本书涉及的内容,从采集数据到训练模型,分析模型都是我们日常生活中会遇到的问题。个人觉得如果附上一个案例分析会更实用。
20220705-20220821 作者是很会写作和教授,非常inspiring的一个人。涵盖了ML系统的各个方面,很全面。我是把这本书当作我了解ML整个领域的综述来读的,没有太多ML算法背景知识也能很好的读完而对每个步骤有个比较清晰的认识,算是一本enlighten了我“哦原来ML里面不止那些算法还有这么多别的有意思的问题亟待解决”的书。本书的主旨似乎就是说服大家ML不光是一个data问题,也同样是一个很难的infra问题。很享受读第1,6-10章。 想读日期20220617
书很有启发性,chip huyen分享了很多实践应用中的经验。自己的知识边界又被拓宽啦,发现了自己的盲区,很开心,可以顺着正确方向学。想看视频版本讲解可以搜CS329S, 她的课程涵盖了书本的大部分内容。
曾经邀请chip来做consulting,这本书写的很好,讲了技术又没讲。 把topic走了一遍,给了link,真的需要可以自己去深入研究。就是chip他们好像不赚钱😅
科普好读物
最近在做一些data dumping/loading的活,读这本书的目的主要是refresh ML knowledge和从infra的角度看ML的需求。well 大部分章节对我来说读读目录和summary就好,但少了一些“踩坑”,少了一些“practice”,不过算是了解了一下ML stack。
写作语言比较平实易懂,内容比较浅显。
作者写得还是挺简单易懂的,但是整本书只是走马观花地把machine learning system介绍一遍。当然这其中也有这个field才刚刚开始发展的原因,各个技术细节虽有提及,但基本都是一笔掠过。面试前读读复习一下挺好的。
收藏