Recording of the @MLCollective x @The_Full_Stack Interest Group on ML in Production's second meeting, on 2022-03-19. We're continuing our reading group on @ChipHuyen's book, "Designing ML Systems". with chapter 2, on Data Engineering Fundamentals.
For more on this group, including how to join and what events we're planning next, check out our home page: mlip.shor.tn/notion
00:00 - Overview of the Chapter
12:29 - Data Vocabulary: ETL, Data Lakes, Data Warehouses, and More
29:41 - Preserving Data and Continual Learning
34:57 - Data Formats and GPU Throughput
40:13 - Outro
コメント