595: Data Engineering 101

Published: July 26, 2022, 11 a.m.

Tune in as Joe Reis and Matt Housley, co-founders of Ternary Data and co-authors of the book \u201cFundamentals of Data Engineering\u201d join Jon Krohn to discuss major undercurrents across the data engineering lifecycle, and their top tools and techniques.\n\nIn this episode you will learn:\n\u2022 What is data engineering? [3:55]\n\u2022 Why Joe and Matt identify as \u201crecovering data scientists\u201d [6:12]\n\u2022 What kinds of people tend to become data scientists vs. data engineers [10:38]?\n\u2022 Key components of Joe and Matt\u2019s book [26:31]\n\u2022 Major undercurrents across the data engineering lifecycle [28:26]\n\u2022 The most under-utilized tool in a data engineer's toolbox [34:39]\n\u2022 How there are tradeoffs in any data pipeline latency considerations, but faster is typically the default assumption [38:55]\n\u2022 Joe and Matt\u2019s favorite data engineering tools and techniques [43:39]\n\nAdditional materials: www.superdatascience.com/595