Data Provenance

Published: Sept. 4, 2017, 1:35 a.m.

Software engineers are familiar with the idea of versioning code, so you can go back later and revive a past state of the system. \xa0For data scientists who might want to reconstruct past models, though, it's not just about keeping the modeling code. \xa0It's also about saving a version of the data that made the model. \xa0There are a lot of other benefits to keeping track of datasets, so in this episode we'll talk about data lineage or data provenance.