46. Ihab Ilyas - Data cleaning is finally being automated

Published: Aug. 12, 2020, 2:06 p.m.

b'

It\\u2019s clich\\xe9 to say that data cleaning accounts for 80% of a data scientist\\u2019s job, but it\\u2019s directionally true.

\\n

That\\u2019s too bad, because fun things like data exploration, visualization and modelling are the reason most people get into data science. So it\\u2019s a good thing that there\\u2019s a major push underway in industry to automate data cleaning as much as possible.

\\n

One of the leaders of that effort is Ihab Ilyas, a professor at the University of Waterloo and founder of two companies, Tamr and Inductiv, both of which are focused on the early stages of the data science lifecycle: data cleaning and data integration. Ihab knows an awful lot about data cleaning and data engineering, and has some really great insights to share about the future direction of the space\\u200a\\u2014\\u200aincluding what work is left for data scientists, once you automate away data cleaning.

'