Episode 58: Deep Learning Meets Distributed Systems with Jim Dowling

Published: March 19, 2021, 8 a.m.

Show Notes

  • (1:56) Jim went over his education at Trinity College Dublin in the late 90s/early 2000s, where he got early exposure to academic research in distributed systems.
  • (4:26) Jim discussed his research focused on dynamic software architecture, particularly the K-Component model that enables individual components to adapt to a changing environment.
  • (5:37) Jim explained his research on collaborative reinforcement learning that enables groups of reinforcement learning agents to solve online optimization problems in dynamic systems.
  • (9:03) Jim recalled his time as a Senior Consultant for MySQL.
  • (9:52) Jim shared the initiatives at the RISE Research Institute of Sweden, in which he has been a researcher since 2007.
  • (13:16) Jim dissected his peer-to-peer systems research at RISE, including theoretical results for search algorithm and walk topology.
  • (15:30) Jim went over challenges building peer-to-peer live streaming systems at RISE, such as GradientTV and Glive.
  • (18:18) Jim provided an overview of research activities at the Division of Software and Computer Systems at the School of Electrical Engineering and Computer Science at KTH Royal Institute of Technology.
  • (19:04) Jim has taught courses on Distributed Systems and Deep Learning on Big Data at KTH Royal Institute of Technology.
  • (22:20) Jim unpacked his O’Reilly article in 2017 called “Distributed TensorFlow,” which includes the deep learning hierarchy of scale.
  • (29:47) Jim discussed the development of HopsFS, a next-generation distribution of the Hadoop Distributed File System (HDFS) that replaces its single-node in-memory metadata service with a distributed metadata service built on a NewSQL database.
  • (34:17) Jim rationalized the intention to commercialize HopsFS and built Hopsworks, an user-friendly data science platform for Hops.
  • (36:56) Jim explored the relative benefits of public research money and VC-funded money.
  • (41:48) Jim unpacked the key ideas in his post “Feature Store: The Missing Data Layer in ML Pipelines.”
  • (47:31) Jim dissected the critical design that enables the Hopsworks feature store to refactor a monolithic end-to-end ML pipeline into separate feature engineering and model training pipelines.
  • (52:49) Jim explained why data warehouses are insufficient for machine learning pipelines and why a feature store is needed instead.
  • (57:59) Jim discussed prioritizing the product roadmap for the Hopswork platform.
  • (01:00:25) Jim hinted at what’s on the 2021 roadmap for Hopswork.
  • (01:03:22) Jim recalled the challenges of getting early customers for Hopsworks.
  • (01:04:30) Jim intuited the differences and similarities between being a professor and being a founder.
  • (01:07:00) Jim discussed worrying trends in the European Tech ecosystem and the role that Logical Clocks will play in the long run.
  • (01:13:37) Closing segment.
Jim’s Contact InfoMentioned Content

Research Papers

Articles

Projects

People

Programming Books