Published: March 19, 2021, 8 a.m.
Show Notes
- (1:56) Jim went over his education at Trinity College Dublin in the late 90s/early 2000s, where he got early exposure to academic research in distributed systems.
- (4:26) Jim discussed his research focused on dynamic software architecture, particularly the K-Component model that enables individual components to adapt to a changing environment.
- (5:37) Jim explained his research on collaborative reinforcement learning that enables groups of reinforcement learning agents to solve online optimization problems in dynamic systems.
- (9:03) Jim recalled his time as a Senior Consultant for MySQL.
- (9:52) Jim shared the initiatives at the RISE Research Institute of Sweden, in which he has been a researcher since 2007.
- (13:16) Jim dissected his peer-to-peer systems research at RISE, including theoretical results for search algorithm and walk topology.
- (15:30) Jim went over challenges building peer-to-peer live streaming systems at RISE, such as GradientTV and Glive.
- (18:18) Jim provided an overview of research activities at the Division of Software and Computer Systems at the School of Electrical Engineering and Computer Science at KTH Royal Institute of Technology.
- (19:04) Jim has taught courses on Distributed Systems and Deep Learning on Big Data at KTH Royal Institute of Technology.
- (22:20) Jim unpacked his O’Reilly article in 2017 called “Distributed TensorFlow,” which includes the deep learning hierarchy of scale.
- (29:47) Jim discussed the development of HopsFS, a next-generation distribution of the Hadoop Distributed File System (HDFS) that replaces its single-node in-memory metadata service with a distributed metadata service built on a NewSQL database.
- (34:17) Jim rationalized the intention to commercialize HopsFS and built Hopsworks, an user-friendly data science platform for Hops.
- (36:56) Jim explored the relative benefits of public research money and VC-funded money.
- (41:48) Jim unpacked the key ideas in his post “Feature Store: The Missing Data Layer in ML Pipelines.”
- (47:31) Jim dissected the critical design that enables the Hopsworks feature store to refactor a monolithic end-to-end ML pipeline into separate feature engineering and model training pipelines.
- (52:49) Jim explained why data warehouses are insufficient for machine learning pipelines and why a feature store is needed instead.
- (57:59) Jim discussed prioritizing the product roadmap for the Hopswork platform.
- (01:00:25) Jim hinted at what’s on the 2021 roadmap for Hopswork.
- (01:03:22) Jim recalled the challenges of getting early customers for Hopsworks.
- (01:04:30) Jim intuited the differences and similarities between being a professor and being a founder.
- (01:07:00) Jim discussed worrying trends in the European Tech ecosystem and the role that Logical Clocks will play in the long run.
- (01:13:37) Closing segment.
Jim’s Contact Info
Mentioned Content
Research Papers
Articles
Projects
People
Programming Books