659: Open-Source Tools for Natural Language Processing

Published: March 7, 2023, noon

NLP practitioners: this episode is for you. From the awareness of linguistic elements and annotation to getting the necessary people in the room, Vincent Warmerdam presents to Jon Krohn a recipe for a successful project and the open-source NLP tools to get there.\nThis episode is brought to you by epic LinkedIn Learning instructor Keith McCormick (linkedin.com/learning/instructors/keith-mccormick). Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.\nIn this episode you will learn:\u2022 How Vincent came to work with De Speld [08:57]\u2022 Vincent\u2019s role at Explosion [18:59]\u2022 How users can apply spaCy [21:46]\u2022 Prodigy: Annotate training data more efficiently with scripts [26:28]\u2022 How to manage \u201cskill anxiety\u201d with Calmcode [32:32]\u2022 How Vincent fixed bad labels [42:47]\u2022 The value of understanding linguistics for NLP [54:42]\u2022 How to constrain artificial stupidity [1:02:38]\nAdditional materials: www.superdatascience.com/659