648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip

Published: Jan. 27, 2023, noon

Text-to-speech gets a groundbreaking update with Microsoft\u2019s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person\u2019s voice.\nAdditional materials: www.superdatascience.com/648\nInterested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.