Episode 070: Processing Real-world HTML

Published: Dec. 14, 2009, 5:01 p.m.

Edward O'Connor from djangosd gives an overview of html5lib, a major-desktop-browser-compatible HTML parser and tokenizer for both Ruby and Python. This talk was part of the DjangoSD/SD Ruby mashup meeting.