Edward O'Connor from djangosd gives an overview of html5lib, a major-desktop-browser-compatible HTML parser and tokenizer for both Ruby and Python. This talk was part of the DjangoSD/SD Ruby mashup meeting.