8/13/2013 8:26:34 PM
Google Open Sources Gumbo: A C library For Parsing HTML5
Gumbo, HTML5, Google Open Source
App Developer Magazine

Google Open Sources Gumbo: A C library For Parsing HTML5

Richard Harris Richard Harris in HTML5 Tuesday, August 13, 2013

Hates off to Google for trying to help the HTML5 standard issue with it's announcement that Gumbo is now open source - in hopes that all browsers can follow suite and get along with HTML5 written code in the same way!

Gumbo is an implementation of the HTML5 parsing algorithm implemented as a pure C99 library with no outside dependencies. It's designed to serve as a building block for other tools and libraries such as linters, validators, templating languages, and refactoring and analysis tools.

Goals & features:
Fully conformant with the HTML5 spec.
Robust and resilient to bad input.
Simple API that can be easily wrapped by other languages.
Support for source locations and pointers back to the original text.
Relatively lightweight, with no outside dependencies.
Passes all html5lib-0.95 tests.
Tested on over 2.5 billion pages from Google's index.

Read more: http://google-opensource.blogspot.com/2013/08/gumb...