Awesome
Jodd Lagarto
🌟 Java HTML parsers suite.
🏡 Website: lagarto.jodd.org
❤️ For developers, by developer.
🦎 Overview
Lagarto Suite is the family of HTML/XML parsers written in Java. It consists of the following libraries:
LagartoParser
is an all-purpose fast and versatile event-based HTML parser. You can use it to modify or analyze some markup content, allowing you to assemble custom complex transformations and code analysis tools quickly. It is performant and follows the rules of the official HTML specification.LagartoDom
builds a DOM tree in memory from the input. You can manipulate a tree more conveniently, with minor performance sacrifice.Jerry
is a "jQuery in Java" - you can use the familiar syntax of JavaScript library inside of Java to parse and manipulate HTML.CSSelly
- finally, the parser of CSS3 selectors.