Home

Awesome

pyMicrodata

A Microdata distiller/parser library

Note: since I retired a few months ago I do not really maintain this package any more. I would be more than happy if an interested party was interested to take over. In the meantime, I have "archived" the repository to clearly signal that there is no maintenance. I would be happy to unarchive it and transfer ownership if someone is interested.
@iherman

DOI

Microdata is a specification that defines new HTML attributes to embed simple machine-readable data in HTML documents. This a module to extract Microdata RDF from an HTML5 page. It implements the algorithm defined and published by the W3C Semantic Web Interest Group task force, in March 2012. The module can be used to produce serialized versions of the extracted graph, or simply an RDFLib Graph Object.

The distribution contains:

The package primarily depends on:

The package has been tested on Python 3.5+, in line with RDFlib.

For the details on the conversion of Microdata to RDF, see:

https://www.w3.org/TR/microdata-rdf/

Release notes (starting from version 1.0)