python-extruct 0.16.0
Propagated dependencies: python-html-text@0.5.2 python-jstyleson@0.0.2 python-lxml@4.9.1 python-mf2py@1.1.2 python-pyrdfa3@3.6.2 python-rdflib@7.1.1 python-w3lib@2.1.2
Channel: guix
Home page: https://github.com/scrapinghub/extruct
Licenses: Modified BSD
Synopsis: Extract embedded metadata from HTML markup
Description:
extruct
is a Python library for extracting embedded metadata from HTML markup. Currently, extruct supports:
W3C's HTML Microdata
embedded JSON-LD
Microformat via mf2py
Facebook's Open Graph
(experimental) RDFa via rdflib
Dublin Core Metadata (DC-HTML-2003)
Total results: 1