python-extruct 0.18.0
Propagated dependencies: python-html-text@0.7.0 python-jstyleson@0.0.2 python-lxml@6.0.1 python-mf2py@2.0.1 python-pyrdfa3@3.6.2 python-rdflib@7.1.1 python-w3lib@2.3.1
Channel: guix
Home page: https://github.com/scrapinghub/extruct
Licenses: Modified BSD
Synopsis: Extract embedded metadata from HTML markup
Description:
extruct is a Python library for extracting embedded metadata from HTML markup. Currently, extruct supports:
W3C's HTML Microdata
embedded JSON-LD
Microformat via mf2py
Facebook's Open Graph
(experimental) RDFa via rdflib
Dublin Core Metadata (DC-HTML-2003)
Total results: 1