This module provides utilities for extracting readable article content from HTML
using Mozilla's Readability algorithm, and for converting relative URLs to absolute
URLs. It handles the DOM parsing and manipulation needed to prepare content for
markdown conversion.
The main function, makeReadable, combines Mozilla Readability with linkedom
(a lightweight DOM implementation) to extract clean article content from messy HTML.
Content extraction and URL normalization module.
This module provides utilities for extracting readable article content from HTML using Mozilla's Readability algorithm, and for converting relative URLs to absolute URLs. It handles the DOM parsing and manipulation needed to prepare content for markdown conversion.
The main function, makeReadable, combines Mozilla Readability with linkedom (a lightweight DOM implementation) to extract clean article content from messy HTML.
Example