Extracts readable article content from HTML using Mozilla Readability.
This function uses the Readability algorithm to extract the main article content from a web page, removing navigation, ads, and other clutter.
Features:
The raw HTML content to process
Article object with extracted content and metadata
When the article cannot be parsed or lacks sufficient content
const html = await fetchHTML('https://example.com/article');const article = makeReadable(html);console.log(article.title); // Article titleconsole.log(article.textContent); // Plain text contentconsole.log(article.content); // HTML contentconsole.log(article.byline); // Author information Copy
const html = await fetchHTML('https://example.com/article');const article = makeReadable(html);console.log(article.title); // Article titleconsole.log(article.textContent); // Plain text contentconsole.log(article.content); // HTML contentconsole.log(article.byline); // Author information
Extracts readable article content from HTML using Mozilla Readability.
This function uses the Readability algorithm to extract the main article content from a web page, removing navigation, ads, and other clutter.
Features: