Skip to content

📚 content

Clean up HTML Content for Retrieval-Augmented Generation with Readability.js

Scraping web pages is one way to fetch content for your retrieval-augmented generation (RAG) application. But parsing the content from a web page can be a pain.

weeklyfoo #68 / 2025-01-19
content