Clean up HTML Content for Retrieval-Augmented Generation with Readability.js
Scraping web pages is one way to fetch content for your retrieval-augmented generation (RAG) application. But parsing the content from a web page can be a pain.
weeklyfoo #68 / 2025-01-19content