Defuddle is an open-source library I built to extract the main content and metadata from web pages. It can also return the content as Markdown.
I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability appears to be mostly abandoned, and didn't work well for many sites.
Defuddle is also available as a CLI:
https://github.com/kepano/defuddle-cli
[1] https://github.com/obsidianmd/obsidian-clipper
Comments URL: https://news.ycombinator.com/item?id=44067409
Points: 3
# Comments: 0
Melden Sie sich an, um einen Kommentar hinzuzufügen
Andere Beiträge in dieser Gruppe

We started CallFS after yet another late-night “why did the uploads vanish?” incident. Our small team had stitched together rsync, a fragile NFS mount, and an S3 bucket—none of it observable, all

Article URL: https://tomrenner.com/posts/llm-inevitabilism/

Article URL: https://www.ycombinator.com/companies/martin/jobs/
Article URL: https://www.nytimes.com/2025/07/08/magazine/fda-collapse-rfk-kennedy.html
Comments URL:
Article URL: https://www.amirsharif.com/protecting-my-attention-at-the-dopamine-carnival
Comments U