Defuddle is an open-source library I built to extract the main content and metadata from web pages. It can also return the content as Markdown.
I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability appears to be mostly abandoned, and didn't work well for many sites.
Defuddle is also available as a CLI:
https://github.com/kepano/defuddle-cli
[1] https://github.com/obsidianmd/obsidian-clipper
Comments URL: https://news.ycombinator.com/item?id=44067409
Points: 3
# Comments: 0
Autentifică-te pentru a adăuga comentarii
Alte posturi din acest grup

Article URL: https://jacobian.org/2025/jun/3/changing-directions/
Article URL: https://austinhenley.com/blog/coord2state.html
Article URL: https://www.durham.ac.uk/department
Article URL: https://www.pnas.org/doi/10.1073/pnas.2416433122

Article URL: https://www.theregister.com/2025/06/03/meta_pauses_android_tracking_tech/
Comments URL: