Defuddle is an open-source library I built to extract the main content and metadata from web pages. It can also return the content as Markdown.
I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability appears to be mostly abandoned, and didn't work well for many sites.
Defuddle is also available as a CLI:
https://github.com/kepano/defuddle-cli
[1] https://github.com/obsidianmd/obsidian-clipper
Comments URL: https://news.ycombinator.com/item?id=44067409
Points: 3
# Comments: 0
Inicia sesión para agregar comentarios
Otros mensajes en este grupo.

Article URL: https://github.com/geohot/tt-tiny
Comments URL: https://news.ycombinator
Article URL: https://self-issued.info/?p=2708
Comments URL: https://news.ycombinator.c

Article URL: http://rednafi.com/go/di_frameworks_bleh/
Comments URL: https://
What are you working on? Any new ideas that you're thinking about?
Comments URL: https://news.ycombinator.com/item?id=44090387
