Defuddle is an open-source library I built to extract the main content and metadata from web pages. It can also return the content as Markdown.
I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability appears to be mostly abandoned, and didn't work well for many sites.
Defuddle is also available as a CLI:
https://github.com/kepano/defuddle-cli
[1] https://github.com/obsidianmd/obsidian-clipper
Comments URL: https://news.ycombinator.com/item?id=44067409
Points: 3
# Comments: 0
Войдите, чтобы добавить комментарий
Другие сообщения в этой группе
Article URL: https://xtool.sh/tutorials/xtool/first-app/
Comments URL: http
Did you know that VSCode extensions run with full access to your system—including file system, network, and credentials? Worse, dozens of malicious extensions have already made it into the marketp

Article URL: https://www.barchart.com/story/news/33003

Article URL: https://github.com/microsoft/edit
Comments URL: https://news.ycombinator