Defuddle is an open-source library I built to extract the main content and metadata from web pages. It can also return the content as Markdown.
I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability appears to be mostly abandoned, and didn't work well for many sites.
Defuddle is also available as a CLI:
https://github.com/kepano/defuddle-cli
[1] https://github.com/obsidianmd/obsidian-clipper
Comments URL: https://news.ycombinator.com/item?id=44067409
Points: 3
# Comments: 0
Zaloguj się, aby dodać komentarz
Inne posty w tej grupie

Article URL: https://arxiv.org/abs/2505.21476
Comments URL: https://news.ycombinator.c

Article URL: https://www.mooncake.dev/blog/htap-is-dead
Comments URL: https:


Article URL: https://buttondown.com/hillelwayne/archive/what-does-undecidable-mean-anyway/
Commen

This started as a throwaway metaphor in a blog post, but is now fully runnable: a toy RTOS with preemptive multitasking inside of Super Mario Bros. on the NES.
Essentially, this is:
- A rudime