Show HN: Defuddle, an HTML-to-Markdown alternative to Readability

Status
Not open for further replies.
K

kepano

Guest
Defuddle is an open-source JS library I built to parse and extract the main content and metadata from web pages. It can also return the content as Markdown.
I built Defuddle while working on Obsidian Web Clipper[1] (also MIT-licensed) because Mozilla's Readability[2] appears to be mostly abandoned, and didn't work well for many sites.
It's still very much a work in progress, but I thought I'd share it today, in light of the announcement that Mozilla is shutting down Pocket. This library could be helpful to anyone building a read-it-later app.
Defuddle is also available as a CLI:
https://github.com/kepano/defuddle-cli
[1] https://github.com/obsidianmd/obsidian-clipper
[2] https://github.com/mozilla/readability



Comments URL: https://news.ycombinator.com/item?id=44067409

Points: 53

# Comments: 10
 
Status
Not open for further replies.
Top