One of the tools I have been using daily is a web clipper that captures not just the current page, but can automatically extract key information from it. You can also do a quick lookup of your existing notes regardless which web page you are on.
Prior to this, I had been using web clipper extensions by Evernote, OneNote, and Notion, and all of them had something missing that would significantly slow me down. Wanted to share what I have built to address this. The code is integrated with the [Rumin](https://getrumin.com) backend (the other tools I built), but you can easily swap out the API calls to point to local storage or some other endpoint.
Check it out. Would love to hear feedback from the community :)
Your extension, of course, Wildcard (https://news.ycombinator.com/item?id=22439141), youtube-dl, and possibly many other could benefit from it.
Community maintained information scrapers/extractors is definitely a direction I want to build towards, collaborating with any existing efforts. Though the exact form will take some iterations (e.g. a marketplace for scripts/"recipes", built-in scripts for common sites, allowing individual users to save their own scrapers etc.)
Years ago, I spent a couple of months building a simple EverNote clone in Clojure. The weakest part of my “for my own use only” project was a FireFox extension I wrote to capture selected web page data and send it to the backend of my system.
This Web Clipper project would have really helped me. I hope the author of this gets the satisfaction of wide adoption in many cool projects.
I was hoping by sharing it I can get a better sense of what sites other people would like to have supported, and keep adding to it :)
in the meantime, it should be easy to swap out the API hostname to something else (or even local storage)
EDIT: I've added an MIT license to the code. Thanks again for pointing out
I just tried while taking a class and I believe I need your service account to use this extension.
sorry the code is a bit of a mess, I'm working on cleaning up the entire code base and documenting things better
Looks like there's a new version WebScrapBook  based on that old code base which is now available.
I'm a active notion web cliper user.
I trust Notion because they have plenty o money to not lose my data.
Why should i use rumin ?
Quick answer for "Why should I use Rumin?" is: "Perhaps you shouldn't yet, but let's stay in touch and I'd love to hear about your use cases and other ideas."
The current version of Rumin is very rough, and there's an overwhelming list of improvements to make. This is one reason why I closed it for sign ups for now. But in the meantime, I feel there's a lot the community can do even with just the web capture component being open source.
Regarding your concern about data loss. I intend to open source more and more parts of the platform, and somehow figure out a model to make the development sustainable.
Thanks again for checking it out!