Personally I’ve wrapped it up in some shell and python scripts, and it does the job just fine.
Abbyy is in my opinion by far the most powerfull OCR software. Linux command line OCR is available but unfortunately too pricey for the normal private user:
A simple button for scanning a document, the software does all the processing for you (rotation, OCR,..). You can apply tags to documents, search them, export them. The storage format is more than open (a png file + OCR results as an HTML page,...)
I would have liked a comparison between Google, Amazon and tesseract. There are projects for doing this at home and I've been meaning to do this for some time but haven't got around to it yet.
Been a while but the only one I can remember off the top of my head is paperless: https://github.com/danielquinn/paperless
If you find the term, yay; if you don't you can't really conclude it's not there.
Even $.25 seems expensive...
The article is from someone in the Oscar Health Insurance.
According to Wikipedia, the company:
is claiming transparency in claims pricing:
>Oscar Health Insurance is a technology-focused health insurance company founded in 2012 and headquartered in New York City. The company has plans to change the health insurance industry through telemedicine, healthcare focused technological interfaces, and transparent claims pricing systems.
Maybe they extended the transparency to people's health data.
Here is another article (still on Medium) that incidentally talks of the Author (as engineer employed in the company):
Do people think companies that deal with PHI have to build every service in house...? Run their own data centers? Build their own backhaul connections to their users?
Obviously I don't have a copy of the contract between Google and Oscar, but I think it's profoundly unlikely that they would write a blog post about something that would be the end of their company without the "simple" (and industry standard) step of signing a BAA with a vendor.
Defending on the service provider, there may be an upfront cost for signing a BAA, but generally the service costs remain the same (assuming you stick to "BAA approved" services). In the case of Google, there is no additional cost:
"As such, we can offer HIPAA regulated customers the same products at the same pricing that is available to all customers, including sustained use discounts. Other public clouds charge more money for their HIPAA cloud, we do not."
Any reputable cloud service implements technical and process controls to control or eliminate access to customer data. Doing something like incorporating customer data into advertising would be a very serious situation, and frankly I doubt anyone would be that dumb at scale.
Google will contractually agree to a bunch of things relative to this and has third party audits to provide additional assurance.
Perhaps using Google's service is faster or more accurate, but it would be necessary to have a comparison.