Hacker Newsnew | past | comments | ask | show | jobs | submit | apichar's commentslogin

We encourage everyone to try out our model. Here is a link to get started: https://jigsawstack.com/speech-to-text


That's great but the thing you're linking and describing is still a blog post/benchmark and you also just had an actual Show HN https://news.ycombinator.com/item?id=43368327

You should still take a look at those Show HN rules and also https://news.ycombinator.com/item?id=22336638, the spirit of the thing is showing cool stuff you've built and when it starts sounding like obvious promotion it tends to work less well. It's in your own interest to categorize/frame your own postings better - they'll get better discussion that way.


True- I saw that too, but I also wanted to start a thread to chat with the community and hear different perspectives. Would love to get your thoughts!


Our pricing model is being updated soon to be even more cost-effective. Currently, we support up to 10 pages per API request. With the upcoming token-based pricing, costs will drop significantly to just $1.40 per 1M tokens.


how do a client break a pdf into 10 page chunks? can a pdf file be uploaded, or we are expected to upload rasterized images?

I also feel that token is not a comprehensible unit for many customers. "page" is better.


You can upload a PDF directly, and the vOCR API supports a page_range parameter so you can specify which pages to process in a single request (e.g., [1,10]).

If your PDF has more than 10 pages, you can make multiple requests, each specifying a different page range (e.g., [1,10], [11,20], etc.). No need to manually split the file.

Also, I totally get that tokens can feel abstract. However by making this shift we can focus on processing time. In doing so the token-based pricing model allows us to be more flexible and provide more at a cheaper cost.

For example, instead of $0.05 per invocation it could be as low as $0.0014–$0.0035 per page depending on factors like the amount of text extracted and formatting.

We do appreciate the feedback and will continue make this approach as transparent as possible.


We're thrilled to announce that JigsawStack’s AI tools are now available in LangChain.js! Automate tasks like web scraping, transcription, and SQL generation with just a few lines of code. Our tools are designed to simplify your workflow and boost your app's AI capabilities.

Check out the full suite of tools and how you can integrate them into your next project: https://jigsawstack.com/blog/jigsawstack-now-available-in-la...

We’d love to hear your feedback!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: