FYI, 27 times per hour is basically nothing. With GPT4 over the API, I make 2-3 ...

ZephyrBlu · on May 6, 2023

Depends heavily on your product. I can imagine there are quite a lot of use cases that have relatively infrequent API usage or highly cacheable responses.

bomewish · on May 7, 2023

> retriever-augmentation and context stuffing

Care you elaborate? This sounds very interesting & useful. Just anything about the setup and implementation would be super helpful.

ukuina · on May 8, 2023

This should get you started: https://haystack.deepset.ai/tutorials/22_pipeline_with_promp...