Hacker News new | past | comments | ask | show | jobs | submit | stephen37's comments login

I love working with Ollama, I was really surprised at how easy it is to build a simple RAG system with it. For example: https://github.com/stephen37/ollama_local_rag


Nice, I've been looking out for something like this! What's Jina AI and how is it local if I need an API key for it? Also, this is the first time I'm hearing about poetry. Might be worth including in the prerequisites (unless I can just stick with pip?)


> What's Jina AI and how is it local if I need an API key for it?

Looks like its a vector DB used for creating and looking up embeddings (vectors). LLM is the second part of RAG, the first part is having a good embedding model.


Oh yes good point, it's because at first I was using a local embedding model and then decided to use JinaAI as they have good embeddings.

Jina AI is doing different things but one of them is having embeddings and I use their English/ German embeddings as in one demo I am working with German data.

You can use pip as well but yes, let me add something about Poetry in case people don't know about it :)


On top of my head, pgvector only supports 2 indexes, those are running in memory only. They don't support GPU indexing, nor Disk based indexing, they also don't have separation of query and insertions.

Also with different people I've talked to, they struggle with scale past 100K-1M vector.

You can also have a look yourself from a performance perspective: https://ann-benchmarks.com/


It's nice to begin with but you usually quickly hit a scale issue from what I tried and talked to different people using it.

Also they only offer index stored in Memory as far as I know, also lack the support of different index or more advanced ones like GPU index.

https://zilliz.com/comparison/milvus-vs-pgvector


Cool to see that Supabase is adding S3 protocol! Nice to see more and more storage solutions available.

We, at Milvus, I've integrated S3, Parquet and other ones to make it possible for developers to use their data no matter what they use.

For those who have used both, how do you find the performance and ease of integration compares between Supabase and other solutions like Milvus that have had these features for some time?


Just make your own post if you’re selling your product


Nice. Will check out Milvus.


I am French originally and on my side what worked well was to speak to myself in English all the time. At home I would actually speak English out loud and it helped a lot with practicing my pronunciation and also to learn new words.

Also reading books in English out loud can be really helpful.


What is Character.AI doing special to have that much more engagement than other apps?


It basically makes making characters you can chat with pretty easy. As in defining their motivations, personalities, etc. Obviously with good prompting you can do this with most LLM-based systems, but they've optimized for this use case.


Get put into these articles without a mention of funding from the publishers of the article? Mentioned five times but I'm not sure I see anything that says they're in the portfolio.


smarter waifus for the wicked


It might be the first time that I see a phone number on a Github Repo, I wonder how many phone calls they will actually receive. It's probably the last thing I'd like to do when looking at an open source repository.


I hope a few! We have gotten some so far. In general, I just want people who are interested in using the software to be able to get a human on the phone immediately.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: