More

redskyluan · 2024-12-24T00:17:15 1734999435

Even SSD won't be fast enough for most indexes due to the random access nature. I've seen more than 1M iops on a huge nvme disk when use DiskANN index

redskyluan · 2024-12-24T00:15:54 1734999354

why not try milvus? you get multiple index types, SIMD based brute force search, IVF, HNSW and DiskANN and you never bother to scale

redskyluan · 2024-12-05T15:09:12 1733411352

Maybe check this https://zilliz.com/pricing

Yan easily store 1B data into Zilliz Serverless and the cost is incredible cheap

xieydd · 2024-12-05T23:52:25 1733442745

You cannot compare Serverless with dedicated instance, this is unfair. A large number of writes and queries will eat up your wallet.

redskyluan · 2024-12-06T01:39:41 1733449181

That's true. Now we can offer the new dedicated Cost-optimized CU，pricing is

$0.318/hour ($228.96/month)

Which means you can store 20M 768 dimension data in 228$ per month

lja · 2024-12-05T16:01:58 1733414518

Zilliz gave us a near 6-figure a month quote for our database.

redskyluan · 2024-08-26T04:50:03 1724647803

Interesting case study, but I'm skeptical of the broader implications. Pinecone is notoriously expensive compared to other vector database services. A horizontal price comparison reveals several better options in the market. Removing the calculator doesn't solve the core issue - it just obfuscates costs and makes it harder for users to compare options upfront. In my view, this approach merely reduces the comparison stage, potentially leading more uninformed users to upload data without fully understanding the pricing implications. While simplification can be valuable, in this case it seems to benefit the company more than the users. A better approach might be to improve the calculator's accuracy and usability rather than removing it entirely. Transparency in pricing is crucial, especially for B2B services where costs can scale quickly.

redskyluan · 2024-08-18T20:53:55 1724014435

there seems to be a big hype to adapt pg into any infra. I love PG but this seems not be right thing.

SOLAR_FIELDS · 2024-08-19T00:06:34 1724025994

I think for me the problem with every single new PG queue is that it seems like everyone and their mother thinks they need to reinvent this specific wheel for some reason and the flavor of the day doesn’t often bring much new to the space. Probably because it's

1. Pretty easy to understand and grok the problem space

2. Scratching the programmer itch of wanting something super generic that you can reuse all over the place

3. Doable with a modest effort over a reasonable scope of time

4. Built on rock solid internals (Postgres) with specific guarantees that you can lean on

Here's 7 of them just right quick:

- https://github.com/timgit/pg-boss

- https://github.com/queueclassic/queue_classic

- https://github.com/florentx/pgqueue

- https://github.com/mbreit/pg_jobs

- https://github.com/graphile/worker

- https://github.com/pgq/pgq

- https://github.com/que-rb/que

Probably could easily find more by searching, I only spent about 5 minutes looking and grabbing the first ones I found.

I'm all for doing this kind of thing as an academic exercise, because it's a great way to learn about this problem space. But at this point if you're reinventing the Postgres job queue wheel and sharing it to this technical audience you need to probably also include why your wheel is particularly interesting if you want to grab my attention.

sgarland · 2024-08-18T21:49:20 1724017760

At low-medium scale, this will be fine. Even at higher scale, so long as you monitor autovacuum performance on the queue table.

At some point it may become practical to bring a dedicated queue system into the stack, sure, but this can massively simplify things when you don’t need or want the additional complexity.

arp242 · 2024-08-18T23:31:20 1724023880

Aside from that, the main advantage of this is transactions. I can do:

  begin;
    insert_row();
    schedule_job_for_elasticsearch();
  commit;

And it's guaranteed that both the row and job for Elasticsearch update are inserted.

If you use a dedicated queue system them this becomes a lot more tricky:

  begin;
    insert_row();
    schedule_job_for_elasticsearch();
  commit; // Can fail, and then we have a ES job but no SQL row.

  begin;
    insert_row();
  commit;
  schedule_job_for_elasticsearch(); // Can fail, and then we have a SQL row and no job.

There are of course also situations where this doesn't apply, but this "insert row(s) in SQL and then queue job to do more with that" is a fairly common use case for queues, and in those cases this is a great choice.

jashmatthews · 2024-08-19T04:42:49 1724042569

Transactional Outbox solves this. You use a table like in the first example but instead of actually doing the ElasticSearch update the Outbox table is piped into the dedicated queue.

nostrebored · 2024-08-19T01:26:06 1724030766

Most of these two phase problems can be solved by having separate queue consumers.

And as far as I can tell, this is only a perk when your two actions are mutate the collocated database and do X. For all other situations this seems like a downgrade.

jashmatthews · 2024-08-19T04:49:32 1724042972

Do you mean like the consumer for the first phase enqueues a job for the second phase?

jeeybee · 2024-08-18T22:03:21 1724018601

I agree, there is no need for FANG level infrastructure. Imo. in most cases, the simplicity / performance tradeoff for small/medium is worth it. There is also a statistics tooling that helps you monitor throughput and failure rats (aggregated on a per second basis)

eknkc · 2024-08-18T22:15:12 1724019312

Instead of SQS, I recently created a basic abstraction on PG that mimics the SQS apis. The intention was to use it during development and we would simply switch to SQS later.

Never did. The production code still uses PG based queue (which has been improved since) and pg just works perfectly fine. Might still need to go with a dedicated queue service at some point but it has been perfectly fine so far.

mlnj · 2024-08-18T21:15:06 1724015706

I use it as a job queue. Yes, it has it's cons, but not dealing with another moving piece in the big picture is totally worth it.

jascha_eng · 2024-08-19T00:02:49 1724025769

I mean I love postgres like the next guy. And I like simple solutions as long as they work. I just wonder if this is truly simpler than using a redis or rabbitmq queue if you need Queues. If you're already using a cloud provider sqs is quite trivial as well.

I guess if you already have postgres and don't want to use the cloud provider's solution. You can use this to avoid hosting another piece of infra.

bdcravens · 2024-08-19T00:14:34 1724026474

db-based gives you the ability to query against your queues, if you use case needs it. Other options tend to dispose the state once the job is finished.

redskyluan · 2024-04-18T19:12:16 1713467536

Based on your use cases. I thought it's not hard to push the window to 32K or even 100k if we change the position embedding

redskyluan · 2024-04-16T18:01:21 1713290481

Something need to be fixed. Opensource is not about license, it's about people.

OSS companies are fighting against cloud vendors to survice.

https://zilliz.com/blog/Redis-tightens-its-license-How-can-a...

redskyluan · 2024-04-14T22:40:21 1713134421

What I Observe: Simple RAG is Fading, but Complex RAG Will Persist and Evolve - Involving Query Rewriting, Data Cleaning, Reflection, Vector Search, Graph Search, Rerankers, and More Intelligent Chunking. Large Models Should Not Just Be Knowledge Providers, But Tool Users and Process Drivers"

danielmarkbruce · 2024-04-14T22:41:24 1713134484

It's becoming so complex that it will stop being called RAG. It's just an application that uses an LLM as one part of it.

arunsr1ni · 2024-04-16T13:29:23 1713274163

I see many folks misunderstanding RAG for a technology. It's just a process. No matter the complexity, if the underlying principal is to augment the LLM with specific information to the question at hand - it is RAG.

danielmarkbruce · 2024-04-17T22:02:55 1713391375

Folks are doing query expansion, chain of thought, all kinds of stuff that isn't even about adding extra info and doesn't involve R. They are just complex LLM applications.

esafak · 2024-04-15T20:55:12 1713214512

Do you know a good article on this?

redskyluan · 2024-04-14T22:31:20 1713133880

I would personally not recommend implementing a Redis protocol on top of SQLite, as I've seen too many failed cases like this. Users may perceive your product as a drop-in Redis replacement, and it may work fine in a PoC, but once it hits production, SQLite's performance and scalability will be severely challenged.

It's much better to use RocksDB as the underlying storage engine for such a solution. RocksDB can provide the performance and scalability required.If you need a distributed solution, I would suggest looking at TiKV or FoundationDB. These are both excellent distributed storage systems that can handle production workloads much better than a SQLite-based approach.

marcrosoft · 2024-04-14T23:27:45 1713137265

This “SQLite isn’t good for production” mentality really needs to die. It’s a myth.

redskyluan · 2024-04-12T18:29:10 1712946550

finally we see some relationship between Golang and AI?

I'm tired about the hype of python and rust. I would really appreciate if we saw more AI project written in golang.

for example: https://github.com/tmc/langchaingo