Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
rolisz
on Oct 26, 2023
|
parent
|
context
|
favorite
| on:
Jina AI launches open-source 8k text embedding
That's not quite tfidf though. I agree you can get better results than that with Ada embeddings, but I would argue you can get even better results with embeddings from smaller chunks.
simonw
on Oct 26, 2023
[–]
I guess technically it's bm25, since it's using the rank mechanism in SQLite FTS5:
https://www.sqlite.org/fts5.html#sorting_by_auxiliary_functi...
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: