Do embeddings work across languages? It must depend on the model, but are there models where similar concepts in different languages occupy the same location in vector space?
If yes it would allow to search a corpus in one language using another, without translating anything up front (neither the corpus nor the query).
Do embeddings work across languages? It must depend on the model, but are there models where similar concepts in different languages occupy the same location in vector space?
If yes it would allow to search a corpus in one language using another, without translating anything up front (neither the corpus nor the query).