The usual companies may not have ethics as you write, but politically motivated company can easily embed "ethics" in a recommendation engine like that.
Vipassana meditation to reveal fears and other mental processes that are hidden. Also to repair focus disseminated by the modern culture of constant distractions. Even 15 minute session is enough to feel the potential and become more aware.
Sometimes this awareness is enough to change bad mental habits. But there are other things that you continue to do even if you understand that you don't want it. These are "imprints" NLP terminology.
To change these imprints my friend does really long Vipassana sessions for more than 10 hours. She claims to change at least one strong imprint that was "panic when staying at home alone". Which is related to an event happened when she was less than a year old.
I have not tried so long Vipassana sessions and to change my imprints I do Connirae Andreas "Core Transformation" technique. It's easier to have other person do it to you, however with enough determination you can train to do it by yourselves. Unfortunately it's still a lot of efforts to learn it. Anyway may be that helps.
In principle if you just put a space between each character it would, though it would also make ngrams between words which you might not want.
edit: for vw, maybe the other lib has special support for character ngrams with word boundaries
They process Lucene index and create embedded representation of it. Then you can search over that representation for "semantic" matches.
Last time I checked it about a year ago the embedded collection of documents was kept in the memory and the search was implemented by a linear scan. So I suspect it can be slow on very large collection of documents.
Part of the answer is here - https://www.wheresyoured.at/the-men-who-killed-google/