exactly, embeddings destroy information.. exact keywords, acronyms, etc. They get squashed into floats.
That is why grep still beats it for code.
I generated visual schematic of every stage of the pipeline - https://vectree.io/c/retrieval-augmented-generation-embeddin...