logoalt Hacker News

WalterBright05/04/20252 repliesview on HN

> for language pairs that are not traditionally supported

Maybe translate X to English, and then to Y?


Replies

omneity05/04/2025

Many languages (with a sizable speaker population) do not have machine translation to or from any other language.

The technique makes sense though, but in the training data stage mostly. BART-style translation models already represent concepts in latent space regardless of the input-output language sidestepping English entirely so you have something like:

`source lang —encoded into-> latent space —decoded into—> target lang`

Works great to get translation support for arbitrary language combinations.

djvdq05/04/2025

It's a bad idea. It makes a lot of mistakes and might totally change the meaning of some sentences.