logoalt Hacker News

WalterBrighttoday at 2:34 AM2 repliesview on HN

> for language pairs that are not traditionally supported

Maybe translate X to English, and then to Y?


Replies

omneitytoday at 3:17 AM

Many languages (with a sizable speaker population) do not have machine translation to or from any other language.

The technique makes sense though, but in the training data stage mostly. BART-style translation models already represent concepts in latent space regardless of the input-output language sidestepping English entirely so you have something like:

`source lang —encoded into-> latent space —decoded into—> target lang`

Works great to get translation support for arbitrary language combinations.

djvdqtoday at 4:31 AM

It's a bad idea. It makes a lot of mistakes and might totally change the meaning of some sentences.