What would you suggest instead?
A non-autoregressive transformer trained with a classification objective.
A non-autoregressive transformer trained with a classification objective.