The softmax, after the network has been trained, yields an estimate of the probability in the traini...

canjobear • today at 3:39 PM • 1 reply • view on HN

The softmax, after the network has been trained, yields an estimate of the probability in the training data, but it is not that probability itself.

jmalicki • today at 3:57 PM

Which models are not trained with the log softmax as the loss function?

➕ show 1 reply

alt Hacker News