logoalt Hacker News

canjobeartoday at 3:39 PM1 replyview on HN

The softmax, after the network has been trained, yields an estimate of the probability in the training data, but it is not that probability itself.


Replies

jmalickitoday at 3:57 PM

Which models are not trained with the log softmax as the loss function?

show 1 reply