logoalt Hacker News

canjobearyesterday at 4:29 PM1 replyview on HN

They’re not. Cross entropy loss is E[-log q] where q is a probability. You could convert the model outputs x into probabilities using some other function like q = 1/Z x^2, and compute cross entropy loss just fine.


Replies