They’re not. Cross entropy loss is E[-log q] where q is a probability. You could convert the model o...

canjobear • yesterday at 4:29 PM • 1 reply • view on HN

They’re not. Cross entropy loss is E[-log q] where q is a probability. You could convert the model outputs x into probabilities using some other function like q = 1/Z x^2, and compute cross entropy loss just fine.

Replies

jmalicki • yesterday at 4:40 PM

Behold the softmax: https://docs.pytorch.org/docs/2.11/generated/torch.nn.CrossE...

➕ show 1 reply

alt Hacker News

Replies