Hm, that’s a multinomial classification with a very high cardinality. It’s really weird it wo...

vslira • yesterday at 11:33 PM • 4 replies • view on HN

Hm, that’s a multinomial classification with a very high cardinality. It’s really weird it works. I’m sure it does as the author states, but for how many authors (out of the whole web) does this work?

Replies

dmd • today at 1:23 AM

It worked on me, and I would be shocked if my blog (dmd.3e.org) has more than a dozen readers. I am stunned.

➕ show 1 reply

londons_explore • today at 7:33 AM

There are ~8 billion people. Sounds big, but it's only 2^33. Ie if you can find 33 things about the text which halve the number of possible writers, you have narrowed it down to 1 person.

Just a couple more things and you can accommodate some of your things being mistaken/wrong/uncertain too.

kelseyfrog • yesterday at 11:44 PM

Sure the cardinality is high, but the model isn't using a uniform prior. What do you suppose all the the values in each of the terms are, P(Text sample | Kelsey Piper) * P(Text sample) / P(Kelsey Piper)?

astrange • today at 12:14 AM

Maybe it just says all writing is Kelsey Piper.

alt Hacker News

Replies