The data is available in a SQLite database on GitHub: https://github.com/vlofgren/hn-green-clankers
You can explore the underlying data using SQL queries in your browser here: https://lite.datasette.io/?url=https%253A%252F%252Fraw.githu... (that's Datasette Lite, my build of the Datasette Python web app that runs in Pyodide in WebAssembly)
Here's a SQL query that shows the users in that data that posted the most comments with at least one em dash - the top ones all look like legitimate accounts to me: https://lite.datasette.io/?url=https%3A%2F%2Fraw.githubuserc...
I still call voodoo on this. I use an iPhone, iPad, Mac to comment here—all of them autocorrect to em dashes at one point or another. Same goes for ellipsis.
apparently HN comments are licensed not only to HN, but also to some guy in sweden
cool cool cool
If you change to
> select user, source, count(*), ...
it's clear that every single outlier in em-dash use in the data set is a green account.