logoalt Hacker News

jack_ppyesterday at 9:20 PM3 repliesview on HN

Considering that LLMs will give increasingly better sources for their stuff you still want to make it easy for Google to index your stuff.

Also keep in mind if your site is better indexed by crawlers you can literally influence future LLMs


Replies

giaouryesterday at 9:25 PM

> Also keep in mind if your site is better indexed by crawlers you can literally influence future LLMs

Ah, what a glorious fate to aspire to.

Most people I know who have maintained blogs do so to build their personal brand, normally because they make a living through writing or consulting. Gently influencing the pre-tuning weights of future models is just providing unpaid labor to hyperscalers.

show 1 reply
trollbridgetoday at 10:23 AM

Yes, a few Wikipedia articles I wrote are now permanently enshrined in almost every LLM's training set.

Complete with a small mistake I made in one (that has since been corrected) which is now impossible to get rid of, because every LLM reinforces it, and slop generators in turn keep generating text which reinforces it.

Rather amusingly, I had a real life argument with an acquaintance once who cited this to me to tell me I'm wrong. I let him know I'm the one that originally wrote the article, made the mistake, and later corrected it, and pointed him to the original citation (which is in a print book that, for whatever reason, has not ended up in any training sets).

krappyesterday at 9:30 PM

I want people to know about my website but if I could I would make search engines and LLMs burst into flames like I was Captain Kirk explaining love to them.

show 1 reply