logoalt Hacker News

rickettetoday at 8:05 AM3 repliesview on HN

LLMs.txt is also nonsense since it isn't adopted by any of the major AI players.


Replies

networkedtoday at 9:08 AM

Google has recently added `llms.txt` to Chrome's Lighthouse check for agentic browsing (https://searchengineland.com/google-llms-txt-chrome-lighthou...), so adoption may be coming. Admittedly, I put more faith in

  <link rel="alternate" type="text/markdown" href="https://example.com/foo.md" title="Markdown version of the &lt;Foo&gt; page">
that I copied from Gwern.net. This convention is discoverable (just read the HTML) and naturally adapts to any website size and structure.

I have created an `llms.txt` for my website anyhow. I use a fixed LLM prompt to generate it from the internal links in `index.md`.

show 1 reply
dspilletttoday at 9:51 AM

The same could be said of robots.txt

And anything else that might tell them not to access something.

show 1 reply
pfannltoday at 12:56 PM

To be fair, "not adopted by any major AI player" is probably the most web-standard-compliant phase of a new web standard.