Claws are now a new layer on top of LLM agents

125 points • by Cyphase • today at 12:56 AM • 536 comments • view on HN

https://xcancel.com/karpathy/status/2024987174077432126

Comments

All: quite a few comments in this thread (and another one we merged hither - https://news.ycombinator.com/item?id=47099160) have contained personal attacks. Hopefully most of them are [flagged] and/or [dead] now.

On HN, please don't cross into personal attack no matter how strongly you feel about someone or disagree with them. It's destructive of what the site is for, and we moderate and/or ban accounts that do it.

If you haven't recently, please review https://news.ycombinator.com/newsguidelines.html and make sure that you're using the site as intended when posting here.

➕ show 1 reply

jameslk • today at 6:58 PM

One safety pattern I’m baking into CLI tools meant for agents: anytime an agent could do something very bad, like email blast too many people, CLI tools now require a one-time password

The tool tells the agent to ask the user for it, and the agent cannot proceed without it. The instructions from the tool show an all caps message explaining the risk and telling the agent that they must prompt the user for the OTP

I haven't used any of the *Claws yet, but this seems like an essential poor man's human-in-the-loop implementation that may help prevent some pain

I prefer to make my own agent CLIs for everything for reasons like this and many others to fully control aspects of what the tool may do and to make them more useful

➕ show 8 replies

daxfohl • today at 7:12 PM

I wonder how the internet would have been different if claws had existed beforehand.

I keep thinking something simpler like Gopher (an early 90's web protocol) might have been sufficient / optimal, with little need to evolve into HTML or REST since the agents might be better able to navigate step-by-step menus and questionnaires, rather than RPCs meant to support GUIs and apps, especially for LLMs with smaller contexts that couldn't reliably parse a whole API doc. I wonder if things will start heading more in that direction as user-side agents become the more common way to interact with things.

➕ show 4 replies

throwaway13337 • today at 2:34 PM

The real big deal about 'claws' in that they're agents oriented around the user.

The kind of AI everyone hates is the stuff that is built into products. This is AI representing the company. It's a foreign invader in your space.

Claws are owned by you and are custom to you. You even name them.

It's the difference between R2D2 and a robot clone trying to sell you shit.

(I'm aware that the llms themselves aren't local but they operate locally and are branded/customized/controlled by the user)

➕ show 2 replies

ZeroGravitas • today at 10:16 AM

So what is a "claw" exactly?

An ai that you let loose on your email etc?

And we run it in a container and use a local llm for "safety" but it has access to all our data and the web?

➕ show 7 replies

nevertoolate • today at 12:40 PM

My summary: openclaw is a 5/5 security risk, if you have a perfectly audited nanoclaw or whatever it is 4/5 still. If it runs with human-in-the-loop it is much better, but the value is quickly diminishing. I think llms are not bad at helping to spec down human language and possibly doing great also in creating guardrails via tests, but i’d prefer something stable over llms running in “creative mode” or “claw” mode.

simonw • today at 1:13 AM

I think "Claw" as the noun for OpenClaw-like agents - AI agents that generally run on personal hardware, communicate via messaging protocols and can both act on direct instructions and schedule tasks - is going to stick.

andai • today at 4:20 PM

We got store-brand Claw before GTA VI.

For real though, it's not that hard to make your own! NanoClaw boasted 500 lines but the repo was 5000 so I was sad. So I took a stab at it.

Turns out it takes 50 lines of code.

All you need is a few lines of Telegram library code in your chosen language, and `claude -p prooompt`.

With 2 lines more you can support Codex or your favorite infinite tokens thingy :)

https://github.com/a-n-d-a-i/ULTRON/blob/main/src/index.ts

That's it! There are no other source files. (Of course, we outsource the agent, but I'm told you can get an almost perfect result there too with 50 lines of bash... watch this space! (It's true, Claude Opus does better in several coding and computer use benchmarks when you remove the harness.))

➕ show 1 reply

mhher • today at 11:42 AM

The current hype around agentic workflows completely glosses over the fundamental security flaw in their architecture: unconstrained execution boundaries. Tools that eagerly load context and grant monolithic LLMs unrestricted shell access are trivial to compromise via indirect prompt injection.

If an agent is curling untrusted data while holding access to sensitive data or already has sensitive data loaded into its context window, arbitrary code execution isn't a theoretical risk; it's an inevitability.

As recent research on context pollution has shown, stuffing the context window with monolithic system prompts and tool schemas actively degrades the model's baseline reasoning capabilities, making it exponentially more vulnerable to these exact exploits.

➕ show 2 replies

daxfohl • today at 9:18 PM

I don't think AI will kill software engineering anytime soon, though I wonder if claws will largely kill the need for frontend specialists.

➕ show 2 replies

yoyohello13 • today at 5:38 PM

I’ve been building my own “OpenClaw” like thing with go-mcp and cloudflare tunnel/email relay. I can send an email to Claude and it will email me back status updates/results. Not as easy to setup as OpenClaw obviously but alt least I know exactly what code is running and what capabilities I’m giving to the LLM.

vivzkestrel • today at 3:10 AM

I still dont understand the hype for any of this claw stuff

zmmmmm • today at 8:40 PM

It seems like the people using these are writing off the risks - either they think it's so unlikely to happen it doesn't matter or they assume they won't be held responsible for the damage / harm / loss.

So I'm curious how it will go down once serious harm does occur. Like someone loses their house, or their entire life savings or have their identity completely stolen. And these may be the better scenarios, because the worse ones are it commits crimes, causes major harm to third parties, lands the owner in jail.

I fully expect the owner to immediately state it was the agent not them, and expect they should be alleviated of some responsibility for it. It already happened in the incident with Scott Shambaugh - the owner of the bot came forward but I didn't see any point where they did anything to take responsibility for the harm they caused.

These people are living in a bubble - Scott is not suing - but I have to assume whenever this really gets tested that the legal system is simply going to treat it as what it is: best case, reckless negligence. Worst case (and most likely) full liability / responsibility for whatever it did. Possibly treating it as with intent.

Unfortunately, it seems like we need this to happen before people will actually take it seriously and start to build the necessary safety architectures / protocols to make it remotely sensible.

➕ show 1 reply

thomassmith65 • today at 1:31 PM

  giving my private data/keys to 400K lines of vibe coded monster that is being actively attacked at scale is not very appealing at all

https://nitter.net/karpathy/status/2024987174077432126

If this were 2010, Google, Anthropic, XAI, OpenAI (GAXO?) would focus on packaging their chatbots as $1500 consumer appliances.

It's 2026, so, instead, a state-of-the-art chatbot will require a subscription forever.

➕ show 1 reply

tomjuggler • today at 10:12 AM

There's a gap in the market here - not me but somebody needs to build an e-commerce bot and call it Santa Claws

➕ show 1 reply

7777777phil • today at 10:11 AM

Karpathy has a good ear for naming things.

"Claw" captures what the existing terminology missed, these aren't agents with more tools (maybe even the opposite), they're persistent processes with scheduling and inter-agent communication that happen to use LLMs for reasoning.

➕ show 5 replies

ksynwa • today at 10:25 AM

Why mac mini instead of something like a raspberry pi? Aren't thede claw things delegating inference to OpenAI, Antropic etc.?

➕ show 6 replies

ollybrinkman • today at 7:33 PM

The challenge with layering on top of LLM agents is payment — agents need to call external tools and services, but most APIs still require accounts and API keys that agents can't manage. The x402 standard (HTTP 402 + EIP-712 USDC signatures) solves this cleanly: agent holds a wallet, signs a micropayment per call, no account needed. Worth considering as a primitive for agent-to-agent commerce in these architectures.

➕ show 1 reply

mittermayr • today at 10:29 AM

I wonder how long it'll take (if it hasn't already) until the messaging around this inevitably moves on to "Do not self-host this, are you crazy? This requires console commands, don't be silly! Our team of industry-veteran security professionals works on your digital safety 24/7, you would never be able to keep up with the demands of today's cybersecurity attack spectrum. Any sane person would host their claw with us!"

Next flood of (likely heavily YC-backed) Clawbase (Coinbase but for Claws) hosting startups incoming?

➕ show 8 replies

ianbutler • today at 7:45 PM

I'm not sure I like this trend of taking the first slightly hypey app in an existing space and then defining the nomenclature of the space relative to that app, in this case even suggesting it's another layer of the stack.

It implies an ubiquity that just isn't there (yet) so it feels unearned and premature in my mind. It seems better for social media narratives more than anything.

I'll admit I don't hate the term claws I just think it's early. Like Bandaid had much more perfusion and mindshare before it became a general term for anything as an example.

I also think this then has an unintended chilling effect in innovation because people get warned off if they think a space is closed to taking different shapes.

At the end of the day I don't think we've begun to see what shapes all of this stuff will take. I do kind of get a point of having a way to talk about it as it's shaping though. Idk things do be hard and rapidly changing.

hmokiguess • today at 3:50 PM

Are these things actually useful or do we have an epidemic of loneliness and a deep need for vanity AI happening?

I say this because I can’t bring myself to finding a use case for it other than a toy that gets boring fast.

One example in some repos around scheduling capabilities mentions “open these things and summarize them for me” this feels like spam and noise not value.

A while back we had a trending tweet about wanting AI to do your dishes for you and not replace creativity, I guess this feels like an attempt to go there but to me it’s the wrong implementation.

➕ show 2 replies

trcf23 • today at 2:18 PM

Has anyone find a useful way to to something with Claws without massive security risk?

As a n8n user, i still don't understand the business value it adds beyond being exciting...

Any resources or blog post to share on that?

➕ show 2 replies

verdverm • today at 9:35 PM

I can say with confidence that I will not use "claw" or any derivations because it attracts a certain kind of ilk.

"team" is plenty good enough, we already use it, it makes for easier integration into hybrid carbon-silicon collaboration

pvtmert • today at 11:43 AM

Does one really need to _buy_ a completely new desktop hardware (ie. mac mini) to _run_ a simple request/response program?

Excluding the fact that you can run LLMs via ollama or similar directly on the device, but that will not have a very good token/s speed as far as I can guess...

➕ show 5 replies

bravetraveler • today at 10:29 AM

I read [and comment on] two influencers maintaining their circles

deadbabe • today at 7:26 PM

Instead of posts about claws I would like to see more examples of what people are actually doing with claws. Why are you giving it access to your bank account?

Even if I had a perfectly working assistant right now, I don’t even know what I would ask it to do. Read me the latest hackernews headlines and comments?

fxj • today at 11:12 AM

He also talks about picoclaw which even runs on $10 hardware and is a fork by sipeed, a chinese company who does IoT.

https://github.com/sipeed/picoclaw

another chinese coompany m5stack provides local LLMs like Qwen2.5-1.5B running on a local IoT device.

https://shop.m5stack.com/products/m5stack-llm-large-language...

Imagine the possibilities. Soon we will see claw-in-a-box for less than $50.

➕ show 2 replies

SV_BubbleTime • today at 9:30 PM

Did Claws the name from Claude? I haven’t been following but didn’t some make OpenClaude and that turned in OpenClaw and ta-da a new name of a thing?

alecco • today at 6:48 PM

> Bought a new Mac mini to properly tinker with claws over the weekend.

Disappointing. There is a Rust-based assistant that can run comfortably in a Raspberry PI (or some very old computer you are not using) https://zeroclawlabs.ai/ https://github.com/zeroclaw-labs/zeroclaw (Built by Harvard and MIT students, looks like)

EDIT: sorry top Google result led to a fake ZeroClaw!

➕ show 4 replies

vatsachak • today at 4:33 PM

This is all so unscientific and unmeasurable. Hopefully we can construct more order parameters on weights and start measuring those instead of "using claws to draw pelicans on bicycles"

arjie • today at 8:45 AM

The openclaw rough architecture isn’t bad but I enjoyed building my own version. I chose rustlang and it works like I want. I made it a separate email address etc. and Apple ID. The biggest annoyance is that I can’t share Google contacts. But otherwise it’s great. I’m trying to find a way to give it a browser and a credit card (limited spend of course) in a way I can trust.

It’s lots of fun.

➕ show 1 reply

edf13 • today at 7:20 PM

That’s one of the reasons we’re building grith.ai ~ these ‘claw’ tools are getting too easy for use (which is good)… but they need securing!

➕ show 1 reply

throw03172019 • today at 4:35 PM

What are people using Claws for? It is interesting to see it everywhere but I haven’t had any good ideas for using them.

Anyone to share their use case? Thanks!

➕ show 2 replies

bjackman • today at 10:14 AM

Does anyone know a Claw-like that:

- doesnt do its own sandboxing (I'll set that up myself)

- just has a web UI instead of wanting to use some weird proprietary messaging app as its interface?

➕ show 4 replies

_boffin_ • today at 3:59 PM

I just realized i built open claw over a year, but never released it to anyone. Should have released it and got the fame. Shucks.

hoss1474489 • today at 5:16 AM

It’s a slow burn, but if you keep using it, it seems to eventually catch fire as the agent builds up scripts and skills and together you build up systems of getting stuff done. In some ways it feels like building rapport with a junior. And like a junior, eventually, if you keep investing, the agent starts doing things that blow by your expectations.

By giving the agent its own isolated computer, I don’t have to care about how the project gets started and stored, I just say “I want ____” and ____ shows up. It’s not that it can do stuff that I can’t. It’s that it can do stuff that I would like but just couldn’t be bothered with.

thih9 • today at 1:49 PM

How much does it cost to run these?

I see mentions of Claude and I assume all of these tools connect to a third party LLM api. I wish these could be run locally too.

➕ show 3 replies

DonHopkins • today at 9:11 PM

simonw> It even comes with an established emoji [lobster emoji]

Good thing they didn't call it OpenSeahorse!

dainiusse • today at 11:39 AM

I don't understand the mac mini hype. Why can it not be a vm?

➕ show 5 replies

_pdp_ • today at 10:25 AM

You can take any AI agent (Codex, Gemini, Claude Code, ollama), run it on a loop with some delay and connect to a messaging platform using Pantalk (https://github.com/pantalk/pantalk). In fact, you can use Pantalk buffer to automatically start your agent. You don't need OpenClaw for that.

What OpenClaw did is to show the messages that this is in fact possible to do. IMHO nobody is using it yet for meaningful things, but the direction is right.

➕ show 1 reply

mikewarot • today at 4:21 PM

I too am interested in "Claws", but I want to figure out how to run it locally inside a capabilities based secure OS, so that it can be tightly constrained, yet remain useful.

derefr • today at 7:41 PM

> I'm definitely a bit sus'd to run OpenClaw specifically - giving my private data/keys to 400K lines of vibe coded monster that is being actively attacked at scale is not very appealing at all.

So... why do that, then?

To be clear, I don't mean "why use agents?" I get it: they're novel, and it's fun to tinker with things.

But rather: why are you giving this thing that you don't trust, your existing keys (so that it can do things masquerading as you), and your existing data (as if it were a confidante you were telling your deepest secrets)?

You wouldn't do this with a human you hired off the street. Even if you're hiring them to be your personal assistant. Giving them your own keys, especially, is like giving them power-of-attorney over your digital life. (And, since they're your keys, their actions can't even be distinguished from your own in an audit log.)

Here's what you would do with a human you're hiring as a personal assistant (who, for some reason, doesn't already have any kind of online identity):

1. you'd make them a new set of credentials and accounts to call their own, rather than giving them access to yours. (Concrete example: giving a coding agent its own Github account, with its own SSH keys it uses to identify as itself.)

2. you'd grant those accounts limited ACLs against your own existing data, just as needed to work on each new project you assign to them. (Concrete example: letting a coding agent's Github user access to fork specific private repos of yours, and the ability to submit PRs back to you.)

3. at first, you'd test them by assigning them to work on greenfield projects for you, that don't expose any sensitive data to them. (The data created in the work process might gradually become "sensitive data", e.g. IP, but that's fine.)

To me, this is the only sane approach. But I don't hear about anyone doing this with agents. Why?

fullstackchris • today at 9:26 PM

so... MCP? can anyone explain what a "claw" is apposed to a "skill" or similar? if not, let's assume in three weeks a new term called "waffle" appears - can you explain what that is?

if not, youre all hype idiots.

its still tokens in, tokens out you fools.

Havoc • today at 7:15 PM

Are people buying mac minis to run the models locally?

➕ show 3 replies

ozim • today at 12:41 PM

I am waiting for Mac mini with M5 processor since M5 MacBook - seems like I need to start saving more money each month for that goal because it is going to be a bloodbath at the moment they land.

trippyballs • today at 10:16 AM

lemme guess there is going to be inter claw protocol now

➕ show 1 reply

Dilettante_ • today at 11:49 AM

I still haven't really been able to wrap my head around the usecase for these. Also fingers crossed the name doesn't stick. Something about it rubs my brain the wrong way.

➕ show 2 replies

zkmon • today at 10:20 AM

AI pollution is "clawing" into every corner of human life. Big guys boast it as catching up with the trend, but not really thinking about where this is all going.

ggrab • today at 10:48 AM

IMO the security pitchforking on OpenClaw is just so overdone. People without consideration for the implications will inevitably get burned, as we saw with the reddit posts "Agentic Coding tool X wiped my hard drive and apologized profusely". I work at a FAANG and every time you try something innovative the "policy people" will climb out of their holes and put random roadblocks in your way, not for the sake of actual security (that would be fine but would require actual engagement) but just to feel important, it reminds me of that.

➕ show 18 replies

alt Hacker News

Claws are now a new layer on top of LLM agents

Comments

🔗 View 38 more comments