logoalt Hacker News

ChatGPT Containers can now run bash, pip/npm install packages and download files

404 pointsby simonwyesterday at 7:19 PM290 commentsview on HN

Comments

smeejtoday at 4:17 PM

As a person who's worked in support roles in tech companies and has a working familiarity with Python but is not a software developer or engineer at all, it's been fascinating to watch the changes.

In the last couple weeks, both Gemini and Claude have asked me, "Can I use the computer?" to answer some particular question. In both cases, my question to each was, "What computer? Mine, or do you have your own?" Here I had thought they were computers, in the vague Star Trek sense. I'm just using the free version in the browser, so I would have been surprised if it had been able to use my computer.

They had their own, and I could watch them script something up in Python to run the calculations I was looking for. It made me wonder who it was at Google/Anthropic who first figured out that the way to get LLMs to stop wetting their metaphorical pants when asked to do calculations was to give them a computer to use.

It did make me scratch my head when I was trying to prompt Nano Banana to generate something and it was like Gemini started talking about the image generator in the third person: "The AI is getting stuck on the earlier instruction, even though we've now abandoned that approach." Felt a little "turtles all the way down" with that one!

e12etoday at 3:30 PM

Hmm.. what's this?

> gmail (read-only) # gmail.search_email_ids → any # > Description: Search Gmail message IDs by query/tags (read-only).

Chat GPT App on android disavows having this... In what context does chat GPT get (read) access to Gmail? Desktop app?

show 1 reply
simonwyesterday at 7:20 PM

Regular default ChatGPT can also now run code in Node.js, Ruby, Perl, PHP, Go, Java, Swift, Kotlin, C and C++.

I'm not sure when these new features landed because they're not listed anywhere in the official ChatGPT release notes, but I checked it with a free account and it's available there as well.

show 2 replies
dangoodmanUTtoday at 1:14 AM

Giving agents linux has compounding benefits in our experience. They're able to sort through weirdness that normal tooling wouldn't allow. Like they can read and image, get an error back from the API and see it wasn't the expected format. They read the magic bytes to see it was a jpeg despite being named .png, and read it correctly.

show 1 reply
candiddevmikeyesterday at 8:59 PM

Seems like everyone is trying to get ahead of tool calling moving people "off platform" and creating differentiators around what tools are available "locally" to the models etc. This also takes the wind out of the sandboxing folks, as it probably won't be long before the "local" tool calling can effectively do anything you'd need to do on your local machine.

I wonder when they'll start offering virtual, persistent dev environments...

show 4 replies
tgq2915yesterday at 10:53 PM

[flagged]

show 6 replies
distalxtoday at 2:43 AM

This is either going to save hours… or create very educational outages.

show 1 reply
sheepscreektoday at 2:04 AM

Nice work detective Simon! I love these “discovery” posts the most because you can’t find this stuff anywhere.

show 1 reply
randomtoastyesterday at 8:58 PM

Maybe soon we have single use applications. Where ChatGPT can write an App for you on-the-fly in a cloud sandbox you interact with it in the browser and fulfill your goal and afterwards the App is shutdown and thrown away.

show 3 replies
Ferniciayesterday at 9:49 PM

Has Gemini lost its ability to run javascript and python? I swear it could when it was launched by now its saying it hasn't the ability. Annoying regression when Claude and ChatGPT are so good at it.

show 1 reply
jmacdyesterday at 8:44 PM

I wonder how long npm/pip etc even makes sense.

Dependancies introduce unnecessary LOC and features which are, more and more, just written by LLMs themselves. It is easier to just write the necessary functionality directly. Whether that is more maintainable or not is a bit YMMV at this stage, but I would wager it is improving.

show 11 replies
trolleskitoday at 9:43 AM

Wow, it can do what I could do 20 years back using Ctrl+T? The progress! Give them another 10 billion, scratch that, 20 billion, scratch that, 75 trillion. - Written by SarcastAI.

skybrianyesterday at 9:25 PM

Not sure if this is still working. I tried getting it to install cowsay and it ran into authentication issues. Does it work for other people?

show 2 replies
pplonski86today at 7:26 AM

thank you for sharing, is there a new container for each code run, or it stays the same for whole conversation?

show 1 reply
carterschonwaldyesterday at 10:55 PM

but… will gpt still get confused by the ellippses that its document viewer ui hack adds? probably yes.

xnxyesterday at 9:46 PM

How much compute do you get in these containers? Could I have it run whisper on an mp3 it downloads?

show 1 reply
CSMastermindtoday at 1:44 AM

Thank God, this was extremely annoying

LowLevelKernelyesterday at 11:53 PM

Isn’t that ChatGPT’s internal MCP tools?

show 1 reply
behnamohyesterday at 8:58 PM

I wonder if the era of dynamic programming languages is over. Python/JS/Ruby/etc. were good tradeoffs when developer time mattered. But now that most code is written by LLMs, it's as "hard" for the LLM to write Python as it is to write Rust/Go (assuming enough training data on the language ofc; LLMs still can't write Gleam/Janet/CommonLisp/etc.).

Esp. with Go's quick compile time, I can see myself using it more and more even in my one-off scripts that would have used Python/Bash otherwise. Plus, I get a binary that I can port to other systems w/o problem.

Compiled is back?

show 36 replies
jacquesmyesterday at 11:19 PM

How long before they'll be mining crypto?

show 1 reply
blobbersyesterday at 11:33 PM

Did I miss the boat on chatgpt? Is there something more to it than the web chat interface?

I jumped on the Claude Code bandwagon and I dropped off chatgpt.

I find the chatgpt voice interface to be infuriating; it literally talks in circles and just spews summary garbage whenever I ask it anything remotely specific.

show 3 replies
shevy-javatoday at 2:03 AM

And so it begins - Skynet 3.0.

syngrog66today at 4:56 AM

ahhh... yet more things I've been able to do for decades already

nottorpyesterday at 9:37 PM

... as root?

show 2 replies
bofadeeztoday at 4:58 AM

[flagged]

show 1 reply
bandramitoday at 2:13 AM

As an infosec guy I'm going to go ahead and buy a bigger house

show 3 replies