This doesn't have API access yet, but OpenAI seem to approve of the Codex API backdoor used by ...

simonw • yesterday at 7:24 PM • 19 replies • view on HN

This doesn't have API access yet, but OpenAI seem to approve of the Codex API backdoor used by OpenClaw these days... https://twitter.com/steipete/status/2046775849769148838 and https://twitter.com/romainhuet/status/2038699202834841962

And that backdoor API has GPT-5.5.

So here's a pelican: https://simonwillison.net/2026/Apr/23/gpt-5-5/#and-some-peli...

I used this new plugin for LLM: https://github.com/simonw/llm-openai-via-codex

UPDATE: I got a much better pelican by setting the reasoning effort to xhigh: https://gist.github.com/simonw/a6168e4165a258e4d664aeae8e602...

Replies

stingraycharles • today at 2:03 AM

OpenAI hired the guy behind OpenClaw, so it makes sense that they’re more lenient towards its usage.

➕ show 1 reply

DrProtic • yesterday at 7:28 PM

That pelican you posted yesterday from a local model looks nicer than this one.

Edit: this one has crossed legs lol

➕ show 1 reply

GistNoesis • yesterday at 8:34 PM

Isn't it awful ? After 5.5 versions it still can't draw a basic bike frame. How is the front wheel supposed to turn sideways ?

➕ show 3 replies

matt3210 • today at 4:17 AM

The pelican doesn’t really matter anymore since models are tuned for it knowing people will ask.

➕ show 1 reply

zerop • today at 8:57 AM

So pelican must have become the mandatory test case to pass for all model providers before launch.

postalcoder • yesterday at 7:42 PM

I made pelicans at different thinking efforts:

https://hcker.news/pelican-low.svg

https://hcker.news/pelican-medium.svg

https://hcker.news/pelican-high.svg

https://hcker.news/pelican-xhigh.svg

Someone needs to make a pelican arena, I have no idea if these are considered good or not.

➕ show 5 replies

droidjj • yesterday at 7:31 PM

It's... like no pelican I've ever seen before.

➕ show 1 reply

XCSme • yesterday at 7:38 PM

Is this direct API usage allowed by their terms? I remember Anthropic really not liking such usage.

➕ show 1 reply

Schlagbohrer • yesterday at 9:52 PM

That's amazing that the default did that much in just 39 "reasoning tokens" (no idea what a reasoning token is but that's still shockingly few tokens)

➕ show 1 reply

deflator • yesterday at 7:42 PM

Hmm. Any idea why it's so much worse than the other ones you have posted lately? Even the open weight local models were much better, like the Qwen one you posted yesterday.

➕ show 3 replies

noonething • yesterday at 9:48 PM

Thank you for doing all this. It's appreciated.

➕ show 1 reply

gpm • yesterday at 8:27 PM

I for one delight in bicycles where neither wheel can turn!

It continues to amaze me that these models that definitely know what bicycle geometry actually looks like somewhere in their weights produces such implausibly bad geometry.

Also mildly interesting, and generally consistent with my experience with LLMs, that it produced the same obvious geometry issue both times.

➕ show 1 reply

andriy_koval • yesterday at 7:41 PM

what is your setup for drawing pelican? Do you ask model to check generated image, find issues and iterate over it which would demonstrate models real abilities?

➕ show 1 reply

singingtoday • today at 1:00 AM

Thank you for continuing to post these! Very interesting benchmark.

SkyBelow • yesterday at 8:15 PM

Wait, I thought we were onto racoons on e-scooters to avoid (some of) the issues with Goodhart's Law coming into play.

➕ show 1 reply

rolymath • yesterday at 8:17 PM

Exciting. Another Pelican post.

➕ show 2 replies

dakolli • yesterday at 8:23 PM

You know they are 1000% training these models to draw pelicans, this hasn't been a valid benchmark for 6 months +

➕ show 2 replies

sjdv1982 • yesterday at 8:15 PM

At some point, OpenAI is going to cheat and hardcode a pelican on a bicycle into the model. 3D modelling has Suzanne and the teapot; LLMs will have the pelican.

alt Hacker News

Replies