logoalt Hacker News

6ak74rfyyesterday at 12:50 AM1 replyview on HN

I too was thinking about something like this a few months ago. There were couple of reasons I didn't pursue the idea. One, the image generation AI wasn't reliable enough. Like, I couldn't get it to generate 2 images where the characters looked consistent, let alone a book worth of images. Two, the margins were quite small, so didn't seem like a viable business.

Wondering if you've thought about such things and your perspective.


Replies

storystarlingyesterday at 7:09 AM

Character consistency was the hardest problem, and honestly what took the longest to get right. We use reference images as style anchors, run multiple generation passes, and have an LLM "critic" that checks for visual inconsistencies and triggers regeneration when needed. It's not perfect but it's gotten to the point where parents are happy with the results.

On margins - tight but workable.