logoalt Hacker News

Aeoluntoday at 4:45 PM5 repliesview on HN

I would not use Claude to get a second opinion on anything that’s an image.


Replies

rmbyrrotoday at 6:07 PM

I agree with you for some kinds of images, but not all.

LLMs are the best PDF-to-markdown converters, in my experience. I have a CLI that converts PDF to PNG, then run a background agent to "read" each PNG and write it down as markdown; it works flawlessly even for complex math formulas, it can "translate" complex charts, graphs, and tables into words.

It's slow and arguably expensive compared to traditional OCR, but very effective and precise.

maxall4today at 4:48 PM

Especially an MRI which is a 3D medium —something current LLMs are very bad at.

show 2 replies
yolo3000today at 4:51 PM

I used it on an ankle fracture xray, it was quite useful to make sense of things. But not like a 2nd opinion.

behnamohtoday at 4:59 PM

What's wrong with Claude? I've asked it to analyze images and even Opus 4 would perfect nail it.

show 2 replies