logoalt Hacker News

jannyfertoday at 4:23 AM4 repliesview on HN

That was fun, and the script on github looks hand-written which is refreshing after having been reading AI-written code for months.

I have 120k photos in iCloud that I'm sure have duplicates (I exported my library to Google Photos years ago and exported it back to iCloud). The iOS duplicate detection stopped flagging duplicates for me to merge a while back. I gotta do something like this script...


Replies

xenophenestoday at 5:52 PM

yeah there's something you can still feel in hand-written code - the variable names show how someone was actually thinking about the problem. AI code is grammatically perfect but quite hollow

mceachentoday at 2:23 PM

If you're really wanting to do perceptual hash based deduplication, use multiple, heterogeneous hash algorithms (phash, dct hash, mean hash, ...) as it is likely that a given hash algo will happily lossily match with very very different images--but if all hashes match, you're much less likely to have false positives.

I wrote up what I do here: https://photostructure.com/guide/what-do-you-mean-by-dedupli...

Aboutplantstoday at 10:43 AM

Ah yes, artisanal code!

RetroTechietoday at 9:15 AM

> and the script on github looks hand-written which is refreshing after having been reading AI-written code for months.

We really need a short for "is it AI or not? has entered the discussion".