> But I guess it's about why so we associate those with spiky shapes, though surely it's because they represent sharp immediate changes in frequency?
Sure, but it's a very abstract connection between objects being sharp in vision and frequencies changing sharply in hearing. There's no guarantee any given organism would make the connection.
I don't think it's abstract at all. Rub something sharp (anything from a stick to a phonograph needle) on an object and you'll directly transcribe its spatial frequency spectrum into an audio frequency spectrum.
In the book „the design of everyday things“ it is mentioned that „natural mappings“ exist. Moving the knob of a vertical slider to the upper end universally means „brighter“ or „louder“, not „less bright“ or „more silent“.