logoalt Hacker News

philajantoday at 4:40 PM1 replyview on HN

This is neat. Have you noticed any difference in duplicate detection between strongly typed and loosely typed languages / code bases?


Replies

rkochanowskitoday at 5:42 PM

No. It depends the most on general code quality and architecture. Some implementations require more code similarity by design. Some languages, like Java, may tend to have more duplication, but it's only a theoretical guess. It also depends on what kind of software is developed with what language.

If you are interested in data, you can check my article. Analysis was done with this tool, but a previous version where exact-copy duplicates were excluded from analysis. https://rkochanowski.com/article/analysis-code-duplication/