you are viewing a single comment's thread.

view the rest of the comments →

[–]neolib 1 insightful - 2 fun1 insightful - 1 fun2 insightful - 2 fun -  (3 children)

Do you plan to implement perceptual image hashing (https://en.wikipedia.org/wiki/Perceptual_hashing)? It would be possible to ban bad images by hash then (and also it's useful for finding alternative versions of images - of better quality for example).

[–][deleted] 1 insightful - 2 fun1 insightful - 1 fun2 insightful - 2 fun -  (2 children)

That's easy

[–]neolib 1 insightful - 1 fun1 insightful - 0 fun2 insightful - 1 fun -  (1 child)

PhotoDNA (which big websites use) is closed source though, and it's unclear (for me) whether open source ones from wiki article are good enough (https://www.phash.org & https://github.com/commonsmachinery/blockhash/).

[–][deleted] 3 insightful - 1 fun3 insightful - 0 fun4 insightful - 1 fun -  (0 children)

I have a C source file I can post. As I recall, the keywords are ahash, vhash, and ddhash. Or you can always go hacky and just downsample the colors to 5 bits and simhash the thing.