Artificial intelligence is so rolling up in the field of image recognition. Classification of cats and dogs has long been out. Now the popular version is the "Lianlian" Plus, such as recognizing which model of sports car this year, or whether the bird's eyebrows are as thick as those of Lao Wang next door.
But the problem is that neural network is "smart" and is smart, but when it makes it clear, "Why should I say this is this?", it is a bit like a poor student being asked about the problem-solving ideas, and hesitating for a long time can't come up with it. The traditional Class Activation Map (CAM) is like putting a glowing aperture on the neural network's head, telling you "Well, it mainly depends on this part", but what exactly does it look at?
Why look at this place? When it encounters nuances of the "twins" level, it is stunned and points to a bunch of similar places and says, "Probably...it's here... maybe..."
At critical moments, there will always be heroes! The research tycoons at Ohio State University can't stand it anymore. They have created a magical tool - Finer-CAM . This thing is simply equipped with a high-definition night vision goggles + microscope for neural networks! Its core trick is ** "What do you look at? You look at differently! "
Traditional CAM is a single-handed battle, staring at the target; while Finer-CAM is a team PK, which will pull out the target categories and those that look like "Old Wang next door" and let them "face to battle" .
By calculating the differences between their prediction results, Finer-CAM can accurately identify those "rebellious" and unique characteristics and severely suppress those "popular faces". It feels like playing "Everyone is here to find fault". In the past, I used to point to a few places and say "I think it's here." Now with Finer-CAM, it can tell you: "Wrong! The real difference is this hair!"
When the Finer-CAM is released, it has a halo, and the functions are so bright that it makes people think of "Wow":
Such a fun and practical thing, of course you have to experience it together! The Imageomics team is quite powerful and directly released the source code of Finer-CAM and the Colab demonstration . You just need to move your fingers, install a gadget called grad-cam
, and then run the generate_cam.py
script they provide to generate "find faults" results, and then use visualize.py
to see the effect.
The emergence of Finer-CAM is like installing a more advanced image analysis system to neural networks, allowing them to see clearly and clearly when facing nuances.
In the future, let AI recognize things that "look exactly the same", and it can finally say confidently: "Hmph! I have long seen the difference between you two!" This technology not only improves the accuracy of image interpretation, but also gives us a deeper understanding of the decision-making process of AI.
Project: https://github.com/Imageomics/Finer-CAM
demo:https://colab.research.google.com/drive/1plLrL7vszVD5r71RGX3YOEXEBmITkT90