TL; dr: This part explains what makes one classifier better than other in MSD-Net Having laid the foundations in Part 1 (link), let’s get straight into the meat of this here. We use the Cifar-10 dataset for experimentation which is an image classification dataset made up of 60K color images with a resolution of 32x32. Note that the low resolution can even make it tough for us, humans to classify these images. First, for reference, Fig 1 plots the global accuracies of each classifier on the CIFAR-10 dataset. The performance of classifiers generally improves as the number of blocks increase. This makes sense as an increase in the number of blocks, also means more parameters. Although, the best classifier on average is B6 (after six blocks). This is possibly because as the number of blocks increase, overfitting becomes more prominent. Now let’s see some images of dogs —- visualized with Grad-CAM!
————————————————– Fig 2 —————————————————-
In Fig 2, we have several images of dogs that are correctly/incorrectly classified by different MSD-Net classifiers. Some of these samples are more difficult than the others. This can be estimated by the number of classifiers that correctly classify the given image, mentioned in the brackets in the first column. In the paper, the authors instead learn to predict a confidence value corresponding to each prediction which is a more principled way to do it. The key points to note here as follows: ————————————– ** Table 1**———————————————————-
————————————————– Fig 3 —————————————————- Although it is easy to judge the performance of these models otherwise using their prediction scores, for unsupervised models such as GANs and VAEs, these visualization techniques can potentially lead to automated evaluation techniques which remain the holy grail. With this note, I’ll conclude the 2-part series. Hope it would have helped you in getting an idea of qualitatively understanding the performance of models, and through the previous post, how to localize model attention.Visualizing MSD-Net with Grad-CAM Part-2
————————————————– Fig 1 —————————————————-
B1
B2
B3
B4
B5
B6
B7
B8
B9
B10
0.939
0.941
0.960
0.956
0.966
0.966
0.966
0.962
0.958
0.958