Are there studies which examine dropout vs other regularizations?
Are there any papers published which show differences of the regularization methods for neural networks, preferably on different domains (or at least different datasets)?
I am asking because I currently have the feeling that most people seem to use only dropout for regularization in computer vision. I would like to check if there would be a reason (not) to use different ways of regularization.
Topic dropout regularization convnet computer-vision neural-network
Category Data Science