Weighted loss functions vs weighted sampling?
For image classification tasks, is there a practical difference between using weighted loss functions vs. using weighted sampling? (I would appreciate theoretical arguments, experience or published papers, anything really.)
Some details:
By "weighted sampling", I mean attributing different sampling probabilities for each sample in the training set.
By "weighted loss functions", I mean weighting error terms differently depending on the sample considered.