How does the margin constant (alpha) in the triplet loss affect the training process when it is a constant?
How does the margin constant in the triplet loss formula affect the gradient calculation when its derivative will be zero?
Topic gradient loss-function deep-learning
Category Data Science