What is the derivative of negative log MLE (MLE used as a negative cost) when the variables are passed through softmax activations? 19.03.2019