The activation function that should be used for the very last layer to get probabilities for each animal is the Softmax function. This is because the problem describes a multi-class classification scenario, where an image belongs to one and only one of 10 distinct animal categories, and the goal is to obtain a probability distribution across these mutually exclusive classes. The Softmax function takes a vector of ar....
Log in to view the answer