The primary hyperparameter used to prevent overfitting in Gradient Boosting Machines is the learning rate, often referred to as shrinkage. Gradient boosting builds an ensemble by adding decision trees sequentially, where each new tree attempts to correct the errors made by the combination of all previous trees. If the model lear....
Log in to view the answer