Avoiding Data Leakage in Machine Learning

To properly evaluate a machine learning model, the available data must be split into training and test subsets. Data leakage occurs when, in one way or another, information regarding the test set inappropriately influences the training or evaluation of the model. This causes us to overestimated the performance of a model. We will detail a … Continue reading Avoiding Data Leakage in Machine Learning