The objective of this page is provide simple explanation for the business (with few technical items), to understand all the steps of the modelization
In our original data, we have numerical and categorical features (region, product taxonomie features, ...).
for machine learning model, or to compute a similarity distance between CPC, we need to have only numerical feature.
So we transform categorical features to numerical, applying a "Target Encoding" :
![]()