Along the adjustment steps described in the previous page, we have other post-processing strategies to tune the results based on the needs of the different product families.
Hard boundaries
A hard-boundary is a feature for which if the value differs between a target CPC and a comparable, we filter out the comparable from the set.
...
- Note: We should aim to add as few hard boundaries as possible. Adding too many hard boundaries is not recommended because it impacts the number of CPCs in the comparable set as well as the coverage.
Inverse boundaries :
Inverse boundaries are the opposite as hard boundaries.
We currently use it on shipto_code and soldto_code to not compare with CPC on different products of the same customer.
Volume boundaries :
A threshold can be set at a product family level to limit the comparison of CPC having a too big difference in volumes.
...
If a threshold of 10 is used, it means that a CPC will only be compared with others in a range of 10x its volume, be it lesser or greater. (e.g. a CPC with a volume of 1 000 can only be compared to CPCs with a volume between 100 and 10 000).
Similarity threshold
To define a similarity threshold, we set a matching percentage for certain features that must be identical between comparables.
...