| Table of Contents |
|---|
CPC definition
You will find the term "CPC" a lot in the following documentation as it is the main level of granularity used through the application. This means that most of our datasets will have one record by CPC of a given GBU.
...
The rule used to define the unit price is as follows:
- Average of volume-weighted prices for the last 3 months with sales.
...
These price drivers are coming from several data sources described below.
Forecasts and historical data
The main data source we are currently using is the Pricing Data Lake in Big Query, especially the two following datasets :
- V_FACT_sales_forecast_enriched_current : Forecasts data.
- V_FACT_sales_history_cpc_last12months : Historical data for the past 12 months.
These datasets include :
- Sales and volume measures that are also used to generate the unit price used as a target (see dedicated § above) for our models.
- Dimensions used as features / price drivers (see dedicated § above).
[Novecare] Detailed
...
processing steps
[SpP] Detailed
...
processing steps
Manual input
All manual data sources are gathered in a spreadsheet specified by GBU and under the responsibility of the business.
For Novecare, this following GSheet is used to process the manual inputs for product groupings, manual regions and manufacturing plant groups.
For SpP, this following GSheet is used to process the manual inputs for product taxonomy and this one for manual regions.
Update the inputs from Gsheet directly (Novecare only)
Here are the steps to follow to use this manual process :
- The user clicks on the button of the link displayed in the "README" tab to trigger an update of the content of the file (this will take a few seconds and is finished as soon as the button re-appears).
- In Dataiku, it will launch a scenario that will retrieve the data of the latest run and send them in the several tabs of the Gsheet.
Let's take the example of the "Regions" tab :
We can see that new records appeared with countries that do not have a manual_region value for the given product_family_h4.
- The user can now add the right manual_region in the green background column or update an existing value. Note : The original_ prefixed columns will always display the current value to help the users potentially reverting changes.

