In the project, we use several layers of scenarios.


805There are a few things to know when using scenarios:

For clarity, only scenarios with specific behavior will be listed below.
The only automated scenario used in the production environment. It will launch the other scenarios iteratively from the extraction of input data to the output data storage for front-ends.
This scenario should have an active trigger on both the MASTER project in design and the automation project (refer to the technical architecture here if these terms are not familiar).

The trigger will be SQL based but the condition of reload will be different from one GBU to another (based on historical date change for SpP and forecast date change for Novecare).
The first custom step is used to update the version_name of the project automatically, this allows us to make sure the MASTER and prod project versions are always correct and different from the versions used in dev environments.
Contains a custom python step at the end to update the "version_date" variable based on the latest data available (either in forecasts or historical also, depending on the GBU).
This is essential to have the proper "version_data_date" displayed in the final outputs and to make sure we only keep the latest run of each month for a version in case we have several (reruns in prod due to errors or bad data for example).
Once in a while ( recommended every 6 months or so), the model will have to be re-optimized for each product family. This is the purpose of this scenario.
It contains a first custom Python step to enable the run_cross_val variable which is normally set to False for the monthly runs. Then, the Optimized_hyperparams dataset is executed and another custom Python step sets back the run_cross_val variable to False.