Procedures
Here is a suggested operation book template:
https://docs.google.com/document/d/1xf3wWoQBgHKQtefaWi04Hx545rPB8HzQ
Procedure guide on how to operate the application
DataPrep Flow
Start
Incremental process
How to start from predeployed & running app: verify that all tables and views have been created then execute the plan.
Termination
How to assess the application's process has terminated: completion of the flow in the TMC
Restart
Is the process the same ? YES
Also for restarting after an error ? If there has been an error in the job you should first delete data from the FACT tables.
Pause
Procedure
How ? Stop the flow in the TMC
Alert contacts
Who should be alerted ? Contacts are defined in this page
Stop
Procedure
How ? Stop the flow in the TMC
Alert contacts
Who should be alerted ? Contacts are defined in this page
Reset
How ?
Scheduling
Trigger
What is the start trigger ? Event based ? Time based ?
Are there differences between DataPrep and DataApp ?
Expected results
For each brick, what is the expected output ?
Intervention
When is the time frame to intervene ? (when downtime is acceptable or scheduled)
Monitoring
Runtime
Where and how can we see the application status (Stopped, waiting, running, etc) ?
Run history
Where are the run actions historic ?
What form does it take ? Logs ?
Resources
Memory / Disk / CPU used by application
Additional metrics
According to operational requirements, detail application metrics (Processed Volume, Process duration, ...)
Logging
Where to find each step logs ?
Error handling
As a general guideline, application should stop as soon as possible.
Alerts
- Contacts
- Meaningful message (timestamps, description, criticality)
Specificity
Detail procedure for specific error cases