Here is a suggested operation book template:
https://docs.google.com/document/d/1xf3wWoQBgHKQtefaWi04Hx545rPB8HzQ
Procedure guide on how to operate the application
In order to run the incremental process you should execute the Talend flow F001_Daily_flow which is scheduled daily
Check in the TMC that the job is completed.
Is the process the same ?
Also for restarting after an error ?
The restart process consists in re-running the incremental process. As the process is incremental (some tables are always loaded in full) even after an error you should simply run the process.
How?
No pause
Who should be alerted ?
In case of issue you can contact the business owner mentioned at this page.
How ?
If you need to stop the daily flow you can remove the trigger from the Talend TMC.
Who should be alerted ?
In case of issue you can contact the business owner mentioned at this page.
How ?
No need.
What is the start trigger ? Event based ? Time based ?
Are there differences between DataPrep and DataApp ?
The Talend flow is scheduled every day at 6am CET.
For each brick, what is the expected output ?
Big Query tables of the project prj-dash-eln-prod will be updated.
When is the time frame to intervene ? (when downtime is acceptable or scheduled)
The project is not critical so you can intervene anytime.
Where and how can we see the application status (Stopped, waiting, running, etc) ?
Look at the TMC to check the job status.
Where are the run actions historic ?
What form does it take ? Logs ?
Memory / Disk / CPU used by application
According to operational requirements, detail application metrics (Processed Volume, Process duration, ...)
Where to find each step logs ?
As a general guideline, application should stop as soon as possible.
- Contacts
- Meaningful message (timestamps, description, criticality)
Detail procedure for specific error cases