High Level Project Architecture
Here is a the link of the data architecture schema.
Architecture Data Flow
Here is a suggested template for Data Model + Data Mapping :
https://docs.google.com/spreadsheets/d/1bD8AIgsNUI2sgANoOEKTuHBlkxhsNVTD8cmOPYEloLw
DataPrep Flow
Schema showing the different STEPS of the application flow - with the data involved at each step
Steps descriptions
Describe the data and process involved at each step
DataSource 1
Description
What is it ?
Tools
The tool used
Access rights
Is there any credentials used ? Where are they stored ?
Source
Location
Where is this data collected ?
Format
The format of the source data
Destination
Location
Where is this data stored ?
Format
The format of the data saved in the databank
Sizing
Expected data volume for :
- full process
- incremental process
Assessment
How to validate that the generated output is valid
Scheduling
Is there an automatic schedule ? At what frequency ? What is the trigger ?
Timing
The average time expected for :
- full process
- incremental process
Criticality
High / Medium / Low
Logging
Logging location
DataSource 2
SAME QUESTIONS