Note: - Please create similar document if target project has dependency on other projects
Data Flow :-
Upload the Architecture diagram of the project(Data Flow)
For Ex: -
List down all upstream and downstream dependencies
- Upstream Dependencies
- Upstream Dependencies-1
- Detail about dependency
- How data getting generated.?
- What is the source of the data.?
- What is the output of the data.?
- How frequently data getting loaded to the output.?
- Mention Point of contact/Team to reach out incase any issue.
- Dataset-1 (Which is input source to dataiku).
- List if any intermediate data transformation is happening.
- Detail about data transformation(e.g. Function, Calculation..)
- Dataset -2
- Mention similar as Dataset-1
- Dependent Scenario-
- Add the scenario name which is been triggered by upstream dependency and mention its time .
- If the dependent scenario trigger is required by dataiku team or not.
- Detail about dependency
- Upstream Dependencies-2
- Mention same details if any other dependencies available
- Upstream Dependencies-1
- Downstream Dependencies
- Downstream Dependencies-1
- Detail about Dependency.
- Where data getting loaded (i.e. Data source, Any platform like Talend, Tableau)?
- How frequently data should be loaded ?
- If any additional reporter needs to added(i.e. Who will receive email on success or failure)?
- No. of user impacted
- Mention Point of contact/Team to reach out incase any issue.
- Dataset-1
- Mention if any quality/specific details need to maintain or needs to check.?
- Potential issues/Concerns if any data loss.
- Mention if any backup/ Rollback options available
- Dataset - 2
- Mention similar as Dataset -1 if any other dataset connected.
- Detail about Dependency.
- Downstream Dependencies-2
- Mention same details if any other dependencies available..?
- Downstream Dependencies-1
- Upstream Dependencies
Procedures : -
- Project Name
- URL
- Short description about the project.(Overview)
- Diagram of complete flow zone of project.
- List No of Zones.
- Zone1
- Zone2
- Zone3
- List No. of Recipes in the project.
- No. of code recipes
- no. of SQL recipes
- no. of Python recipes
- No. of visual recipes
Provide tabular data of all recipe details
Recipe Name
Recipe Type
Environment Name
Visual recipe -1
Recipe -Type -1(e.g. Sync, Join.. etc)
Visual recipe
Visual Recipe -2
Recipe -Type -1(e.g. Sync, Join.. etc)
Visual recipe
Sql Recipe -1
SQL
Sql Recipe
Python Recipe -1 Python Env Name -1
- No. of code recipes
- List No. Datasets in the project.
List all the dataset and details.
Datasets Name
Connection Type
Connection Name
Dataset Type
Dataset 1
connection type(e.g. GCS,BQ)
Connection name
Dataset type( e.g. Base/Input/Source, Transformation, Output)
Dataset 2
connection type(e.g. GCS,BQ)
Connection name
Dataset type( e.g. Base/Input/Source, Transformation, Output)
Dataset 2
connection type(e.g. GCS,BQ)
Connection name
Dataset type( e.g. Base/Input/Source, Transformation, Output)
- Description about all the zones
- Zone 1
- Snapshot of Zone 1
- List of all sources to the Zone.(e.g. Dataset name.)
- Short description about how source data is generated and from where its generated.
- Mention if any query used in datasets.
- Provide detail description about data transformation and data flow in the specific zone.
- Mention each recipe and problem statement.
- Mention the queries used in the code recipes(e.g. BW queries, sql queries)
- Where the destination/output dataset is used.?
- Mention if any append used in any dataset and backup of dataset.
Zone -2
- Provide similar description about the zone -1 if any other zones available.
- Zone 1
- Detail about Library
- Library -1
- Mention if any python code available in Library
- Detail description about the python code.
- Mention if maintenance needed from Dataops team.
- Mention the changes need to updated.
- Where does this library used.
- Library -2
- Mention details if any library available.
- Library -1
- Detail about the API.
- Mention if any API created in API designer.
- Short description about the API.
- Mention if any maintenance/code changes needed as per instance.(e.g. URL change, variable change)
- Project/ Scenario variable
- Mention if any variables stored in project variable.
- Mention if any data needs to be maintained in variables.
- Reason of variable and how its used..
- Detail about Webapp/Dashboard.
- Webapp/Dashboard -1
- Description about the webapp/dashboard.
- List of datasets connected to webapp/ dashboard.
- Mention if any scenario/Job is updating/loading the webapp/dashboard.
- Frequency of webapp/dashboard to be updated.
- etc.
- Webapp/Dashboard -2
- Mention as similar to webapp-1 if any other webapp/dashboard is available.
- Webapp/Dashboard -1
- Plugin info.
- Mention if any plugin used.
- Detail description about the plugin.
- Mention if the plugin is installed from dataiku store or self developed.
- Mention the environment details if self developed.
- resource consumption details.
- Memory consumption
- CPU consumption.
Scheduling and Monitoring: -
- Detail about the scenario.
- Scenario -1
- Short description about the scenario(What it does.?)
- Mention the steps.
- Mention the datasets which is building.
- Mention the trigger details.
- Mention if any custom codes available in steps/triggers.
- Mention if any further changes needed in scenario.
- Scenario -2
- Mention same details as Scenario 1 if any other scenario available.
- Scenario -1
Error Handling: -
- Potential Problems/Concerns
- Problem -1
- Any recurring and potential issues for the project.
- Point of contact in case of any issues.
- Related documentation
- Problem -2
- Mention the same details as problem 1
- Problem -1
- Link to the additional documents if you have(Dataiku projects)
- Dataset related.
- PPTs
- Docs
- Point of contacts for any issue in dataiku project from DT team and Business Team
- No. of impacted users.

