Author

Version

Comments

Date

Ramprasad Tak

1.0

Initial Version

6/02/2023

Ramprasad TaK

1.1

Version 1

 

Ramprasad TaK

1.2

Version 1

 










Summary
1- Introduction
1.1- Purpose of the document
1.2- Intended Audience
2- How it works
2.1- Description
2.2- Logging Process
3- Logging Model
3.1- Description
3.2- Model
3.2- Tables

1- Introduction

1.1- Purpose of the document

This document describes how the datahub process works for all the GCP environments.

1.2- Intended Audience

This document is intended for the Data Architectures, Data Engineering and operational team.
It will be used as reference for any project or domain for the developments of the models.

2- How it works

2.1- Description


The objective of this datahub system is to guarantee that we are able to store data sets ( for example reference data) at one common place and utilize it with different projects.
Using a data hub we are able to track all the runs in GCP, if the files were well integrated, if there is an abort where it happens, when and what do we need to do to fix the issues.
This will help to support the operational teams to visualize all the file integration in the system and have the models and reports well loaded.

2.2- Datahub Process

   

                                     
Gudsis:

 Gudsis is example of project using the datahub, as it is sending telephone numbers from the helix to GCP and it will get transformed and then send it to other application
     


  1. First Step is the bucket area. The process needs to load all the files that arrive in GCP; the information will be provided if they are integrated or not.
  2. The second step is the run between bucket and reference data set where common data is stored
  3. Third step is load data logs in control data set which can track the logs of file
  4. The fourth step is the run between source data set to destination data set here for example IN_helix to OUT_IDM8 table
  5. The fifth step is send data from


3- Datahub Model

3.1- Description

Describe all the tables that are needed for a full logging process.

3.2- Model