Name convention: [001]_[002]_[003]_[004]_[005]_[006]_[007]_[008]
Where:
# | Category | Description |
|---|---|---|
001 | Area | Start with the prefix STG to identify 3 characters for this category |
002 | System Name | System the data come from. Ex: HLX ; ELN ; MES 3 Characters for this category |
003 | Site Name | Site that data come from Ex: 0000 4 Characters for this category |
004 | System Reference | System the data come from. Ex: HLX ; ELN ; MES 4 Characters for this category |
005 | File Code | For each file code we will have a dedicated staging table Ex: F001; F002 4 Characters for this category |
006 | Extraction Type | If it’s a full or incremental extraction (F= Full; I = Incremental) 1 character for this category |
007 | Frequency | If it’s Monthly, weekly, daily, quarterly etc… Ex: M = monthly, W = weekly 1 character for this category |
008 | File name | Identify the content of the table. Ex: Cannot exceed 63 characters |
Example : STG_HLX_0000_0000_F001_F_W_stellar_escalation_follow_up
Columns:
Views
# | Field Name | Description | Type | Example |
|---|---|---|---|---|
001 | meta_run_id | Id of the run, this information come from STG_runs | Integer | E.g. 12408 |
002 | meta_md5_hsh | file check sum | String | E.g. 69095ac6258ec1c9fd151cec979ace71 |
003 | meta_file_name | Bucket File name | String | E.g. 1291619-3d223530ac3c11ecae3500000ad02795-Emulsion Polymerization.json |
004 | meta_file_path | Path of the bucket | String | |
005 | meta_execution_id | System Execution id Prefix of the tool used to populate the field + execution id that match tool log. | String | E.g. Talend_6Oyhb6 |
006 | meta_bucket_load_date | Date the bucket file was loaded (UTC) | timestamp | E.g. 2022-12-12 17:00:21 UTC |
007 | meta_business_date | This date comes from the business (UTC), it’s when the data was refreshed from the business side. | timestamp | E.g. 2022-12-12 17:00:21 UTC |
008 | meta_stg_insert_date | When the data was inserted in the staging | timestamp | E.g. 2022-12-12 18:00:42 UTC |
009 | meta_source_system | Identification of the source where the data comes from. | String | E.g SAP; BW; ORA |
Name convention: [001]_[002]_[003]_[004]_[005]_[006]_[007]
Where:
# | Category | Description |
001 | Area | Start with the prefix ODS to identify 3 characters for this category |
002 | System Name | System the data come from. Ex: HLX ; ELN ; MES 3 Characters for this category |
003 | Site Name | Site that data come from Ex: 0000 4 Characters for this category |
004 | File Code | For each file code we will have a dedicated staging table Ex: F001; F002 4 Characters for this category |
005 | Extraction Type | If it’s a full or incremental extraction (F= Full; I = Incremental) 1 character for this category |
006 | Frequency | If it’s Monthly, weekly, daily, quarterly etc… Ex: M = monthly, W = weekly 1 character for this category |
007 | File name | Identify the content of the table. Ex: Cannot exceed 63 characters |
Example of the staging naming convention:
ODS_0000_F001_F_W_stellar_escalation_follow_up
# | Field Name | Description | Type | Example |
|---|---|---|---|---|
001 | meta_run_id | Id of the run, this information come from STG_runs | Integer | E.g. 12408 |
002 | meta_md5_hsh | file checksum | String | E.g. 69095ac6258ec1c9fd151cec979ace71 |
003 | meta_table_name | Name of the table where the data comes from | String | E.g. |
004 | meta_business_date | This date comes from the business (UTC), it’s when the data was refreshed from the business side. | timestamp | E.g. 2022-12-12 17:00:21 UTC |
005 | meta_execution_id | System Execution id Prefix of the tool used to populate the field + execution id that match tool log. | String | E.g. Talend_6Oyhb6 |
006 | meta_ods_insert_date | When the data was inserted in the staging | timestamp | E.g. 2022-12-12 18:00:42 UTC |
007 | meta_source_system | Identification of the source where the data comes from. | String | E.g SAP; BW; ORA |
# | Field Name | Description | Type | Example |
|---|---|---|---|---|
001 | business_id | Primary key of source table. Could be a single attribute or combination of the attributes | String | E.g 0001; “ABC” |
002 | <Table_name>_key | Generated SGK. As stated before, must be unique. The procedure should be implemented as a Hash function based on the Business Id, concatenate with the Extraction Date to reinforce the key uniqueness | String | Eg: gbu_key |
003 | meta_run_id | Id of the run, this information come from STG_runs | Integer | E.g. 12408 |
004 | meta_execution_id | System Execution id Prefix of the tool used to populate the field + execution id that match tool log. | String | |
005 | inserted_date | Datetime when the record was originally inserted into this table (NOW) and will never change. | timestamp | E.g. 2022-12-12 18:00:42 UTC |
006 | updated_date | Datetime and set to the same date as Inserted Date when the record is inserted for the first time. | timestamp | E.g. 2022-12-12 18:00:42 UTC |
# | Field Name | Description | Type | Example |
|---|---|---|---|---|
001 | start_date | Date when this version of the Business Id is considered to be effective. | Timestamp | Populate with 1900-01-01 00:00:00 for the first version |
002 | end_date | Date when that record is no longer active Implement the rules previously defined | Timestamp | Populate with 9999-12-31 00:00:00 for all active records |
003 | current_flag | Identify the active record, the latest version of the record Synonym for End_Date = 9999-12-31T00:00:00 | Boolean | The Active record will have the value set to True (all others should be set to False) |
# | Field Name | Description | Type | Example |
|---|---|---|---|---|
001 | <Table_name>_key | Generated SGK. As stated before, must be unique. The procedure should be implemented as a Hash function based on the Business Id, concatenate with the Extraction Date to reinforce the key uniqueness | String | Eg: fact_invoices_key |
002 | meta_run_id | Id of the run, this information come from STG_runs | Integer | Compulsory E.g. 12408 |
003 | meta_execution_id | System Execution id Prefix of the tool used to populate the field + execution id that match tool log. | String | Compulsory |
004 | inserted_date | Should be a datetime at least at the second level. | timestamp | E.g. CURRENT_TIMESTAMP() |