Entity-Level Testing:
| Attribute | Table in Lake | Attribute in Lake | Match Ratio | Sample # | Lake # | Matched # | Unmatched # | Transformation | RAG | Comments | Actions |
| GBU | V_DIM_gbu | GBU_Lake | 100.00% | 1 | 36 | 1 | 0 | Yes | OK | > Transformation Required. Requires concatenation of fields | > Transformation requirement to be discussed |
| Material_Name | V_DIM_material | material_name | 97.26% | 2259 | 1969393 | 2197 | 62 | No | DQ | > 61 out the 62 Product Names have Product Code in the Sample > When matching to the lake by Product Code, 61 are found with potential DQ Issues > The one not found is 209081 / AUGEO® CRYSTAL DRUM 210 KG | > Fix Data Quality Issues with Registered Mark > Find the remaining products not containing Registered Mark > Default logic implementation |
| Material_Code | V_DIM_material | material_id | 99.72% | 2460 | 2339144 | 2453 | 7 | No | DQ | >The Material_Names not matched contain vales that seem to be default values "OTHER INDUSTRIAL SERVICES", "NONE", "OTHER PASS THROUGH ADMINISTRATION","WATER", "ELECTRICITY", "GAZ NON RAW MATERIAL", "AUGEO® CRYSTAL DRUM 210 KG" | > Product Codes to be found at the source > Default logic implementation |
| Ship_To_Name | V_DIM_customer | customer_medium_name | 98.68% | 2422 | 289137 | 2390 | 32 | No | DQ | > All except 1 of the unmatched have Ship_To_Code > At least some not matching names due DQ Issues ("ACT CHEMICAL CO., LTD" Vs. "ACT CHEMICAL CO.. LTD") > Unmatched: WP1_400/NOT ASSIGNED | > NA > Default logic implementation |
| Ship_To_Code | V_DIM_customer | customer_id | 99.86% | 3514 | 514721 | 3509 | 5 | No | OK | > 4 codes not found, 1 is set as NONE | > Ayoub to check the source of the codes to see where they coming from > Default logic implementation |
| Sold_To_Name | V_DIM_customer | customer_medium_name | 98.88% | 1956 | 289137 | 1934 | 22 | No | DQ | > All except 1 have code > At least some not matching due DQ Issues in the names > Unmatched: WP1_400/NOT ASSIGNED | > NA - Attributed to DQ Issues > Default logic to be applied |
| Sold_To_Code | V_DIM_customer | customer_id | 99.86% | 2150 | 514721 | 2147 | 3 | No | OK | > 2 codes not found, 1 is set as NONE | > Ayoub to check the source of the codes > Default logic to be applied |
| Sold_To_Group | V_DIM_customer | customer_corporate_group_name | 97.78% | 632 | 5883 | 618 | 14 | No | DQ | > All except 1 have code > At least some not matching due DQ Issues in the names | > Several customer ids can be found under the same Sold_To_Group, which is a rolled-up customer level. So it should not be retrieved from customer id. Additionally, the Sold To Group Id is not available in the transparency dashboard. > Discussion needed to see if we need to take the ID from Salesforce, including Frank, Naveen, Ayoub... > Analysis is required to see if the dashboard has everything that is in CRM |
| Group_of_Activity | V_DIM_group_of_activity | Group of Activity Lake | 87.50% | 8 | 191 | 7 | 1 | Yes | OK | > Only unmatched value is "NOT ASSIGNED (#)" > Transformation Required. | > Default Logic to be applied > Transformation requirement to be discussed |
| Country_Ship_To | V_DIM_country | Country Ship To Lake | 98.86% | 88 | 264 | 87 | 1 | Yes | OK | > Only unmatched value is "NONE" > Transformation Required. | > Default Logic to be applied > Transformation requirement to be discussed |
| Country_Sold_To | V_DIM_country | Country Sold To Lake | 98.80% | 83 | 264 | 82 | 1 | Yes | OK | > Only unmatched value is "NONE" > Transformation Required. | > Default Logic to be applied > Transformation requirement to be discussed |
| GBU_Region | V_DIM_geo_zone | GBU Region Lake | 83.33% | 6 | 5 | 5 | 1 | Yes | OK | > Only unmatched value is "NONE" > Transformation Required. | > Default Logic to be applied > Transformation requirement to be discussed |
| End_Use | V_DIM_gbr_enduse | gbr_enduse_name | 98.61% | 72 | 1572 | 71 | 1 | No | OK | > Only unmatched value is "NONE" | > Default Logic to be applied |
| GBU_Zone_H1 | V_DIM_mini_zone | GBU Zone H1 Lake | 50.00% | 6 | 4 | 3 | 3 | Yes | DQ | > 3 unmatched values "CS/# (#)", "EUROPE (EU)", "NONE" > Transformation Required. | > Transformation requirement to be discussed > Europe is the one real gap, needs to be added. All the others are Not Assigned > Default logic to be applied |
| Market | V_DIM_gbr_market | gbr_market_name | 81.82% | 11 | 19 | 9 | 2 | No | OK | > 2 unmatched values "NOT ASSIGNED", "NONE" | > Default logic to be applied |
| GBU_Customer_Segment | V_DIM_customer_segmentation | GBU Customer Segment Lake | 60.00% | 10 | 7 | 6 | 4 | Yes | OK | > 4 unmatched values "NOT ASSIGNED", "NONE", "NOT YET ASSIGNED", "NOT VALID" > Transformation Required. | > Transformation requirement to be discussed > Default logic to be applied |
| Group_Customer_Segment | V_DIM_customer_segmentation | Group Customer Segment Lake | 60.00% | 10 | 7 | 6 | 4 | Yes | OK | > 4 unmatched values "NOT ASSIGNED", "NONE", "NOT YET ASSIGNED", "NOT VALID" > Transformation Required. | > Transformation requirement to be discussed > Default logic to be applied |
| Product_Name | V_DIM_material | commercial_material_group_name | 15.33% | 1539 | 32111 | 236 | 1303 | No | DQ | NA | > Matching logic based on GBU needs to be applied (this is for Novecare) > Default logic to be applied |
| Product_Name | V_DIM_material | material_name | 84.73% | 1303 | 1969393 | 1104 | 199 | No | DQ | > All except 1 have code > At least some not matching due DQ Issues in the names | > Product Name should be matched to the Material Group Name, which is higher level of the material name. These two need to be separated > Pau to investigate Material Group Name --> Only one value matched (OFFICE SVCS), non of the others > Default logic to be applied |
| Product_Code | V_DIM_material | commercial_material_group_id | 99.94% | 1567 | 77778 | 1566 | 1 | Yes | OK | > The only unmatched is a "NONE" value > Transformation required to remove the zeros in front of the number | > The gap seems to be in what/where/how do get the names out of the codes > Transformation required to remove the zeros in front of the number > Default logic to be applied |
| Product_Brand_Name | V_DIM_product_line | product_line_name | 7.30% | 137 | 5187 | 10 | 127 | No | FAILED | > Data needed not present the Data Lake yet | > To be brought with transactions |