This page presents the data development documentation for Electrochemistry for all sites 

Summary

Description

Data Ingestion

The data ingestion phase for batteries follows the standard approach describe on LB Data Dev Architecture - General.

Data Sources

ELN

The list of the spreadsheets extracted from JSON files coming from ELN can be found in the Data Mapping of the next section.

Related documents

Document NameLink
Battery - ELN Data Modelhttps://app.diagrams.net/#G1sD6OqKnBzSR_SrGvzlhEl7s5F5vUGQqD
Battery - ELN Template Documentation

Instruments

Raw Data Pairing (all sites)


Aubervilliers and NOH data

The instrument files are, most of them, manually added to a Google Driver's folder or a folder in lab servers. 

The spreadsheet Where to find instrument files? has the full list of instrument folders and location:

Data Mapping

No data mapping needed as the data is copied exactly as the source.

Talend Jobs

The jobs F010 and F011 are responsables for orchestrating the Data Ingestion:

Data Preparation or Parsing

The data preparation phase for batteries follows the standard approach describe on LB Data Dev Architecture - General.

Data Mapping

The spreadsheet below presents all data transformation between the raw files (extracted files) and a BigQuery delta table. Some files are unstructured and semi-structured. This steps aims to structure the files in the target table format and checking if the column's type (schema) are conformed expected.


Talend Jobs

The jobs F020 and F021 are responsables for orchestrating the Data Preparation:

Data Integration or Computing

The data integration phase for batteries follows the standard approach describe on LB Data Dev Architecture - General.

Data Mapping

The spreadsheet below presents all data transformations between the tables on Staging and ODS. This steps aims to add some calculations and intelligence to the data. Not all tables will need passing through this step.

Talend Jobs

The job F030 is responsable for orchestrating the Data Integration:

Data Presentation or DW/DM

Data Presentation

The data presentation phase for batteries follows the standard approach describe on LB Data Dev Architecture - General.

Data Mapping

No data mapping available, as there are no transformations for now. This must be created if there are requirements for that. 

Talend Jobs

No need of jobs as there are no steps to load DW/DM. A priori, all data is presented as views.

Data Model

The data model presents the tables/views presented on DW/DM dataset and the relation between them. 

{Add model}

Orchestrating Jobs

All the jobs are run in sequence under the follow job and project name on TAC/Talend Cloud:

ProjectJob/Flow
RnI_ACN_BatteryF000_RnI_ACN_Battery_Orch_Flow

For scheduling details check the Operational documentation.

Specific Naming Conventions

Folders and Table Names

Most files are organized based on its "Workflow name", "Method name". For exemple:

Workflow nameMethod nameInstrument name
ConductivityConductivityBiologic SP-150
Electrochemical testsConductivityBiologic VMP 300
Raw materialsLithium Metal CompatibilityBiologic VMP 300
Raw materialsTortuosity factorBiologic VMP 300
MechanosynthesisCalcination / DryingRotative Oven
Dry ProcessCoatingCoating line
Wet processMechanical testMark-10 90° Peel Test Fixture Model G1109

All instruments folders structure and tables name use the same logic depending on the file or table format. For exemple:

Folders

  

Tables (Staging)

Abbreviations

Abbreviations specifically for Battery, they are used in the Talend jobs or tables names:

NameAbbreviationNote

Instrument

Instr

Raw Materials

RawMat

Conductivity

Conduct

Mechanosynthesis

Mechano

Critical Current Density

Crit_Curr_Density

Lithium Metal Compatibility

Lith_Met_Comp

Tortuosity factor

Tortuosity

Dry Process

Dry

Wet process

Wet