Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This page presents the data development documentation for Batteries data for the all sites of Aubervilliers and NOH.  workflows

Summary

...

Table of Contents
maxLevel1

...

Our team aims to develop the next generation of lithium battery without any liquid in it (All-Solid State Battery). To do so, new battery materials have to be synthesized and new ways to assemble them have to be developed.

Data Ingestion

Data Sources

ELN

The

list
of the spreadsheets extracted from JSON files coming from ELN can be found in the Data Mapping of the next section.

Related documents

...

Google Drive Live Link
urlhttps://docs.google.com/document/d/1myJ7zU4cTW1LG6eAj8z5rcKpMqhPOejl/edit#heading=h.gjdgxs

Instruments

The instrument files are, most of them, manually added to a Google Driver's folder or a folder in lab servers. 

The spreadsheet Where to find instrument files? has the full list of instrument folders and location:

Google Drive Live Link
urlhttps://docs.google.com/spreadsheets/d/1Sc53zcj5ScNdDUTmtFwa9AeUoD4MZxB6RAUyQ4kJ0P0/edit#gid=0

Data Mapping

No data mapping needed as the data is copied exactly as the source.

Talend Jobs

The jobs F010 and F011 are responsables for orchestrating the Data Ingestion:

Image Removed

Data Preparation or Parsing

Data Mapping

The spreadsheet below presents all data transformation between the raw files (extracted files) and a BigQuery delta table. Some files are unstructured and semi-structured. This steps aims to structure the files in the target table format and checking if the column's type (schema) are conformed expected.

Google Drive Live Link
urlhttps://docs.google.com/spreadsheets/d/1lYbtU9-s7P0AGVvwKH9YKhJNAeoexTvB/edit?usp=sharing&ouid=103674584428097024221&rtpof=true&sd=true

Talend Jobs

The jobs F020 and F021 are responsables for orchestrating the Data Preparation:

Image Removed

Data Integration or Computing

Data Mapping

The spreadsheet below presents all data transformations between the tables on Staging and ODS. This steps aims to add some calculations and intelligence to the data. Not all tables will need passing through this step.

Google Drive Live Link
urlhttps://docs.google.com/spreadsheets/d/18Z0oMjqq1Krcu1hxYi2Nv7iqLnYojzUq/edit?usp=sharing&ouid=103674584428097024221&rtpof=true&sd=true

Talend Jobs

The job F030 is responsable for orchestrating the Data Integration:

Image Removed

Data Presentation or DW/DM

Data Presentation

Data Mapping

No data mapping available, as there are no transformations for now. This must be created if there are requirements for that. 

Talend Jobs

No need of jobs as there are no steps to load DW/DM. A priori, all data is presented as views.

Data Model

The data model presents the tables/views presented on DW/DM dataset and the relation between them. 

{Add model}

Orchestrating Jobs

All the jobs are run in sequence under the follow job and project name on TAC/Talend Cloud:

...

For scheduling details check the Operational documentation.

Specific Naming Conventions

Folders and Table Names

Most files are organized based on its "Workflow name", "Method name". For exemple:

...

All instruments folders structure and tables name use the same logic depending on the file or table format. For exemple:

Folders

  Image Removed

Tables (Staging)

Image Removed

Abbreviations

Abbreviations specifically for Battery, they are used in the Talend jobs or tables names:

...

Instrument

...

Raw Materials

...

-children
depth1

...

Conductivity

...

Mechanosynthesis

...

Critical Current Density

...

Lithium Metal Compatibility

...

Tortuosity factor

...

Dry Process

...

Wet process

...