Technical documentation for the Bollate raw data load and integration.

All schemas can be found on the following GDrive link: #133 Battery Data management platform >> 06 - Documentation >> Architecture >> Battery Data Scheme and Diagrams |
Raw data files have different structures and are extracted from different the servers:
| Cyclers | Description | Data Status | Flow Status |
|---|---|---|---|
Maccor | CSV Raw files coming from Maccor machine. The source server is located in Italy | ||
Biologic BCS | CSV Raw files coming from Biologic BCS machine. The source server is located in Italy | ||
Biologic VMP | CSV Raw files coming from Biologic VMP machine. The source server is located in Italy | ||
| Arbin | Data coming from SQL Server database? for Arbin machine |

| N | Description |
|---|---|
| A | A Talend job is responsible for getting all files from the sources and bringing them to the Talend server to finally load the raw data to BigQuery. |
| a1 | All new raw data is loaded in a "delta" table. A SQL script is responsible to consolidate the new data to the already existing. Every cycler has its own tables. |
B | Once the data is consolidated, it's taken to another dataset where data is transformed, enriched and integrated on the same table with other Bollate's cycles. |
| b1 | Data is transformed, enriched and integrated passing through many tables. There is one or many SQL scripts to do this task. |
| C | Data coming from ELN is integrated at this point. Check ELN page for more details. |
| D | Data is pushed to the enriched_data dataset where it can be accessed by the users. |
All SQL scripts can be found:
|
Data mapping to have the same table structure for all the cyclers files:
This project holds all jobs necessary to load all Battery Bollate raw data
![]()