ELN API  •  Oracle Views  •  Datalab  →  Talend  →  GCP BigQuery


Project Name

Each ELN product follows a distinct IDBS access rights workflow

ETL 

Talend

Status

Deployed in PROD

1.  Project Overview

Objectives

The ELN system hosts multiple independent research projects. The Talend ETL flow enforces a project-level access policy: each collaborator is assigned to one or more specific ELN projects, and the pipeline only extracts and loads the data belonging to that collaborator's assigned projects.

Scope

2.  Architecture & Data Flow

The ETL pipeline follows a classic ETL pattern orchestrated by Talend. The high-level data flow is described below:

              [ELN API]  ─┐

                                     [Oracle Views ]  ──► [Talend ETL Jobs] ──► [GCP BigQuery Tables]

[Datalab Platform ]─┘

Flow Steps

3.  Source Systems


Source System

Type

Description

ELN API

REST API

Electronic Lab Notebook – exposes project data per collaborator access level

Oracle Views

Relational DB Views

Pre-aggregated relational data extracted via JDBC connections

Datalab Platform

Internal Data Platform

Analytical datasets and processed outputs from the Datalab environment

3.1  ELN API

Authentication:

        (TEST)

        (PROD)

Note :

Exemple :  

3.2  Oracle DB 

Authentication:

          (TEST)

          (PROD)

View:

Note:

3.3  Datalab 

3.4 Target table: 

Bigquery: DM.ELN_IDBS_AccessRights

         Talend tasks:

 (Master)


(Get Experiments)

(Cross experiments with collaborators and roles )

Error Handling Strategy

Scheduling

4.  Annexe document: 

This annex document could be helpful if you are looking for more details on access rights related to ELN. (link) 

5.  Contacts & responsibilities: