You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 24 Next »

Overview


A Data Model represents the way data is structured in a set of data or a database, such as in Lab Booster’s data ocean.

The data model defines how the data ocean is connected to:

- The data input i.e. the ELN

- The data output i.e. DataLab the WebApp in which users can access data

Context

As of mid-2023, each market in Lab Booster has its own data model i.e. its own way to structure data.

This means that we are forced to build the connections between ELN/data ocean/DataLab at each new project.

Objective

Our aim is to have a common data model for all markets so that the connections between ELN/data ocean/DataLab are built, once and for all.


This will accelerate delivery of new projects and ensure better performance. 


This page is divided two sections

  1. Entity-Relationship Diagram (ERD), which served as a basis to design the data model
  2. Data model


Entity-Relationship Diagram (ERD)

Data Models are generally based on a diagram or schema called Entity-Relationship Diagram defining

  • Entities i.e. a definable object or concept within a system
  • Relationships i.e. how entities are related to one another

Building the ERD is a preliminary step to designing the actual data model to ensure that all required entities and relationships are accurately defined and represented.

This section is split in two parts

  1. Entity-Relationship Diagram design
  2. ERD mapping with R&I workflows

Entity-Relationship Diagram design 

Entity dictionary 


EntityDefinitionExample(s)
Experiment

A recording of actions performed in the lab by an operator at a given date to achieve an objective

Actions recorded in an Experiment include

  • Activities
  • Samples 
  • Tests
  • Planning

Emulsion Polymerization experiment

Solvent screening experiment

Solvay UserA recording of the user that created the Experiment, including Solvay ID and email
User PermissionsA setting determining whether a user has access or not to an Experiment and its contents
Request

Planning

Activity

A group of Processes performed in the lab in a specific order.
An Activity is constrained to one Experiment, while several Activities can be performed in one Experiment 

Chemical synthesis is an Activity that encompasses certain Processes
Process

A Process is defined by:

  • The action carried out in the lab 
  • The conditions in which it is carried out
  • The input (process ingredients or formulations) and outputs (end-product and/or numerical results)

Synonym: step

Within a chemical synthesis Activity, several Processes are carried out in the following order 1. First chemical reaction 2. Treatment by liquid-liquid extraction Purification 3. Second chemical reaction 4. Purification

Process Step

Process End Product

Ingredient

A chemical product defined by its name and unique ID and recorded in an inventory

Ingredient characteristics include date, batch number, supplier, physical state (liquid/solid), density, color...

An Ingredient can be:

  • A Formulation Batch
  • Sample
  • Process End-Product

Formulation

A combination of chemical products defined by the Ingredients and the Ingredients target proportions 

Formulation characteristics include total number of chemical products, target concentration, target volume, calculated density... 


Formulation Batch

A combination of chemical products defined by the Ingredients, the Ingredients target proportions, its unique ID and date

Formulation Batch characteristics include total number of chemical products, actual concentration, total volume, density, container (vessel, jar, bottle)... 

Formulation Batch is a Formulation that has been created in the lab


Sample

Step Sample

Sample Test Plan

Test Group

Test

Measure

Conditions

Results

Results Series

Aggregated Results










Process Ingredient

A product, component, material, sample of formulation batch recorded in chemical inventory used as an input in a Process

Synonym: reagent 

Ferulic Acid is a Process Ingredient in the Biocinversion Process for Fermentation Activity 
Process formulationThe Process Ingredients (generic) and their proportion (target) composing a formula that is used in a Process
Process formulation batchThe Process Ingredients (chemical inventory) and their proportion (actual) composing a formula that is prepared in the lab and used in a Process
Process Sample

A Sample taken from the Process End-Product(s) or from the Process while it is ongoing

A Process Sample is constrained to one Process or Process End-Product, while several Process Samples can be taken from a Process or a Process End-Product.

A Test or several Tests can be performed on it.

Samples taken during the Growth Process of the fermentation Activity
Process End-product

The output of a Process, characterized by its name, composition, aspect etc. 

An End-product is constrained to one Process, while there can be several End-Products for one Process.

Vanillin is an End-Product of the Process Bioconversion of the fermentation Activity
Formulation

Formulation ABC = 

1,2- Dibromoethane - 89%

+ Rhodacal 60 - 5%

+ Rhodasurf L-20 - 6%

Formulation Batch

Formulation ABC Batch xxxx =

1,2- Dibromoethane Batch xxxx Sigma Aldrich - 89.1%

+ Rhodacal 60 Batch xxxx - 5.2%

+ Rhodasurf L-20 Batch xxxx- 5.7%

Ingredient
Rhodasurf L-20 Batch SP4D26X01 100% active received 08/06/23
Sample

A piece of the product/component on which a Test is perfomed

The Sample can be taken from:

  • A raw material: a product/component received from a supplier 
  • A formulation batch: a combination of raw materials and/or synthesized products/components
  • A synthesized product/component: a product/component resulting from the combination and transformation of raw materials and/or other synthesized products/component (see Process Sample)

Raw material: methanol from Sigma Aldrich

Formulation sample: combination of solvent, active ingredient and other additives

Synthesized product: latex 

Test groupA group of Tests performed on the same sampleCharacterization tests 
Test 

Observation/analysis of the behavior of a Sample of a component/product when a procedure is carried out in a set of conditions :

  • Sample preparation
  • Sample analysis
  • Sample performance in application

It has an SOP (Standard Operating Procedure)

Conductivity test 

Viscosity test 

Paint application test

ConditionA variable/setting defined by the operator for a Test and affecting its Result

Temperature/pressure at which experiment is carried out

Solute concentration

ResultThe outcome of a Test performed on a Sample in specified Conditions
It can be a numerical value, a set of numerical values (i.e. curve) or a non numerical value (i.e. observations)
pH = 8.4
 Shear stress vs. Shear rate
Pass/Fail
Result SerieA set of Results, obtained for a Test performed the same Conditions, and measured with a time interval between themSample aging
MeasureA property of the Sample 
In the context of a Test it can be a Condition and/or a Result
Temperature 


Relationships definition

One Experiment can have;

  • One or more Samples;
  • One or more Formulations;

A Formulation can have:

  • One or more Ingredients;
  • One or more Formulation Batches;

A Formulation Batch can have:

  • One or more Ingredients;

A Lab Activity can:

  • Have one or more Processes;
  • One or more Samples;
  • Be associated to only one Experiment;

A Sample can:

  • only be part of a Formulation Batch;
  • have one or more processes associated;
  • be part of one or more processes;

A Process can:

  • Have/use multiples Samples;
  • Be a part of a group of processes used within a Sample;

A Formulation Batch can be associated to one or more Samples;

A Test Group can:

  • Only be part of one Sample
  • Have one or more Tests

A Test can:

  • Only be part of one Test Group
  • Have one or more Conditions
  • Have one or more Result Series

A Condition can:

  • Only be part of one Test;
  • Have one or more Measures;

A Result Serie can:

  • Only be part of one Test;
  • Have one or more Results;

A Result can:

  • Only be part of one Result Serie;
  • Have one or more Measures

A Measure can be used both as a Condition as well as a Result;


Entity-Relationship Diagram 


ERD mapping with R&I workflows (WIP)

Three types of R&I workflows were distinguished

  • Formulation workflows
  • Synthesis workflows
  • Analysis workflows

This was done in order to ensure that the ERD defined accomodates all types of R&I workflows.

The mapping done for different workflows is summarized in the table below.


GBU/F- R&IWorkflow nameWorkflow typeMapping statusLink to mappingDocumentation - Data capture
Novecare GBUSeed Care FormulationFormulationOngoingSeed Care mappingELN template
Novecare GBUSeed Care Request & ResultsFormulationOngoingSeed Care mappingELN template
Battery PlatformMecanosynthesisSynthesisDoneMecanosynthesis mappingELN template
Aroma Performance GBUFermentationSynthesisDoneFermentation mappingELN spreadsheet mockup
BioMatTech PlatformBiodegradabilityAnalysisDoneBiodegradability mappingLIMS spreadsheet mockup
Specialty Polymers GBU
AnalysisOngoing

Specialty Polymers GBU
SynthesisTo do

Novecare GBUEP CoatingsSynthesisTo do

Novecare GBUAgroFormulationTo do

Novecare GBUPaint CoatingsFormulationTo do

Corporate R&ISolvent platform - Solubilization
To do

Corporate R&I
AnalysisTo do

Green Hydrogen PlatformConductivityAnalysisTo do



Data model


  • No labels