Overview
A Data Model represents the way data is structured in a set of data or a database, such as in Lab Booster’s data ocean.
The data model defines how the data ocean is connected to:
- The data input i.e. the ELN
- The data output i.e. DataLab the WebApp in which users can access data
Context
As of mid-2023, each market in Lab Booster has its own data model i.e. its own way to structure data.
This means that we are forced to build the connections between ELN/data ocean/DataLab at each new project.
Objective
Our aim is to have a common data model for all markets so that the connections between ELN/data ocean/DataLab are built, once and for all.
This will accelerate delivery of new projects and ensure better performance.
This page is divided two sections
- Entity-Relationship Diagram (ERD), which served as a basis to design the data model
- Data model
Entity-Relationship Diagram (ERD)
Data Models are generally based on a diagram or schema called Entity-Relationship Diagram defining
- Entities i.e. a definable object or concept within a system
- Relationships i.e. how entities are related to one another
Building the ERD is a preliminary step to designing the actual data model to ensure that all required entities and relationships are accurately defined and represented.
This section is split in three parts
- Entities dictionary
- Relationship definition
- ERD representation
Entity dictionary
| Entity | Definition | Example(s) |
|---|---|---|
| Experiment | A defined set of lab activities performed on samples of specific products/components | Emulsion Polymerization experiment Solvent screening experiment |
| Formulation | The ingredients (generic) and their proportion (target) composing a formula | Formulation ABC = 1,2- Dibromoethane - 89% + Rhodacal 60 - 5% + Rhodasurf L-20 - 6% |
| Formulation Batch | A use of a formula prepared in the lab with the real ingredients (chemical inventory) and their proportion (actual) | Formulation ABC Batch xxxx = 1,2- Dibromoethane Batch xxxx Sigma Aldrich - 89.1% + Rhodacal 60 Batch xxxx - 5.2% + Rhodasurf L-20 Batch xxxx- 5.7% |
| Ingredient | A product, component, material, sample of formulation batch recorded in chemical inventory | Rhodasurf L-20 Batch SP4D26X01 100% active received 08/06/23 |
| Sample | A piece of the product/component on which the test is perfomed The sample can be taken from:
| Raw material: methanol from Sigma Aldrich Formulation sample: combination of solvent, active ingredient and other additives Synthesized product: latex |
| Activity Group | A group of lab activities performed on the same sample | Syrup experiment includes photographing, measuring pH, measuring conductivity, measuring viscosity of one sample |
| Lab Activity | Observation/analysis of the behavior of a sample of a component/product when a procedure is carried out in a set of conditions :
It has an SOP (Standard Operating Procedure) | Formulation Conductivity Paint application |
| Process | ||
| Condition | A variable/setting defined by the operator for a lab activity and affecting its outcome | Temperature/pressure at which experiment is carried out Solute concentration |
| Result | The outcome of a lab activity performed on a sample in specified conditions It can be a numerical value, a set of numerical values (i.e. curve) or a non numerical value (i.e. observations) | pH = 8.4 Shear stress vs. Shear rate Pass/Fail |
| Result Serie | A set of results, obtained for a lab activity performed the same conditions, and measured with a time interval between them | Sample aging |
| Measure | A property of the sample In the context of a lab activity it can be a condition and/or a result | Temperature |
Relationships definition
One Experiment can have;
- One or more Samples;
- One or more Formulations;
A Formulation can have:
- One or more Ingredients;
- One or more Formulation Batches;
A Formulation Batch can have:
- One or more Ingredients;
A Lab Activity can:
- Have one or more Processes;
- One or more Samples;
- Be associated to only one Experiment;
A Sample can:
- only be part of a Formulation Batch;
- have one or more processes associated;
- be part of one or more processes;
A Process can:
- Have/use multiples Samples;
- Be a part of a group of processes used within a Sample;
A Formulation Batch can be associated to one or more Samples;
A Test Group can:
- Only be part of one Sample
- Have one or more Tests
A Test can:
- Only be part of one Test Group
- Have one or more Conditions
- Have one or more Result Series
A Condition can:
- Only be part of one Test;
- Have one or more Measures;
A Result Serie can:
- Only be part of one Test;
- Have one or more Results;
A Result can:
- Only be part of one Result Serie;
- Have one or more Measures
A Measure can be used both as a Condition as well as a Result;
Entity-Relationship Diagram
Legend
Entity
Primary key
Foreign key
Attribute
Relationships
- The ring represents "zero"
- The dash represents "one"
- The crow's foot represents "many"
Diagram
ERD mapping with R&I workflows
Distinguish
- Formulation workflows
- Synthesis workflows
- Analysis workflows
Ensure that data model fits for all
Formulation
Seed Care
Synthesis
Mecanosynthesis
Fermentation
Analysis
Biodeg
