EU Pollinator Hub

, ,

Dataset Report
Unique identifier: SNITS17.0.0
Title: SI Units
Long title: A selection of SI units provided by the Bureau International des Poids et Mesures (BIPM)
Status: Quality Validated
Current Version: v. 1.0
Published: 2022-11-23
Reviewed by:
Citation proposal:
EU Pollinator Hub 2022 Report of dataset SI Units, v. 1.0 [SNITS17.0.0]. EU Pollinator Hub. [2026-02-24] app.pollinatorhub.eu
Compliance with FAIR* principles
Findable
Accessible
Interoperable
Reusable
See https://www.go-fair.org/fair-principles for more information about FAIR principles
Data Quality
Under evaluation

Document history

Release

Version v. 1.0 released on 2022-11-23.

Revision

Table 1. List of revisions made to the document. Identifier of revision (No); date of revision (Date); description of revision (Description); reason for revision (Reason).
No Date Description Reason
1 2022-11-23 00:11:00 Initial release. n/a

Abbreviations

BIPM
Bureau International des Poids et Mesures
EUPH
EU Pollinator Hub
n.a.
not available

Executive summary

Data overview:

The dataset contains SI units provided by the Bureau International des Poids et Mesures (BIPM) [1] for internal use on the EU Pollinator Hub.

Data value:

The dataset is required by the EUPH for administrative purposes and fulfils an important role in data standardisation across datasets.

Data description:

File bipm.csv contains contains 81 records with base units, derived unit with special name, other derived units and accepted non-SI units, provided by the Bureau International des Poids et Mesures (BIPM) [1], including a version enhanced with HTML code for correct representation to be used in front end display.

Data application:

This data will be used for the standardisation of data integrated on the EU Pollinator Hub (EUPH).

Unresolved issues:

n/a

Introduction

tandardisation is an important goal of the EU Pollinator Hub (EUPH). Existing standards are prioritised to achieve this goal. The Bureau International des Poids et Mesures (BIPM) is an important source for internationally recognized standards. The standards are published in the SI Brochure, which is distributed under the terms of the Creative Commons Attribution 4.0 International License. The dataset is required by the EUPH for administrative purposes and fulfils an important role in data standardisation across datasets.

Material and methods

Data acquisition

Raw data integrated into this dataset consists of 1 file:

Table 2. List of raw data and metadata files included in the dataset. Identifier of table row (No); name of the file (File); the type of the file (Type); file contains data (D); file contains metadata (M); date of upload of the file to the EU Pollinator Hub (Arrival); number of data points contained within the file (if applicable); uploaded file size.
No File Type D M Arrival Data points File size
1 euph_000017_siunits_table_bipm.csv CSV - Comma seperated values Yes No 2024-01-26 19:01:09 567 8.31 KiB

Data preparation

Data was copied into a Microsoft® Excel® worksheet (Microsoft Corporation, Version 2210 Build 16.0.15726.20188). Prepared data was then converted to CSV and imported for profiling into a SQL database (MariaDB foundation, Server-Version 10.4.24) running in a XAMPP environment (BitRock, version 3.3.0). Data was exported from the MySQL database to CSV format using utf-8 coding. Data profiling was performed according to SOP-005 Data profiling using phpMyAdmin (version 5.2.0).

Data validation

None

Data analysis

None

Data description

Dataset

Table 3. Summary of tables belonging to the dataset. Table row identifier (No); name of the table (Table); description of the table (Description).
No Table Description
1 BIPM This table contains standard units for metrology, electricity, photometry, radiometry, ionizing radiation, time scales and chemistry.
Table 4. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
interactions.single.uid SNITS17.0.0
Title SI Units
Long title A selection of SI units provided by the Bureau International des Poids et Mesures (BIPM)
Target IRI https://app.pollinatorhub.eu/dataset-discovery/SNITS17.0.0
interactions.single.section-details.licence CC BY 4.0
DOI n/a
Created 2022-11-25
Published 2022-11-23
Contact n/a
Keywords n/a
Data collection years n/a
Regions, the data was collected in n/a
Abstract

The dataset contains information from “The International System of Units (SI)”, a periodically updated brochure issued by from the International Bureau of Weights and Measures (BIPM), in which all the decisions and recommendations concerning units are collected.

Table 5. Standardised metadata of the data provider EU Pollinator Hub. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name EU Pollinator Hub
Url
Acronym EUPH
IRI https://app.pollinatorhub.eu/data-providers/euph
Address
Country Belgium
Contact https://www.linkedin.com/company/beelife-european-beekeeping-coordination/ pollinatorhub.eu
Description

The EU Pollinator Hub (EUPH) is a data hub related to pollinators, which is provided by the European Food Safety Authority (EFSA).

Tables

BIPM

Table 6. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier SNITS17.BIPMA6.0
Name BIPM
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/SNITS17.BIPMA6.0
Table Type File
Licence CC BY 4.0
Description

This table contains standard units for metrology, electricity, photometry, radiometry, ionizing radiation, time scales and chemistry.

This table contains standard units for metrology, electricity, photometry, radiometry, ionizing radiation, time scales and chemistry.

Metadata

n/a
Table 7. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit
SiId
Integer number pms:recordID [0.0.RCRDD344]

n/a

Status
String Text [0.0.TEXTA315]

n/a

QuantityName
String Text [0.0.TEXTA315]

n/a

QuantitySymbol
String Text [0.0.TEXTA315]

n/a

UnitName
String bipm:siUnit [0.0.SNITA850]

n/a

UnitSymbol
String Text [0.0.TEXTA315]

n/a

UnitBaseUnit
String Text [0.0.TEXTA315]

n/a

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 8. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
SiId 1 - 2 41.0 1 20.5 41 61.5 81 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 81 ( 100.0% )
Status 9 - 30 n/a base unit n/a n/a n/a derived unit… 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 4.9% )
QuantityName 4 - 44 n/a area n/a n/a n/a electric flu… 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 69 ( 85.2% )
QuantitySymbol 0 - 28 n/a n/a n/a n/a ρ, &gamm… 81 61 ( 75.3% ) 0 ( 0.0% ) 0 ( 0.0% ) 20 ( 24.7% )
UnitName 3 - 31 n/a bel n/a n/a n/a watt per squ… 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 77 ( 95.1% )
UnitSymbol 0 - 33 n/a n/a n/a n/a J K−1 mol−1 81 13 ( 16.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 68 ( 84.0% )
UnitBaseUnit 0 - 63 n/a n/a n/a n/a kg m2 s−2 mo… 81 10 ( 12.3% ) 0 ( 0.0% ) 0 ( 0.0% ) 63 ( 77.8% )

Quality measures

Table 9. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
SiId
100.00%
100.00%
1 1
Status
100.00%
4.94%
derived unit base unit
QuantityName
100.00%
85.19%
time electric current
QuantitySymbol
24.69%
24.69%
null <i>t</i>
UnitName
100.00%
95.06%
second metre
UnitSymbol
83.95%
83.95%
null s
UnitBaseUnit
87.65%
77.78%
null m m<sup>-1</sup>

Changes made to preparatory file

None

Changes made to data

None

Unresolved issues

None

References

There are no sources in the current document.

Annex 1: Table column reports

Table: BIPM

Column: SiId

Table 10. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name SiId
Description
Data type Integer number
Descriptor pms:recordID [UID:0.0.RCRDD344]
Descriptor description

Unique sequence of integers associated with a record within a certain table.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RCRDD344
Unit

n/a

Table 11. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
SiId 1 - 2 41.0 1 20.5 41 61.5 81 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 81 ( 100.0% )
Table 12. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
SiId
100.00%
100.00%
1 1

Continuous Data Distribution

Figure 1. Distribution of values in the column.

Outliers

Figure 2. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 3. Visualization of completeness of the data in the column.

Uniqueness

Figure 4. Visualization of uniqueness of the data in the column.

Column: Status

Table 13. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Status
Description
Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 14. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Status 9 - 30 n/a base unit n/a n/a n/a derived unit… 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 4.9% )
Table 15. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Status
100.00%
4.94%
derived unit base unit

Data Distribution Top 20

Figure 5. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 6. Visualization of completeness of the data in the column.

Uniqueness

Figure 7. Visualization of uniqueness of the data in the column.

Column: QuantityName

Table 16. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name QuantityName
Description
Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 17. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
QuantityName 4 - 44 n/a area n/a n/a n/a electric flu… 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 69 ( 85.2% )
Table 18. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
QuantityName
100.00%
85.19%
time electric current

Completeness

Figure 8. Visualization of completeness of the data in the column.

Uniqueness

Figure 9. Visualization of uniqueness of the data in the column.

Column: QuantitySymbol

Table 19. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name QuantitySymbol
Description
Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 20. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
QuantitySymbol 0 - 28 n/a n/a n/a n/a &rho;, &gamm… 81 61 ( 75.3% ) 0 ( 0.0% ) 0 ( 0.0% ) 20 ( 24.7% )
Table 21. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
QuantitySymbol
24.69%
24.69%
null <i>t</i>

Completeness

Figure 10. Visualization of completeness of the data in the column.

Uniqueness

Figure 11. Visualization of uniqueness of the data in the column.

Column: UnitName

Table 22. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name UnitName
Description
Data type String
Descriptor bipm:siUnit [UID:0.0.SNITA850]
Descriptor description

The International System of Units (Système International d'Unités). maintained by the Bureau International des Poids et Mesures (BIPM) is a recommended practical system of units of measurement, with the international abbreviation SI.

Descriptor target IRI https://app.pollinatorhub.eu/dashboard/descriptors/850
Unit

n/a

Table 23. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
UnitName 3 - 31 n/a bel n/a n/a n/a watt per squ… 81 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 77 ( 95.1% )
Table 24. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
UnitName
100.00%
95.06%
second metre

Completeness

Figure 12. Visualization of completeness of the data in the column.

Uniqueness

Figure 13. Visualization of uniqueness of the data in the column.

Column: UnitSymbol

Table 25. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name UnitSymbol
Description
Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 26. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
UnitSymbol 0 - 33 n/a n/a n/a n/a J K−1 mol−1 81 13 ( 16.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 68 ( 84.0% )
Table 27. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
UnitSymbol
83.95%
83.95%
null s

Completeness

Figure 14. Visualization of completeness of the data in the column.

Uniqueness

Figure 15. Visualization of uniqueness of the data in the column.

Column: UnitBaseUnit

Table 28. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name UnitBaseUnit
Description
Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 29. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
UnitBaseUnit 0 - 63 n/a n/a n/a n/a kg m2 s−2 mo… 81 10 ( 12.3% ) 0 ( 0.0% ) 0 ( 0.0% ) 63 ( 77.8% )
Table 30. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
UnitBaseUnit
87.65%
77.78%
null m m<sup>-1</sup>

Completeness

Figure 16. Visualization of completeness of the data in the column.

Uniqueness

Figure 17. Visualization of uniqueness of the data in the column.