|
|
, , • • |
| Unique identifier: | UNITS7.0.0 |
| Title: | Units |
| Long title: | EUPH Reference dataset containing units |
| Status: | Quality Validated |
| Current Version: | v. 1.0 |
| Published: | 2023-02-04 |
| Reviewed by: | |
| Citation proposal: |
EU Pollinator Hub 2023 Report of dataset Units, v. 1.0 [UNITS7.0.0]. EU Pollinator Hub. [2026-02-24] app.pollinatorhub.eu
|
|
Data Quality
Under evaluation
|
|||||||||||||||
Document history
Release
Version v. 1.0 released on 2023-02-04.
Revision
| No | Date | Description | Reason |
|---|---|---|---|
| 1 | 2023-02-04 00:02:00 | Initial release. | n/a |
Abbreviations
Executive summary
Data overview:
The reference dataset for units contains a collection of units used by other datasets. Data sources in their original state were merged into one single table. If available, identifiers and descriptions of the original datasets are maintained to guaranty their integrity and to enable linkage across datasets.
Data value:
The dataset provides a unique identifier for all units to be integrated on the platform. It is required by the EU Pollinator Hub (EUPH) for administrative purposes and fulfils an important role in data standardisation across datasets.
Data description:
The dataset contains 1 table. Units and their descriptions were obtained from 3 sources: from the United Nations Statistics Division (UNSD), from the Bureau International des Poids et Mesures (BIPM), from the Statistics Division of the Food and Agriculture Organization of the United Nations (FAOSTAT) and from the Organization for Standardization (ISO).
Data application:
This data will be used for the standardisation of data integrated on the EUPH by data providers.
Unresolved issues:
Introduction
Standardisation is an important goal of the EU Pollinator Hub (EUPH). Existing standards are prioritised to achieve this goal. Variables are described by numerical facts which we call data. Data is commonly expressed in a unit, which is one of many attributes that may be used to classify data. On the EUPH a unit is treated as metadata. One important goal of the EUPH is to allow users to link data from different sources. It is therefore necessary to provide a standardised description of the data, including units. The general policy of the EUPH is not to modify raw data once it has been prepared for integration. Since the EUPH aims to allow analysis and visualisation of data on the spot, converting the data to a common unit may represent a major challenge, both in terms of usability during the process of data integration and time required for data processing. It has therefor been decided that data will be standardised during the integration process. When data is integrated on the platform, data providers define the unit in which the data to be integrated is expressed from a list of preconfigured set of units (base units and derived units as well as decimal multiples and sub-multiples of the units). If the preconfigured set of units does not meet the requirements of the data provider, the provider has to transform the data to an accepted unit before integration. There is a potentially unlimited number of derived units. Depending on the future management of the EUPH, the user might therefore also request the integration of a unit into the EUPH. The present dataset contains a preconfigured set of units used by international standardising organisations (SI, ISO, United Nations) and the information required for the conversion of units into base units. This allows linkage and visualisation in a variety of units on the EUPH.
Material and methods
Data acquisition
Raw data integrated into this dataset comes from 4 different sources on the EUPH:
| No | File | Type | D | M | Arrival | Data points | File size |
|---|---|---|---|---|---|---|---|
| 1 | euph_000007_units_table_units.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:26 | 5,808 | n/a |
Data preparation
Data was copied into a Microsoft® Excel® worksheet (Microsoft Corporation, Version 2210 Build 16.0.15726.20188). Prepared data was then converted to CSV and imported for profiling into a SQL database (MariaDB foundation, Server-Version 10.4.24) running in a XAMPP environment (BitRock, version 3.3.0). Data was exported from the MySQL database to CSV format using utf-8 coding. Data profiling was performed according to SOP-005 Data profiling using phpMyAdmin (version 5.2.0).
Data validation
None
Data analysis
None
Data description
Dataset
| No | Table | Description |
|---|---|---|
| 1 | Units | Data in this table was obtained from various sources. It contains standardised information on units, including standardised units, decimal multiples… |
| Parameter | Content |
|---|---|
| interactions.single.uid | UNITS7.0.0 |
| Title | Units |
| Long title | EUPH Reference dataset containing units |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/UNITS7.0.0 |
| interactions.single.section-details.licence | EU Pollinator Hub |
| DOI | n/a |
| Created | 2022-11-26 |
| Published | 2023-02-04 |
| Contact | n/a |
| Keywords | n/a |
| Data collection years | n/a |
| Regions, the data was collected in | n/a |
| Abstract | This dataset contains standardised information on units. |
| Parameter | Content |
|---|---|
| Name | EU Pollinator Hub |
| Url | |
| Acronym | EUPH |
| IRI | https://app.pollinatorhub.eu/data-providers/euph |
| Address | |
| Country | Belgium |
| Contact | https://www.linkedin.com/company/beelife-european-beekeeping-coordination/ pollinatorhub.eu |
| Description | The EU Pollinator Hub (EUPH) is a data hub related to pollinators, which is provided by the European Food Safety Authority (EFSA). |
Tables
Units
| Parameter | Content |
|---|---|
| Unique identifier | UNITS7.UNITS4.0 |
| Name | Units |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/UNITS7.UNITS4.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
Data in this table was obtained from various sources. It contains standardised information on units, including standardised units, decimal multiples and submultiples, information on how to convert units to base units (function and conversion factor), as well as the name of the quantities, which these units describe, to be used for data standardisation on the EU Pollinator Hub (EUPH). |
Data in this table was obtained from various sources. It contains standardised information on units, including standardised units, decimal multiples and submultiples, information on how to convert units to base units (function and conversion factor), as well as the name of the quantities, which these units describe, to be used for data standardisation on the EU Pollinator Hub (EUPH).
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| UnitId |
|
Integer number | pms:unitID [0.0.NITID86] | n/a |
| UnitName |
|
String | dwc:measurementUnit [0.0.NTNME87] | n/a |
| UnitStandardised |
|
String | Text [0.0.TEXTA315] | n/a |
| QuantityDescription |
|
String | Text [0.0.TEXTA315] | n/a |
| UnitDescription |
|
String | Text [0.0.TEXTA315] | n/a |
| BaseUnit |
|
String | Text [0.0.TEXTA315] | n/a |
| ConFunct |
|
String | Text [0.0.TEXTA315] | n/a |
| Multiplier |
|
String | Text [0.0.TEXTA315] | n/a |
| FaostatUnit |
|
String | faostat:Unit [0.0.UNITA94] | n/a |
| UnsdUnit |
|
String | unsd:unsdUnitID [0.0.NSDNT849] | n/a |
| SiUnit |
|
String | bipm:siUnit [0.0.SNITA850] | n/a |
| CurrencyUnit |
|
String | iso-4217:aphabeticCurrencyCode [0.0.PHBTC851] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UnitId | 1 - 3 | 242.5 | 1 | 121.25 | 242.5 | 363.75 | 484 | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 484 ( 100.0% ) |
| UnitName | 0 - 23 | n/a | n/a | n/a | n/a | units/100 km… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 1 ( 0.2% ) | 468 ( 96.7% ) | |
| UnitStandardised | 1 - 54 | n/a | % | n/a | n/a | n/a | (102 km2 lan… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 467 ( 96.5% ) |
| QuantityDescription | 4 - 36 | n/a | NULL | n/a | n/a | n/a | activity ref… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 78 ( 16.1% ) |
| UnitDescription | 3 - 65 | n/a | Lek | n/a | n/a | n/a | The codes as… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 440 ( 90.9% ) |
| BaseUnit | 1 - 19 | n/a | % | n/a | n/a | n/a | g N2O/kg dry… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 413 ( 85.3% ) |
| ConFunct | 4 - 4 | n/a | NULL | n/a | n/a | n/a | NULL | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 1 ( 0.2% ) |
| Multiplier | 4 - 13 | n/a | NULL | n/a | n/a | n/a | 2.777778 E-0… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 30 ( 6.2% ) |
| FaostatUnit | 1 - 30 | n/a | % | n/a | n/a | n/a | per 100 squa… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 59 ( 12.2% ) |
| UnsdUnit | 1 - 4 | n/a | 1 | n/a | n/a | n/a | NULL | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 13 ( 2.7% ) |
| SiUnit | 3 - 31 | n/a | bel | n/a | n/a | n/a | watt per squ… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 78 ( 16.1% ) |
| CurrencyUnit | 3 - 4 | n/a | ADP | n/a | n/a | n/a | NULL | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 304 ( 62.8% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| UnitId |
100.00%
|
100.00%
|
1 | 1 |
| UnitName |
100.00%
|
96.69%
|
kt | null |
| UnitStandardised |
100.00%
|
96.49%
|
10<sup>3</sup> t | NULL |
| QuantityDescription |
100.00%
|
16.12%
|
monetary value | power |
| UnitDescription |
100.00%
|
90.91%
|
thousand tonnes | percent |
| BaseUnit |
100.00%
|
85.33%
|
kg | % |
| ConFunct |
100.00%
|
0.21%
|
NULL | NULL |
| Multiplier |
100.00%
|
6.20%
|
1 E+00 | 2.388915 E-07 |
| FaostatUnit |
100.00%
|
12.19%
|
NULL | % |
| UnsdUnit |
100.00%
|
2.69%
|
NULL | 1 |
| SiUnit |
100.00%
|
16.12%
|
NULL | metre |
| CurrencyUnit |
100.00%
|
62.81%
|
NULL | EUR |
Changes made to preparatory file
Changes made to data
Unresolved issues
References
There are no sources in the current document.
Annex 1: Table column reports
Table: Units
Column: UnitId
| Parameter | Content |
|---|---|
| Column name | UnitId |
| Description |
|
| Data type | Integer number |
| Descriptor | pms:unitID [UID:0.0.NITID86] |
| Descriptor description |
Identifier of a measurement unit. May be a global unique identifier or an identifier specific to a collection or institution. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NITID86 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UnitId | 1 - 3 | 242.5 | 1 | 121.25 | 242.5 | 363.75 | 484 | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 484 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| UnitId |
100.00%
|
100.00%
|
1 | 1 |
Continuous Data Distribution
Outliers
Completeness
Uniqueness
Column: UnitName
| Parameter | Content |
|---|---|
| Column name | UnitName |
| Description |
|
| Data type | String |
| Descriptor | dwc:measurementUnit [UID:0.0.NTNME87] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTNME87 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UnitName | 0 - 23 | n/a | n/a | n/a | n/a | units/100 km… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 1 ( 0.2% ) | 468 ( 96.7% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| UnitName |
100.00%
|
96.69%
|
kt | null |
Completeness
Uniqueness
Column: UnitStandardised
| Parameter | Content |
|---|---|
| Column name | UnitStandardised |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UnitStandardised | 1 - 54 | n/a | % | n/a | n/a | n/a | (102 km2 lan… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 467 ( 96.5% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| UnitStandardised |
100.00%
|
96.49%
|
10<sup>3</sup> t | NULL |
Completeness
Uniqueness
Column: QuantityDescription
| Parameter | Content |
|---|---|
| Column name | QuantityDescription |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| QuantityDescription | 4 - 36 | n/a | NULL | n/a | n/a | n/a | activity ref… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 78 ( 16.1% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| QuantityDescription |
100.00%
|
16.12%
|
monetary value | power |
Completeness
Uniqueness
Column: UnitDescription
| Parameter | Content |
|---|---|
| Column name | UnitDescription |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UnitDescription | 3 - 65 | n/a | Lek | n/a | n/a | n/a | The codes as… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 440 ( 90.9% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| UnitDescription |
100.00%
|
90.91%
|
thousand tonnes | percent |
Completeness
Uniqueness
Column: BaseUnit
| Parameter | Content |
|---|---|
| Column name | BaseUnit |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BaseUnit | 1 - 19 | n/a | % | n/a | n/a | n/a | g N2O/kg dry… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 413 ( 85.3% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| BaseUnit |
100.00%
|
85.33%
|
kg | % |
Completeness
Uniqueness
Column: ConFunct
| Parameter | Content |
|---|---|
| Column name | ConFunct |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ConFunct | 4 - 4 | n/a | NULL | n/a | n/a | n/a | NULL | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 1 ( 0.2% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| ConFunct |
100.00%
|
0.21%
|
NULL | NULL |
Data Distribution Top 20
Completeness
Uniqueness
Column: Multiplier
| Parameter | Content |
|---|---|
| Column name | Multiplier |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Multiplier | 4 - 13 | n/a | NULL | n/a | n/a | n/a | 2.777778 E-0… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 30 ( 6.2% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| Multiplier |
100.00%
|
6.20%
|
1 E+00 | 2.388915 E-07 |
Completeness
Uniqueness
Column: FaostatUnit
| Parameter | Content |
|---|---|
| Column name | FaostatUnit |
| Description |
|
| Data type | String |
| Descriptor | faostat:Unit [UID:0.0.UNITA94] |
| Descriptor description |
Unit in which FAOSTAT provides data, summarised in the "Units" list on the web page "Definitions and standards used in FAOSTAT" of the FAOSTAT website. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.UNITA94 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FaostatUnit | 1 - 30 | n/a | % | n/a | n/a | n/a | per 100 squa… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 59 ( 12.2% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| FaostatUnit |
100.00%
|
12.19%
|
NULL | % |
Completeness
Uniqueness
Column: UnsdUnit
| Parameter | Content |
|---|---|
| Column name | UnsdUnit |
| Description |
|
| Data type | String |
| Descriptor | unsd:unsdUnitID [UID:0.0.NSDNT849] |
| Descriptor description |
Special units maintained by the United Nations Statistics Division (UNSD)). |
| Descriptor target IRI | https://app.pollinatorhub.eu/dashboard/descriptors/849 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| UnsdUnit | 1 - 4 | n/a | 1 | n/a | n/a | n/a | NULL | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 13 ( 2.7% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| UnsdUnit |
100.00%
|
2.69%
|
NULL | 1 |
Data Distribution Top 20
Completeness
Uniqueness
Column: SiUnit
| Parameter | Content |
|---|---|
| Column name | SiUnit |
| Description |
|
| Data type | String |
| Descriptor | bipm:siUnit [UID:0.0.SNITA850] |
| Descriptor description |
The International System of Units (Système International d'Unités). maintained by the Bureau International des Poids et Mesures (BIPM) is a recommended practical system of units of measurement, with the international abbreviation SI. |
| Descriptor target IRI | https://app.pollinatorhub.eu/dashboard/descriptors/850 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SiUnit | 3 - 31 | n/a | bel | n/a | n/a | n/a | watt per squ… | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 78 ( 16.1% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| SiUnit |
100.00%
|
16.12%
|
NULL | metre |
Completeness
Uniqueness
Column: CurrencyUnit
| Parameter | Content |
|---|---|
| Column name | CurrencyUnit |
| Description |
|
| Data type | String |
| Descriptor | iso-4217:aphabeticCurrencyCode [UID:0.0.PHBTC851] |
| Descriptor description |
A three-letter code from ISO 4217 that represents a currency. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.PHBTC851 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CurrencyUnit | 3 - 4 | n/a | ADP | n/a | n/a | n/a | NULL | 484 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 304 ( 62.8% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| CurrencyUnit |
100.00%
|
62.81%
|
NULL | EUR |