EU Pollinator Hub

, ,

Dataset Report
Unique identifier: PLLNT287.0.0
Title: Pollinator Ontology V.1.1
Long title: The Pollinator Ontology in the EU Pollinator Hub Controlled Vocabulary (EUPH-CV)
Status: Quality Validated
Current Version: v. 1.1
Published: 2026-04-25
Reviewed by:
Citation proposal:
Rubinigg 2026 Report of dataset Pollinator Ontology V.1.1, v. 1.1 [PLLNT287.0.0]. EU Pollinator Hub. [2026-05-01] app.pollinatorhub.eu
Compliance with FAIR* principles
Findable
Accessible
Interoperable
Reusable
See https://www.go-fair.org/fair-principles for more information about FAIR principles
Data Quality
Under evaluation

Document history

Release

Version v. 1.1 released on 2026-04-25.

Revision

Table 1. List of revisions made to the document. Identifier of revision (No); date of revision (Date); description of revision (Description); reason for revision (Reason).
No Date Description Reason
1 2026-04-25 00:04:00 Initial release. n/a

Abbreviations

No abbreviations.

Executive summary

Data overview:

The EU Pollinator Hub (EUPH) provides a controlled vocabulary (EUPH-CV) for datastandardisation. A subset of the EUPH-CV has been designed as an ontology, the Pollinator Ontology, in which terms are arranged in a hierarchical structure, enriched with definitions, formal relationships between terms, and other properties (e.g., relations to metadata standard elements, comments, images, and version information).

Data value:

The classes im the controlled vocabulary and the relationships between them serve as a backbone (1) for the definition of the metadata standard elements (EUPH-MS), (2) for the standardisation of data stored on the EUPH in the form of literals, resource identifiers or resource description frameworks (RDF), and (3) for the alignment of terminologies from different vocabularies. The Pollinator Ontology is a crucial technology in knowledge management, big data processing, and machine learning. The use of terms from ontologies to annotate research data greatly improves automated data interpretation and interoperability and, hence, compliance with FAIR principles.

Data description:

The dataset contains 4 files, which are required to analyse the ontoplogy metrics. File terms.csv (442 733 bytes) contains the basic information on the classes. File terms_to_terms.csv (117 650 bytes) contains the relationships between classes. File translations.csv (2 285 199 bytes) contains the translations of the classes to human languages. The file language.csv (1 441 bytes) contains the name of the human languages.

Data application:

The data should be used to validate the method used in the manuscript "The EU Pollinator Hub Controlled Vocabulary: An Ontology for Pollinators" submitted for publication in December 2025.

Unresolved issues:

n/a

Introduction

The Pollinator Ontology of the EU Pollinator Hub Controlled Vocabulary (EUPH-CV) is an ontology for information and data about pollinators and their interaction with the abiotic and biotic environment, particularly humans. It contains a series of concepts, referred to as classes, their properties and the relationships between them. A first version of the ontology has been deployed as part of a web-based open-source software application, the EU Pollinator Hub, a tool that has been developed, among others, to promote (1) the standardisation and internationalisation of data related to pollinators, (2) the community-driven development of the ontology and (3) the integration of existing vocabularies in the future.

Material and methods

Data acquisition

Data was collected with the vocabulary management user interface of the EU Pollinator Hub, which has been developed to create and maintene the Controlled Vocabulary and which is regulated by a set of SOP (Standard Operating Procedures) and Work Instructions (WI).

Table 2. List of raw data and metadata files included in the dataset. Identifier of table row (No); name of the file (File); the type of the file (Type); file contains data (D); file contains metadata (M); date of upload of the file to the EU Pollinator Hub (Arrival); number of data points contained within the file (if applicable); uploaded file size.
No File Type D M Arrival Data points File size
1 terms.csv CSV - Comma seperated values Yes No 2026-04-26 09:04:39 48,384 508.64 KiB
2 terms_to_terms.csv CSV - Comma seperated values Yes No 2026-04-26 09:04:35 17,888 126.24 KiB
3 language.csv CSV - Comma seperated values Yes No 2026-04-26 09:04:51 172 1.41 KiB
4 translations.csv CSV - Comma seperated values Yes No 2026-04-26 09:04:53 317,925 2.54 MiB
5 definitions.csv CSV - Comma seperated values Yes No 2026-04-26 09:04:25 48,438 1.19 MiB
6 ontology_analysis.ipynb Miscellaneous No Yes 2026-04-26 10:04:49 n/a 393.34 KiB
7 POLON.owl Miscellaneous No Yes 2026-04-26 17:04:53 n/a 4.13 MiB

Data preparation

A detailed description of the method will become available after the publication of the manuscript.

Data validation

None

Data analysis

Data analysis was performrd with Jupyter Lab notebook (Version 4.5.1).

Data description

Dataset

Table 3. Summary of tables belonging to the dataset. Table row identifier (No); name of the table (Table); description of the table (Description).
No Table Description
1 terms Basic properties of the classes of a subset of the EUPH-CV used to construct the Pollinator Ontology.
2 terms to terms Relationship between classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.
3 language List of languages of the translation of classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.
4 translations Translations to human languages of the classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.
5 definitions Definitions of the classes of a subset of the EUPH-CV used to construct the Pollinator Ontology.
Table 4. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
interactions.single.uid PLLNT287.0.0
Title Pollinator Ontology V.1.1
Long title The Pollinator Ontology in the EU Pollinator Hub Controlled Vocabulary (EUPH-CV)
Target IRI https://app.pollinatorhub.eu/dataset-discovery/PLLNT287.0.0
interactions.single.section-details.licence CC BY-NC 4.0
DOI https://doi.org/10.5281/zenodo.19754718
Created 2026-04-26
Published 2026-04-25
Contact n/a
Keywords Pollinator, controlled vocabulary, honey bee, ontology
Data collection years n/a
Regions, the data was collected in n/a
Abstract

The Pollinator Ontology, a subset of the EU Pollinator Hub Controlled Vocabulary (EUPH-CV), is an ontology for information and data about pollinators and their interaction with the abiotic and biotic environment, particularly humans. It contains a series of concepts, referred to as classes, their properties and the relationships between them. A first version of the ontology has been deployed as part of the EU Pollinator Hub, a tool that has been developed, among others, to promote (1) the standardisation and internationalisation of data related to pollinators, (2) the community-driven development of the ontology and (3) the integration of existing vocabularies in the future.

Table 5. Standardised metadata of the data provider Rubinigg. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name Rubinigg
Url
Acronym VR
IRI https://app.pollinatorhub.eu/data-providers/visualife-rubinigg
Contact https://orcid.org/0000-0002-7061-920X https://www.linkedin.com/in/rubinigg/
Description

Tables

terms

Table 6. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier PLLNT287.TERMS706.0
Name terms
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/PLLNT287.TERMS706.0
Table Type File
Licence CC BY-NC 4.0
Description

Basic properties of the classes of a subset of the EUPH-CV used to construct the Pollinator Ontology.

Basic properties of the classes of a subset of the EUPH-CV used to construct the Pollinator Ontology.

Metadata

  • uri links to external IRI
  • user_id links to the user identifier of the creator of the class
  • dataset_id links to the EUPH identifier if the dataset to which the class has been linked
  • synonym_id links to terms.id in case the class is a synonym of another class
Table 7. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 8. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct

Quality measures

Table 9. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value

Changes made to preparatory file

No changes have been made to the preparatory file

Changes made to data

No changes have been made to the data in the table

Unresolved issues

No unresolved issues have been detected

terms to terms

Table 10. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier PLLNT287.TRMST708.0
Name terms to terms
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/PLLNT287.TRMST708.0
Table Type File
Licence CC BY-NC 4.0
Description

Relationship between classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.

Relationship between classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.

Metadata

  • term_id links to terms.id
  • parent_id links to terms.id
  • relationship_id links to terms.id
Table 11. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 12. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct

Quality measures

Table 13. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value

Changes made to preparatory file

No changes have been made to the preparatory file

Changes made to data

No changes have been made to the data in the table

Unresolved issues

No unresolved issues have been detected

language

Table 14. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier PLLNT287.LNGGE710.0
Name language
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/PLLNT287.LNGGE710.0
Table Type File
Licence CC BY-NC 4.0
Description

List of languages of the translation of classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.

List of languages of the translation of classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.

Metadata

n/a
Table 15. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 16. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct

Quality measures

Table 17. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value

Changes made to preparatory file

No changes have been made to the preparatory file

Changes made to data

No changes have been made to the data in the table

Unresolved issues

No unresolved issues have been detected

translations

Table 18. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier PLLNT287.TRNSL712.0
Name translations
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/PLLNT287.TRNSL712.0
Table Type File
Licence CC BY-NC 4.0
Description

Translations to human languages of the classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.

Translations to human languages of the classes of the subset of the EUPH-CV used to construct the Pollinator Ontology.

Metadata

  • term_id links to terms.id
  • language_id links to language.id
  • user_id links to the user identifier of the creator of the translation
  • synonym_id links to translations.id in case the translation is a synonym of another translation
Table 19. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 20. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct

Quality measures

Table 21. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value

Changes made to preparatory file

No changes have been made to the preparatory file

Changes made to data

No changes have been made to the data in the table

Unresolved issues

No unresolved issues have been detected

definitions

Table 22. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier PLLNT287.DFNTN714.0
Name definitions
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/PLLNT287.DFNTN714.0
Table Type File
Licence CC BY-NC 4.0
Description

Definitions of the classes of a subset of the EUPH-CV used to construct the Pollinator Ontology.

Definitions of the classes of a subset of the EUPH-CV used to construct the Pollinator Ontology.

Metadata

  • term_id links to terms.id
  • user_id links to the user identifier of the creator of the translation
Table 23. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 24. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct

Quality measures

Table 25. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value

Changes made to preparatory file

No changes have been made to the preparatory file

Changes made to data

No changes have been made to the data in the table

Unresolved issues

No unresolved issues have been detected

References

  1. Noa SD., Santiago O., Gregor S., Michael R., Gilles SM. 2025 The EU Pollinator Hub: Operationalisation of the EU Bee Partnership Platform for Harmonised Data Collection and Sharing Among Stakeholders on Bees and Pollinators. EFSA Supporting Publications. Vol. 22, (1) p. 9219E. doi: 10.2903/sp.efsa.2025.EN-9219

Annex 1: Table column reports

Table: terms

Table: terms to terms

Table: language

Table: translations

Table: definitions