EU Pollinator Hub

, ,

Dataset Report
Unique identifier: BGDGN198.0.0
Title: B-GOOD Genotyping Subspecies
Long title: Dataset from the B-GOOD project, containing results for Tier 1 and Tier 2 genotyping on idividual worker bee for subspecies differentiation using an SNP chip.
Status: Quality Validated
Current Version: v. 1.0
Published: 2025-03-17
Reviewed by: Rubinigg Michael as Data scientist
Citation proposal:
B-GOOD Bee Health Data Portal 2025 Report of dataset B-GOOD Genotyping Subspecies, v. 1.0 [BGDGN198.0.0]. EU Pollinator Hub. [2026-02-24] app.pollinatorhub.eu
Compliance with FAIR* principles
Findable
Accessible
Interoperable
Reusable
See https://www.go-fair.org/fair-principles for more information about FAIR principles
Data Quality
Good

This document is intended for use by collaborators of the EU Pollinator Hub and may be passed on with the express permission of the leader of the consortium and for the purpose determined by the leader of the consortium.

Document history

Release

Version v. 1.0 released on 2025-03-17. Reviewed by Rubinigg Michael.

Revision

Table 1. List of revisions made to the document. Identifier of revision (No); date of revision (Date); description of revision (Description); reason for revision (Reason).
No Date Description Reason
1 2025-03-17 00:03:00 Initial release. n/a

Abbreviations

CSV
Comma-Separated Values
EU
European Union
EUPH
EU Pollinator Hub
MLU
Martin-Luther-Universität Halle-Wittenberg (Martin Luther University of Halle-Wittenberg)
SNP
Single Nucleotide Polymorphism

Executive summary

Data overview:

The data was published by Tehel A, Paxton R (MLU) on the B-GOOD Bee Health Data Portal as part of the B-GOOD project (grant agreement 817622), funded under the EU Horizon 2020 Research and Innovation Programme. It contains genotyping data of idividual worker bees with bee array markers (ca. 4000 SNP) for subspecies differentiation (Momeni et al. 2021), a reference model representing the European subspecies: Apis mellifera mellifera, A. m. iberiensis, A. m. ligustica, A. m. carnica, A. m. macedonica, A. m. cecropia, A. m. adami, A. m. cypria, A. m. anatoliaca, A. m. caucasica, A. m. armeniaca, A. m. ruttneri and A. m. siciliana (and Buckfast bees).

Data value:

The objectives of the B-GOOD project were: (1) Facilitate decision making for beekeepers and other stakeholders by establishing ready-to-use tools for operationalising the HSI; (2) Test, standardise and validate methods for measuring and reporting selected indicators affecting bee health; (3) Explore the various socio-economic and ecological factors beyond bee health; (4) Foster an EU community to collect and share knowledge related to honey bees and their environment; (5) Engender a lasting learning and innovation system (LIS); (6) Minimise the impact of biotic and abiotic stressors.

Data description:

n/a

Data application:

Currently, the data integrated from the B-GOOD Bee Health Data Portal contains major issues and does not comply with the FAIR Guiding Principles for scientific data management and stewardship applied on the EU Pollinator Hub. More descriptive information about the context, quality and condition, or characteristics of the data (e.g. protocols, measurement devices used, units of the captured data, or any other details about the study) must be provided. More metadata in the form of accurate and relevant attributes (*e.g. *metadata that describes the scope of the data has been described, any particularities or limitations about the data that other users should be aware of, specification of the date of generation/collection of the data, the lab conditions, who prepared the data, the parameter settings, the name and version of the software used, specification of whether it is raw or processed data, explanation of all variable names are explained if they are not self-explanatory) must be provided. The dataset requires myjor revision by the data provider.

Unresolved issues:

n/a

Introduction

n/a

Material and methods

Data acquisition

All raw data files were downloaded from the B-GOOD Bee Health Data Portal on 2024-09-27 06:21:34.

List of raw data obtained from the data provider.

  1. File wp1-genotyping-snpchip-tier1-tier2.xlsx accessed on 2024-09-27 06:21:34, provided by B-GOOD Bee Health Data Portal

Metadata was obtained from the dataset's web page.

Table 2. List of raw data and metadata files included in the dataset. Identifier of table row (No); name of the file (File); the type of the file (Type); file contains data (D); file contains metadata (M); date of upload of the file to the EU Pollinator Hub (Arrival); number of data points contained within the file (if applicable); uploaded file size.
No File Type D M Arrival Data points File size
1 Data_PREP_MR_241102.csv CSV - Comma seperated values Yes No 2024-11-02 17:11:54 7,150 36.13 KiB
2 Data including Buckfast_PREP_MR_241102.csv CSV - Comma seperated values Yes No 2024-11-02 17:11:28 6,144 28.54 KiB

Data preparation

All files in the zip-archives were extracted using File Explorer (Microsoft Corporation, version 22H2).

File wp1-genotyping-snpchip-tier1-tier2.xlsx was procesed with MS Excel (Microsoft Corporation, version 2409). The worksheets were exported in CSV format (utf-8 encoding).

Data was then exported to the respective preparatory files and uploaded to the EU Pollinator Hub according to SOP-017 (Dataset integration.

Data validation

No data validation was performed.

Data analysis

No data analysis was performed.

Data description

Dataset

Table 3. Summary of tables belonging to the dataset. Table row identifier (No); name of the table (Table); description of the table (Description).
No Table Description
1 Data Data file
2 Data including Buckfast Data file including Buckfast
Table 4. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
interactions.single.uid BGDGN198.0.0
Title B-GOOD Genotyping Subspecies
Long title Dataset from the B-GOOD project, containing results for Tier 1 and Tier 2 genotyping on idividual worker bee for subspecies differentiation using an SNP chip.
Target IRI https://app.pollinatorhub.eu/dataset-discovery/BGDGN198.0.0
interactions.single.section-details.licence CC BY-NC-ND 4.0
DOI n/a
Created 2024-11-02
Published 2025-03-17
Contact n/a
Keywords SNP, genotyping, subspecies
Data collection years n/a
Regions, the data was collected in n/a
Abstract

Dataset containing data on genotyping on idividual worker bees with the Eurofins bee array markers (~ 4000 SNP) for subspecies differentiation (Momeni et al. 2021). Reference model representing the European subspecies: Apis mellifera mellifera, A. m. iberiensis, A. m. ligustica, A. m. carnica, A. m. macedonica, A. m. cecropia, A. m. adami, A. m. cypria, A. m. anatoliaca, A. m. caucasica, A. m. armeniaca, A. m. ruttneri and A. m. siciliana (and Buckfast bees). It was published by Tehel A, Paxton R (MLU) on the B-GOOD Bee Health Data Portal as part of the B-GOOD project (grant agreement 817622), funded under the EU Horizon 2020 Research and Innovation Programme.

Table 5. Standardised metadata of the data provider B-GOOD Bee Health Data Portal. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name B-GOOD Bee Health Data Portal
Url
Acronym B-GOOD
IRI https://app.pollinatorhub.eu/data-providers/b-good-bee-health-data-portal
Address https://b-good-project.eu
Country Belgium
Contact b-good-project.eu
Description

Project funded by the EU Horizon 2020 Research and Innovation Programme under grant agreement No 817622. Project website: https://b-good-project.eu

Tables

Data

Table 6. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier BGDGN198.DATAA484.0
Name Data
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/BGDGN198.DATAA484.0
Table Type File
Licence CC BY-NC-ND 4.0
Description

Data file

Data file

Metadata

n/a
Table 7. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit
Sample_ID

Identifer of the sample.

String dwc:materialSampleID [0.0.MTRLS489]

n/a

Sample_nr

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Type

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

country_code

ISO 3166-1 Alpha-2 country code of the country from which the sample was taken.

String iso-3166:alpha-2CountryCode [0.0.LPHCN4]

n/a

country

Name of the country from which the sample was taken.

String dwc:country [0.0.CNTRY159]

n/a

wp

Not specified by the data provider. Presumably the Work Packege of the B-GOOD project.

Integer number Integer [0.0.NTGER313]

n/a

subspecies

Name of the species analysed.

String dwc:scientificName [0.0.SCNTF503]

n/a

d/w

Not specified by the data provider. Presumably the honey bee caste that was sampled.

String Text [0.0.TEXTA315]

n/a

Adami

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Anatoliaca

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Armeniaca

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Carnica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Carpatica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Caucasica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Cecropia

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Cypria

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Iberiensis

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Ligustica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Macedonica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Mellifera

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Rodopica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Ruttneri

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Max_predicted
Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Predicted

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Reported

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 8. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sample_ID 8 - 8 n/a ADBYNMMN n/a n/a n/a ZZUPHAAK 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 285 ( 99.7% )
Sample_nr 5 - 5 n/a B4222 n/a n/a n/a B5741 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 286 ( 100.0% )
Type 3 - 3 n/a TST n/a n/a n/a TST 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
country_code 2 - 2 n/a BE n/a n/a n/a UK 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 3.5% )
country 5 - 15 n/a Belgium n/a n/a n/a the Netherla… 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 3.5% )
wp 1 - 1 3.0 3 3 3 3 3 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
subspecies 14 - 14 n/a Apis mellife… n/a n/a n/a Apis mellife… 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
d/w 6 - 6 n/a worker n/a n/a n/a worker 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
Adami 1 - 4 4.39 0 0.4 1.1 3.625 93.3 286 0 ( 0.0% ) 13 ( 4.5% ) 0 ( 0.0% ) 92 ( 32.2% )
Anatoliaca 1 - 3 0.11 0 0 0 0.1 9.8 286 0 ( 0.0% ) 204 ( 71.3% ) 0 ( 0.0% ) 11 ( 3.8% )
Armeniaca 1 - 3 0.11 0 0 0.1 0.1 3 286 0 ( 0.0% ) 80 ( 28.0% ) 0 ( 0.0% ) 9 ( 3.1% )
Carnica 1 - 4 42.95 0 1.5 27.4 90.175 98.1 286 0 ( 0.0% ) 11 ( 3.8% ) 0 ( 0.0% ) 185 ( 64.7% )
Carpatica 1 - 4 4.07 0 0 0.1 0.5 87.3 286 0 ( 0.0% ) 115 ( 40.2% ) 0 ( 0.0% ) 52 ( 18.2% )
Caucasica 1 - 3 0.01 0 0 0 0 0.3 286 0 ( 0.0% ) 268 ( 93.7% ) 0 ( 0.0% ) 4 ( 1.4% )
Cecropia 1 - 4 0.11 0 0 0 0.1 13.1 286 0 ( 0.0% ) 190 ( 66.4% ) 0 ( 0.0% ) 10 ( 3.5% )
Cypria 1 - 3 0.20 0 0.1 0.1 0.2 3.9 286 0 ( 0.0% ) 30 ( 10.5% ) 0 ( 0.0% ) 16 ( 5.6% )
Iberiensis 1 - 4 7.87 0 0.1 0.1 0.2 99.2 286 0 ( 0.0% ) 14 ( 4.9% ) 0 ( 0.0% ) 20 ( 7.0% )
Ligustica 1 - 4 13.04 0 0.3 0.9 2.55 98.8 286 0 ( 0.0% ) 2 ( 0.7% ) 0 ( 0.0% ) 83 ( 29.0% )
Macedonica 1 - 3 0.06 0 0 0 0 7.3 286 0 ( 0.0% ) 249 ( 87.1% ) 0 ( 0.0% ) 10 ( 3.5% )
Mellifera 1 - 4 26.59 0 0.2 1.45 63.7 98.9 286 0 ( 0.0% ) 30 ( 10.5% ) 0 ( 0.0% ) 141 ( 49.3% )
Rodopica 1 - 3 0.01 0 0 0 0 0.8 286 0 ( 0.0% ) 266 ( 93.0% ) 0 ( 0.0% ) 5 ( 1.7% )
Ruttneri 1 - 4 0.44 0.1 0.1 0.2 0.3 54.7 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 15 ( 5.2% )
Max_predicted 2 - 4 85.48 32.4 78.25 93.05 96.5 99.2 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 171 ( 59.8% )
Predicted 5 - 10 n/a Adami n/a n/a n/a Ruttneri 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 7 ( 2.4% )
Reported 5 - 12 n/a Adami n/a n/a n/a Not_assigned 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 6 ( 2.1% )

Quality measures

Table 9. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sample_ID
100.00%
99.65%
RLJLBCKU LGYKGAXM
Sample_nr
100.00%
100.00%
B4806 B4806
Type
100.00%
0.35%
TST TST
country_code
100.00%
3.50%
NL FR
country
100.00%
3.50%
the Netherlands France
wp
100.00%
0.35%
3 3
subspecies
100.00%
0.35%
Apis mellifera Apis mellifera
d/w
100.00%
0.35%
worker worker
Adami
100.00%
32.17%
0.1 58.1
Anatoliaca
100.00%
3.85%
0 3.9
Armeniaca
100.00%
3.15%
0.1 1.7
Carnica
100.00%
64.69%
0.1 95.5
Carpatica
100.00%
18.18%
0 28.1
Caucasica
100.00%
1.40%
0 0.3
Cecropia
100.00%
3.50%
0 2.9
Cypria
100.00%
5.59%
0.1 0.8
Iberiensis
100.00%
6.99%
0.1 4.3
Ligustica
100.00%
29.02%
0.1 95.9
Macedonica
100.00%
3.50%
0 0.5
Mellifera
100.00%
49.30%
0.1 98.6
Rodopica
100.00%
1.75%
0 0.8
Ruttneri
100.00%
5.24%
0.2 1.7
Max_predicted
100.00%
59.79%
99.2 95.5
Predicted
100.00%
2.45%
Carnica Ruttneri
Reported
100.00%
2.10%
Not_assigned Adami

Changes made to preparatory file

  1. Column countrycode was inserted and filled with the ISO 3166-1 alpha-2 country code in order to facilitate the automated analysis of the data.

Changes made to data

  1. Names of the countries in column countries were modified to match the names used by ISO 3166.

Unresolved issues

  1. It is unclear what column Sample_nr contains. The data provider is requested to make this information available.
  2. It is unclear what column Type contains. The data provider is requested to make this information available.
  3. It is unclear what column wp contains. The data provider is requested to make this information available.
  4. It is unclear what column d/w contains. The data provider is requested to make this information available.
  5. It is unclear what column Adami contains. The data provider is requested to make this information available.
  6. It is unclear what column Anatoliaca contains. The data provider is requested to make this information available.
  7. It is unclear what column Armeniaca contains. The data provider is requested to make this information available.
  8. It is unclear what column Carnica contains. The data provider is requested to make this information available.
  9. It is unclear what column Carpatica contains. The data provider is requested to make this information available.
  10. It is unclear what column Caucasica contains. The data provider is requested to make this information available.
  11. It is unclear what column Cecropia contains. The data provider is requested to make this information available.
  12. It is unclear what column Cypria contains. The data provider is requested to make this information available.
  13. It is unclear what column Iberiensis contains. The data provider is requested to make this information available.
  14. It is unclear what column Ligustica contains. The data provider is requested to make this information available.
  15. It is unclear what column Macedonica contains. The data provider is requested to make this information available.
  16. It is unclear what column Mellifera contains. The data provider is requested to make this information available.
  17. It is unclear what column Rodopica contains. The data provider is requested to make this information available.
  18. It is unclear what column Ruttneri contains. The data provider is requested to make this information available.
  19. It is unclear what column Max_predicted contains. The data provider is requested to make this information available.
  20. It is unclear what column Predicted contains. The data provider is requested to make this information available.
  21. It is unclear what column Reported contains. The data provider is requested to make this information available.

Data including Buckfast

Table 10. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier BGDGN198.DTNCL485.0
Name Data including Buckfast
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/BGDGN198.DTNCL485.0
Table Type File
Licence CC BY-NC-ND 4.0
Description

Data file including Buckfast

Data file including Buckfast

Metadata

n/a
Table 11. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit
Buckfast

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Carnica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Carpatica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Caucasica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Cecropia

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Cypria

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Iberiensis

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Ligustica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Macedonica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Mellifera

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Rodopica

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Ruttneri

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Siciliana

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Max_predicted

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Predicted

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Reported

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Sample_ID

Identifer of the sample.

String dwc:materialSampleID [0.0.MTRLS489]

n/a

Sample_nr

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Type

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Pool_sample

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

ssp_region

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

country_code

ISO 3166-1 Alpha-2 country code of the country from which the sample was taken.

String iso-3166:alpha-2CountryCode [0.0.LPHCN4]

n/a

country

Name of the country from which the sample was taken.

String Text [0.0.TEXTA315]

n/a

hygienic

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

wp

Not specified by the data provider. Presumably the Work Packege of the B-GOOD project.

Integer number Integer [0.0.NTGER313]

n/a

species

Name of the species analysed.

String dwc:scientificName [0.0.SCNTF503]

n/a

ssp_country

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

d/w

Not specified by the data provider. Presumably the honey bee caste that was sampled.

String Text [0.0.TEXTA315]

n/a

branch

Not specified by the data provider.

String Text [0.0.TEXTA315]

n/a

Adami

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Anatoliaca

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Armeniaca

Not specified by the data provider.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 12. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Buckfast 1 - 4 3.27 0 0.3 0.7 1.95 71.5 192 0 ( 0.0% ) 13 ( 6.8% ) 0 ( 0.0% ) 56 ( 29.2% )
Carnica 1 - 4 38.05 0 0.525 15.1 84.075 98.2 192 0 ( 0.0% ) 9 ( 4.7% ) 0 ( 0.0% ) 132 ( 68.8% )
Carpatica 1 - 4 4.35 0 0.1 0.5 1.275 87.7 192 0 ( 0.0% ) 39 ( 20.3% ) 0 ( 0.0% ) 42 ( 21.9% )
Caucasica 1 - 3 0.03 0 0 0 0 0.4 192 0 ( 0.0% ) 153 ( 79.7% ) 0 ( 0.0% ) 5 ( 2.6% )
Cecropia 1 - 3 0.09 0 0 0.1 0.1 0.6 192 0 ( 0.0% ) 74 ( 38.5% ) 0 ( 0.0% ) 7 ( 3.6% )
Cypria 1 - 3 0.07 0 0 0 0.1 1.5 192 0 ( 0.0% ) 127 ( 66.1% ) 0 ( 0.0% ) 9 ( 4.7% )
Iberiensis 1 - 4 5.39 0 0.1 0.1 0.175 99.3 192 0 ( 0.0% ) 35 ( 18.2% ) 0 ( 0.0% ) 15 ( 7.8% )
Ligustica 1 - 4 14.78 0 0.1 0.75 2.275 97.8 192 0 ( 0.0% ) 36 ( 18.8% ) 0 ( 0.0% ) 67 ( 34.9% )
Macedonica 1 - 3 0.14 0 0 0.1 0.2 1.2 192 0 ( 0.0% ) 61 ( 31.8% ) 0 ( 0.0% ) 8 ( 4.2% )
Mellifera 1 - 4 31.30 0 1.7 10.4 69.6 99.8 192 0 ( 0.0% ) 11 ( 5.7% ) 0 ( 0.0% ) 128 ( 66.7% )
Rodopica 1 - 3 0.11 0 0 0.1 0.1 1.6 192 0 ( 0.0% ) 75 ( 39.1% ) 0 ( 0.0% ) 9 ( 4.7% )
Ruttneri 1 - 3 0.49 0 0.1 0.1 0.3 54 192 0 ( 0.0% ) 15 ( 7.8% ) 0 ( 0.0% ) 14 ( 7.3% )
Siciliana 1 - 3 0.22 0 0.1 0.1 0.2 2.7 192 0 ( 0.0% ) 19 ( 9.9% ) 0 ( 0.0% ) 14 ( 7.3% )
Max_predicted 2 - 4 83.56 33.9 73.175 91.85 97.675 99.8 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 134 ( 69.8% )
Predicted 5 - 10 n/a Adami n/a n/a n/a Ruttneri 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 4.2% )
Reported 7 - 12 n/a Carnica n/a n/a n/a Not_assigned 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 2.6% )
Sample_ID 2 - 8 n/a AFHKUMFR n/a n/a n/a ZZUPHAAK 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 192 ( 100.0% )
Sample_nr 5 - 5 n/a B4806 n/a n/a n/a B5741 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 192 ( 100.0% )
Type 3 - 3 n/a TST n/a n/a n/a TST 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Pool_sample 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
ssp_region 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
country_code 2 - 2 n/a BE n/a n/a n/a UK 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 5.2% )
country 5 - 15 n/a Belgium n/a n/a n/a the Netherla… 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 5.2% )
hygienic 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
wp 1 - 1 3.0 3 3 3 3 3 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
species 14 - 14 n/a Apis mellife… n/a n/a n/a Apis mellife… 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
ssp_country 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
d/w 6 - 6 n/a worker n/a n/a n/a worker 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
branch 1 - 1 n/a M n/a n/a n/a M 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Adami 1 - 4 1.53 0 0.1 0.2 0.4 63 192 0 ( 0.0% ) 37 ( 19.3% ) 0 ( 0.0% ) 23 ( 12.0% )
Anatoliaca 1 - 3 0.05 0 0 0 0.1 1.6 192 0 ( 0.0% ) 139 ( 72.4% ) 0 ( 0.0% ) 9 ( 4.7% )
Armeniaca 1 - 3 0.03 0 0 0 0 0.4 192 0 ( 0.0% ) 154 ( 80.2% ) 0 ( 0.0% ) 5 ( 2.6% )

Quality measures

Table 13. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Buckfast
100.00%
29.17%
0.3 3.6
Carnica
100.00%
68.75%
0.1 93.5
Carpatica
100.00%
21.88%
0 13.6
Caucasica
100.00%
2.60%
0 0.4
Cecropia
100.00%
3.65%
0.1 0.5
Cypria
100.00%
4.69%
0 1.1
Iberiensis
100.00%
7.81%
0.1 4.3
Ligustica
100.00%
34.90%
0 71.6
Macedonica
100.00%
4.17%
0.1 1
Mellifera
100.00%
66.67%
0 11.3
Rodopica
100.00%
4.69%
0 1.6
Ruttneri
100.00%
7.29%
0.1 0.7
Siciliana
100.00%
7.29%
0.1 0.9
Max_predicted
100.00%
69.79%
99.2 93.5
Predicted
100.00%
4.17%
Carnica Ruttneri
Reported
100.00%
2.60%
Not_assigned Iberiensis
Sample_ID
100.00%
100.00%
LGYKGAXM LGYKGAXM
Sample_nr
100.00%
100.00%
B4806 B4806
Type
100.00%
0.52%
TST TST
Pool_sample
100.00%
0.52%
Na Na
ssp_region
100.00%
0.52%
Na Na
country_code
100.00%
5.21%
NL FR
country
100.00%
5.21%
the Netherlands France
hygienic
100.00%
0.52%
Na Na
wp
100.00%
0.52%
3 3
species
100.00%
0.52%
Apis mellifera Apis mellifera
ssp_country
100.00%
0.52%
Na Na
d/w
100.00%
0.52%
worker worker
branch
100.00%
0.52%
M M
Adami
100.00%
11.98%
0.2 42.9
Anatoliaca
100.00%
4.69%
0 0.9
Armeniaca
100.00%
2.60%
0 0.4

Changes made to preparatory file

  1. Column countrycode was inserted and filled with the ISO 3166-1 alpha-2 country code in order to facilitate the automated analysis of the data.

Changes made to data

  1. Names of the countries in column countries were modified to match the names used by ISO 3166.

Unresolved issues

  1. It is unclear what column Sample_nr contains. The data provider is requested to make this information available.
  2. It is unclear what column Type contains. The data provider is requested to make this information available.
  3. It is unclear what column Pool_sample contains. The data provider is requested to make this information available.
  4. It is unclear what column ssp_region contains. The data provider is requested to make this information available.
  5. It is unclear what column hygienic contains. The data provider is requested to make this information available.
  6. It is unclear what column wp contains. The data provider is requested to make this information available.
  7. It is unclear what column ssp_country contains. The data provider is requested to make this information available.
  8. It is unclear what column d/w contains. The data provider is requested to make this information available.
  9. It is unclear what column branch contains. The data provider is requested to make this information available.
  10. It is unclear what column Adami contains. The data provider is requested to make this information available.
  11. It is unclear what column Anatoliaca contains. The data provider is requested to make this information available.
  12. It is unclear what column Armeniaca contains. The data provider is requested to make this information available.
  13. It is unclear what column Buckfast contains. The data provider is requested to make this information available.
  14. It is unclear what column Carnica contains. The data provider is requested to make this information available.
  15. It is unclear what column Carpatica contains. The data provider is requested to make this information available.
  16. It is unclear what column Caucasica contains. The data provider is requested to make this information available.
  17. It is unclear what column Cecropia contains. The data provider is requested to make this information available.
  18. It is unclear what column Cypria contains. The data provider is requested to make this information available.
  19. It is unclear what column Iberiensis contains. The data provider is requested to make this information available.
  20. It is unclear what column Ligustica contains. The data provider is requested to make this information available.
  21. It is unclear what column Macedonica contains. The data provider is requested to make this information available.
  22. It is unclear what column Mellifera contains. The data provider is requested to make this information available.
  23. It is unclear what column Rodopica contains. The data provider is requested to make this information available.
  24. It is unclear what column Ruttneri contains. The data provider is requested to make this information available.
  25. It is unclear what column Max_predicted contains. The data provider is requested to make this information available.
  26. It is unclear what column Predicted contains. The data provider is requested to make this information available.
  27. It is unclear what column Reported contains. The data provider is requested to make this information available.

References

  1. Tehel A., Paxton R. 2023 genotyping SNPchip Tier1 Tier2 [WP1]. B-GOOD Bee Health Data Portal. [2024-11-2] beehealthdata.org
  2. Momeni J., Parejo M., Nielsen RO., Langa J., Montes I., Papoutsis L. et al. 2021 Authoritative subspecies diagnosis tool for European honey bees based on ancestry informative SNPs. BMC Genomics. Vol. 22, (1) p. 101. doi: 10.1186/s12864-021-07379-7

Annex 1: Table column reports

Table: Data

Column: Sample_ID

Table 14. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Sample_ID
Description

Identifer of the sample.

Data type String
Descriptor dwc:materialSampleID [UID:0.0.MTRLS489]
Descriptor description

An identifier for a material sample.

Descriptor target IRI http://rs.tdwg.org/dwc/terms/materialSampleID
Unit

n/a

Table 15. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sample_ID 8 - 8 n/a ADBYNMMN n/a n/a n/a ZZUPHAAK 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 285 ( 99.7% )
Table 16. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sample_ID
100.00%
99.65%
RLJLBCKU LGYKGAXM

Completeness

Figure 1. Visualization of completeness of the data in the column.

Uniqueness

Figure 2. Visualization of uniqueness of the data in the column.

Column: Sample_nr

Table 17. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Sample_nr
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 18. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sample_nr 5 - 5 n/a B4222 n/a n/a n/a B5741 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 286 ( 100.0% )
Table 19. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sample_nr
100.00%
100.00%
B4806 B4806

Completeness

Figure 3. Visualization of completeness of the data in the column.

Uniqueness

Figure 4. Visualization of uniqueness of the data in the column.

Column: Type

Table 20. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Type
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 21. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Type 3 - 3 n/a TST n/a n/a n/a TST 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
Table 22. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Type
100.00%
0.35%
TST TST

Data Distribution Top 20

Figure 5. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 6. Visualization of completeness of the data in the column.

Uniqueness

Figure 7. Visualization of uniqueness of the data in the column.

Column: country_code

Table 23. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name country_code
Description

ISO 3166-1 Alpha-2 country code of the country from which the sample was taken.

Data type String
Descriptor iso-3166:alpha-2CountryCode [UID:0.0.LPHCN4]
Descriptor description

A two-letter code that represents a country name, recommended as the general purpose code.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN4
Unit

n/a

Table 24. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
country_code 2 - 2 n/a BE n/a n/a n/a UK 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 3.5% )
Table 25. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
country_code
100.00%
3.50%
NL FR

Data Distribution Top 20

Figure 8. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 9. Visualization of completeness of the data in the column.

Uniqueness

Figure 10. Visualization of uniqueness of the data in the column.

Column: country

Table 26. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name country
Description

Name of the country from which the sample was taken.

Data type String
Descriptor dwc:country [UID:0.0.CNTRY159]
Descriptor description

A collective generic term that refers here to a wide variety of dependencies, areas of special sovereignty, uninhabited islands, and other entities in addition to the traditional countries or independent states.

Descriptor target IRI http://rs.tdwg.org/dwc/terms/country
Unit

n/a

Table 27. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
country 5 - 15 n/a Belgium n/a n/a n/a the Netherla… 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 3.5% )
Table 28. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
country
100.00%
3.50%
the Netherlands France

Data Distribution Top 20

Figure 11. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 12. Visualization of completeness of the data in the column.

Uniqueness

Figure 13. Visualization of uniqueness of the data in the column.

Column: wp

Table 29. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name wp
Description

Not specified by the data provider. Presumably the Work Packege of the B-GOOD project.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 30. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
wp 1 - 1 3.0 3 3 3 3 3 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
Table 31. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
wp
100.00%
0.35%
3 3

Data Distribution Top 20

Figure 14. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 15. Distribution of values in the column.

Outliers

Figure 16. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 17. Visualization of completeness of the data in the column.

Uniqueness

Figure 18. Visualization of uniqueness of the data in the column.

Column: subspecies

Table 32. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name subspecies
Description

Name of the species analysed.

Data type String
Descriptor dwc:scientificName [UID:0.0.SCNTF503]
Descriptor description

The full scientific name, with authorship and date information if known.

Descriptor target IRI http://rs.tdwg.org/dwc/terms/scientificName
Unit

n/a

Table 33. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
subspecies 14 - 14 n/a Apis mellife… n/a n/a n/a Apis mellife… 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
Table 34. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
subspecies
100.00%
0.35%
Apis mellifera Apis mellifera

Data Distribution Top 20

Figure 19. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 20. Visualization of completeness of the data in the column.

Uniqueness

Figure 21. Visualization of uniqueness of the data in the column.

Column: d/w

Table 35. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name d/w
Description

Not specified by the data provider. Presumably the honey bee caste that was sampled.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 36. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
d/w 6 - 6 n/a worker n/a n/a n/a worker 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.3% )
Table 37. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
d/w
100.00%
0.35%
worker worker

Data Distribution Top 20

Figure 22. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 23. Visualization of completeness of the data in the column.

Uniqueness

Figure 24. Visualization of uniqueness of the data in the column.

Column: Adami

Table 38. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Adami
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 39. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Adami 1 - 4 4.39 0 0.4 1.1 3.625 93.3 286 0 ( 0.0% ) 13 ( 4.5% ) 0 ( 0.0% ) 92 ( 32.2% )
Table 40. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Adami
100.00%
32.17%
0.1 58.1

Continuous Data Distribution

Figure 25. Distribution of values in the column.

Outliers

Figure 26. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 27. Visualization of completeness of the data in the column.

Uniqueness

Figure 28. Visualization of uniqueness of the data in the column.

Column: Anatoliaca

Table 41. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Anatoliaca
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 42. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Anatoliaca 1 - 3 0.11 0 0 0 0.1 9.8 286 0 ( 0.0% ) 204 ( 71.3% ) 0 ( 0.0% ) 11 ( 3.8% )
Table 43. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Anatoliaca
100.00%
3.85%
0 3.9

Data Distribution Top 20

Figure 29. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 30. Distribution of values in the column.

Completeness

Figure 31. Visualization of completeness of the data in the column.

Uniqueness

Figure 32. Visualization of uniqueness of the data in the column.

Column: Armeniaca

Table 44. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Armeniaca
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 45. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Armeniaca 1 - 3 0.11 0 0 0.1 0.1 3 286 0 ( 0.0% ) 80 ( 28.0% ) 0 ( 0.0% ) 9 ( 3.1% )
Table 46. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Armeniaca
100.00%
3.15%
0.1 1.7

Data Distribution Top 20

Figure 33. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 34. Distribution of values in the column.

Outliers

Figure 35. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 36. Visualization of completeness of the data in the column.

Uniqueness

Figure 37. Visualization of uniqueness of the data in the column.

Column: Carnica

Table 47. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Carnica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 48. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Carnica 1 - 4 42.95 0 1.5 27.4 90.175 98.1 286 0 ( 0.0% ) 11 ( 3.8% ) 0 ( 0.0% ) 185 ( 64.7% )
Table 49. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Carnica
100.00%
64.69%
0.1 95.5

Continuous Data Distribution

Figure 38. Distribution of values in the column.

Outliers

Figure 39. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 40. Visualization of completeness of the data in the column.

Uniqueness

Figure 41. Visualization of uniqueness of the data in the column.

Column: Carpatica

Table 50. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Carpatica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 51. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Carpatica 1 - 4 4.07 0 0 0.1 0.5 87.3 286 0 ( 0.0% ) 115 ( 40.2% ) 0 ( 0.0% ) 52 ( 18.2% )
Table 52. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Carpatica
100.00%
18.18%
0 28.1

Continuous Data Distribution

Figure 42. Distribution of values in the column.

Outliers

Figure 43. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 44. Visualization of completeness of the data in the column.

Uniqueness

Figure 45. Visualization of uniqueness of the data in the column.

Column: Caucasica

Table 53. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Caucasica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 54. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Caucasica 1 - 3 0.01 0 0 0 0 0.3 286 0 ( 0.0% ) 268 ( 93.7% ) 0 ( 0.0% ) 4 ( 1.4% )
Table 55. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Caucasica
100.00%
1.40%
0 0.3

Data Distribution Top 20

Figure 46. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 47. Distribution of values in the column.

Completeness

Figure 48. Visualization of completeness of the data in the column.

Uniqueness

Figure 49. Visualization of uniqueness of the data in the column.

Column: Cecropia

Table 56. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Cecropia
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 57. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Cecropia 1 - 4 0.11 0 0 0 0.1 13.1 286 0 ( 0.0% ) 190 ( 66.4% ) 0 ( 0.0% ) 10 ( 3.5% )
Table 58. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Cecropia
100.00%
3.50%
0 2.9

Data Distribution Top 20

Figure 50. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 51. Distribution of values in the column.

Completeness

Figure 52. Visualization of completeness of the data in the column.

Uniqueness

Figure 53. Visualization of uniqueness of the data in the column.

Column: Cypria

Table 59. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Cypria
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 60. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Cypria 1 - 3 0.20 0 0.1 0.1 0.2 3.9 286 0 ( 0.0% ) 30 ( 10.5% ) 0 ( 0.0% ) 16 ( 5.6% )
Table 61. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Cypria
100.00%
5.59%
0.1 0.8

Data Distribution Top 20

Figure 54. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 55. Distribution of values in the column.

Outliers

Figure 56. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 57. Visualization of completeness of the data in the column.

Uniqueness

Figure 58. Visualization of uniqueness of the data in the column.

Column: Iberiensis

Table 62. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Iberiensis
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 63. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Iberiensis 1 - 4 7.87 0 0.1 0.1 0.2 99.2 286 0 ( 0.0% ) 14 ( 4.9% ) 0 ( 0.0% ) 20 ( 7.0% )
Table 64. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Iberiensis
100.00%
6.99%
0.1 4.3

Continuous Data Distribution

Figure 59. Distribution of values in the column.

Outliers

Figure 60. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 61. Visualization of completeness of the data in the column.

Uniqueness

Figure 62. Visualization of uniqueness of the data in the column.

Column: Ligustica

Table 65. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Ligustica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 66. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Ligustica 1 - 4 13.04 0 0.3 0.9 2.55 98.8 286 0 ( 0.0% ) 2 ( 0.7% ) 0 ( 0.0% ) 83 ( 29.0% )
Table 67. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Ligustica
100.00%
29.02%
0.1 95.9

Continuous Data Distribution

Figure 63. Distribution of values in the column.

Outliers

Figure 64. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 65. Visualization of completeness of the data in the column.

Uniqueness

Figure 66. Visualization of uniqueness of the data in the column.

Column: Macedonica

Table 68. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Macedonica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 69. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Macedonica 1 - 3 0.06 0 0 0 0 7.3 286 0 ( 0.0% ) 249 ( 87.1% ) 0 ( 0.0% ) 10 ( 3.5% )
Table 70. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Macedonica
100.00%
3.50%
0 0.5

Data Distribution Top 20

Figure 67. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 68. Distribution of values in the column.

Completeness

Figure 69. Visualization of completeness of the data in the column.

Uniqueness

Figure 70. Visualization of uniqueness of the data in the column.

Column: Mellifera

Table 71. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Mellifera
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 72. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Mellifera 1 - 4 26.59 0 0.2 1.45 63.7 98.9 286 0 ( 0.0% ) 30 ( 10.5% ) 0 ( 0.0% ) 141 ( 49.3% )
Table 73. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Mellifera
100.00%
49.30%
0.1 98.6

Continuous Data Distribution

Figure 71. Distribution of values in the column.

Outliers

Figure 72. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 73. Visualization of completeness of the data in the column.

Uniqueness

Figure 74. Visualization of uniqueness of the data in the column.

Column: Rodopica

Table 74. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Rodopica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 75. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Rodopica 1 - 3 0.01 0 0 0 0 0.8 286 0 ( 0.0% ) 266 ( 93.0% ) 0 ( 0.0% ) 5 ( 1.7% )
Table 76. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Rodopica
100.00%
1.75%
0 0.8

Data Distribution Top 20

Figure 75. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 76. Distribution of values in the column.

Completeness

Figure 77. Visualization of completeness of the data in the column.

Uniqueness

Figure 78. Visualization of uniqueness of the data in the column.

Column: Ruttneri

Table 77. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Ruttneri
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 78. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Ruttneri 1 - 4 0.44 0.1 0.1 0.2 0.3 54.7 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 15 ( 5.2% )
Table 79. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Ruttneri
100.00%
5.24%
0.2 1.7

Data Distribution Top 20

Figure 79. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 80. Distribution of values in the column.

Outliers

Figure 81. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 82. Visualization of completeness of the data in the column.

Uniqueness

Figure 83. Visualization of uniqueness of the data in the column.

Column: Max_predicted

Table 80. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Max_predicted
Description
Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 81. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Max_predicted 2 - 4 85.48 32.4 78.25 93.05 96.5 99.2 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 171 ( 59.8% )
Table 82. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Max_predicted
100.00%
59.79%
99.2 95.5

Continuous Data Distribution

Figure 84. Distribution of values in the column.

Outliers

Figure 85. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 86. Visualization of completeness of the data in the column.

Uniqueness

Figure 87. Visualization of uniqueness of the data in the column.

Column: Predicted

Table 83. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Predicted
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 84. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Predicted 5 - 10 n/a Adami n/a n/a n/a Ruttneri 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 7 ( 2.4% )
Table 85. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Predicted
100.00%
2.45%
Carnica Ruttneri

Data Distribution Top 20

Figure 88. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 89. Visualization of completeness of the data in the column.

Uniqueness

Figure 90. Visualization of uniqueness of the data in the column.

Column: Reported

Table 86. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Reported
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 87. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Reported 5 - 12 n/a Adami n/a n/a n/a Not_assigned 286 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 6 ( 2.1% )
Table 88. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Reported
100.00%
2.10%
Not_assigned Adami

Data Distribution Top 20

Figure 91. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 92. Visualization of completeness of the data in the column.

Uniqueness

Figure 93. Visualization of uniqueness of the data in the column.

Table: Data including Buckfast

Column: Buckfast

Table 89. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Buckfast
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 90. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Buckfast 1 - 4 3.27 0 0.3 0.7 1.95 71.5 192 0 ( 0.0% ) 13 ( 6.8% ) 0 ( 0.0% ) 56 ( 29.2% )
Table 91. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Buckfast
100.00%
29.17%
0.3 3.6

Continuous Data Distribution

Figure 94. Distribution of values in the column.

Outliers

Figure 95. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 96. Visualization of completeness of the data in the column.

Uniqueness

Figure 97. Visualization of uniqueness of the data in the column.

Column: Carnica

Table 92. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Carnica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 93. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Carnica 1 - 4 38.05 0 0.525 15.1 84.075 98.2 192 0 ( 0.0% ) 9 ( 4.7% ) 0 ( 0.0% ) 132 ( 68.8% )
Table 94. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Carnica
100.00%
68.75%
0.1 93.5

Continuous Data Distribution

Figure 98. Distribution of values in the column.

Outliers

Figure 99. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 100. Visualization of completeness of the data in the column.

Uniqueness

Figure 101. Visualization of uniqueness of the data in the column.

Column: Carpatica

Table 95. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Carpatica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 96. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Carpatica 1 - 4 4.35 0 0.1 0.5 1.275 87.7 192 0 ( 0.0% ) 39 ( 20.3% ) 0 ( 0.0% ) 42 ( 21.9% )
Table 97. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Carpatica
100.00%
21.88%
0 13.6

Continuous Data Distribution

Figure 102. Distribution of values in the column.

Outliers

Figure 103. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 104. Visualization of completeness of the data in the column.

Uniqueness

Figure 105. Visualization of uniqueness of the data in the column.

Column: Caucasica

Table 98. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Caucasica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 99. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Caucasica 1 - 3 0.03 0 0 0 0 0.4 192 0 ( 0.0% ) 153 ( 79.7% ) 0 ( 0.0% ) 5 ( 2.6% )
Table 100. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Caucasica
100.00%
2.60%
0 0.4

Data Distribution Top 20

Figure 106. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 107. Distribution of values in the column.

Completeness

Figure 108. Visualization of completeness of the data in the column.

Uniqueness

Figure 109. Visualization of uniqueness of the data in the column.

Column: Cecropia

Table 101. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Cecropia
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 102. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Cecropia 1 - 3 0.09 0 0 0.1 0.1 0.6 192 0 ( 0.0% ) 74 ( 38.5% ) 0 ( 0.0% ) 7 ( 3.6% )
Table 103. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Cecropia
100.00%
3.65%
0.1 0.5

Data Distribution Top 20

Figure 110. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 111. Distribution of values in the column.

Outliers

Figure 112. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 113. Visualization of completeness of the data in the column.

Uniqueness

Figure 114. Visualization of uniqueness of the data in the column.

Column: Cypria

Table 104. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Cypria
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 105. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Cypria 1 - 3 0.07 0 0 0 0.1 1.5 192 0 ( 0.0% ) 127 ( 66.1% ) 0 ( 0.0% ) 9 ( 4.7% )
Table 106. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Cypria
100.00%
4.69%
0 1.1

Data Distribution Top 20

Figure 115. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 116. Distribution of values in the column.

Completeness

Figure 117. Visualization of completeness of the data in the column.

Uniqueness

Figure 118. Visualization of uniqueness of the data in the column.

Column: Iberiensis

Table 107. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Iberiensis
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 108. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Iberiensis 1 - 4 5.39 0 0.1 0.1 0.175 99.3 192 0 ( 0.0% ) 35 ( 18.2% ) 0 ( 0.0% ) 15 ( 7.8% )
Table 109. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Iberiensis
100.00%
7.81%
0.1 4.3

Continuous Data Distribution

Figure 119. Distribution of values in the column.

Outliers

Figure 120. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 121. Visualization of completeness of the data in the column.

Uniqueness

Figure 122. Visualization of uniqueness of the data in the column.

Column: Ligustica

Table 110. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Ligustica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 111. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Ligustica 1 - 4 14.78 0 0.1 0.75 2.275 97.8 192 0 ( 0.0% ) 36 ( 18.8% ) 0 ( 0.0% ) 67 ( 34.9% )
Table 112. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Ligustica
100.00%
34.90%
0 71.6

Continuous Data Distribution

Figure 123. Distribution of values in the column.

Outliers

Figure 124. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 125. Visualization of completeness of the data in the column.

Uniqueness

Figure 126. Visualization of uniqueness of the data in the column.

Column: Macedonica

Table 113. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Macedonica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 114. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Macedonica 1 - 3 0.14 0 0 0.1 0.2 1.2 192 0 ( 0.0% ) 61 ( 31.8% ) 0 ( 0.0% ) 8 ( 4.2% )
Table 115. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Macedonica
100.00%
4.17%
0.1 1

Data Distribution Top 20

Figure 127. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 128. Distribution of values in the column.

Outliers

Figure 129. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 130. Visualization of completeness of the data in the column.

Uniqueness

Figure 131. Visualization of uniqueness of the data in the column.

Column: Mellifera

Table 116. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Mellifera
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 117. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Mellifera 1 - 4 31.30 0 1.7 10.4 69.6 99.8 192 0 ( 0.0% ) 11 ( 5.7% ) 0 ( 0.0% ) 128 ( 66.7% )
Table 118. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Mellifera
100.00%
66.67%
0 11.3

Continuous Data Distribution

Figure 132. Distribution of values in the column.

Outliers

Figure 133. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 134. Visualization of completeness of the data in the column.

Uniqueness

Figure 135. Visualization of uniqueness of the data in the column.

Column: Rodopica

Table 119. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Rodopica
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 120. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Rodopica 1 - 3 0.11 0 0 0.1 0.1 1.6 192 0 ( 0.0% ) 75 ( 39.1% ) 0 ( 0.0% ) 9 ( 4.7% )
Table 121. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Rodopica
100.00%
4.69%
0 1.6

Data Distribution Top 20

Figure 136. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 137. Distribution of values in the column.

Outliers

Figure 138. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 139. Visualization of completeness of the data in the column.

Uniqueness

Figure 140. Visualization of uniqueness of the data in the column.

Column: Ruttneri

Table 122. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Ruttneri
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 123. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Ruttneri 1 - 3 0.49 0 0.1 0.1 0.3 54 192 0 ( 0.0% ) 15 ( 7.8% ) 0 ( 0.0% ) 14 ( 7.3% )
Table 124. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Ruttneri
100.00%
7.29%
0.1 0.7

Continuous Data Distribution

Figure 141. Distribution of values in the column.

Outliers

Figure 142. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 143. Visualization of completeness of the data in the column.

Uniqueness

Figure 144. Visualization of uniqueness of the data in the column.

Column: Siciliana

Table 125. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Siciliana
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 126. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Siciliana 1 - 3 0.22 0 0.1 0.1 0.2 2.7 192 0 ( 0.0% ) 19 ( 9.9% ) 0 ( 0.0% ) 14 ( 7.3% )
Table 127. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Siciliana
100.00%
7.29%
0.1 0.9

Continuous Data Distribution

Figure 145. Distribution of values in the column.

Outliers

Figure 146. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 147. Visualization of completeness of the data in the column.

Uniqueness

Figure 148. Visualization of uniqueness of the data in the column.

Column: Max_predicted

Table 128. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Max_predicted
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 129. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Max_predicted 2 - 4 83.56 33.9 73.175 91.85 97.675 99.8 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 134 ( 69.8% )
Table 130. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Max_predicted
100.00%
69.79%
99.2 93.5

Continuous Data Distribution

Figure 149. Distribution of values in the column.

Outliers

Figure 150. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 151. Visualization of completeness of the data in the column.

Uniqueness

Figure 152. Visualization of uniqueness of the data in the column.

Column: Predicted

Table 131. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Predicted
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 132. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Predicted 5 - 10 n/a Adami n/a n/a n/a Ruttneri 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 4.2% )
Table 133. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Predicted
100.00%
4.17%
Carnica Ruttneri

Data Distribution Top 20

Figure 153. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 154. Visualization of completeness of the data in the column.

Uniqueness

Figure 155. Visualization of uniqueness of the data in the column.

Column: Reported

Table 134. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Reported
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 135. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Reported 7 - 12 n/a Carnica n/a n/a n/a Not_assigned 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 2.6% )
Table 136. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Reported
100.00%
2.60%
Not_assigned Iberiensis

Data Distribution Top 20

Figure 156. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 157. Visualization of completeness of the data in the column.

Uniqueness

Figure 158. Visualization of uniqueness of the data in the column.

Column: Sample_ID

Table 137. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Sample_ID
Description

Identifer of the sample.

Data type String
Descriptor dwc:materialSampleID [UID:0.0.MTRLS489]
Descriptor description

An identifier for a material sample.

Descriptor target IRI http://rs.tdwg.org/dwc/terms/materialSampleID
Unit

n/a

Table 138. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sample_ID 2 - 8 n/a AFHKUMFR n/a n/a n/a ZZUPHAAK 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 192 ( 100.0% )
Table 139. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sample_ID
100.00%
100.00%
LGYKGAXM LGYKGAXM

Completeness

Figure 159. Visualization of completeness of the data in the column.

Uniqueness

Figure 160. Visualization of uniqueness of the data in the column.

Column: Sample_nr

Table 140. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Sample_nr
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 141. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sample_nr 5 - 5 n/a B4806 n/a n/a n/a B5741 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 192 ( 100.0% )
Table 142. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sample_nr
100.00%
100.00%
B4806 B4806

Completeness

Figure 161. Visualization of completeness of the data in the column.

Uniqueness

Figure 162. Visualization of uniqueness of the data in the column.

Column: Type

Table 143. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Type
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 144. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Type 3 - 3 n/a TST n/a n/a n/a TST 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 145. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Type
100.00%
0.52%
TST TST

Data Distribution Top 20

Figure 163. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 164. Visualization of completeness of the data in the column.

Uniqueness

Figure 165. Visualization of uniqueness of the data in the column.

Column: Pool_sample

Table 146. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Pool_sample
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 147. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Pool_sample 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 148. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Pool_sample
100.00%
0.52%
Na Na

Data Distribution Top 20

Figure 166. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 167. Visualization of completeness of the data in the column.

Uniqueness

Figure 168. Visualization of uniqueness of the data in the column.

Column: ssp_region

Table 149. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ssp_region
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 150. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ssp_region 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 151. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ssp_region
100.00%
0.52%
Na Na

Data Distribution Top 20

Figure 169. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 170. Visualization of completeness of the data in the column.

Uniqueness

Figure 171. Visualization of uniqueness of the data in the column.

Column: country_code

Table 152. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name country_code
Description

ISO 3166-1 Alpha-2 country code of the country from which the sample was taken.

Data type String
Descriptor iso-3166:alpha-2CountryCode [UID:0.0.LPHCN4]
Descriptor description

A two-letter code that represents a country name, recommended as the general purpose code.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN4
Unit

n/a

Table 153. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
country_code 2 - 2 n/a BE n/a n/a n/a UK 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 5.2% )
Table 154. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
country_code
100.00%
5.21%
NL FR

Data Distribution Top 20

Figure 172. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 173. Visualization of completeness of the data in the column.

Uniqueness

Figure 174. Visualization of uniqueness of the data in the column.

Column: country

Table 155. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name country
Description

Name of the country from which the sample was taken.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 156. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
country 5 - 15 n/a Belgium n/a n/a n/a the Netherla… 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 10 ( 5.2% )
Table 157. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
country
100.00%
5.21%
the Netherlands France

Data Distribution Top 20

Figure 175. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 176. Visualization of completeness of the data in the column.

Uniqueness

Figure 177. Visualization of uniqueness of the data in the column.

Column: hygienic

Table 158. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name hygienic
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 159. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
hygienic 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 160. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
hygienic
100.00%
0.52%
Na Na

Data Distribution Top 20

Figure 178. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 179. Visualization of completeness of the data in the column.

Uniqueness

Figure 180. Visualization of uniqueness of the data in the column.

Column: wp

Table 161. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name wp
Description

Not specified by the data provider. Presumably the Work Packege of the B-GOOD project.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 162. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
wp 1 - 1 3.0 3 3 3 3 3 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 163. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
wp
100.00%
0.52%
3 3

Data Distribution Top 20

Figure 181. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 182. Distribution of values in the column.

Outliers

Figure 183. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 184. Visualization of completeness of the data in the column.

Uniqueness

Figure 185. Visualization of uniqueness of the data in the column.

Column: species

Table 164. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name species
Description

Name of the species analysed.

Data type String
Descriptor dwc:scientificName [UID:0.0.SCNTF503]
Descriptor description

The full scientific name, with authorship and date information if known.

Descriptor target IRI http://rs.tdwg.org/dwc/terms/scientificName
Unit

n/a

Table 165. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
species 14 - 14 n/a Apis mellife… n/a n/a n/a Apis mellife… 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 166. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
species
100.00%
0.52%
Apis mellifera Apis mellifera

Data Distribution Top 20

Figure 186. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 187. Visualization of completeness of the data in the column.

Uniqueness

Figure 188. Visualization of uniqueness of the data in the column.

Column: ssp_country

Table 167. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ssp_country
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 168. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ssp_country 2 - 2 n/a Na n/a n/a n/a Na 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 169. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ssp_country
100.00%
0.52%
Na Na

Data Distribution Top 20

Figure 189. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 190. Visualization of completeness of the data in the column.

Uniqueness

Figure 191. Visualization of uniqueness of the data in the column.

Column: d/w

Table 170. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name d/w
Description

Not specified by the data provider. Presumably the honey bee caste that was sampled.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 171. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
d/w 6 - 6 n/a worker n/a n/a n/a worker 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 172. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
d/w
100.00%
0.52%
worker worker

Data Distribution Top 20

Figure 192. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 193. Visualization of completeness of the data in the column.

Uniqueness

Figure 194. Visualization of uniqueness of the data in the column.

Column: branch

Table 173. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name branch
Description

Not specified by the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 174. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
branch 1 - 1 n/a M n/a n/a n/a M 192 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.5% )
Table 175. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
branch
100.00%
0.52%
M M

Data Distribution Top 20

Figure 195. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 196. Visualization of completeness of the data in the column.

Uniqueness

Figure 197. Visualization of uniqueness of the data in the column.

Column: Adami

Table 176. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Adami
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 177. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Adami 1 - 4 1.53 0 0.1 0.2 0.4 63 192 0 ( 0.0% ) 37 ( 19.3% ) 0 ( 0.0% ) 23 ( 12.0% )
Table 178. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Adami
100.00%
11.98%
0.2 42.9

Continuous Data Distribution

Figure 198. Distribution of values in the column.

Outliers

Figure 199. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 200. Visualization of completeness of the data in the column.

Uniqueness

Figure 201. Visualization of uniqueness of the data in the column.

Column: Anatoliaca

Table 179. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Anatoliaca
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 180. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Anatoliaca 1 - 3 0.05 0 0 0 0.1 1.6 192 0 ( 0.0% ) 139 ( 72.4% ) 0 ( 0.0% ) 9 ( 4.7% )
Table 181. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Anatoliaca
100.00%
4.69%
0 0.9

Data Distribution Top 20

Figure 202. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 203. Distribution of values in the column.

Completeness

Figure 204. Visualization of completeness of the data in the column.

Uniqueness

Figure 205. Visualization of uniqueness of the data in the column.

Column: Armeniaca

Table 182. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Armeniaca
Description

Not specified by the data provider.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 183. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Armeniaca 1 - 3 0.03 0 0 0 0 0.4 192 0 ( 0.0% ) 154 ( 80.2% ) 0 ( 0.0% ) 5 ( 2.6% )
Table 184. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Armeniaca
100.00%
2.60%
0 0.4

Data Distribution Top 20

Figure 206. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 207. Distribution of values in the column.

Completeness

Figure 208. Visualization of completeness of the data in the column.

Uniqueness

Figure 209. Visualization of uniqueness of the data in the column.