EU Pollinator Hub

, ,

Dataset Report
Unique identifier: BGDVR197.0.0
Title: B-GOOD Virus levels
Long title: Dataset from the B-GOOD project, containing Virus levels in honey bee pools of different sizes
Status: Quality Validated
Current Version: v. 1.0
Published: 2025-03-17
Reviewed by: Rubinigg Michael as Data scientist
Citation proposal:
B-GOOD Bee Health Data Portal 2025 Report of dataset B-GOOD Virus levels, v. 1.0 [BGDVR197.0.0]. EU Pollinator Hub. [2026-02-24] app.pollinatorhub.eu
Compliance with FAIR* principles
Findable
Accessible
Interoperable
Reusable
See https://www.go-fair.org/fair-principles for more information about FAIR principles
Data Quality
Good

This document is intended for use by collaborators of the EU Pollinator Hub and may be passed on with the express permission of the leader of the consortium and for the purpose determined by the leader of the consortium.

Document history

Release

Version v. 1.0 released on 2025-03-17. Reviewed by Rubinigg Michael.

Revision

Table 1. List of revisions made to the document. Identifier of revision (No); date of revision (Date); description of revision (Description); reason for revision (Reason).
No Date Description Reason
1 2025-03-17 00:03:00 Initial release. n/a

Abbreviations

ABPV
Acute Bee Paralysis Virus
BQCV
Black Queen Cell Virus
CBPV
Chronic Bee Paralysis Virus
CSV
Comma-Separated Values
DWV
Deformed Wing Virus
EU
European Union
EUPH
EU Pollinator Hub
INRAE
Institut National de la Recherche Agronomique (National Research Institute for Agriculture, Food and Environment)
SBV
Sackbrood Virus

Executive summary

Data overview:

The dataset contains data on the quantification of 5 viruses (DWV, BQCV ABPV CBPV SBV) in pools of different sizes and according to the samplign frame in honey bee samples. It was published by Godeau UG, Pioz MP, Dievart VD, Alaux CA, Bonjour-Dalmon ABD (INRAE) on the B-GOOD Bee Health Data Portal as part of the B-GOOD project (grant agreement 817622), funded under the EU Horizon 2020 Research and Innovation Programme.

Data value:

The objectives of the B-GOOD project were: (1) Facilitate decision making for beekeepers and other stakeholders by establishing ready-to-use tools for operationalising the HSI; (2) Test, standardise and validate methods for measuring and reporting selected indicators affecting bee health; (3) Explore the various socio-economic and ecological factors beyond bee health; (4) Foster an EU community to collect and share knowledge related to honey bees and their environment; (5) Engender a lasting learning and innovation system (LIS); (6) Minimise the impact of biotic and abiotic stressors.

Data description:

n/a

Data application:

Currently, the data integrated from the B-GOOD Bee Health Data Portal contains major issues and does not fully comply with the FAIR Guiding Principles for scientific data management and stewardship applied on the EU Pollinator Hub. More descriptive information about the context, quality and condition, or characteristics of the data (e.g. protocols, measurement devices used, units of the captured data, or any other details about the study) must be provided. More metadata in the form of accurate explanations of all variable names must be provided.

Unresolved issues:

n/a

Introduction

n/a

Material and methods

Data acquisition

All raw data files were downloaded from the B-GOOD Bee Health Data Portal on 2024-09-26 18:16:38.

List of raw data obtained from the data provider.

  1. File BGOOD_Virus_Data_2018_2020 Pool size.xlsx, accessed on 2024-09-26 18:16:38, provided by B-GOOD Bee Health Data Portal

Metadata was obtained from the dataset's web page.

Table 2. List of raw data and metadata files included in the dataset. Identifier of table row (No); name of the file (File); the type of the file (Type); file contains data (D); file contains metadata (M); date of upload of the file to the EU Pollinator Hub (Arrival); number of data points contained within the file (if applicable); uploaded file size.
No File Type D M Arrival Data points File size
1 BGOOD_Virus_Data_2018_2020 Pool size_PREP_MR_241102.csv CSV - Comma seperated values Yes No 2024-11-02 14:11:28 10,590 74.06 KiB

Data preparation

The file in the zip-archive was extracted using File Explorer (Microsoft Corporation, version 22H2).

The file GOOD_Virus_Data_2018_2020 Pool size.xlsx was opened with MS Excel (Microsoft Corporation, version 2409). The worksheets were exported to data files in CSV format (UTF-8 encoding) and imported into Notepad++ (version 8.7) where missing values were substituted by {NULL} using regular expressions. Dates were parsed to the required YYYY-MM-DD format using the python script ParseDates.py.

Data was then exported to the respective preparatory files and uploaded to the EU Pollinator Hub according to SOP-017 (Dataset integration.

Data validation

No data validation was performed.

Data analysis

No data analysis was performed.

Data description

Dataset

Table 3. Summary of tables belonging to the dataset. Table row identifier (No); name of the table (Table); description of the table (Description).
No Table Description
1 Virus Data Data on the quantification of the viruses in pools of different sizes and according to the samplign frame.
Table 4. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
interactions.single.uid BGDVR197.0.0
Title B-GOOD Virus levels
Long title Dataset from the B-GOOD project, containing Virus levels in honey bee pools of different sizes
Target IRI https://app.pollinatorhub.eu/dataset-discovery/BGDVR197.0.0
interactions.single.section-details.licence CC BY-NC-ND 4.0
DOI n/a
Created 2024-11-02
Published 2025-03-17
Contact n/a
Keywords ABPV, Apis mellifera, BQCV, CBPV, DWV, SBV, honey bee
Data collection years n/a
Regions, the data was collected in n/a
Abstract

Dataset containing data on the quantification of 5 viruses in honey bees (DWV, BQCV, ABPV, CBPV, SBV) in pools of different sizes (1, 30 or 100 bees) and according to the sampling frame (brood vs. storage) in honey bee samples. It was published by Godeau UG, Pioz MP, Dievart VD, Alaux CA, Bonjour-Dalmon ABD (INRAE) on the B-GOOD Bee Health Data Portal as part of the B-GOOD project (grant agreement 817622), funded under the EU Horizon 2020 Research and Innovation Programme.

Table 5. Standardised metadata of the data provider B-GOOD Bee Health Data Portal. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name B-GOOD Bee Health Data Portal
Url
Acronym B-GOOD
IRI https://app.pollinatorhub.eu/data-providers/b-good-bee-health-data-portal
Address https://b-good-project.eu
Country Belgium
Contact b-good-project.eu
Description

Project funded by the EU Horizon 2020 Research and Innovation Programme under grant agreement No 817622. Project website: https://b-good-project.eu

Tables

Virus Data

Table 6. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier BGDVR197.VRSDT483.0
Name Virus Data
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/BGDVR197.VRSDT483.0
Table Type File
Licence CC BY-NC-ND 4.0
Description

Data on the quantification of the viruses in pools of different sizes and according to the samplign frame.

Data on the quantification of the viruses in pools of different sizes and according to the samplign frame.

Metadata

n/a
Table 7. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit
BM Sample Code

Sample code.

String dwc:materialSampleID [0.0.MTRLS489]

n/a

Hive ID

Identifier of teh hive from which the sample was taken.

String pms:beehiveID [0.0.HVEID216]

n/a

Sampling Date

Date at which the sample was taken.

Date iso-8601:calendarDate [0.0.DATEA317]

n/a

Sampling Site

Type of frame (Brood frame, Storage frame) from which the sample was taken.

String Text [0.0.TEXTA315]

n/a

Pool size (nb bees)

Number of individual honey bees in the sample.

Integer number Integer [0.0.NTGER313]

no.

DWV Cq mean

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Deformed Wing Virus (DWV).

Decimal number pms:quantificationCycle [0.0.QNTFC467]

no.

DWV Nb copies/bee

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Deformed Wing Virus (DWV) detected in the sample.

Integer number Integer [0.0.NTGER313]

n/a

SBV Cq mean

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Sackbrood Virus (SBV).

Decimal number pms:quantificationCycle [0.0.QNTFC467]

no.

SBV Nb copies/bee

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Sackbrood Virus (SBV) detected in the sample.

Integer number Integer [0.0.NTGER313]

n/a

BQCV Cq mean

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Black Queen Cell Virus (BQCV).

Decimal number pms:quantificationCycle [0.0.QNTFC467]

no.

BQCV Nb copies/bee

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Black Queen Cell Virus (BQCV) detected in the sample.

Integer number Integer [0.0.NTGER313]

n/a

CBPV Cq mean

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Chronic Bee Paralysis Virus (CBPV).

Decimal number pms:quantificationCycle [0.0.QNTFC467]

no.

CBPV Nb copies/bee

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Chronic Bee Paralysis Virus (CBPV) detected in the sample.

Integer number Integer [0.0.NTGER313]

n/a

ABPV Cq mean

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Acute Bee Paralysis Virus (ABPV).

Decimal number pms:quantificationCycle [0.0.QNTFC467]

no.

ABPV Nb copies/bee

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Acute Bee Paralysis Virus (ABPV) detected in the sample.

Integer number Integer [0.0.NTGER313]

n/a

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 8. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
BM Sample Code 4 - 6 n/a 06.Apr n/a n/a n/a Jun.30 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 706 ( 100.0% )
Hive ID 1 - 12 n/a 6 n/a n/a n/a 93(b)-46(h) 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 18 ( 2.5% )
Sampling Date 10 - 10 n/a 2018-04-10 n/a n/a n/a 2020-10-16 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 0.7% )
Sampling Site 0 - 13 n/a Brood frame n/a n/a n/a Storage fram… 706 351 ( 49.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 3 ( 0.4% )
Pool size (nb bees) 1 - 3 26.6 1 1 1 30 100 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 0.7% )
DWV Cq mean 3 - 5 19.934 3.59 12.535 20.88 26.455 39.73 706 1 ( 0.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 610 ( 86.4% )
DWV Nb copies/bee 4 - 15 2,428,369,430,152.8 2,320 23,900,000 1,540,000,000 444,500,000,000 107,000,000,000,000 706 1 ( 0.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 656 ( 92.9% )
SBV Cq mean 4 - 5 26.613 7.63 22.33 27.99 31.3025 39.27 706 8 ( 1.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 593 ( 84.0% )
SBV Nb copies/bee 3 - 13 11,032,210,469.1 705 187,750 2,445,000 70,900,000 4,310,000,000,000 706 8 ( 1.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 637 ( 90.2% )
BQCV Cq mean 4 - 5 22.718 4.85 19.135 23.825 26.295 35.54 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 568 ( 80.5% )
BQCV Nb copies/bee 5 - 14 53,648,532,226.6 15,700 12,600,000 59,350,000 1,187,500,000 24,400,000,000,000 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 624 ( 88.4% )
CBPV Cq mean 1 - 5 33.389 4 32.5175 34.08 35.2625 39.96 706 164 ( 23.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 410 ( 58.1% )
CBPV Nb copies/bee 3 - 12 1,426,840,228.1 480 25,050 78,500 384,000 508,000,000,000 706 164 ( 23.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 480 ( 68.0% )
ABPV Cq mean 1 - 5 27.212 2 23.175 28.42 32.015 39.37 706 57 ( 8.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 553 ( 78.3% )
ABPV Nb copies/bee 3 - 14 221,083,836,715.4 989 178,500 3,130,000 111,500,000 37,700,000,000,000 706 57 ( 8.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 607 ( 86.0% )

Quality measures

Table 9. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
BM Sample Code
100.00%
100.00%
A1-01 A1-01
Hive ID
100.00%
2.55%
154 438
Sampling Date
100.00%
0.71%
2020-10-16 2019-04-04
Sampling Site
50.28%
0.42%
n/a Storage frame
Pool size (nb bees)
100.00%
0.71%
1 98
DWV Cq mean
99.86%
86.40%
17.55 32.01
DWV Nb copies/bee
99.86%
92.92%
10200000 1470000
SBV Cq mean
98.87%
83.99%
n/a 25.85
SBV Nb copies/bee
98.87%
90.23%
1450000 5130000
BQCV Cq mean
100.00%
80.45%
26.09 19.65
BQCV Nb copies/bee
100.00%
88.39%
32300000 890000000
CBPV Cq mean
76.77%
58.07%
n/a 38.87
CBPV Nb copies/bee
76.77%
67.99%
13600 18300
ABPV Cq mean
91.93%
78.33%
n/a 28.35
ABPV Nb copies/bee
91.93%
85.98%
33400 5870000

Changes made to preparatory file

None

Changes made to data

  1. Missing values (351 occurrences) were replaced by {NULL}.
  2. Fields contaiing {#DIV/0!} (460 occurrences) were replaced by {NULL}

Unresolved issues

  1. Columns DWV Cq mean, DWV Nb copies/bee, SBV Cq mean, SBV Nb copies/bee, CBPV Cq mean, CBPV Nb copies/bee, ABPV Cq mean, ABPV Nb copies/bee contain 460 occurrences of {#DIV/0!}. The data provider is requested to re-validate the data.
  2. In column BM Sample Code it seems that in 30 records a date instead of a sample number has been stored, probably the consequence of wrong column formatting by Excel. The data provider is requested to re-validate the data.
  3. For column DWV Cq mean it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  4. For column DWV Nb copies/bee it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  5. For column SBV Cq mean it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  6. For column SBV Nb copies/bee it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  7. For column BQCV Cq mean it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  8. For column BQCV Nb copies/bee it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  9. For column CBPV Cq mean it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  10. For column CBPF Nb copies/bee it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  11. For column ABPV Cq mean it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.
  12. For column ABPF Nb copies/bee it may be guessed, but it is not explicitly stated what it describes. The data provider is requested to make this information available.

References

  1. Godeau U., Pioz M., Dievart V., Alaux C., Bonjour-Dalmon A. 2023 Virus levels in bee pools of different sizes. B-GOOD Bee Health Data Portal. [2024-11-2] beehealthdata.org

Annex 1: Table column reports

Table: Virus Data

Column: BM Sample Code

Table 10. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name BM Sample Code
Description

Sample code.

Data type String
Descriptor dwc:materialSampleID [UID:0.0.MTRLS489]
Descriptor description

An identifier for a material sample.

Descriptor target IRI http://rs.tdwg.org/dwc/terms/materialSampleID
Unit

n/a

Table 11. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
BM Sample Code 4 - 6 n/a 06.Apr n/a n/a n/a Jun.30 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 706 ( 100.0% )
Table 12. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
BM Sample Code
100.00%
100.00%
A1-01 A1-01

Completeness

Figure 1. Visualization of completeness of the data in the column.

Uniqueness

Figure 2. Visualization of uniqueness of the data in the column.

Column: Hive ID

Table 13. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Hive ID
Description

Identifier of teh hive from which the sample was taken.

Data type String
Descriptor pms:beehiveID [UID:0.0.HVEID216]
Descriptor description

A beehive ID is a unique sequence of characters associated with a beehive, which is specific to a dataset, to an apiary or to a beekeeper.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.HVEID216
Unit

n/a

Table 14. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Hive ID 1 - 12 n/a 6 n/a n/a n/a 93(b)-46(h) 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 18 ( 2.5% )
Table 15. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Hive ID
100.00%
2.55%
154 438

Data Distribution Top 20

Figure 3. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 4. Visualization of completeness of the data in the column.

Uniqueness

Figure 5. Visualization of uniqueness of the data in the column.

Column: Sampling Date

Table 16. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Sampling Date
Description

Date at which the sample was taken.

Data type Date
Descriptor iso-8601:calendarDate [UID:0.0.DATEA317]
Descriptor description

particular calendar day [...] represented by its calendar year [...], its calendar month [...] and its calendar day of month [...]

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DATEA317
Unit

n/a

Table 17. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sampling Date 10 - 10 n/a 2018-04-10 n/a n/a n/a 2020-10-16 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 0.7% )
Table 18. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sampling Date
100.00%
0.71%
2020-10-16 2019-04-04

Data Distribution Top 20

Figure 6. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 7. Visualization of completeness of the data in the column.

Uniqueness

Figure 8. Visualization of uniqueness of the data in the column.

Column: Sampling Site

Table 19. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Sampling Site
Description

Type of frame (Brood frame, Storage frame) from which the sample was taken.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 20. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Sampling Site 0 - 13 n/a Brood frame n/a n/a n/a Storage fram… 706 351 ( 49.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 3 ( 0.4% )
Table 21. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Sampling Site
50.28%
0.42%
n/a Storage frame

Data Distribution Top 20

Figure 9. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 10. Visualization of completeness of the data in the column.

Uniqueness

Figure 11. Visualization of uniqueness of the data in the column.

Column: Pool size (nb bees)

Table 22. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Pool size (nb bees)
Description

Number of individual honey bees in the sample.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

no.

Table 23. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Pool size (nb bees) 1 - 3 26.6 1 1 1 30 100 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 0.7% )
Table 24. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Pool size (nb bees)
100.00%
0.71%
1 98

Data Distribution Top 20

Figure 12. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 13. Distribution of values in the column.

Outliers

Figure 14. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 15. Visualization of completeness of the data in the column.

Uniqueness

Figure 16. Visualization of uniqueness of the data in the column.

Column: DWV Cq mean

Table 25. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name DWV Cq mean
Description

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Deformed Wing Virus (DWV).

Data type Decimal number
Descriptor pms:quantificationCycle [UID:0.0.QNTFC467]
Descriptor description

Depending on the real-time instrument, either threshold cycle (Ct), crossing point (Cp) or a take-off point (Top) are used to refer to the same quantification cycle value (Cq): the fractional PCR cycle at which the target is quantified in a given sample. It was proposed to use the term quantification cycle (Cq) in accordance with the data standard RDML (Real-Time PCR Data Markup Language)

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.QNTFC467
Unit

no.

Table 26. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
DWV Cq mean 3 - 5 19.934 3.59 12.535 20.88 26.455 39.73 706 1 ( 0.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 610 ( 86.4% )
Table 27. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
DWV Cq mean
99.86%
86.40%
17.55 32.01

Continuous Data Distribution

Figure 17. Distribution of values in the column.

Outliers

Figure 18. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 19. Visualization of completeness of the data in the column.

Uniqueness

Figure 20. Visualization of uniqueness of the data in the column.

Column: DWV Nb copies/bee

Table 28. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name DWV Nb copies/bee
Description

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Deformed Wing Virus (DWV) detected in the sample.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 29. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
DWV Nb copies/bee 4 - 15 2,428,369,430,152.8 2,320 23,900,000 1,540,000,000 444,500,000,000 107,000,000,000,000 706 1 ( 0.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 656 ( 92.9% )
Table 30. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
DWV Nb copies/bee
99.86%
92.92%
10200000 1470000

Continuous Data Distribution

Figure 21. Distribution of values in the column.

Outliers

Figure 22. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 23. Visualization of completeness of the data in the column.

Uniqueness

Figure 24. Visualization of uniqueness of the data in the column.

Column: SBV Cq mean

Table 31. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name SBV Cq mean
Description

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Sackbrood Virus (SBV).

Data type Decimal number
Descriptor pms:quantificationCycle [UID:0.0.QNTFC467]
Descriptor description

Depending on the real-time instrument, either threshold cycle (Ct), crossing point (Cp) or a take-off point (Top) are used to refer to the same quantification cycle value (Cq): the fractional PCR cycle at which the target is quantified in a given sample. It was proposed to use the term quantification cycle (Cq) in accordance with the data standard RDML (Real-Time PCR Data Markup Language)

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.QNTFC467
Unit

no.

Table 32. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
SBV Cq mean 4 - 5 26.613 7.63 22.33 27.99 31.3025 39.27 706 8 ( 1.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 593 ( 84.0% )
Table 33. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
SBV Cq mean
98.87%
83.99%
n/a 25.85

Continuous Data Distribution

Figure 25. Distribution of values in the column.

Outliers

Figure 26. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 27. Visualization of completeness of the data in the column.

Uniqueness

Figure 28. Visualization of uniqueness of the data in the column.

Column: SBV Nb copies/bee

Table 34. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name SBV Nb copies/bee
Description

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Sackbrood Virus (SBV) detected in the sample.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 35. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
SBV Nb copies/bee 3 - 13 11,032,210,469.1 705 187,750 2,445,000 70,900,000 4,310,000,000,000 706 8 ( 1.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 637 ( 90.2% )
Table 36. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
SBV Nb copies/bee
98.87%
90.23%
1450000 5130000

Continuous Data Distribution

Figure 29. Distribution of values in the column.

Outliers

Figure 30. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 31. Visualization of completeness of the data in the column.

Uniqueness

Figure 32. Visualization of uniqueness of the data in the column.

Column: BQCV Cq mean

Table 37. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name BQCV Cq mean
Description

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Black Queen Cell Virus (BQCV).

Data type Decimal number
Descriptor pms:quantificationCycle [UID:0.0.QNTFC467]
Descriptor description

Depending on the real-time instrument, either threshold cycle (Ct), crossing point (Cp) or a take-off point (Top) are used to refer to the same quantification cycle value (Cq): the fractional PCR cycle at which the target is quantified in a given sample. It was proposed to use the term quantification cycle (Cq) in accordance with the data standard RDML (Real-Time PCR Data Markup Language)

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.QNTFC467
Unit

no.

Table 38. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
BQCV Cq mean 4 - 5 22.718 4.85 19.135 23.825 26.295 35.54 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 568 ( 80.5% )
Table 39. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
BQCV Cq mean
100.00%
80.45%
26.09 19.65

Continuous Data Distribution

Figure 33. Distribution of values in the column.

Outliers

Figure 34. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 35. Visualization of completeness of the data in the column.

Uniqueness

Figure 36. Visualization of uniqueness of the data in the column.

Column: BQCV Nb copies/bee

Table 40. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name BQCV Nb copies/bee
Description

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Black Queen Cell Virus (BQCV) detected in the sample.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 41. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
BQCV Nb copies/bee 5 - 14 53,648,532,226.6 15,700 12,600,000 59,350,000 1,187,500,000 24,400,000,000,000 706 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 624 ( 88.4% )
Table 42. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
BQCV Nb copies/bee
100.00%
88.39%
32300000 890000000

Continuous Data Distribution

Figure 37. Distribution of values in the column.

Outliers

Figure 38. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 39. Visualization of completeness of the data in the column.

Uniqueness

Figure 40. Visualization of uniqueness of the data in the column.

Column: CBPV Cq mean

Table 43. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name CBPV Cq mean
Description

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Chronic Bee Paralysis Virus (CBPV).

Data type Decimal number
Descriptor pms:quantificationCycle [UID:0.0.QNTFC467]
Descriptor description

Depending on the real-time instrument, either threshold cycle (Ct), crossing point (Cp) or a take-off point (Top) are used to refer to the same quantification cycle value (Cq): the fractional PCR cycle at which the target is quantified in a given sample. It was proposed to use the term quantification cycle (Cq) in accordance with the data standard RDML (Real-Time PCR Data Markup Language)

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.QNTFC467
Unit

no.

Table 44. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
CBPV Cq mean 1 - 5 33.389 4 32.5175 34.08 35.2625 39.96 706 164 ( 23.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 410 ( 58.1% )
Table 45. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
CBPV Cq mean
76.77%
58.07%
n/a 38.87

Continuous Data Distribution

Figure 41. Distribution of values in the column.

Outliers

Figure 42. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 43. Visualization of completeness of the data in the column.

Uniqueness

Figure 44. Visualization of uniqueness of the data in the column.

Column: CBPV Nb copies/bee

Table 46. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name CBPV Nb copies/bee
Description

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Chronic Bee Paralysis Virus (CBPV) detected in the sample.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 47. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
CBPV Nb copies/bee 3 - 12 1,426,840,228.1 480 25,050 78,500 384,000 508,000,000,000 706 164 ( 23.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 480 ( 68.0% )
Table 48. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
CBPV Nb copies/bee
76.77%
67.99%
13600 18300

Continuous Data Distribution

Figure 45. Distribution of values in the column.

Outliers

Figure 46. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 47. Visualization of completeness of the data in the column.

Uniqueness

Figure 48. Visualization of uniqueness of the data in the column.

Column: ABPV Cq mean

Table 49. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ABPV Cq mean
Description

Not specified by the data provider. Presumably the average Cq-value (Ct value) for the infection load with the Acute Bee Paralysis Virus (ABPV).

Data type Decimal number
Descriptor pms:quantificationCycle [UID:0.0.QNTFC467]
Descriptor description

Depending on the real-time instrument, either threshold cycle (Ct), crossing point (Cp) or a take-off point (Top) are used to refer to the same quantification cycle value (Cq): the fractional PCR cycle at which the target is quantified in a given sample. It was proposed to use the term quantification cycle (Cq) in accordance with the data standard RDML (Real-Time PCR Data Markup Language)

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.QNTFC467
Unit

no.

Table 50. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ABPV Cq mean 1 - 5 27.212 2 23.175 28.42 32.015 39.37 706 57 ( 8.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 553 ( 78.3% )
Table 51. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ABPV Cq mean
91.93%
78.33%
n/a 28.35

Continuous Data Distribution

Figure 49. Distribution of values in the column.

Outliers

Figure 50. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 51. Visualization of completeness of the data in the column.

Uniqueness

Figure 52. Visualization of uniqueness of the data in the column.

Column: ABPV Nb copies/bee

Table 52. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ABPV Nb copies/bee
Description

Not specified by the data provider. Presumably the number of copies of the analysed sequence for Acute Bee Paralysis Virus (ABPV) detected in the sample.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 53. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ABPV Nb copies/bee 3 - 14 221,083,836,715.4 989 178,500 3,130,000 111,500,000 37,700,000,000,000 706 57 ( 8.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 607 ( 86.0% )
Table 54. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ABPV Nb copies/bee
91.93%
85.98%
33400 5870000

Continuous Data Distribution

Figure 53. Distribution of values in the column.

Outliers

Figure 54. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 55. Visualization of completeness of the data in the column.

Uniqueness

Figure 56. Visualization of uniqueness of the data in the column.