Table: data 2021 all countries

[UID:BGDWT180.DTLLC375.0]
Data

Data Files

Main dataset files related to this table.

Columns

Description File Details
weather 2021_PREP_MR_241014.csv
n/a
File type: CSV - Comma seperated values
File size: 54.66 MiB

Supplemental Files

Any supplemental files, not containing data.

Columns

Description File Details
Licence
This file contains dataset licencing information.
This is a generated file.
Table Metadata
This file contains file column metadata in csv format.
This is a generated file.
About

Description

This table contains data from all weather stations in the different countries (BE, CH, DE, FR, NL, PT, RO, UK) from 2021

Table structure

n/a

Changes made to preparatory file

  1. Data reporting atmospheric pressure in the raw data files was either provided in Pa or in hPa. Values provided in hPa were transformed to Pa in order to store all values in a common unit while maintaining the raw data.
  2. Data reporting wind speed in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.
  3. Data reporting wind gust in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.

Changes made to data in table

No changes were made in the data file.

Unresolved issues

  1. In columns temperature, temperature_min, temperature_max, feelslike and dew_point from the device located in Romania 9.095 records contain values > 100°C. These records must be revised by the data provider.
  2. The description of the data (metadata) is largely inclomplete and allows no clear standardisation of the data.
    • For column temperature ist is unclear, how the temperature is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column feelslike it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column dew_point it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column RH ist is unclear, how the relative humidity is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column atmpressure_pa and atmpressureh_pa it is unclear, how the atmospheric pressure is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval)
    • For data reported in column atmpressure_sealevel_Pa and atmpressure_sealevel_hPa is is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For data reported in column atmpressure_grndlevel_hPa is is unclear what is reported in the raw data file.
    • For data reported in column rain_counter is is unclear what is reported in the raw data file.
    • For data reported in column rain_max is is unclear what is reported in the raw data file.
    • For data reported in column valid_ticks is is unclear what is reported in the raw data file.
    • For column wind_speed_ms and wind_speed_kmh it is unclear, how the wind speed is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_gust_ms and wind_gust_kmh it is unclear, how the wind gust is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_deg it is unclear, how the wind direction is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column irradiance it is unclear, how the solar irradiance is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column energy density it is unclear, how the rate of solar radiation is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column clouds is is unclear what is reported in the raw data file.
    • For data reported in column visibility is is unclear what is reported in the raw data file.
    • For data reported in column carbon_dioxide is is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column payload is is unclear what is reported in the raw data file.
    • For data reported in column time_sync_error_s is is unclear what is reported in the raw data file.
    • For data reported in column seq_number_modem is is unclear what is reported in the raw data file.
    • For data reported in column seq_number_firmware is is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column temperature_wetbulb_stull2011_C is is unclear what is reported in the ray data file.
  3. Data in raw data files acquired in 2021 contain
    • 8 quadruplicate records for the same date and time in raw data files from the device in Romania. These records must be revised by the data provider.
    • 76 triplicate records for the same date and time (3 x 23 records in raw data files from the device in Switzerland, 3 x 53 records in raw data files from the device in Romania). These records must be revised by the data provider.
    • 1763 duplicate records for the same date and time (2 x 293 records in raw data files from the device in Switzerland, 2 x 1470 records in a raw data file from the device in Romania). These records must be revised by the data provider.

References

  1. Dooremalen C. 2022 Bee Health Data Portal - Dataset. B-GOOD Bee Health Data Portal. [2024-10-15] beehealthdata.org
Metadata

Column Metadata

Columns

Descriptive Measures

Column profiling

Columns

Quality Measures

Data quality measures

Columns

Profiling Charts
Samples

Data samples

Properties

Dataset

Unique identifier

[BGDWT180.DTLLC375.0]

EUPH IRI

https://app.pollinatorhub.eu/dataset-discovery/parts/BGDWT180.DTLLC375.0

Table Type

File

Licence in

n/a

Columns

55

Rows

184,065

Data points

10,123,575

Published

2025-03-17
Share
Metrics

Total views

76

Total downloads

11