Table: data 2020 all countries

[UID:BGDWT180.DTLLC377.0]
Data

Data Files

Main dataset files related to this table.

Columns

Description File Details
weather 2020_PREP_MR_241014.csv
n/a
File type: CSV - Comma seperated values
File size: 43.88 MiB

Supplemental Files

Any supplemental files, not containing data.

Columns

Description File Details
Licence
This file contains dataset licencing information.
This is a generated file.
Table Metadata
This file contains file column metadata in csv format.
This is a generated file.
About

Description

This table contains data from all weather stations in the different countries (BE, CH, DE, FR, NL, PT, RO, UK) from 2020

Table structure

n/a

Changes made to preparatory file

  1. Data reporting atmospheric pressure in the raw data files was either provided in Pa or in hPa. Values provided in hPa were transformed to Pa in order to store all values in a common unit while maintaining the raw data.
  2. Data reporting wind speed in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.
  3. Data reporting wind gust in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.

Changes made to data in table

No changes were made in the data file.

Unresolved issues

  1. In columns temperature, temperature_min, temperature_max, feelslike and dew_point obtained from the device located in Romania 21.694 records contain values > 100°C. These records must be revised by the data provider.
  2. In 3 records from the device in Belgium values are out of range. These records must be revised by the data provider.
  3. The description of the data (metadata) is largely incomplete and allows no clear standardisation of the data.
    • For column temperature it is unclear, how the temperature is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column feelslike it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column dewpoint it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column RH it is unclear, how the relative humidity is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column atmpressure_pa and atmpressureh_pa it is unclear, how the atmospheric pressure is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column atmpressure_sealevel_Pa and atmpressure_sealevel_hPa it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For data reported in column atmpressure_grndlevel_hPa it is unclear what is reported in the ray data file.
    • For data reported in column rain_counter it is unclear what is reported in the raw data file.
    • For data reported in column rain_max it is unclear what is reported in the raw data file.
    • For data reported in column valid_ticks it is unclear what is reported in the raw data file.
    • For column wind_speed_ms and wind_speed_kmh it is unclear, how the wind speed is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_gust_ms and wind_gust_kmh it is unclear, how the wind gust is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_deg it is unclear, how the wind direction is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column irradiance it is unclear, how the solar irradiance is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column energy density it is unclear, how the rate of solar radiation is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column clouds it is unclear what is reported in the raw data file.
    • For data reported in column visibility it is unclear what is reported in the raw data file.
    • For data reported in column carbon_dioxide it is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column payload it is unclear what is reported in the raw data file.
    • For data reported in column time_sync_error_s it is unclear what is reported in the raw data file.
    • For data reported in column seq_number_modem it is unclear what is reported in the raw data file.
    • For data reported in column seq_number_firmware it is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column temperature_wetbulb_stull2011_C it is unclear what is reported in the raw data file.
  4. Data in raw data files acquired in 2020 contain:
    • 9 triplicate records for the same date and time (3 x 8 records in raw data files from the device in Switzerland, 3 x 1 record in a raw data file from the device in Romania). These records must be revised by the data provider.
    • 413 duplicate records for the same date and time (2 x 272 records in raw data files from the device in Switzerland, 2 x 141 records in a raw data file from the device in Romania). These records must be revised by the data provider.

References

  1. Dooremalen C. 2022 Bee Health Data Portal - Dataset. B-GOOD Bee Health Data Portal. [2024-10-15] beehealthdata.org
Metadata

Column Metadata

Columns

Descriptive Measures

Column profiling

Columns

Quality Measures

Data quality measures

Columns

Profiling Charts
Samples

Data samples

Properties

Dataset

Unique identifier

[BGDWT180.DTLLC377.0]

EUPH IRI

https://app.pollinatorhub.eu/dataset-discovery/parts/BGDWT180.DTLLC377.0

Table Type

File

Licence in

n/a

Columns

55

Rows

145,838

Data points

8,021,090

Published

2025-03-17
Share
Metrics

Total views

53

Total downloads

5