EU Pollinator Hub BEELIFE EUROPEAN
BEEKEEPING COORDINATION

Avenue Louise 209/7, 1050 Brussels, Belgium
info@pollinatorhub.eu • www.pollinatorhub.eu • +32 (0) 486 973 920
Dataset Report
UID: BGDWT180.0.0
Name: B-GOOD Weather Data
Title: Dataset from the B-GOOD project, containing the relevant weather data at the study sites.
Status: Approved
Version: v. 1.0
Date: 2024-11-21
Author: Rubinigg Michael
Citation proposal:
Rubinigg M. 2024 Report of dataset B-GOOD Weather Data, v. 1.0 [BGDWT180.0.0]. EU Pollinator Hub. [2025-03-29] app.pollinatorhub.eu
Compliance with FAIR* principles
Findable
Accessible
Interoperable
Reusable
See https://www.go-fair.org/fair-principles for more information about FAIR principles
Data Quality
Requires major revision
This document is intended for use by collaborators of the EU Pollinator Hub and may be passed on with the express permission of the leader of the consortium and for the purpose determined by the leader of the consortium.

Document History

Release

Version v. 1.0 released on 2025-03-29. Written by Rubinigg Michael. Reviewed by Rubinigg Michael.

Revision

Table 1. List of revisions made to the document. Identifier of revision (No); date of revision (Date); description of revision (Description); reason for revision (Reason).
No Date Description Reason
1 2025-03-29 10:03:14 Initial release. N.A.

Abbreviations

CSV
Comma-Separated Values
EU
European Union
EUPH
EU Pollinator Hub
WR
Stichting Wageningen Research (Wageningen Research foundation)

Executive Summary

Data overview:

The data was published by van Dooremalen C (WR) on the B-GOOD Bee Health Data Portal as part of the B-GOOD project (grant agreement 817622), funded under the EU Horizon 2020 Research and Innovation Programme. The dataset contains...

Data value:

The objectives of the B-GOOD project were: (1) Facilitate decision making for beekeepers and other stakeholders by establishing ready-to-use tools for operationalising the HSI; (2) Test, standardise and validate methods for measuring and reporting selected indicators affecting bee health; (3) Explore the various socio-economic and ecological factors beyond bee health; (4) Foster an EU community to collect and share knowledge related to honey bees and their environment; (5) Engender a lasting learning and innovation system (LIS); (6) Minimise the impact of biotic and abiotic stressors.

Data description:

n/a

Data application:

Currently, the data integrated from the B-GOOD Bee Health Data Portal contains major issues and does not comply with the FAIR Guiding Principles for scientific data management and stewardship applied on the EU Pollinator Hub. More descriptive information about the context, quality and condition, or characteristics of the data (e.g. protocols, measurement devices used, units of the captured data, or any other details about the study) must be provided. More metadata in the form of accurate and relevant attributes (*e.g. *metadata that describes the scope of the data has been described, any particularities or limitations about the data that other users should be aware of, specification of the date of generation/collection of the data, the lab conditions, who prepared the data, the parameter settings, the name and version of the software used, specification of whether it is raw or processed data, explanation of all variable names are explained if they are not self-explanatory) must be provided. It requires major revisions by the data provider.

Introduction

n/a

Material and Methods

Data Acquisition

All raw data files were downloaded from the B-GOOD Bee Health Data Portal on 2024-10-11.

List of raw data obtained from the data provider.

  1. Archive weather-data-2020.zip accessed on 2024-10-11 06:33:06, provided by B-GOOD Bee Health Data Portal
  2. Archive weather-data-2021.zip accessed on 2024-10-11 06:33:06, provided by B-GOOD Bee Health Data Portal
  3. File meta-data-weather-tier-1-b-good.xlsx accessed on 2024-10-11 06:33:06, provided by B-GOOD Bee Health Data Portal

Metadata was obtained from the dataset's web page.

Table 2. List of raw data and metadata files included in the dataset. Identifier of table row (No); name of the file (File); the type of the file (Type); file contains data (D); file contains metadata (M); date of upload of the file to the EU Pollinator Hub (Arrival); number of data points contained within the file (if applicable); uploaded file size.
No File Type D M Arrival Data points File size
1 weather 2021_PREP_MR_241014.csv CSV - Comma seperated values Yes No 2024-10-15 16:10:17 10,123,575 54.66 MiB
2 weather 2020_PREP_MR_241014.csv CSV - Comma seperated values Yes No 2024-10-15 16:10:12 8,021,090 43.88 MiB
3 meta-data-weather-tier-1-b-good.xlsx Miscellaneous No Yes 2024-10-15 18:10:38 n/a 186.62 KiB

Data Preparation

All files in the zip-archives were extracted using File Explorer (Microsoft Corporation, version 22H2).

Each raw data file was imported into MS Excel (Microsoft Corporation, version 2409) where a first assessment of the existing data was made. Based on this assessment a data mapping file was constructed in which each column in the raw data files was assigned to a column with a common name (header), definition, unit and data type, which applied to the presumed content of each single column in the raw data files. The metadatafile meta-data-weather-tier-1-b-good.xlsx was used as a guideline. Subsequently, each data column header in the raw data file was substituted by the relevant common column header.

All processed raw data files were then exported from MS Excel in CSV format (utf-8 encoding) and imported into into a SQL database (MariaDB foundation, server version 10.4.32) running in an XAMPP environment (BitRock, version 5.2.1). Depending on the year of data acquisition, the record were then divided into one table that contained only data from 2020 and one table that contained data from 2021 (including 2 records from 2022). Each table was then exported to the preparatory files weather 2020PREPMR241014.csv and weather 2021PREPMR241014.csv, respectively, which were subsequently imported into the EU Pollinator Hub.

Data was then exported to the respective preparatory files and uploaded to the EU Pollinator Hub according to SOP-017 (Dataset integration.

Data Validation

No data validation was performed.

Data Analysis

No data analysis was performed.

Data Description

Dataset

Table 3. Summary of tables belonging to the dataset. Table row identifier (No); name of the table (Table); description of the table (Description).
No Table Description
1 data 2021 all countries This table contains data from all weather stations in the different countries (BE, CH, DE, FR, NL, PT, RO, UK)…
2 data 2020 all countries This table contains data from all weather stations in the different countries (BE, CH, DE, FR, NL, PT, RO, UK)…
Table 4. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
UID BGDWT180.0.0
Name B-GOOD Weather Data
Title Dataset from the B-GOOD project, containing the relevant weather data at the study sites.
IRI https://app.pollinatorhub.eu/dataset-discovery/BGDWT180.0.0
Licence CC BY-NC-ND 4.0
DOI n/a
Creation date 2024-10-10
Publishing date 2025-03-17
Contact information n/a
Keywords Apis mellifera, honey bee, weather
Data collection years n/a
Regions, the data was collected in Belgique/België, Deutschland, France, Nederland, Portugal, România, Schweiz/Suisse/Svizzera, United Kingdom
Description

The dataset contains data from weather stations in the vicinity of the test apiaries located in Belgium, Switzerland, Germany, France, the Netherlands, Portugal, Romania and United Kingdom in 2020 and 2021. It was published by van Dooremalen C (WR) on the B-GOOD Bee Health Data Portal as part of the B-GOOD project (grant agreement 817622), funded under the EU Horizon 2020 Research and Innovation Programme.

Table 5. Standardised metadata of the data provider B-GOOD Bee Health Data Portal. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name B-GOOD Bee Health Data Portal
URL
Acronym B-GOOD
IRI https://app.pollinatorhub.eu/data-providers/b-good-bee-health-data-portal
Address https://b-good-project.eu
Country Belgium
Contact information b-good-project.eu
Description

Project funded by the EU Horizon 2020 Research and Innovation Programme under grant agreement No 817622. Project website: https://b-good-project.eu

Tables

data 2021 all countries

Table 6. Standardised metadata of the table. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
UID BGDWT180.DTLLC375.0
Name data 2021 all countries
IRI https://app.pollinatorhub.eu/dataset-discovery/parts/BGDWT180.DTLLC375.0
Type File
Licence CC BY-NC-ND 4.0
Description

Table data 2021 all countries contains 184.065 records (54,67 MB) from 9 distinct raw data files, 8 distinct countries and 4 distinct device types assessing different parameters in different units and with different methods.

Table data 2021 all countries contains 184.065 records (54,67 MB) from 9 distinct raw data files, 8 distinct countries and 4 distinct device types assessing different parameters in different units and with different methods.

Metadata

n/a
Table 7. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Name Description Data type Descriptor Unit
raw_data

Name of the raw data file, obtained from the data provider.

String Text [0.0.TEXTA315]

n/a

country

NUTS level 0 code of the location in which the weather station was based.

String nuts2021Code [0.0.NTSCD55]

n/a

device_type

Type/model/name of the device/system, which was used acquire the data.

String Text [0.0.TEXTA315]

n/a

DeviceID

Identifier of the device/system, which was used acquire the data.

String applianceID [0.0.PPLNC488]

n/a

unix_time

Date and time in Unix time format, presumably giving the time at which data has been transmitted.

Integer number unixTime [0.0.NXTME469]

n/a

timezone

Offset of local time in which the device is located from UTC, given in seconds.

String utcOffset [0.0.TCFFS470]

s

datetime

Date and time in ISO 8601 format, presumably giving the time at which data has been transmitted.

Date and Time calendarDateAndTime [0.0.DTNDT319]

n/a

year

Calendar year, presumably giving the time at which data has been transmitted.

Integer number year [0.0.YEARA340]

year

month

Calendar month, presumably giving the time at which data has been transmitted.

Integer number calendarMonth [0.0.CLNDR376]

n/a

day

Calendar day of month, presumably giving the time at which data has been transmitted.

Integer number day [0.0.DAYAB382]

n/a

hour

Clock hour, presumably giving the time at which data has been transmitted.

Integer number clock hour [0.0.HRFDY386]

n/a

location_name

Name of the location in which the device/system is located.

String Text [0.0.TEXTA315]

n/a

location_lat

Geographic latitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Decimal number decimalLatitude [0.0.LTTDE333]

°

location_long

Geographic longitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Decimal number decimalLongitude [0.0.LNGTD332]

°

temperature

Not sufficiently specified by the data provider. Temperature measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Decimal number temperature [0.0.TMPRT394]

°C

temperature_min

Minimum temperature measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

temperature_max

Maximum temperature measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

feels_like

Temperature in account with the human perception of weather measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

dew_point

Temperature of dew point, measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

RH

Not sufficiently specified by the data provider. Relative humidity measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Decimal number relativeHumidity [0.0.RLTVH395]

%

atmpressure_pa

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in or converted to Pa.

Decimal number atmosphericPressure [0.0.TMSPH396]

Pa

atmpressure_hpa

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in hPa.

Decimal number atmosphericPressure [0.0.TMSPH396]
atmpressure_sealevel_Pa

Not sufficiently specified by the data provider. Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_pa. Expressed in Pa.

Decimal number atmosphericPressure [0.0.TMSPH396]

Pa

atmpressure_sealevel_hPa

Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Decimal number atmosphericPressure [0.0.TMSPH396]
atmpressure_grndlevel

Not sufficiently specified by the data provider. Ground level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Decimal number atmosphericPressure [0.0.TMSPH396]
rain

Not sufficiently specified by the data provider. Cumulative rainfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Decimal number rainfall [0.0.RNFLL471]

mm

rain_intval

Duration of the interval vor which the rainfall in column rain is reported.

Integer number Integer [0.0.NTGER313]

min

rain_3h

Cumulative rainfall, measured in the three hours preceeding and including the relevant interval.

Decimal number rainfall [0.0.RNFLL471]

mm

snow

Not sufficiently specified by the data provider.Cumulative snowfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Decimal number snowfall [0.0.SNWFL472]

mm

snow_3h

Cumulative snowfall, measured in the three hours preceeding and including the relevant interval.

Decimal number snowfall [0.0.SNWFL472]

mm

rain_counter

No information is provided on this parameter.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

rain_max

No information is provided on this parameter, presumably maximum rainfall intensity in the relevant interval, given in mm/h.

Decimal number rainfallIntensity [0.0.RNFLL473]

mm h-1

rain_gauge

Rain gauge resolution, defined as the minimum amount of rain that a rain gauge can register, given in mm.

Decimal number DecimalNumber [0.0.DCMLN314]

mm

valid_ticks

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

wind_speed_ms

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in or transformed to m/s.

Decimal number windSpeed [0.0.WNDSP474]

m s-1

wind_speed_kmh

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in km/h.

Decimal number windSpeed [0.0.WNDSP474]

km h-1

wind_deg

Not sufficiently specified by the data provider. Wind direction measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in degrees.

Integer number windDirection [0.0.WNDDR475]

°

wind_gust_ms

Wind gust measured in the relevant interval. Given in or transformed to m/s.

Decimal number windGust [0.0.WNDGS476]

m s-1

wind_gust_kmh

Wind gust measured in the relevant interval. Given in km/h.

Decimal number windGust [0.0.WNDGS476]

km h-1

irradiance

Not sufficiently specified by the data provider. Solar irradiance measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Decimal number solarIrradiance [0.0.SLRRR477]

W m-2

energy_density

Not sufficiently specified by the data provider. Rate of solar radiation measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Decimal number rateOfSolarRadiation [0.0.RTFSL478]

J cm-2

irradiance_max

Maximum [irradiance](solar irradiance) measured in the relevant interval.

Decimal number solarIrradiance [0.0.SLRRR477]

W m-2

clouds

No information is provided on this parameter, presumably the part of the sky that is covered with clouds.

Integer number Integer [0.0.NTGER313]

%

visibility

No information is provided on this parameter, presumably the visibility in the atmosphere for the human eye, given in m.

Integer number Integer [0.0.NTGER313]

m

carbon_dioxide

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

weather_id

Internal weather condition code adopted by the provider of the weather data.

Integer number Integer [0.0.NTGER313]

n/a

weather_main

Internal main group for the description of the weather adopted by the provider of the weather data.

String Text [0.0.TEXTA315]

n/a

weather_description

Internal subgroup for the description of the weather adopted by the provider of the weather data.

String Text [0.0.TEXTA315]

n/a

weather_icon

Internal code for icons describing the weather condistions, adopted by the provider of the weather data.

String Text [0.0.TEXTA315]

n/a

battery

Voltage of the batteries of the device/system.

Decimal number DecimalNumber [0.0.DCMLN314]

V

payload

No information is provided on this parameter.

String Text [0.0.TEXTA315]

n/a

time_sync_error_s

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

s

seq_number_modem

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

seq_number_firmware

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

temperature_wetbulb_stull2011_C

No information is provided on this parameter.

Decimal number DecimalNumber [0.0.DCMLN314]

°C

Metadata of individual tables can be found in Annex 1.

Descriptive Measures

Table 8. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
raw_data 24 - 45 n/a BE_weather d… n/a n/a n/a RO_weather d… 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 9 ( 0.0% )
country 2 - 2 n/a BE n/a n/a n/a UK 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 0.0% )
device_type 8 - 18 n/a Climatik n/a n/a n/a Weatherhelix 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
DeviceID 3 - 3 176.1 164 164 182 182 219 184,065 70,705 ( 38.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 0.0% )
unix_time 10 - 10 1,625,279,669.9 1,609,459,200 1,617,282,000 1,625,288,400 1,633,305,600 1,640,991,600 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 8,761 ( 4.8% )
timezone 1 - 5 5,754.5 0 3,600 7,200 7,200 10,800 184,065 157,111 ( 85.4% ) 3,552 ( 1.9% ) 0 ( 0.0% ) 5 ( 0.0% )
datetime 19 - 19 n/a 2021-01-01 0… n/a n/a n/a 2022-01-01 0… 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 69,084 ( 37.5% )
year 4 - 4 2,021.0 2,021 2,021 2,021 2,021 2,021 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
month 1 - 2 6.5 1 4 7 10 12 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 13 ( 0.0% )
day 1 - 2 15.7 1 8 16 23 31 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 0.0% )
hour 1 - 2 12.5 1 6.25 12.5 18.75 24 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 25 ( 0.0% )
location_name 0 - 15 n/a 84007004 n/a n/a n/a UCLUJ 184,065 148,351 ( 80.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
location_lat 9 - 9 48.8408064 46.759188 46.759188 46.967707 52.947616 52.947616 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
location_long 8 - 9 10.1038098 -1.068273 -1.068273 7.399013 23.570373 23.570373 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
temperature 1 - 6 29.957 -16 5.6 12.28 19.1 306.62 184,065 51,088 ( 27.8% ) 4,716 ( 2.6% ) 0 ( 0.0% ) 5,949 ( 3.2% )
temperature_min 1 - 22 27.399900999999999839929 -16.1 4.1 10.4 17.1 305.5 184,065 43,751 ( 23.8% ) 4,807 ( 2.6% ) 0 ( 0.0% ) 4,185 ( 2.3% )
temperature_max 1 - 22 28.122694880000000949849 -15.8 4.8 11 17.6 309.72 184,065 43,751 ( 23.8% ) 4,729 ( 2.6% ) 0 ( 0.0% ) 4,202 ( 2.3% )
feels_like 1 - 6 100.874 -14.04 5.84 15.09 273.52 307.69 184,065 157,111 ( 85.4% ) 11 ( 0.0% ) 0 ( 0.0% ) 6,391 ( 3.5% )
dew_point 1 - 6 24.514 -16.14 0 0.1 10.4 295.86 184,065 63,485 ( 34.5% ) 51,174 ( 27.8% ) 0 ( 0.0% ) 5,643 ( 3.1% )
RH 1 - 4 76.44 0 67 83 92 100 184,065 0 ( 0.0% ) 4,536 ( 2.5% ) 0 ( 0.0% ) 799 ( 0.4% )
atmpressure_pa 1 - 9 98,166.113 0 100,400 101,300 102,190 120,060 184,065 43,751 ( 23.8% ) 4,400 ( 2.4% ) 0 ( 0.0% ) 8,123 ( 4.4% )
atmpressure_hpa 1 - 9 411.51953 0 0 0 1,016.78545 1,094 184,065 98,476 ( 53.5% ) 51,088 ( 27.8% ) 0 ( 0.0% ) 6,602 ( 3.6% )
atmpressure_sealevel_Pa 1 - 5 10,867.4 0 0 0 0 99,640 184,065 116,331 ( 63.2% ) 60,187 ( 32.7% ) 0 ( 0.0% ) 855 ( 0.5% )
atmpressure_sealevel_hPa 1 - 6 125.538 0 0 0 0 996.4 184,065 125,430 ( 68.1% ) 51,088 ( 27.8% ) 0 ( 0.0% ) 855 ( 0.5% )
atmpressure_grndlevel 1 - 1 0.0 0 0 0 0 0 184,065 174,966 ( 95.1% ) 9,099 ( 4.9% ) 0 ( 0.0% ) 2 ( 0.0% )
rain 1 - 5 65.435 0 0 24 133 255 184,065 15,196 ( 8.3% ) 58,461 ( 31.8% ) 0 ( 0.0% ) 584 ( 0.3% )
rain_intval 2 - 2 29.2 10 10 10 60 60 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
rain_3h 1 - 1 0.0 0 0 0 0 0 184,065 174,966 ( 95.1% ) 9,099 ( 4.9% ) 0 ( 0.0% ) 2 ( 0.0% )
snow 1 - 5 0.026 0 0 0 0 23.37 184,065 174,640 ( 94.9% ) 8,809 ( 4.8% ) 0 ( 0.0% ) 91 ( 0.0% )
snow_3h 1 - 1 0.0 0 0 0 0 0 184,065 174,966 ( 95.1% ) 9,099 ( 4.9% ) 0 ( 0.0% ) 2 ( 0.0% )
rain_counter 1 - 1 0.0 0 0 0 0 0 184,065 78,252 ( 42.5% ) 105,813 ( 57.5% ) 0 ( 0.0% ) 2 ( 0.0% )
rain_max 1 - 1 0.0 0 0 0 0 0 184,065 70,705 ( 38.4% ) 113,360 ( 61.6% ) 0 ( 0.0% ) 2 ( 0.0% )
rain_gauge 1 - 3 0.19 0 0.2 0.2 0.2 0.2 184,065 78,252 ( 42.5% ) 5,653 ( 3.1% ) 0 ( 0.0% ) 3 ( 0.0% )
valid_ticks 1 - 1 0.0 0 0 0 0 0 184,065 78,252 ( 42.5% ) 105,813 ( 57.5% ) 0 ( 0.0% ) 2 ( 0.0% )
wind_speed_ms 1 - 5 1.197 0 0 0 2.06 32.4 184,065 122,120 ( 66.3% ) 34,953 ( 19.0% ) 0 ( 0.0% ) 968 ( 0.5% )
wind_speed_kmh 1 - 1 0.0 0 0 0 0 9 184,065 149,074 ( 81.0% ) 34,208 ( 18.6% ) 0 ( 0.0% ) 7 ( 0.0% )
wind_deg 1 - 3 131.1 0 59 115 189 360 184,065 122,120 ( 66.3% ) 2,904 ( 1.6% ) 0 ( 0.0% ) 362 ( 0.2% )
wind_gust_ms 1 - 5 5.880 0 0 0.89 10.8 90 184,065 133,547 ( 72.6% ) 24,955 ( 13.6% ) 0 ( 0.0% ) 158 ( 0.1% )
wind_gust_kmh 1 - 2 2.2 0 0 1 4 25 184,065 149,074 ( 81.0% ) 15,788 ( 8.6% ) 0 ( 0.0% ) 18 ( 0.0% )
irradiance 1 - 4 101.3 0 0 2 98 1,152 184,065 35,714 ( 19.4% ) 46,186 ( 25.1% ) 0 ( 0.0% ) 1,077 ( 0.6% )
energy_density 1 - 3 65.8 0 0 3 113 371 184,065 175,305 ( 95.2% ) 4,110 ( 2.2% ) 0 ( 0.0% ) 361 ( 0.2% )
irradiance_max 1 - 4 112.9 0 0 2 106 1,394 184,065 70,705 ( 38.4% ) 29,147 ( 15.8% ) 0 ( 0.0% ) 641 ( 0.3% )
clouds 1 - 3 52.6 0 20 75 90 100 184,065 157,111 ( 85.4% ) 5,471 ( 3.0% ) 0 ( 0.0% ) 102 ( 0.1% )
visibility 1 - 5 8,692.5 0 9,999 10,000 10,000 10,000 184,065 162,812 ( 88.5% ) 565 ( 0.3% ) 0 ( 0.0% ) 66 ( 0.0% )
carbon_dioxide 0 - 0 n/a n/a n/a n/a 184,065 184,065 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
weather_id 3 - 3 730.1 200 701 800 803 804 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 31 ( 0.0% )
weather_main 0 - 12 n/a Clear n/a n/a n/a Thunderstorm 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 11 ( 0.0% )
weather_description 0 - 28 n/a broken cloud… n/a n/a n/a very heavy r… 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 0.0% )
weather_icon 0 - 3 n/a 01d n/a n/a n/a 50n 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 19 ( 0.0% )
battery 1 - 4 4.121 3.25 4.15 4.15 4.15 4.25 184,065 70,705 ( 38.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 22 ( 0.0% )
payload 0 - 23 n/a x4aea4003933… n/a n/a n/a x731e0005ff2… 184,065 78,252 ( 42.5% ) 0 ( 0.0% ) 0 ( 0.0% ) 100,133 ( 54.4% )
time_sync_error_s 1 - 4 17.3 -210 12 14 15 235 184,065 78,252 ( 42.5% ) 41 ( 0.0% ) 0 ( 0.0% ) 231 ( 0.1% )
seq_number_modem 1 - 5 2,116.5 0 1,089 2,091 3,107 10,822 184,065 78,252 ( 42.5% ) 26 ( 0.0% ) 0 ( 0.0% ) 5,263 ( 2.9% )
seq_number_firmware 0 - 0 n/a n/a n/a n/a 184,065 184,065 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
temperature_wetbulb_stull2011_C 1 - 6 0.005 -12.61 0 0 0 10.92 184,065 125,430 ( 68.1% ) 51,097 ( 27.8% ) 0 ( 0.0% ) 1,718 ( 0.9% )

Quality Measures

Table 9. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
raw_data
100.00%
0.00%
NL_weather data 2021.csv BE_weather data 2021.csv
country
100.00%
0.00%
NL BE
device_type
100.00%
0.00%
Weatherhelix Climatik
DeviceID
61.59%
0.00%
182 171
unix_time
14.64%
4.76%
1612144800 1609462800
timezone
14.64%
0.00%
7200 0
datetime
100.00%
37.53%
2021-02-01 02:00:00 2021-01-07 11:50:00
year
4.76%
0.00%
2021 2021
month
4.76%
0.01%
1 2
day
4.76%
0.02%
1 31
hour
4.76%
0.01%
1 1
location_name
19.40%
0.00%
n/a 84007004
location_lat
14.64%
0.00%
n/a 52.947616
location_long
14.64%
0.00%
n/a -1.068273
temperature
72.24%
3.23%
n/a -1.95
temperature_min
76.23%
2.27%
n/a 0.15
temperature_max
76.23%
2.28%
n/a 1.82
feels_like
14.64%
3.47%
n/a -4.17
dew_point
65.51%
3.07%
n/a -5.22
RH
100.00%
0.43%
0 14
atmpressure_pa
76.23%
4.41%
n/a 97555
atmpressure_hpa
46.50%
3.59%
n/a 1064
atmpressure_sealevel_Pa
36.80%
0.46%
0 95600
atmpressure_sealevel_hPa
31.86%
0.46%
n/a 956
atmpressure_grndlevel
4.94%
0.00%
0 0
rain
91.74%
0.32%
0 1.82
rain_intval
100.00%
0.00%
10 60
rain_3h
4.94%
0.00%
0 0
snow
5.12%
0.05%
n/a 0.94
snow_3h
4.94%
0.00%
0 0
rain_counter
57.49%
0.00%
0 0
rain_max
61.59%
0.00%
0 0
rain_gauge
57.49%
0.00%
0.2 0
valid_ticks
57.49%
0.00%
0 0
wind_speed_ms
33.65%
0.53%
n/a 12.07
wind_speed_kmh
19.01%
0.00%
0 6
wind_deg
33.65%
0.20%
0 339
wind_gust_ms
27.45%
0.09%
n/a 20.1
wind_gust_kmh
19.01%
0.01%
0 25
irradiance
80.60%
0.59%
0 1088
energy_density
4.76%
0.20%
0 348
irradiance_max
61.59%
0.35%
0 1198
clouds
14.64%
0.06%
75 70
visibility
11.55%
0.04%
10000 450
carbon_dioxide
0.00%
0.00%
n/a n/a
weather_id
14.64%
0.02%
800 721
weather_main
14.64%
0.01%
n/a Haze
weather_description
14.64%
0.02%
n/a haze
weather_icon
14.64%
0.01%
n/a 11n
battery
61.59%
0.01%
4.15 3.35
payload
57.49%
54.40%
n/a x7302010fb928f0020001ff
time_sync_error_s
57.49%
0.13%
14 -83
seq_number_modem
57.49%
2.86%
1275 4096
seq_number_firmware
0.00%
0.00%
n/a n/a
temperature_wetbulb_stull2011_C
31.86%
0.93%
n/a -5.12

Changes made to preparatory file

  1. Data reporting atmospheric pressure in the raw data files was either provided in Pa or in hPa. Values provided in hPa were transformed to Pa in order to store all values in a common unit while maintaining the raw data.
  2. Data reporting wind speed in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.
  3. Data reporting wind gust in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.

Changes made to data

No changes were made in the data file.

Unresolved issues

  1. In columns temperature, temperature_min, temperature_max, feelslike and dew_point from the device located in Romania 9.095 records contain values > 100°C. These records must be revised by the data provider.
  2. The description of the data (metadata) is largely inclomplete and allows no clear standardisation of the data.
    • For column temperature ist is unclear, how the temperature is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column feelslike it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column dew_point it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column RH ist is unclear, how the relative humidity is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column atmpressure_pa and atmpressureh_pa it is unclear, how the atmospheric pressure is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval)
    • For data reported in column atmpressure_sealevel_Pa and atmpressure_sealevel_hPa is is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For data reported in column atmpressure_grndlevel_hPa is is unclear what is reported in the raw data file.
    • For data reported in column rain_counter is is unclear what is reported in the raw data file.
    • For data reported in column rain_max is is unclear what is reported in the raw data file.
    • For data reported in column valid_ticks is is unclear what is reported in the raw data file.
    • For column wind_speed_ms and wind_speed_kmh it is unclear, how the wind speed is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_gust_ms and wind_gust_kmh it is unclear, how the wind gust is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_deg it is unclear, how the wind direction is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column irradiance it is unclear, how the solar irradiance is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column energy density it is unclear, how the rate of solar radiation is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column clouds is is unclear what is reported in the raw data file.
    • For data reported in column visibility is is unclear what is reported in the raw data file.
    • For data reported in column carbon_dioxide is is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column payload is is unclear what is reported in the raw data file.
    • For data reported in column time_sync_error_s is is unclear what is reported in the raw data file.
    • For data reported in column seq_number_modem is is unclear what is reported in the raw data file.
    • For data reported in column seq_number_firmware is is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column temperature_wetbulb_stull2011_C is is unclear what is reported in the ray data file.
  3. Data in raw data files acquired in 2021 contain
    • 8 quadruplicate records for the same date and time in raw data files from the device in Romania. These records must be revised by the data provider.
    • 76 triplicate records for the same date and time (3 x 23 records in raw data files from the device in Switzerland, 3 x 53 records in raw data files from the device in Romania). These records must be revised by the data provider.
    • 1763 duplicate records for the same date and time (2 x 293 records in raw data files from the device in Switzerland, 2 x 1470 records in a raw data file from the device in Romania). These records must be revised by the data provider.

data 2020 all countries

Table 10. Standardised metadata of the table. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
UID BGDWT180.DTLLC377.0
Name data 2020 all countries
IRI https://app.pollinatorhub.eu/dataset-discovery/parts/BGDWT180.DTLLC377.0
Type File
Licence CC BY-NC-ND 4.0
Description

Table data 2020 all countries contains 145.838 records (43,88 MB) from 9 distinct raw data files, 8 distinct countries and 4 distinct device types assessing different parameters in different units and with different methods.

Table data 2020 all countries contains 145.838 records (43,88 MB) from 9 distinct raw data files, 8 distinct countries and 4 distinct device types assessing different parameters in different units and with different methods.

Metadata

n/a
Table 11. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Name Description Data type Descriptor Unit
raw_data

Name of the raw data file, obtained from the data provider.

String Text [0.0.TEXTA315]

n/a

country

NUTS 2021 level 0 code of the location in which the weather station was based.

String nuts2021Code [0.0.NTSCD55]

n/a

device_type

Type/model/name of the device/system, which was used acquire the data.

String Text [0.0.TEXTA315]

n/a

DeviceID

Identifier of the device/system, which was used acquire the data.

String applianceID [0.0.PPLNC488]

n/a

unix_time

Date and time in Unix time format, presumably giving the time at which data has been transmitted.

Integer number unixTime [0.0.NXTME469]

n/a

timezone

Offset of local time in which the device is located from UTC, given in seconds.

String utcOffset [0.0.TCFFS470]

s

datetime

Date and time in ISO 8601 format, presumably giving the time at which data has been transmitted.

Date and Time calendarDateAndTime [0.0.DTNDT319]

n/a

year

Calendar year, presumably giving the time at which data has been transmitted.

Integer number year [0.0.YEARA340]

year

month

Calendar month, presumably giving the time at which data has been transmitted.

Integer number calendarMonth [0.0.CLNDR376]

n/a

day

Calendar day of month, presumably giving the time at which data has been transmitted.

Integer number day [0.0.DAYAB382]

n/a

hour

Clock hour, presumably giving the time at which data has been transmitted.

Integer number clock hour [0.0.HRFDY386]

n/a

location_name

Name of the location in which the device/system is located.

String Text [0.0.TEXTA315]

n/a

location_lat

Geographic latitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Decimal number decimalLatitude [0.0.LTTDE333]

°

location_long

Geographic longitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Decimal number decimalLongitude [0.0.LNGTD332]

°

temperature

Not sufficiently specified by the data provider. Temperature measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Decimal number temperature [0.0.TMPRT394]

°C

temperature_min

Minimum temperature measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

temperature_max

Maximum temperature measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

feels_like

Temperature in account with the human perception of weather measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

dew_point

Temperature of dew point, measured in the relevant interval.

Decimal number temperature [0.0.TMPRT394]

°C

RH

Not sufficiently specified by the data provider. Relative humidity measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Decimal number relativeHumidity [0.0.RLTVH395]

%

atmpressure_pa

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in or converted to Pa.

Decimal number atmosphericPressure [0.0.TMSPH396]

Pa

atmpressure_hpa

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in hPa.

Decimal number atmosphericPressure [0.0.TMSPH396]
atmpressure_sealevel_Pa

Not sufficiently specified by the data provider. Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_pa. Expressed in Pa.

Decimal number atmosphericPressure [0.0.TMSPH396]

Pa

atmpressure_sealevel_hPa

Not sufficiently specified by the data provider. Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Decimal number atmosphericPressure [0.0.TMSPH396]
atmpressure_grndlevel

Not sufficiently specified by the data provider. Ground level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Decimal number atmosphericPressure [0.0.TMSPH396]
rain

Not sufficiently specified by the data provider. Cumulative rainfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Decimal number rainfall [0.0.RNFLL471]

mm

rain_intval

Duration of the interval vor which the rainfall in column rain is reported.

Integer number Integer [0.0.NTGER313]

min

rain_3h

Cumulative rainfall, measured in the three hours preceeding and including the relevant interval.

Decimal number rainfall [0.0.RNFLL471]

mm

snow

Not sufficiently specified by the data provider.Cumulative snowfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Decimal number snowfall [0.0.SNWFL472]

mm

snow_3h

Cumulative snowfall, measured in the three hours preceeding and including the relevant interval.

Decimal number snowfall [0.0.SNWFL472]

mm

rain_counter

No information is provided on this parameter.

Decimal number DecimalNumber [0.0.DCMLN314]

n/a

rain_max

No information is provided on this parameter, presumably maximum rainfall intensity in the relevant interval, given in mm/h.

Decimal number rainfallIntensity [0.0.RNFLL473]

mm h-1

rain_gauge

Rain gauge resolution, defined as the minimum amount of rain that a rain gauge can register, given in mm.

Decimal number DecimalNumber [0.0.DCMLN314]

mm

valid_ticks

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

wind_speed_ms

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in or transformed to m/s.

Decimal number windSpeed [0.0.WNDSP474]

m s-1

wind_speed_kmh

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in km/h.

Decimal number windSpeed [0.0.WNDSP474]

km h-1

wind_deg

Not sufficiently specified by the data provider. Wind direction measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in degrees.

Integer number windDirection [0.0.WNDDR475]

°

wind_gust_ms

Wind gust measured in the relevant interval. Given in or transformed to m/s.

Decimal number windGust [0.0.WNDGS476]

m s-1

wind_gust_kmh

Wind gust measured in the relevant interval. Given in km/h.

Decimal number windGust [0.0.WNDGS476]

km h-1

irradiance

Not sufficiently specified by the data provider. Solar irradiance measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Decimal number solarIrradiance [0.0.SLRRR477]

W m-2

energy_density

Not sufficiently specified by the data provider. Rate of solar radiation measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Decimal number rateOfSolarRadiation [0.0.RTFSL478]

J cm-2

irradiance_max

Maximum solar irradiance measured in the relevant interval.

Decimal number solarIrradiance [0.0.SLRRR477]

W m-2

clouds

No information is provided on this parameter, presumably the part of the sky that is covered with clouds.

Integer number Integer [0.0.NTGER313]

%

visibility

No information is provided on this parameter, presumably the visibility in the atmosphere for the human eye, given in m.

Integer number Integer [0.0.NTGER313]

m

carbon_dioxide

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

weather_id

Internal weather condition code adopted by the provider of the weather data.

Integer number Integer [0.0.NTGER313]

n/a

weather_main

Internal main group for the description of the weather adopted by the provider of the weather data.

String Text [0.0.TEXTA315]

n/a

weather_description

Internal subgroup for the description of the weather adopted by the provider of the weather data.

String Text [0.0.TEXTA315]

n/a

weather_icon

Internal code for icons describing the weather condistions, adopted by the provider of the weather data.

String Text [0.0.TEXTA315]

n/a

battery

Voltage of the batteries of the device/system.

Decimal number DecimalNumber [0.0.DCMLN314]

V

payload

No information is provided on this parameter.

String Text [0.0.TEXTA315]

n/a

time_sync_error_s

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

s

seq_number_modem

No information is provided on this parameter.

Integer number Integer [0.0.NTGER313]

n/a

seq_number_firmware

No information is provided on this parameter.

String Text [0.0.TEXTA315]

n/a

temperature_wetbulb_stull2011_C

No information is provided on this parameter.

Decimal number DecimalNumber [0.0.DCMLN314]

°C

Metadata of individual tables can be found in Annex 1.

Descriptive Measures

Table 12. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
raw_data 24 - 41 n/a BE_weather d… n/a n/a n/a RO_weather d… 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 9 ( 0.0% )
country 2 - 2 n/a BE n/a n/a n/a UK 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 0.0% )
device_type 8 - 18 n/a Climatik n/a n/a n/a Weatherhelix 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
DeviceID 3 - 3 172.3 164 164 171 182 182 145,838 77,990 ( 53.5% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
unix_time 10 - 10 1,593,661,661.1 1,577,836,800 1,585,742,400 1,593,662,400 1,601,571,600 1,609,455,600 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 8,785 ( 6.0% )
timezone 1 - 5 5,683.4 0 3,600 7,200 7,200 10,800 145,838 119,055 ( 81.6% ) 3,744 ( 2.6% ) 0 ( 0.0% ) 5 ( 0.0% )
datetime 19 - 19 n/a 2020-01-01 0… n/a n/a n/a 2020-12-31 2… 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 57,653 ( 39.5% )
year 4 - 4 2,020.0 2,020 2,020 2,020 2,020 2,020 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
month 1 - 2 6.5 1 4 7 10 12 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 13 ( 0.0% )
day 1 - 2 15.8 1 8 16 23 31 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 0.0% )
hour 1 - 2 12.5 1 6 12 18 24 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 25 ( 0.0% )
location_name 0 - 15 n/a 84007004 n/a n/a n/a UCLUJ 145,838 110,272 ( 75.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
location_lat 9 - 9 48.8594320 46.759188 46.759188 46.967707 52.947616 52.947616 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
location_long 8 - 9 10.0120545 -1.068273 -1.068273 7.399013 23.570373 23.570373 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
temperature 1 - 6 61.707 -100 9 15 25.215 305.93 145,838 22,661 ( 15.5% ) 58 ( 0.0% ) 0 ( 0.0% ) 6,424 ( 4.4% )
temperature_min 1 - 22 37.306928700000000276304 -100 7 12.19 19.2 303.46 145,838 51,207 ( 35.1% ) 112 ( 0.1% ) 0 ( 0.0% ) 4,708 ( 3.2% )
temperature_max 1 - 22 38.686732669999997824561 -100 7.9 12.9 20.3 307.7 145,838 51,207 ( 35.1% ) 45 ( 0.0% ) 0 ( 0.0% ) 4,444 ( 3.0% )
feels_like 1 - 6 100.737 -8.99 6.48 14.97 275.03 305.02 145,838 119,055 ( 81.6% ) 10 ( 0.0% ) 0 ( 0.0% ) 6,575 ( 4.5% )
dew_point 1 - 6 39.634 -7.78 6.55 10.4 14.3 295.57 145,838 66,737 ( 45.8% ) 31 ( 0.0% ) 0 ( 0.0% ) 5,393 ( 3.7% )
RH 1 - 4 76.55 0 64.8 82 91.8 100 145,838 0 ( 0.0% ) 3 ( 0.0% ) 0 ( 0.0% ) 826 ( 0.6% )
atmpressure_pa 5 - 9 100,229.409 50,000 98,850 100,570 101,700 104,900 145,838 38,440 ( 26.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 20,205 ( 13.9% )
atmpressure_hpa 3 - 9 1,016.24168 970 1,009.8233 1,017.0732 1,023 1,049 145,838 96,394 ( 66.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 18,771 ( 12.9% )
atmpressure_sealevel_Pa 1 - 6 72,438.0 0 0 101,055 101,945 104,020 145,838 114,105 ( 78.2% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 1,220 ( 0.8% )
atmpressure_sealevel_hPa 3 - 7 1,014.375 977.45 1,008.6 1,016.4 1,021.25 1,040.2 145,838 123,177 ( 84.5% ) 0 ( 0.0% ) 0 ( 0.0% ) 1,219 ( 0.8% )
atmpressure_grndlevel 1 - 1 0.0 0 0 0 0 0 145,838 136,766 ( 93.8% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 2 ( 0.0% )
rain 1 - 5 19.111 0 0 0 1 255 145,838 14,885 ( 10.2% ) 78,437 ( 53.8% ) 0 ( 0.0% ) 603 ( 0.4% )
rain_intval 2 - 2 32.4 10 10 10 60 60 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
rain_3h 1 - 3 0.03 0 0 0 0 17 145,838 136,575 ( 93.6% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 23 ( 0.0% )
snow 1 - 4 0.010 0 0 0 0 2 145,838 136,699 ( 93.7% ) 8,927 ( 6.1% ) 0 ( 0.0% ) 55 ( 0.0% )
snow_3h 1 - 3 0.00 0 0 0 0 3 145,838 136,743 ( 93.8% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 9 ( 0.0% )
rain_counter 1 - 10 0.172750850 0 0 0 0 0.71372549 145,838 77,990 ( 53.5% ) 51,426 ( 35.3% ) 0 ( 0.0% ) 3 ( 0.0% )
rain_max 1 - 1 0.0 0 0 0 0 0 145,838 77,990 ( 53.5% ) 67,848 ( 46.5% ) 0 ( 0.0% ) 2 ( 0.0% )
rain_gauge 1 - 3 0.07 0 0 0 0.2 0.2 145,838 77,990 ( 53.5% ) 45,186 ( 31.0% ) 0 ( 0.0% ) 3 ( 0.0% )
valid_ticks 1 - 1 0.0 0 0 0 0 0 145,838 77,990 ( 53.5% ) 67,848 ( 46.5% ) 0 ( 0.0% ) 2 ( 0.0% )
wind_speed_ms 1 - 5 1.366 0 0 0.1 2.1 28.8 145,838 89,398 ( 61.3% ) 28,218 ( 19.3% ) 0 ( 0.0% ) 171 ( 0.1% )
wind_speed_kmh 1 - 1 0.1 0 0 0 0 8 145,838 116,181 ( 79.7% ) 27,685 ( 19.0% ) 0 ( 0.0% ) 7 ( 0.0% )
wind_deg 1 - 3 135.4 0 56 110 220 360 145,838 89,398 ( 61.3% ) 328 ( 0.2% ) 0 ( 0.0% ) 362 ( 0.2% )
wind_gust_ms 1 - 5 5.854 0 0 0 10.8 115.2 145,838 89,486 ( 61.4% ) 28,432 ( 19.5% ) 0 ( 0.0% ) 185 ( 0.1% )
wind_gust_kmh 1 - 2 2.6 0 0 1 4 32 145,838 116,181 ( 79.7% ) 11,511 ( 7.9% ) 0 ( 0.0% ) 18 ( 0.0% )
irradiance 1 - 4 100.0 0 0 2 84 1,126 145,838 35,566 ( 24.4% ) 31,131 ( 21.3% ) 0 ( 0.0% ) 1,061 ( 0.7% )
energy_density 1 - 3 67.6 0 0 3 115 364 145,838 137,055 ( 94.0% ) 4,130 ( 2.8% ) 0 ( 0.0% ) 363 ( 0.2% )
irradiance_max 1 - 4 82.3 0 2 2 70 1,218 145,838 77,990 ( 53.5% ) 12,335 ( 8.5% ) 0 ( 0.0% ) 563 ( 0.4% )
clouds 1 - 3 44.6 0 20 40 75 100 145,838 119,055 ( 81.6% ) 5,181 ( 3.6% ) 0 ( 0.0% ) 102 ( 0.1% )
visibility 1 - 5 8,920.7 0 9,999 10,000 10,000 10,000 145,838 119,156 ( 81.7% ) 155 ( 0.1% ) 0 ( 0.0% ) 67 ( 0.0% )
carbon_dioxide 0 - 0 n/a n/a n/a n/a 145,838 145,838 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
weather_id 3 - 3 727.0 200 701 800 802 804 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 28 ( 0.0% )
weather_main 0 - 12 n/a Clear n/a n/a n/a Thunderstorm 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 11 ( 0.0% )
weather_description 0 - 28 n/a broken cloud… n/a n/a n/a very heavy r… 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 29 ( 0.0% )
weather_icon 0 - 3 n/a 01d n/a n/a n/a 50n 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 19 ( 0.0% )
battery 1 - 4 4.169 3 4.15 4.15 4.2 4.25 145,838 65,223 ( 44.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 6 ( 0.0% )
payload 0 - 23 n/a x n/a n/a n/a x7355810d052… 145,838 100,651 ( 69.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 44,675 ( 30.6% )
time_sync_error_s 1 - 4 8.8 -300 0 11 15 299 145,838 77,990 ( 53.5% ) 22,689 ( 15.6% ) 0 ( 0.0% ) 599 ( 0.4% )
seq_number_modem 1 - 4 1,829.8 0 959.25 1,726 2,545.75 4,095 145,838 133,622 ( 91.6% ) 2 ( 0.0% ) 0 ( 0.0% ) 4,097 ( 2.8% )
seq_number_firmware 0 - 0 n/a n/a n/a n/a 145,838 145,838 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
temperature_wetbulb_stull2011_C 1 - 5 10.667 -4.35 7 10.66 14.21 24.67 145,838 123,177 ( 84.5% ) 1 ( 0.0% ) 0 ( 0.0% ) 2,530 ( 1.7% )

Quality Measures

Table 13. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
raw_data
100.00%
0.01%
PT_weather data 2020.csv FR_weather data 2020.xlsx
country
100.00%
0.01%
PT FR
device_type
100.00%
0.00%
Weatherhelix Climatik
DeviceID
46.52%
0.00%
164 171
unix_time
18.36%
6.02%
1581616800 1577836800
timezone
18.36%
0.00%
7200 0
datetime
100.00%
39.53%
2020-08-18 13:00:00 2020-07-27 13:40:00
year
6.02%
0.00%
2020 2020
month
6.02%
0.01%
1 2
day
6.02%
0.02%
1 31
hour
6.02%
0.02%
1 24
location_name
24.39%
0.00%
n/a 84007004
location_lat
18.36%
0.00%
n/a 52.947616
location_long
18.36%
0.00%
n/a -1.068273
temperature
84.46%
4.40%
n/a -1.92
temperature_min
64.89%
3.23%
n/a -4.22
temperature_max
64.89%
3.05%
n/a 7.35
feels_like
18.36%
4.51%
n/a -3.26
dew_point
54.24%
3.70%
n/a -2.47
RH
100.00%
0.57%
93 19.7
atmpressure_pa
73.64%
13.85%
n/a 95835
atmpressure_hpa
33.90%
12.87%
n/a 1011.8719
atmpressure_sealevel_Pa
21.76%
0.84%
0 103115
atmpressure_sealevel_hPa
15.54%
0.84%
n/a 1031.15
atmpressure_grndlevel
6.22%
0.00%
0 0
rain
89.79%
0.41%
0 2.93
rain_intval
100.00%
0.00%
10 60
rain_3h
6.35%
0.02%
n/a 7
snow
6.27%
0.04%
n/a 1.85
snow_3h
6.24%
0.01%
n/a 0.6
rain_counter
46.52%
0.00%
n/a 0.71372549
rain_max
46.52%
0.00%
0 0
rain_gauge
46.52%
0.00%
n/a 0.2
valid_ticks
46.52%
0.00%
0 0
wind_speed_ms
38.70%
0.12%
n/a 13.41
wind_speed_kmh
20.34%
0.00%
0 8
wind_deg
38.70%
0.25%
230 318
wind_gust_ms
38.64%
0.13%
n/a 18.9
wind_gust_kmh
20.34%
0.01%
0 32
irradiance
75.61%
0.73%
0 1053
energy_density
6.02%
0.25%
0 254
irradiance_max
46.52%
0.39%
2 960
clouds
18.36%
0.07%
75 78
visibility
18.30%
0.05%
10000 49
carbon_dioxide
0.00%
0.00%
n/a n/a
weather_id
18.36%
0.02%
800 621
weather_main
18.36%
0.01%
n/a Haze
weather_description
18.36%
0.02%
n/a shower snow
weather_icon
18.36%
0.01%
n/a 11n
battery
55.28%
0.00%
n/a 3
payload
30.98%
30.63%
n/a x7349c10d612e607a1701ff
time_sync_error_s
46.52%
0.41%
0 -82
seq_number_modem
8.38%
2.81%
438 76
seq_number_firmware
0.00%
0.00%
n/a n/a
temperature_wetbulb_stull2011_C
15.54%
1.73%
n/a 16.98

Changes made to preparatory file

  1. Data reporting atmospheric pressure in the raw data files was either provided in Pa or in hPa. Values provided in hPa were transformed to Pa in order to store all values in a common unit while maintaining the raw data.
  2. Data reporting wind speed in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.
  3. Data reporting wind gust in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.

Changes made to data

No changes were made in the data file.

Unresolved issues

  1. In columns temperature, temperature_min, temperature_max, feelslike and dew_point obtained from the device located in Romania 21.694 records contain values > 100°C. These records must be revised by the data provider.
  2. In 3 records from the device in Belgium values are out of range. These records must be revised by the data provider.
  3. The description of the data (metadata) is largely incomplete and allows no clear standardisation of the data.
    • For column temperature it is unclear, how the temperature is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column feelslike it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column dewpoint it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column RH it is unclear, how the relative humidity is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column atmpressure_pa and atmpressureh_pa it is unclear, how the atmospheric pressure is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column atmpressure_sealevel_Pa and atmpressure_sealevel_hPa it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For data reported in column atmpressure_grndlevel_hPa it is unclear what is reported in the ray data file.
    • For data reported in column rain_counter it is unclear what is reported in the raw data file.
    • For data reported in column rain_max it is unclear what is reported in the raw data file.
    • For data reported in column valid_ticks it is unclear what is reported in the raw data file.
    • For column wind_speed_ms and wind_speed_kmh it is unclear, how the wind speed is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_gust_ms and wind_gust_kmh it is unclear, how the wind gust is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_deg it is unclear, how the wind direction is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column irradiance it is unclear, how the solar irradiance is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column energy density it is unclear, how the rate of solar radiation is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column clouds it is unclear what is reported in the raw data file.
    • For data reported in column visibility it is unclear what is reported in the raw data file.
    • For data reported in column carbon_dioxide it is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column payload it is unclear what is reported in the raw data file.
    • For data reported in column time_sync_error_s it is unclear what is reported in the raw data file.
    • For data reported in column seq_number_modem it is unclear what is reported in the raw data file.
    • For data reported in column seq_number_firmware it is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column temperature_wetbulb_stull2011_C it is unclear what is reported in the raw data file.
  4. Data in raw data files acquired in 2020 contain:
    • 9 triplicate records for the same date and time (3 x 8 records in raw data files from the device in Switzerland, 3 x 1 record in a raw data file from the device in Romania). These records must be revised by the data provider.
    • 413 duplicate records for the same date and time (2 x 272 records in raw data files from the device in Switzerland, 2 x 141 records in a raw data file from the device in Romania). These records must be revised by the data provider.

References

  1. GFISCO 2024 GO FAIR initiative: Make your data & services FAIR. (en-US) GO FAIR. [2024-10-1] www.go-fair.org

Annex 1: Table column reports

data 2021 all countries

raw_data

Table 14. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name raw_data
Description

Name of the raw data file, obtained from the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 15. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
raw_data 24 - 45 n/a BE_weather d… n/a n/a n/a RO_weather d… 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 9 ( 0.0% )
Table 16. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
raw_data
100.00%
0.00%
NL_weather data 2021.csv BE_weather data 2021.csv

Data Distribution Top 20

Figure 1. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 2. Visualization of completeness of the data in the column.

Uniqueness

Figure 3. Visualization of uniqueness of the data in the column.

country

Table 17. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name country
Description

NUTS level 0 code of the location in which the weather station was based.

Data type String
Descriptor eurostat:nuts2021Code [UID:0.0.NTSCD55]
Descriptor description

A NUTS code defined in the NUTS classification 2021, valid from 2021-01-01 to 2023-12-31, containing 92 regions at NUTS level 1, 244 regions at NUTS level 2 and 1165 regions at NUTS level 3 level.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTSCD55
Unit

n/a

Table 18. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
country 2 - 2 n/a BE n/a n/a n/a UK 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 0.0% )
Table 19. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
country
100.00%
0.00%
NL BE

Data Distribution Top 20

Figure 4. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 5. Visualization of completeness of the data in the column.

Uniqueness

Figure 6. Visualization of uniqueness of the data in the column.

device_type

Table 20. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name device_type
Description

Type/model/name of the device/system, which was used acquire the data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 21. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
device_type 8 - 18 n/a Climatik n/a n/a n/a Weatherhelix 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 22. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
device_type
100.00%
0.00%
Weatherhelix Climatik

Data Distribution Top 20

Figure 7. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 8. Visualization of completeness of the data in the column.

Uniqueness

Figure 9. Visualization of uniqueness of the data in the column.

DeviceID

Table 23. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name DeviceID
Description

Identifier of the device/system, which was used acquire the data.

Data type String
Descriptor pms:applianceID [UID:0.0.PPLNC488]
Descriptor description

Unique sequence of characters associated with an appliance within a dataset.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.PPLNC488
Unit

n/a

Table 24. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
DeviceID 3 - 3 176.1 164 164 182 182 219 184,065 70,705 ( 38.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 5 ( 0.0% )
Table 25. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
DeviceID
61.59%
0.00%
182 171

Data Distribution Top 20

Figure 10. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 11. Distribution of values in the column.

Outliers

Figure 12. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 13. Visualization of completeness of the data in the column.

Uniqueness

Figure 14. Visualization of uniqueness of the data in the column.

unix_time

Table 26. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name unix_time
Description

Date and time in Unix time format, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor pms:unixTime [UID:0.0.NXTME469]
Descriptor description

Unix time is a date and time representation widely used in computing. It measures time by the number of non-leap seconds that have elapsed since 00:00:00 UTC on 1 January 1970, the Unix epoch. [...] Unix time is sometimes referred to as Epoch time. This can be misleading since Unix time is not the only time system based on an epoch and the Unix epoch is not the only epoch used by other time systems.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NXTME469
Unit

n/a

Table 27. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
unix_time 10 - 10 1,625,279,669.9 1,609,459,200 1,617,282,000 1,625,288,400 1,633,305,600 1,640,991,600 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 8,761 ( 4.8% )
Table 28. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
unix_time
14.64%
4.76%
1612144800 1609462800

Continuous Data Distribution

Figure 15. Distribution of values in the column.

Outliers

Figure 16. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 17. Visualization of completeness of the data in the column.

Uniqueness

Figure 18. Visualization of uniqueness of the data in the column.

timezone

Table 29. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name timezone
Description

Offset of local time in which the device is located from UTC, given in seconds.

Data type String
Descriptor bipm:utcOffset [UID:0.0.TCFFS470]
Descriptor description

The UTC offset is the difference in hours and minutes between Coordinated Universal Time (UTC) and local solar time, at a particular place. This difference is expressed with respect to UTC and is generally shown in the format ±[hh]:[mm], ±[hh][mm], or ±[hh].

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TCFFS470
Unit

s

Table 30. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
timezone 1 - 5 5,754.5 0 3,600 7,200 7,200 10,800 184,065 157,111 ( 85.4% ) 3,552 ( 1.9% ) 0 ( 0.0% ) 5 ( 0.0% )
Table 31. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
timezone
14.64%
0.00%
7200 0

Data Distribution Top 20

Figure 19. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 20. Distribution of values in the column.

Outliers

Figure 21. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 22. Visualization of completeness of the data in the column.

Uniqueness

Figure 23. Visualization of uniqueness of the data in the column.

datetime

Table 32. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name datetime
Description

Date and time in ISO 8601 format, presumably giving the time at which data has been transmitted.

Data type Date and Time
Descriptor iso-8601:calendarDateAndTime [UID:0.0.DTNDT319]
Descriptor description

date and time representation [...] that includes all the time scale components [...] associated with the expression [...]

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DTNDT319
Unit

n/a

Table 33. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
datetime 19 - 19 n/a 2021-01-01 0… n/a n/a n/a 2022-01-01 0… 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 69,084 ( 37.5% )
Table 34. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
datetime
100.00%
37.53%
2021-02-01 02:00:00 2021-01-07 11:50:00

Completeness

Figure 24. Visualization of completeness of the data in the column.

Uniqueness

Figure 25. Visualization of uniqueness of the data in the column.

year

Table 35. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name year
Description

Calendar year, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor dwc:year [UID:0.0.YEARA340]
Descriptor description

A term from the Darwin Core standard:

The four-digit year in which the dwc:Event occurred, according to the Common Era Calendar.

IRI http://rs.tdwg.org/dwc/terms/year
Unit

year

Table 36. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
year 4 - 4 2,021.0 2,021 2,021 2,021 2,021 2,021 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 37. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
year
4.76%
0.00%
2021 2021

Data Distribution Top 20

Figure 26. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 27. Distribution of values in the column.

Outliers

Figure 28. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 29. Visualization of completeness of the data in the column.

Uniqueness

Figure 30. Visualization of uniqueness of the data in the column.

month

Table 38. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name month
Description

Calendar month, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor iso-8601:calendarMonth [UID:0.0.CLNDR376]
Descriptor description

time scale unit [...] resulting from a defined division of a calendar year [...], each containing a specific number of calendar days [...] Note 1 to entry: A calendar month is in common parlance often referred to as month, however in this document calendar month and month have different definitions.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.CLNDR376
Unit

n/a

Table 39. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
month 1 - 2 6.5 1 4 7 10 12 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 13 ( 0.0% )
Table 40. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
month
4.76%
0.01%
1 2

Data Distribution Top 20

Figure 31. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 32. Distribution of values in the column.

Outliers

Figure 33. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 34. Visualization of completeness of the data in the column.

Uniqueness

Figure 35. Visualization of uniqueness of the data in the column.

day

Table 41. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name day
Description

Calendar day of month, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor dwc:day [UID:0.0.DAYAB382]
Descriptor description

A term from the Darwin Core standard:

The integer day of the month on which the dwc:Event occurred.

IRI http://rs.tdwg.org/dwc/terms/day
Unit

n/a

Table 42. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
day 1 - 2 15.7 1 8 16 23 31 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 0.0% )
Table 43. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
day
4.76%
0.02%
1 31

Data Distribution Top 20

Figure 36. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 37. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 38. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 39. Visualization of completeness of the data in the column.

Uniqueness

Figure 40. Visualization of uniqueness of the data in the column.

hour

Table 44. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name hour
Description

Clock hour, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor iso-8601:clock hour [UID:0.0.HRFDY386]
Descriptor description

time scale unit [...] whose duration [...] is one hour [...] Clock hour is in common parlance often referred to as hour, however in this document clock hour and hour have different definitions.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.HRFDY386
Unit

n/a

Table 45. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
hour 1 - 2 12.5 1 6.25 12.5 18.75 24 184,065 175,305 ( 95.2% ) 0 ( 0.0% ) 0 ( 0.0% ) 25 ( 0.0% )
Table 46. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
hour
4.76%
0.01%
1 1

Data Distribution Top 20

Figure 41. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 42. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 43. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 44. Visualization of completeness of the data in the column.

Uniqueness

Figure 45. Visualization of uniqueness of the data in the column.

location_name

Table 47. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name location_name
Description

Name of the location in which the device/system is located.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 48. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
location_name 0 - 15 n/a 84007004 n/a n/a n/a UCLUJ 184,065 148,351 ( 80.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 49. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
location_name
19.40%
0.00%
n/a 84007004

Data Distribution Top 20

Figure 46. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 47. Visualization of completeness of the data in the column.

Uniqueness

Figure 48. Visualization of uniqueness of the data in the column.

location_lat

Table 50. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name location_lat
Description

Geographic latitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Data type Decimal number
Descriptor dwc:decimalLatitude [UID:0.0.LTTDE333]
Descriptor description

A term from the Darwin Core standard:

The geographic latitude (in decimal degrees, using the spatial reference system given in dwc:geodeticDatum) of the geographic center of a dcterms:Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive.

IRI http://rs.tdwg.org/dwc/terms/decimalLatitude
Unit

°

Table 51. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
location_lat 9 - 9 48.8408064 46.759188 46.759188 46.967707 52.947616 52.947616 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 52. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
location_lat
14.64%
0.00%
n/a 52.947616

Data Distribution Top 20

Figure 49. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 50. Distribution of values in the column.

Outliers

Figure 51. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 52. Visualization of completeness of the data in the column.

Uniqueness

Figure 53. Visualization of uniqueness of the data in the column.

location_long

Table 53. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name location_long
Description

Geographic longitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Data type Decimal number
Descriptor dwc:decimalLongitude [UID:0.0.LNGTD332]
Descriptor description

A term from the Darwin Core standard:

The geographic longitude (in decimal degrees, using the spatial reference system given in dwc:geodeticDatum) of the geographic center of a dcterms:Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive.

IRI http://rs.tdwg.org/dwc/terms/decimalLongitude
Unit

°

Table 54. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
location_long 8 - 9 10.1038098 -1.068273 -1.068273 7.399013 23.570373 23.570373 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 55. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
location_long
14.64%
0.00%
n/a -1.068273

Data Distribution Top 20

Figure 54. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 55. Distribution of values in the column.

Outliers

Figure 56. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 57. Visualization of completeness of the data in the column.

Uniqueness

Figure 58. Visualization of uniqueness of the data in the column.

temperature

Table 56. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature
Description

Not sufficiently specified by the data provider. Temperature measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 57. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature 1 - 6 29.957 -16 5.6 12.28 19.1 306.62 184,065 51,088 ( 27.8% ) 4,716 ( 2.6% ) 0 ( 0.0% ) 5,949 ( 3.2% )
Table 58. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature
72.24%
3.23%
n/a -1.95

Continuous Data Distribution

Figure 59. Distribution of values in the column.

Outliers

Figure 60. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 61. Visualization of completeness of the data in the column.

Uniqueness

Figure 62. Visualization of uniqueness of the data in the column.

temperature_min

Table 59. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature_min
Description

Minimum temperature measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 60. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature_min 1 - 22 27.399900999999999839929 -16.1 4.1 10.4 17.1 305.5 184,065 43,751 ( 23.8% ) 4,807 ( 2.6% ) 0 ( 0.0% ) 4,185 ( 2.3% )
Table 61. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature_min
76.23%
2.27%
n/a 0.15

Continuous Data Distribution

Figure 63. Distribution of values in the column.

Outliers

Figure 64. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 65. Visualization of completeness of the data in the column.

Uniqueness

Figure 66. Visualization of uniqueness of the data in the column.

temperature_max

Table 62. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature_max
Description

Maximum temperature measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 63. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature_max 1 - 22 28.122694880000000949849 -15.8 4.8 11 17.6 309.72 184,065 43,751 ( 23.8% ) 4,729 ( 2.6% ) 0 ( 0.0% ) 4,202 ( 2.3% )
Table 64. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature_max
76.23%
2.28%
n/a 1.82

Continuous Data Distribution

Figure 67. Distribution of values in the column.

Outliers

Figure 68. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 69. Visualization of completeness of the data in the column.

Uniqueness

Figure 70. Visualization of uniqueness of the data in the column.

feels_like

Table 65. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name feels_like
Description

Temperature in account with the human perception of weather measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 66. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
feels_like 1 - 6 100.874 -14.04 5.84 15.09 273.52 307.69 184,065 157,111 ( 85.4% ) 11 ( 0.0% ) 0 ( 0.0% ) 6,391 ( 3.5% )
Table 67. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
feels_like
14.64%
3.47%
n/a -4.17

Continuous Data Distribution

Figure 71. Distribution of values in the column.

Outliers

Figure 72. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 73. Visualization of completeness of the data in the column.

Uniqueness

Figure 74. Visualization of uniqueness of the data in the column.

dew_point

Table 68. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name dew_point
Description

Temperature of dew point, measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 69. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
dew_point 1 - 6 24.514 -16.14 0 0.1 10.4 295.86 184,065 63,485 ( 34.5% ) 51,174 ( 27.8% ) 0 ( 0.0% ) 5,643 ( 3.1% )
Table 70. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
dew_point
65.51%
3.07%
n/a -5.22

Continuous Data Distribution

Figure 75. Distribution of values in the column.

Outliers

Figure 76. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 77. Visualization of completeness of the data in the column.

Uniqueness

Figure 78. Visualization of uniqueness of the data in the column.

RH

Table 71. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name RH
Description

Not sufficiently specified by the data provider. Relative humidity measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Data type Decimal number
Descriptor pms:relativeHumidity [UID:0.0.RLTVH395]
Descriptor description

Relative humidity (RH) (expressed as a percent) also measures water vapor, but RELATIVE to the temperature of the air. In other words, it is a measure of the actual amount of water vapor in the air compared to the total amount of vapor that can exist in the air at its current temperature.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RLTVH395
Unit

%

Table 72. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
RH 1 - 4 76.44 0 67 83 92 100 184,065 0 ( 0.0% ) 4,536 ( 2.5% ) 0 ( 0.0% ) 799 ( 0.4% )
Table 73. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
RH
100.00%
0.43%
0 14

Continuous Data Distribution

Figure 79. Distribution of values in the column.

Outliers

Figure 80. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 81. Visualization of completeness of the data in the column.

Uniqueness

Figure 82. Visualization of uniqueness of the data in the column.

atmpressure_pa

Table 74. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_pa
Description

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in or converted to Pa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit

Pa

Table 75. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_pa 1 - 9 98,166.113 0 100,400 101,300 102,190 120,060 184,065 43,751 ( 23.8% ) 4,400 ( 2.4% ) 0 ( 0.0% ) 8,123 ( 4.4% )
Table 76. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_pa
76.23%
4.41%
n/a 97555

Continuous Data Distribution

Figure 83. Distribution of values in the column.

Outliers

Figure 84. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 85. Visualization of completeness of the data in the column.

Uniqueness

Figure 86. Visualization of uniqueness of the data in the column.

atmpressure_hpa

Table 77. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_hpa
Description

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in hPa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit
Table 78. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_hpa 1 - 9 411.51953 0 0 0 1,016.78545 1,094 184,065 98,476 ( 53.5% ) 51,088 ( 27.8% ) 0 ( 0.0% ) 6,602 ( 3.6% )
Table 79. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_hpa
46.50%
3.59%
n/a 1064

Continuous Data Distribution

Figure 87. Distribution of values in the column.

Completeness

Figure 88. Visualization of completeness of the data in the column.

Uniqueness

Figure 89. Visualization of uniqueness of the data in the column.

atmpressure_sealevel_Pa

Table 80. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_sealevel_Pa
Description

Not sufficiently specified by the data provider. Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_pa. Expressed in Pa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit

Pa

Table 81. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_sealevel_Pa 1 - 5 10,867.4 0 0 0 0 99,640 184,065 116,331 ( 63.2% ) 60,187 ( 32.7% ) 0 ( 0.0% ) 855 ( 0.5% )
Table 82. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_sealevel_Pa
36.80%
0.46%
0 95600

Continuous Data Distribution

Figure 90. Distribution of values in the column.

Completeness

Figure 91. Visualization of completeness of the data in the column.

Uniqueness

Figure 92. Visualization of uniqueness of the data in the column.

atmpressure_sealevel_hPa

Table 83. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_sealevel_hPa
Description

Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit
Table 84. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_sealevel_hPa 1 - 6 125.538 0 0 0 0 996.4 184,065 125,430 ( 68.1% ) 51,088 ( 27.8% ) 0 ( 0.0% ) 855 ( 0.5% )
Table 85. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_sealevel_hPa
31.86%
0.46%
n/a 956

Continuous Data Distribution

Figure 93. Distribution of values in the column.

Completeness

Figure 94. Visualization of completeness of the data in the column.

Uniqueness

Figure 95. Visualization of uniqueness of the data in the column.

atmpressure_grndlevel

Table 86. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_grndlevel
Description

Not sufficiently specified by the data provider. Ground level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit
Table 87. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_grndlevel 1 - 1 0.0 0 0 0 0 0 184,065 174,966 ( 95.1% ) 9,099 ( 4.9% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 88. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_grndlevel
4.94%
0.00%
0 0

Data Distribution Top 20

Figure 96. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 97. Distribution of values in the column.

Completeness

Figure 98. Visualization of completeness of the data in the column.

Uniqueness

Figure 99. Visualization of uniqueness of the data in the column.

rain

Table 89. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain
Description

Not sufficiently specified by the data provider. Cumulative rainfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Data type Decimal number
Descriptor pms:rainfall [UID:0.0.RNFLL471]
Descriptor description

The amount of precipitation of any type (including the liquid equivalent of frozen hydrometeors); usually taken as that amount measured by means of a rain gauge (thus a small, varying amount of direct condensation is included). A more accurate term would be precipitation or precipitation amount. However, the broad use of "rainfall" is firmly established in meteorology, especially in hydrologic and climatological literature. Its best utilization would confine it to liquid precipitation, and so would provide a distinction between precipitation immediately accessible to soil and streams and that delayed in storage as snow or ice on the earth's surface.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RNFLL471
Unit

mm

Table 90. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain 1 - 5 65.435 0 0 24 133 255 184,065 15,196 ( 8.3% ) 58,461 ( 31.8% ) 0 ( 0.0% ) 584 ( 0.3% )
Table 91. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain
91.74%
0.32%
0 1.82

Continuous Data Distribution

Figure 100. Distribution of values in the column.

Outliers

Figure 101. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 102. Visualization of completeness of the data in the column.

Uniqueness

Figure 103. Visualization of uniqueness of the data in the column.

rain_intval

Table 92. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_intval
Description

Duration of the interval vor which the rainfall in column rain is reported.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

min

Table 93. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_intval 2 - 2 29.2 10 10 10 60 60 184,065 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 94. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_intval
100.00%
0.00%
10 60

Data Distribution Top 20

Figure 104. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 105. Distribution of values in the column.

Outliers

Figure 106. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 107. Visualization of completeness of the data in the column.

Uniqueness

Figure 108. Visualization of uniqueness of the data in the column.

rain_3h

Table 95. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_3h
Description

Cumulative rainfall, measured in the three hours preceeding and including the relevant interval.

Data type Decimal number
Descriptor pms:rainfall [UID:0.0.RNFLL471]
Descriptor description

The amount of precipitation of any type (including the liquid equivalent of frozen hydrometeors); usually taken as that amount measured by means of a rain gauge (thus a small, varying amount of direct condensation is included). A more accurate term would be precipitation or precipitation amount. However, the broad use of "rainfall" is firmly established in meteorology, especially in hydrologic and climatological literature. Its best utilization would confine it to liquid precipitation, and so would provide a distinction between precipitation immediately accessible to soil and streams and that delayed in storage as snow or ice on the earth's surface.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RNFLL471
Unit

mm

Table 96. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_3h 1 - 1 0.0 0 0 0 0 0 184,065 174,966 ( 95.1% ) 9,099 ( 4.9% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 97. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_3h
4.94%
0.00%
0 0

Data Distribution Top 20

Figure 109. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 110. Distribution of values in the column.

Completeness

Figure 111. Visualization of completeness of the data in the column.

Uniqueness

Figure 112. Visualization of uniqueness of the data in the column.

snow

Table 98. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name snow
Description

Not sufficiently specified by the data provider.Cumulative snowfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Data type Decimal number
Descriptor pms:snowfall [UID:0.0.SNWFL472]
Descriptor description

Precipitation composed of white or translucent ice crystals, chiefly in complex branch hexagonal form and often agglomerated into snowflakes. For weather-observing purposes, the intensity of snow is characterized as 1) light when the visibility is 1 km (5/8 statute mile) or more; 2) moderate when the visibility is less than 1 km (5/8 statute mile) but not less than 1/2 km (5/16 statute mile); and 3) heavy when the visibility is less than 1/2 km (5/16 statute mile).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SNWFL472
Unit

mm

Table 99. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
snow 1 - 5 0.026 0 0 0 0 23.37 184,065 174,640 ( 94.9% ) 8,809 ( 4.8% ) 0 ( 0.0% ) 91 ( 0.0% )
Table 100. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
snow
5.12%
0.05%
n/a 0.94

Data Distribution Top 20

Figure 113. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 114. Distribution of 20 least common values, from lowest to highest.

Completeness

Figure 115. Visualization of completeness of the data in the column.

Uniqueness

Figure 116. Visualization of uniqueness of the data in the column.

snow_3h

Table 101. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name snow_3h
Description

Cumulative snowfall, measured in the three hours preceeding and including the relevant interval.

Data type Decimal number
Descriptor pms:snowfall [UID:0.0.SNWFL472]
Descriptor description

Precipitation composed of white or translucent ice crystals, chiefly in complex branch hexagonal form and often agglomerated into snowflakes. For weather-observing purposes, the intensity of snow is characterized as 1) light when the visibility is 1 km (5/8 statute mile) or more; 2) moderate when the visibility is less than 1 km (5/8 statute mile) but not less than 1/2 km (5/16 statute mile); and 3) heavy when the visibility is less than 1/2 km (5/16 statute mile).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SNWFL472
Unit

mm

Table 102. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
snow_3h 1 - 1 0.0 0 0 0 0 0 184,065 174,966 ( 95.1% ) 9,099 ( 4.9% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 103. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
snow_3h
4.94%
0.00%
0 0

Data Distribution Top 20

Figure 117. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 118. Distribution of values in the column.

Completeness

Figure 119. Visualization of completeness of the data in the column.

Uniqueness

Figure 120. Visualization of uniqueness of the data in the column.

rain_counter

Table 104. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_counter
Description

No information is provided on this parameter.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 105. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_counter 1 - 1 0.0 0 0 0 0 0 184,065 78,252 ( 42.5% ) 105,813 ( 57.5% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 106. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_counter
57.49%
0.00%
0 0

Data Distribution Top 20

Figure 121. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 122. Distribution of values in the column.

Completeness

Figure 123. Visualization of completeness of the data in the column.

Uniqueness

Figure 124. Visualization of uniqueness of the data in the column.

rain_max

Table 107. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_max
Description

No information is provided on this parameter, presumably maximum rainfall intensity in the relevant interval, given in mm/h.

Data type Decimal number
Descriptor pms:rainfallIntensity [UID:0.0.RNFLL473]
Descriptor description

The rate of precipitation, usually expressed in millimeters or inches per hour.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RNFLL473
Unit

mm h-1

Table 108. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_max 1 - 1 0.0 0 0 0 0 0 184,065 70,705 ( 38.4% ) 113,360 ( 61.6% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 109. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_max
61.59%
0.00%
0 0

Data Distribution Top 20

Figure 125. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 126. Distribution of values in the column.

Completeness

Figure 127. Visualization of completeness of the data in the column.

Uniqueness

Figure 128. Visualization of uniqueness of the data in the column.

rain_gauge

Table 110. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_gauge
Description

Rain gauge resolution, defined as the minimum amount of rain that a rain gauge can register, given in mm.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

mm

Table 111. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_gauge 1 - 3 0.19 0 0.2 0.2 0.2 0.2 184,065 78,252 ( 42.5% ) 5,653 ( 3.1% ) 0 ( 0.0% ) 3 ( 0.0% )
Table 112. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_gauge
57.49%
0.00%
0.2 0

Data Distribution Top 20

Figure 129. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 130. Distribution of values in the column.

Outliers

Figure 131. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 132. Visualization of completeness of the data in the column.

Uniqueness

Figure 133. Visualization of uniqueness of the data in the column.

valid_ticks

Table 113. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name valid_ticks
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 114. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
valid_ticks 1 - 1 0.0 0 0 0 0 0 184,065 78,252 ( 42.5% ) 105,813 ( 57.5% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 115. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
valid_ticks
57.49%
0.00%
0 0

Data Distribution Top 20

Figure 134. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 135. Distribution of values in the column.

Completeness

Figure 136. Visualization of completeness of the data in the column.

Uniqueness

Figure 137. Visualization of uniqueness of the data in the column.

wind_speed_ms

Table 116. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_speed_ms
Description

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in or transformed to m/s.

Data type Decimal number
Descriptor pms:windSpeed [UID:0.0.WNDSP474]
Descriptor description

The rate at which air is moving horizontally past a given point. It may be a 2-minute average speed (reported as wind speed) or an instantaneous speed (reported as a peak wind speed, wind gust, or squall).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDSP474
Unit

m s-1

Table 117. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_speed_ms 1 - 5 1.197 0 0 0 2.06 32.4 184,065 122,120 ( 66.3% ) 34,953 ( 19.0% ) 0 ( 0.0% ) 968 ( 0.5% )
Table 118. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_speed_ms
33.65%
0.53%
n/a 12.07

Continuous Data Distribution

Figure 138. Distribution of values in the column.

Completeness

Figure 139. Visualization of completeness of the data in the column.

Uniqueness

Figure 140. Visualization of uniqueness of the data in the column.

wind_speed_kmh

Table 119. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_speed_kmh
Description

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in km/h.

Data type Decimal number
Descriptor pms:windSpeed [UID:0.0.WNDSP474]
Descriptor description

The rate at which air is moving horizontally past a given point. It may be a 2-minute average speed (reported as wind speed) or an instantaneous speed (reported as a peak wind speed, wind gust, or squall).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDSP474
Unit

km h-1

Table 120. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_speed_kmh 1 - 1 0.0 0 0 0 0 9 184,065 149,074 ( 81.0% ) 34,208 ( 18.6% ) 0 ( 0.0% ) 7 ( 0.0% )
Table 121. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_speed_kmh
19.01%
0.00%
0 6

Data Distribution Top 20

Figure 141. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 142. Distribution of values in the column.

Completeness

Figure 143. Visualization of completeness of the data in the column.

Uniqueness

Figure 144. Visualization of uniqueness of the data in the column.

wind_deg

Table 122. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_deg
Description

Not sufficiently specified by the data provider. Wind direction measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in degrees.

Data type Integer number
Descriptor pms:windDirection [UID:0.0.WNDDR475]
Descriptor description

The true direction from which the wind is blowing at a given location (i.e., wind blowing from the north to the south is a north wind). It is normally measured in tens of degrees from 10 degrees clockwise through 360 degrees. North is 360 degrees. A wind direction of 0 degrees is only used when wind is calm.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDDR475
Unit

°

Table 123. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_deg 1 - 3 131.1 0 59 115 189 360 184,065 122,120 ( 66.3% ) 2,904 ( 1.6% ) 0 ( 0.0% ) 362 ( 0.2% )
Table 124. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_deg
33.65%
0.20%
0 339

Data Distribution Top 20

Figure 145. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 146. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 147. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 148. Visualization of completeness of the data in the column.

Uniqueness

Figure 149. Visualization of uniqueness of the data in the column.

wind_gust_ms

Table 125. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_gust_ms
Description

Wind gust measured in the relevant interval. Given in or transformed to m/s.

Data type Decimal number
Descriptor pms:windGust [UID:0.0.WNDGS476]
Descriptor description

Rapid fluctuations in the wind speed with a variation of 10 knots or more between peaks and lulls. The speed of the gust will be the maximum instantaneous wind speed.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDGS476
Unit

m s-1

Table 126. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_gust_ms 1 - 5 5.880 0 0 0.89 10.8 90 184,065 133,547 ( 72.6% ) 24,955 ( 13.6% ) 0 ( 0.0% ) 158 ( 0.1% )
Table 127. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_gust_ms
27.45%
0.09%
n/a 20.1

Data Distribution Top 20

Figure 150. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 151. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 152. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 153. Visualization of completeness of the data in the column.

Uniqueness

Figure 154. Visualization of uniqueness of the data in the column.

wind_gust_kmh

Table 128. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_gust_kmh
Description

Wind gust measured in the relevant interval. Given in km/h.

Data type Decimal number
Descriptor pms:windGust [UID:0.0.WNDGS476]
Descriptor description

Rapid fluctuations in the wind speed with a variation of 10 knots or more between peaks and lulls. The speed of the gust will be the maximum instantaneous wind speed.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDGS476
Unit

km h-1

Table 129. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_gust_kmh 1 - 2 2.2 0 0 1 4 25 184,065 149,074 ( 81.0% ) 15,788 ( 8.6% ) 0 ( 0.0% ) 18 ( 0.0% )
Table 130. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_gust_kmh
19.01%
0.01%
0 25

Data Distribution Top 20

Figure 155. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 156. Distribution of values in the column.

Outliers

Figure 157. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 158. Visualization of completeness of the data in the column.

Uniqueness

Figure 159. Visualization of uniqueness of the data in the column.

irradiance

Table 131. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name irradiance
Description

Not sufficiently specified by the data provider. Solar irradiance measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Data type Decimal number
Descriptor pms:solarIrradiance [UID:0.0.SLRRR477]
Descriptor description

Solar irradiance is the power per unit area (surface power density) received from the Sun in the form of electromagnetic radiation in the wavelength range of the measuring instrument. Solar irradiance is measured in watts per square metre (W/m2) in SI units.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SLRRR477
Unit

W m-2

Table 132. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
irradiance 1 - 4 101.3 0 0 2 98 1,152 184,065 35,714 ( 19.4% ) 46,186 ( 25.1% ) 0 ( 0.0% ) 1,077 ( 0.6% )
Table 133. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
irradiance
80.60%
0.59%
0 1088

Continuous Data Distribution

Figure 160. Distribution of values in the column.

Outliers

Figure 161. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 162. Visualization of completeness of the data in the column.

Uniqueness

Figure 163. Visualization of uniqueness of the data in the column.

energy_density

Table 134. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name energy_density
Description

Not sufficiently specified by the data provider. Rate of solar radiation measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Data type Decimal number
Descriptor pms:rateOfSolarRadiation [UID:0.0.RTFSL478]
Descriptor description

The langley (Ly) is a unit of heat transmission, especially used to express the rate of solar radiation (or insolation) received by the earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RTFSL478
Unit

J cm-2

Table 135. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
energy_density 1 - 3 65.8 0 0 3 113 371 184,065 175,305 ( 95.2% ) 4,110 ( 2.2% ) 0 ( 0.0% ) 361 ( 0.2% )
Table 136. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
energy_density
4.76%
0.20%
0 348

Data Distribution Top 20

Figure 164. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 165. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 166. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 167. Visualization of completeness of the data in the column.

Uniqueness

Figure 168. Visualization of uniqueness of the data in the column.

irradiance_max

Table 137. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name irradiance_max
Description

Maximum [irradiance](solar irradiance) measured in the relevant interval.

Data type Decimal number
Descriptor pms:solarIrradiance [UID:0.0.SLRRR477]
Descriptor description

Solar irradiance is the power per unit area (surface power density) received from the Sun in the form of electromagnetic radiation in the wavelength range of the measuring instrument. Solar irradiance is measured in watts per square metre (W/m2) in SI units.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SLRRR477
Unit

W m-2

Table 138. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
irradiance_max 1 - 4 112.9 0 0 2 106 1,394 184,065 70,705 ( 38.4% ) 29,147 ( 15.8% ) 0 ( 0.0% ) 641 ( 0.3% )
Table 139. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
irradiance_max
61.59%
0.35%
0 1198

Continuous Data Distribution

Figure 169. Distribution of values in the column.

Outliers

Figure 170. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 171. Visualization of completeness of the data in the column.

Uniqueness

Figure 172. Visualization of uniqueness of the data in the column.

clouds

Table 140. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name clouds
Description

No information is provided on this parameter, presumably the part of the sky that is covered with clouds.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

%

Table 141. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
clouds 1 - 3 52.6 0 20 75 90 100 184,065 157,111 ( 85.4% ) 5,471 ( 3.0% ) 0 ( 0.0% ) 102 ( 0.1% )
Table 142. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
clouds
14.64%
0.06%
75 70

Data Distribution Top 20

Figure 173. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 174. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 175. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 176. Visualization of completeness of the data in the column.

Uniqueness

Figure 177. Visualization of uniqueness of the data in the column.

visibility

Table 143. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name visibility
Description

No information is provided on this parameter, presumably the visibility in the atmosphere for the human eye, given in m.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

m

Table 144. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
visibility 1 - 5 8,692.5 0 9,999 10,000 10,000 10,000 184,065 162,812 ( 88.5% ) 565 ( 0.3% ) 0 ( 0.0% ) 66 ( 0.0% )
Table 145. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
visibility
11.55%
0.04%
10000 450

Data Distribution Top 20

Figure 178. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 179. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 180. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 181. Visualization of completeness of the data in the column.

Uniqueness

Figure 182. Visualization of uniqueness of the data in the column.

carbon_dioxide

Table 146. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name carbon_dioxide
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 147. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
carbon_dioxide 0 - 0 n/a n/a n/a n/a 184,065 184,065 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
Table 148. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
carbon_dioxide
0.00%
0.00%
n/a n/a

Data Distribution Top 20

Figure 183. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 184. Visualization of completeness of the data in the column.

Uniqueness

Figure 185. Visualization of uniqueness of the data in the column.

weather_id

Table 149. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_id
Description

Internal weather condition code adopted by the provider of the weather data.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 150. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_id 3 - 3 730.1 200 701 800 803 804 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 31 ( 0.0% )
Table 151. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_id
14.64%
0.02%
800 721

Data Distribution Top 20

Figure 186. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 187. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 188. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 189. Visualization of completeness of the data in the column.

Uniqueness

Figure 190. Visualization of uniqueness of the data in the column.

weather_main

Table 152. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_main
Description

Internal main group for the description of the weather adopted by the provider of the weather data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 153. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_main 0 - 12 n/a Clear n/a n/a n/a Thunderstorm 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 11 ( 0.0% )
Table 154. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_main
14.64%
0.01%
n/a Haze

Data Distribution Top 20

Figure 191. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 192. Visualization of completeness of the data in the column.

Uniqueness

Figure 193. Visualization of uniqueness of the data in the column.

weather_description

Table 155. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_description
Description

Internal subgroup for the description of the weather adopted by the provider of the weather data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 156. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_description 0 - 28 n/a broken cloud… n/a n/a n/a very heavy r… 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 0.0% )
Table 157. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_description
14.64%
0.02%
n/a haze

Data Distribution Top 20

Figure 194. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 195. Distribution of 20 least common values, from lowest to highest.

Completeness

Figure 196. Visualization of completeness of the data in the column.

Uniqueness

Figure 197. Visualization of uniqueness of the data in the column.

weather_icon

Table 158. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_icon
Description

Internal code for icons describing the weather condistions, adopted by the provider of the weather data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 159. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_icon 0 - 3 n/a 01d n/a n/a n/a 50n 184,065 157,111 ( 85.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 19 ( 0.0% )
Table 160. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_icon
14.64%
0.01%
n/a 11n

Data Distribution Top 20

Figure 198. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 199. Visualization of completeness of the data in the column.

Uniqueness

Figure 200. Visualization of uniqueness of the data in the column.

battery

Table 161. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name battery
Description

Voltage of the batteries of the device/system.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

V

Table 162. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
battery 1 - 4 4.121 3.25 4.15 4.15 4.15 4.25 184,065 70,705 ( 38.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 22 ( 0.0% )
Table 163. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
battery
61.59%
0.01%
4.15 3.35

Data Distribution Top 20

Figure 201. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 202. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 203. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 204. Visualization of completeness of the data in the column.

Uniqueness

Figure 205. Visualization of uniqueness of the data in the column.

payload

Table 164. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name payload
Description

No information is provided on this parameter.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 165. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
payload 0 - 23 n/a x4aea4003933… n/a n/a n/a x731e0005ff2… 184,065 78,252 ( 42.5% ) 0 ( 0.0% ) 0 ( 0.0% ) 100,133 ( 54.4% )
Table 166. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
payload
57.49%
54.40%
n/a x7302010fb928f0020001ff

Completeness

Figure 206. Visualization of completeness of the data in the column.

Uniqueness

Figure 207. Visualization of uniqueness of the data in the column.

time_sync_error_s

Table 167. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name time_sync_error_s
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

s

Table 168. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
time_sync_error_s 1 - 4 17.3 -210 12 14 15 235 184,065 78,252 ( 42.5% ) 41 ( 0.0% ) 0 ( 0.0% ) 231 ( 0.1% )
Table 169. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
time_sync_error_s
57.49%
0.13%
14 -83

Data Distribution Top 20

Figure 208. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 209. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 210. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 211. Visualization of completeness of the data in the column.

Uniqueness

Figure 212. Visualization of uniqueness of the data in the column.

seq_number_modem

Table 170. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name seq_number_modem
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 171. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
seq_number_modem 1 - 5 2,116.5 0 1,089 2,091 3,107 10,822 184,065 78,252 ( 42.5% ) 26 ( 0.0% ) 0 ( 0.0% ) 5,263 ( 2.9% )
Table 172. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
seq_number_modem
57.49%
2.86%
1275 4096

Continuous Data Distribution

Figure 213. Distribution of values in the column.

Outliers

Figure 214. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 215. Visualization of completeness of the data in the column.

Uniqueness

Figure 216. Visualization of uniqueness of the data in the column.

seq_number_firmware

Table 173. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name seq_number_firmware
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 174. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
seq_number_firmware 0 - 0 n/a n/a n/a n/a 184,065 184,065 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
Table 175. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
seq_number_firmware
0.00%
0.00%
n/a n/a

Data Distribution Top 20

Figure 217. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 218. Visualization of completeness of the data in the column.

Uniqueness

Figure 219. Visualization of uniqueness of the data in the column.

temperature_wetbulb_stull2011_C

Table 176. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature_wetbulb_stull2011_C
Description

No information is provided on this parameter.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

°C

Table 177. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature_wetbulb_stull2011_C 1 - 6 0.005 -12.61 0 0 0 10.92 184,065 125,430 ( 68.1% ) 51,097 ( 27.8% ) 0 ( 0.0% ) 1,718 ( 0.9% )
Table 178. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature_wetbulb_stull2011_C
31.86%
0.93%
n/a -5.12

Continuous Data Distribution

Figure 220. Distribution of values in the column.

Completeness

Figure 221. Visualization of completeness of the data in the column.

Uniqueness

Figure 222. Visualization of uniqueness of the data in the column.

Changes made to preparatory file

  1. Data reporting atmospheric pressure in the raw data files was either provided in Pa or in hPa. Values provided in hPa were transformed to Pa in order to store all values in a common unit while maintaining the raw data.
  2. Data reporting wind speed in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.
  3. Data reporting wind gust in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.

Changes made to data

No changes were made in the data file.

Unresolved issues

  1. In columns temperature, temperature_min, temperature_max, feelslike and dew_point from the device located in Romania 9.095 records contain values > 100°C. These records must be revised by the data provider.
  2. The description of the data (metadata) is largely inclomplete and allows no clear standardisation of the data.
    • For column temperature ist is unclear, how the temperature is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column feelslike it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column dew_point it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column RH ist is unclear, how the relative humidity is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column atmpressure_pa and atmpressureh_pa it is unclear, how the atmospheric pressure is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval)
    • For data reported in column atmpressure_sealevel_Pa and atmpressure_sealevel_hPa is is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For data reported in column atmpressure_grndlevel_hPa is is unclear what is reported in the raw data file.
    • For data reported in column rain_counter is is unclear what is reported in the raw data file.
    • For data reported in column rain_max is is unclear what is reported in the raw data file.
    • For data reported in column valid_ticks is is unclear what is reported in the raw data file.
    • For column wind_speed_ms and wind_speed_kmh it is unclear, how the wind speed is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_gust_ms and wind_gust_kmh it is unclear, how the wind gust is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_deg it is unclear, how the wind direction is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column irradiance it is unclear, how the solar irradiance is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column energy density it is unclear, how the rate of solar radiation is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column clouds is is unclear what is reported in the raw data file.
    • For data reported in column visibility is is unclear what is reported in the raw data file.
    • For data reported in column carbon_dioxide is is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column payload is is unclear what is reported in the raw data file.
    • For data reported in column time_sync_error_s is is unclear what is reported in the raw data file.
    • For data reported in column seq_number_modem is is unclear what is reported in the raw data file.
    • For data reported in column seq_number_firmware is is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column temperature_wetbulb_stull2011_C is is unclear what is reported in the ray data file.
  3. Data in raw data files acquired in 2021 contain
    • 8 quadruplicate records for the same date and time in raw data files from the device in Romania. These records must be revised by the data provider.
    • 76 triplicate records for the same date and time (3 x 23 records in raw data files from the device in Switzerland, 3 x 53 records in raw data files from the device in Romania). These records must be revised by the data provider.
    • 1763 duplicate records for the same date and time (2 x 293 records in raw data files from the device in Switzerland, 2 x 1470 records in a raw data file from the device in Romania). These records must be revised by the data provider.

data 2020 all countries

raw_data

Table 179. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name raw_data
Description

Name of the raw data file, obtained from the data provider.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 180. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
raw_data 24 - 41 n/a BE_weather d… n/a n/a n/a RO_weather d… 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 9 ( 0.0% )
Table 181. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
raw_data
100.00%
0.01%
PT_weather data 2020.csv FR_weather data 2020.xlsx

Data Distribution Top 20

Figure 223. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 224. Visualization of completeness of the data in the column.

Uniqueness

Figure 225. Visualization of uniqueness of the data in the column.

country

Table 182. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name country
Description

NUTS 2021 level 0 code of the location in which the weather station was based.

Data type String
Descriptor eurostat:nuts2021Code [UID:0.0.NTSCD55]
Descriptor description

A NUTS code defined in the NUTS classification 2021, valid from 2021-01-01 to 2023-12-31, containing 92 regions at NUTS level 1, 244 regions at NUTS level 2 and 1165 regions at NUTS level 3 level.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTSCD55
Unit

n/a

Table 183. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
country 2 - 2 n/a BE n/a n/a n/a UK 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 0.0% )
Table 184. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
country
100.00%
0.01%
PT FR

Data Distribution Top 20

Figure 226. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 227. Visualization of completeness of the data in the column.

Uniqueness

Figure 228. Visualization of uniqueness of the data in the column.

device_type

Table 185. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name device_type
Description

Type/model/name of the device/system, which was used acquire the data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 186. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
device_type 8 - 18 n/a Climatik n/a n/a n/a Weatherhelix 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 187. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
device_type
100.00%
0.00%
Weatherhelix Climatik

Data Distribution Top 20

Figure 229. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 230. Visualization of completeness of the data in the column.

Uniqueness

Figure 231. Visualization of uniqueness of the data in the column.

DeviceID

Table 188. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name DeviceID
Description

Identifier of the device/system, which was used acquire the data.

Data type String
Descriptor pms:applianceID [UID:0.0.PPLNC488]
Descriptor description

Unique sequence of characters associated with an appliance within a dataset.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.PPLNC488
Unit

n/a

Table 189. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
DeviceID 3 - 3 172.3 164 164 171 182 182 145,838 77,990 ( 53.5% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 190. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
DeviceID
46.52%
0.00%
164 171

Data Distribution Top 20

Figure 232. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 233. Distribution of values in the column.

Outliers

Figure 234. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 235. Visualization of completeness of the data in the column.

Uniqueness

Figure 236. Visualization of uniqueness of the data in the column.

unix_time

Table 191. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name unix_time
Description

Date and time in Unix time format, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor pms:unixTime [UID:0.0.NXTME469]
Descriptor description

Unix time is a date and time representation widely used in computing. It measures time by the number of non-leap seconds that have elapsed since 00:00:00 UTC on 1 January 1970, the Unix epoch. [...] Unix time is sometimes referred to as Epoch time. This can be misleading since Unix time is not the only time system based on an epoch and the Unix epoch is not the only epoch used by other time systems.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NXTME469
Unit

n/a

Table 192. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
unix_time 10 - 10 1,593,661,661.1 1,577,836,800 1,585,742,400 1,593,662,400 1,601,571,600 1,609,455,600 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 8,785 ( 6.0% )
Table 193. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
unix_time
18.36%
6.02%
1581616800 1577836800

Continuous Data Distribution

Figure 237. Distribution of values in the column.

Outliers

Figure 238. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 239. Visualization of completeness of the data in the column.

Uniqueness

Figure 240. Visualization of uniqueness of the data in the column.

timezone

Table 194. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name timezone
Description

Offset of local time in which the device is located from UTC, given in seconds.

Data type String
Descriptor bipm:utcOffset [UID:0.0.TCFFS470]
Descriptor description

The UTC offset is the difference in hours and minutes between Coordinated Universal Time (UTC) and local solar time, at a particular place. This difference is expressed with respect to UTC and is generally shown in the format ±[hh]:[mm], ±[hh][mm], or ±[hh].

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TCFFS470
Unit

s

Table 195. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
timezone 1 - 5 5,683.4 0 3,600 7,200 7,200 10,800 145,838 119,055 ( 81.6% ) 3,744 ( 2.6% ) 0 ( 0.0% ) 5 ( 0.0% )
Table 196. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
timezone
18.36%
0.00%
7200 0

Data Distribution Top 20

Figure 241. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 242. Distribution of values in the column.

Outliers

Figure 243. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 244. Visualization of completeness of the data in the column.

Uniqueness

Figure 245. Visualization of uniqueness of the data in the column.

datetime

Table 197. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name datetime
Description

Date and time in ISO 8601 format, presumably giving the time at which data has been transmitted.

Data type Date and Time
Descriptor iso-8601:calendarDateAndTime [UID:0.0.DTNDT319]
Descriptor description

date and time representation [...] that includes all the time scale components [...] associated with the expression [...]

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DTNDT319
Unit

n/a

Table 198. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
datetime 19 - 19 n/a 2020-01-01 0… n/a n/a n/a 2020-12-31 2… 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 57,653 ( 39.5% )
Table 199. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
datetime
100.00%
39.53%
2020-08-18 13:00:00 2020-07-27 13:40:00

Completeness

Figure 246. Visualization of completeness of the data in the column.

Uniqueness

Figure 247. Visualization of uniqueness of the data in the column.

year

Table 200. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name year
Description

Calendar year, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor dwc:year [UID:0.0.YEARA340]
Descriptor description

A term from the Darwin Core standard:

The four-digit year in which the dwc:Event occurred, according to the Common Era Calendar.

IRI http://rs.tdwg.org/dwc/terms/year
Unit

year

Table 201. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
year 4 - 4 2,020.0 2,020 2,020 2,020 2,020 2,020 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 202. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
year
6.02%
0.00%
2020 2020

Data Distribution Top 20

Figure 248. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 249. Distribution of values in the column.

Outliers

Figure 250. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 251. Visualization of completeness of the data in the column.

Uniqueness

Figure 252. Visualization of uniqueness of the data in the column.

month

Table 203. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name month
Description

Calendar month, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor iso-8601:calendarMonth [UID:0.0.CLNDR376]
Descriptor description

time scale unit [...] resulting from a defined division of a calendar year [...], each containing a specific number of calendar days [...] Note 1 to entry: A calendar month is in common parlance often referred to as month, however in this document calendar month and month have different definitions.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.CLNDR376
Unit

n/a

Table 204. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
month 1 - 2 6.5 1 4 7 10 12 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 13 ( 0.0% )
Table 205. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
month
6.02%
0.01%
1 2

Data Distribution Top 20

Figure 253. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 254. Distribution of values in the column.

Outliers

Figure 255. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 256. Visualization of completeness of the data in the column.

Uniqueness

Figure 257. Visualization of uniqueness of the data in the column.

day

Table 206. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name day
Description

Calendar day of month, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor dwc:day [UID:0.0.DAYAB382]
Descriptor description

A term from the Darwin Core standard:

The integer day of the month on which the dwc:Event occurred.

IRI http://rs.tdwg.org/dwc/terms/day
Unit

n/a

Table 207. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
day 1 - 2 15.8 1 8 16 23 31 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 0.0% )
Table 208. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
day
6.02%
0.02%
1 31

Data Distribution Top 20

Figure 258. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 259. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 260. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 261. Visualization of completeness of the data in the column.

Uniqueness

Figure 262. Visualization of uniqueness of the data in the column.

hour

Table 209. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name hour
Description

Clock hour, presumably giving the time at which data has been transmitted.

Data type Integer number
Descriptor iso-8601:clock hour [UID:0.0.HRFDY386]
Descriptor description

time scale unit [...] whose duration [...] is one hour [...] Clock hour is in common parlance often referred to as hour, however in this document clock hour and hour have different definitions.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.HRFDY386
Unit

n/a

Table 210. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
hour 1 - 2 12.5 1 6 12 18 24 145,838 137,055 ( 94.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 25 ( 0.0% )
Table 211. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
hour
6.02%
0.02%
1 24

Data Distribution Top 20

Figure 263. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 264. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 265. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 266. Visualization of completeness of the data in the column.

Uniqueness

Figure 267. Visualization of uniqueness of the data in the column.

location_name

Table 212. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name location_name
Description

Name of the location in which the device/system is located.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 213. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
location_name 0 - 15 n/a 84007004 n/a n/a n/a UCLUJ 145,838 110,272 ( 75.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 214. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
location_name
24.39%
0.00%
n/a 84007004

Data Distribution Top 20

Figure 268. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 269. Visualization of completeness of the data in the column.

Uniqueness

Figure 270. Visualization of uniqueness of the data in the column.

location_lat

Table 215. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name location_lat
Description

Geographic latitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Data type Decimal number
Descriptor dwc:decimalLatitude [UID:0.0.LTTDE333]
Descriptor description

A term from the Darwin Core standard:

The geographic latitude (in decimal degrees, using the spatial reference system given in dwc:geodeticDatum) of the geographic center of a dcterms:Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive.

IRI http://rs.tdwg.org/dwc/terms/decimalLatitude
Unit

°

Table 216. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
location_lat 9 - 9 48.8594320 46.759188 46.759188 46.967707 52.947616 52.947616 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 217. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
location_lat
18.36%
0.00%
n/a 52.947616

Data Distribution Top 20

Figure 271. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 272. Distribution of values in the column.

Outliers

Figure 273. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 274. Visualization of completeness of the data in the column.

Uniqueness

Figure 275. Visualization of uniqueness of the data in the column.

location_long

Table 218. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name location_long
Description

Geographic longitude of the location in which the device/system is located, given in WGS84 format in decimal degrees.

Data type Decimal number
Descriptor dwc:decimalLongitude [UID:0.0.LNGTD332]
Descriptor description

A term from the Darwin Core standard:

The geographic longitude (in decimal degrees, using the spatial reference system given in dwc:geodeticDatum) of the geographic center of a dcterms:Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive.

IRI http://rs.tdwg.org/dwc/terms/decimalLongitude
Unit

°

Table 219. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
location_long 8 - 9 10.0120545 -1.068273 -1.068273 7.399013 23.570373 23.570373 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.0% )
Table 220. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
location_long
18.36%
0.00%
n/a -1.068273

Data Distribution Top 20

Figure 276. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 277. Distribution of values in the column.

Outliers

Figure 278. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 279. Visualization of completeness of the data in the column.

Uniqueness

Figure 280. Visualization of uniqueness of the data in the column.

temperature

Table 221. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature
Description

Not sufficiently specified by the data provider. Temperature measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 222. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature 1 - 6 61.707 -100 9 15 25.215 305.93 145,838 22,661 ( 15.5% ) 58 ( 0.0% ) 0 ( 0.0% ) 6,424 ( 4.4% )
Table 223. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature
84.46%
4.40%
n/a -1.92

Continuous Data Distribution

Figure 281. Distribution of values in the column.

Outliers

Figure 282. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 283. Visualization of completeness of the data in the column.

Uniqueness

Figure 284. Visualization of uniqueness of the data in the column.

temperature_min

Table 224. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature_min
Description

Minimum temperature measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 225. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature_min 1 - 22 37.306928700000000276304 -100 7 12.19 19.2 303.46 145,838 51,207 ( 35.1% ) 112 ( 0.1% ) 0 ( 0.0% ) 4,708 ( 3.2% )
Table 226. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature_min
64.89%
3.23%
n/a -4.22

Continuous Data Distribution

Figure 285. Distribution of values in the column.

Outliers

Figure 286. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 287. Visualization of completeness of the data in the column.

Uniqueness

Figure 288. Visualization of uniqueness of the data in the column.

temperature_max

Table 227. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature_max
Description

Maximum temperature measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 228. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature_max 1 - 22 38.686732669999997824561 -100 7.9 12.9 20.3 307.7 145,838 51,207 ( 35.1% ) 45 ( 0.0% ) 0 ( 0.0% ) 4,444 ( 3.0% )
Table 229. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature_max
64.89%
3.05%
n/a 7.35

Continuous Data Distribution

Figure 289. Distribution of values in the column.

Outliers

Figure 290. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 291. Visualization of completeness of the data in the column.

Uniqueness

Figure 292. Visualization of uniqueness of the data in the column.

feels_like

Table 230. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name feels_like
Description

Temperature in account with the human perception of weather measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 231. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
feels_like 1 - 6 100.737 -8.99 6.48 14.97 275.03 305.02 145,838 119,055 ( 81.6% ) 10 ( 0.0% ) 0 ( 0.0% ) 6,575 ( 4.5% )
Table 232. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
feels_like
18.36%
4.51%
n/a -3.26

Continuous Data Distribution

Figure 293. Distribution of values in the column.

Outliers

Figure 294. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 295. Visualization of completeness of the data in the column.

Uniqueness

Figure 296. Visualization of uniqueness of the data in the column.

dew_point

Table 233. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name dew_point
Description

Temperature of dew point, measured in the relevant interval.

Data type Decimal number
Descriptor pms:temperature [UID:0.0.TMPRT394]
Descriptor description
IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMPRT394
Unit

°C

Table 234. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
dew_point 1 - 6 39.634 -7.78 6.55 10.4 14.3 295.57 145,838 66,737 ( 45.8% ) 31 ( 0.0% ) 0 ( 0.0% ) 5,393 ( 3.7% )
Table 235. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
dew_point
54.24%
3.70%
n/a -2.47

Continuous Data Distribution

Figure 297. Distribution of values in the column.

Outliers

Figure 298. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 299. Visualization of completeness of the data in the column.

Uniqueness

Figure 300. Visualization of uniqueness of the data in the column.

RH

Table 236. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name RH
Description

Not sufficiently specified by the data provider. Relative humidity measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval.

Data type Decimal number
Descriptor pms:relativeHumidity [UID:0.0.RLTVH395]
Descriptor description

Relative humidity (RH) (expressed as a percent) also measures water vapor, but RELATIVE to the temperature of the air. In other words, it is a measure of the actual amount of water vapor in the air compared to the total amount of vapor that can exist in the air at its current temperature.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RLTVH395
Unit

%

Table 237. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
RH 1 - 4 76.55 0 64.8 82 91.8 100 145,838 0 ( 0.0% ) 3 ( 0.0% ) 0 ( 0.0% ) 826 ( 0.6% )
Table 238. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
RH
100.00%
0.57%
93 19.7

Continuous Data Distribution

Figure 301. Distribution of values in the column.

Outliers

Figure 302. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 303. Visualization of completeness of the data in the column.

Uniqueness

Figure 304. Visualization of uniqueness of the data in the column.

atmpressure_pa

Table 239. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_pa
Description

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in or converted to Pa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit

Pa

Table 240. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_pa 5 - 9 100,229.409 50,000 98,850 100,570 101,700 104,900 145,838 38,440 ( 26.4% ) 0 ( 0.0% ) 0 ( 0.0% ) 20,205 ( 13.9% )
Table 241. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_pa
73.64%
13.85%
n/a 95835

Continuous Data Distribution

Figure 305. Distribution of values in the column.

Outliers

Figure 306. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 307. Visualization of completeness of the data in the column.

Uniqueness

Figure 308. Visualization of uniqueness of the data in the column.

atmpressure_hpa

Table 242. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_hpa
Description

Not sufficiently specified by the data provider. Atmospheric pressure measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Measured in hPa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit
Table 243. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_hpa 3 - 9 1,016.24168 970 1,009.8233 1,017.0732 1,023 1,049 145,838 96,394 ( 66.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 18,771 ( 12.9% )
Table 244. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_hpa
33.90%
12.87%
n/a 1011.8719

Continuous Data Distribution

Figure 309. Distribution of values in the column.

Outliers

Figure 310. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 311. Visualization of completeness of the data in the column.

Uniqueness

Figure 312. Visualization of uniqueness of the data in the column.

atmpressure_sealevel_Pa

Table 245. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_sealevel_Pa
Description

Not sufficiently specified by the data provider. Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_pa. Expressed in Pa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit

Pa

Table 246. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_sealevel_Pa 1 - 6 72,438.0 0 0 101,055 101,945 104,020 145,838 114,105 ( 78.2% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 1,220 ( 0.8% )
Table 247. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_sealevel_Pa
21.76%
0.84%
0 103115

Continuous Data Distribution

Figure 313. Distribution of values in the column.

Outliers

Figure 314. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 315. Visualization of completeness of the data in the column.

Uniqueness

Figure 316. Visualization of uniqueness of the data in the column.

atmpressure_sealevel_hPa

Table 248. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_sealevel_hPa
Description

Not sufficiently specified by the data provider. Equivalent sea level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit
Table 249. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_sealevel_hPa 3 - 7 1,014.375 977.45 1,008.6 1,016.4 1,021.25 1,040.2 145,838 123,177 ( 84.5% ) 0 ( 0.0% ) 0 ( 0.0% ) 1,219 ( 0.8% )
Table 250. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_sealevel_hPa
15.54%
0.84%
n/a 1031.15

Continuous Data Distribution

Figure 317. Distribution of values in the column.

Outliers

Figure 318. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 319. Visualization of completeness of the data in the column.

Uniqueness

Figure 320. Visualization of uniqueness of the data in the column.

atmpressure_grndlevel

Table 251. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name atmpressure_grndlevel
Description

Not sufficiently specified by the data provider. Ground level atmospheric pressure, presumably based on the atmospheric pressure given in column atmpressure_hpa. Expressed in hPa.

Data type Decimal number
Descriptor pms:atmosphericPressure [UID:0.0.TMSPH396]
Descriptor description

Atmospheric pressure, also known as air pressure or barometric pressure (after the barometer), is the pressure within the atmosphere of Earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TMSPH396
Unit
Table 252. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
atmpressure_grndlevel 1 - 1 0.0 0 0 0 0 0 145,838 136,766 ( 93.8% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 253. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
atmpressure_grndlevel
6.22%
0.00%
0 0

Data Distribution Top 20

Figure 321. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 322. Distribution of values in the column.

Completeness

Figure 323. Visualization of completeness of the data in the column.

Uniqueness

Figure 324. Visualization of uniqueness of the data in the column.

rain

Table 254. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain
Description

Not sufficiently specified by the data provider. Cumulative rainfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Data type Decimal number
Descriptor pms:rainfall [UID:0.0.RNFLL471]
Descriptor description

The amount of precipitation of any type (including the liquid equivalent of frozen hydrometeors); usually taken as that amount measured by means of a rain gauge (thus a small, varying amount of direct condensation is included). A more accurate term would be precipitation or precipitation amount. However, the broad use of "rainfall" is firmly established in meteorology, especially in hydrologic and climatological literature. Its best utilization would confine it to liquid precipitation, and so would provide a distinction between precipitation immediately accessible to soil and streams and that delayed in storage as snow or ice on the earth's surface.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RNFLL471
Unit

mm

Table 255. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain 1 - 5 19.111 0 0 0 1 255 145,838 14,885 ( 10.2% ) 78,437 ( 53.8% ) 0 ( 0.0% ) 603 ( 0.4% )
Table 256. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain
89.79%
0.41%
0 2.93

Continuous Data Distribution

Figure 325. Distribution of values in the column.

Completeness

Figure 326. Visualization of completeness of the data in the column.

Uniqueness

Figure 327. Visualization of uniqueness of the data in the column.

rain_intval

Table 257. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_intval
Description

Duration of the interval vor which the rainfall in column rain is reported.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

min

Table 258. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_intval 2 - 2 32.4 10 10 10 60 60 145,838 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 259. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_intval
100.00%
0.00%
10 60

Data Distribution Top 20

Figure 328. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 329. Distribution of values in the column.

Outliers

Figure 330. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 331. Visualization of completeness of the data in the column.

Uniqueness

Figure 332. Visualization of uniqueness of the data in the column.

rain_3h

Table 260. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_3h
Description

Cumulative rainfall, measured in the three hours preceeding and including the relevant interval.

Data type Decimal number
Descriptor pms:rainfall [UID:0.0.RNFLL471]
Descriptor description

The amount of precipitation of any type (including the liquid equivalent of frozen hydrometeors); usually taken as that amount measured by means of a rain gauge (thus a small, varying amount of direct condensation is included). A more accurate term would be precipitation or precipitation amount. However, the broad use of "rainfall" is firmly established in meteorology, especially in hydrologic and climatological literature. Its best utilization would confine it to liquid precipitation, and so would provide a distinction between precipitation immediately accessible to soil and streams and that delayed in storage as snow or ice on the earth's surface.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RNFLL471
Unit

mm

Table 261. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_3h 1 - 3 0.03 0 0 0 0 17 145,838 136,575 ( 93.6% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 23 ( 0.0% )
Table 262. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_3h
6.35%
0.02%
n/a 7

Data Distribution Top 20

Figure 333. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 334. Distribution of 20 least common values, from lowest to highest.

Completeness

Figure 335. Visualization of completeness of the data in the column.

Uniqueness

Figure 336. Visualization of uniqueness of the data in the column.

snow

Table 263. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name snow
Description

Not sufficiently specified by the data provider.Cumulative snowfall, presumably measured in the relvant measurement interval as the sum of all measurements.

Data type Decimal number
Descriptor pms:snowfall [UID:0.0.SNWFL472]
Descriptor description

Precipitation composed of white or translucent ice crystals, chiefly in complex branch hexagonal form and often agglomerated into snowflakes. For weather-observing purposes, the intensity of snow is characterized as 1) light when the visibility is 1 km (5/8 statute mile) or more; 2) moderate when the visibility is less than 1 km (5/8 statute mile) but not less than 1/2 km (5/16 statute mile); and 3) heavy when the visibility is less than 1/2 km (5/16 statute mile).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SNWFL472
Unit

mm

Table 264. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
snow 1 - 4 0.010 0 0 0 0 2 145,838 136,699 ( 93.7% ) 8,927 ( 6.1% ) 0 ( 0.0% ) 55 ( 0.0% )
Table 265. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
snow
6.27%
0.04%
n/a 1.85

Data Distribution Top 20

Figure 337. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 338. Distribution of 20 least common values, from lowest to highest.

Completeness

Figure 339. Visualization of completeness of the data in the column.

Uniqueness

Figure 340. Visualization of uniqueness of the data in the column.

snow_3h

Table 266. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name snow_3h
Description

Cumulative snowfall, measured in the three hours preceeding and including the relevant interval.

Data type Decimal number
Descriptor pms:snowfall [UID:0.0.SNWFL472]
Descriptor description

Precipitation composed of white or translucent ice crystals, chiefly in complex branch hexagonal form and often agglomerated into snowflakes. For weather-observing purposes, the intensity of snow is characterized as 1) light when the visibility is 1 km (5/8 statute mile) or more; 2) moderate when the visibility is less than 1 km (5/8 statute mile) but not less than 1/2 km (5/16 statute mile); and 3) heavy when the visibility is less than 1/2 km (5/16 statute mile).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SNWFL472
Unit

mm

Table 267. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
snow_3h 1 - 3 0.00 0 0 0 0 3 145,838 136,743 ( 93.8% ) 9,072 ( 6.2% ) 0 ( 0.0% ) 9 ( 0.0% )
Table 268. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
snow_3h
6.24%
0.01%
n/a 0.6

Data Distribution Top 20

Figure 341. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 342. Distribution of values in the column.

Completeness

Figure 343. Visualization of completeness of the data in the column.

Uniqueness

Figure 344. Visualization of uniqueness of the data in the column.

rain_counter

Table 269. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_counter
Description

No information is provided on this parameter.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

n/a

Table 270. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_counter 1 - 10 0.172750850 0 0 0 0 0.71372549 145,838 77,990 ( 53.5% ) 51,426 ( 35.3% ) 0 ( 0.0% ) 3 ( 0.0% )
Table 271. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_counter
46.52%
0.00%
n/a 0.71372549

Data Distribution Top 20

Figure 345. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 346. Distribution of values in the column.

Completeness

Figure 347. Visualization of completeness of the data in the column.

Uniqueness

Figure 348. Visualization of uniqueness of the data in the column.

rain_max

Table 272. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_max
Description

No information is provided on this parameter, presumably maximum rainfall intensity in the relevant interval, given in mm/h.

Data type Decimal number
Descriptor pms:rainfallIntensity [UID:0.0.RNFLL473]
Descriptor description

The rate of precipitation, usually expressed in millimeters or inches per hour.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RNFLL473
Unit

mm h-1

Table 273. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_max 1 - 1 0.0 0 0 0 0 0 145,838 77,990 ( 53.5% ) 67,848 ( 46.5% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 274. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_max
46.52%
0.00%
0 0

Data Distribution Top 20

Figure 349. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 350. Distribution of values in the column.

Completeness

Figure 351. Visualization of completeness of the data in the column.

Uniqueness

Figure 352. Visualization of uniqueness of the data in the column.

rain_gauge

Table 275. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name rain_gauge
Description

Rain gauge resolution, defined as the minimum amount of rain that a rain gauge can register, given in mm.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

mm

Table 276. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
rain_gauge 1 - 3 0.07 0 0 0 0.2 0.2 145,838 77,990 ( 53.5% ) 45,186 ( 31.0% ) 0 ( 0.0% ) 3 ( 0.0% )
Table 277. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
rain_gauge
46.52%
0.00%
n/a 0.2

Data Distribution Top 20

Figure 353. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 354. Distribution of values in the column.

Completeness

Figure 355. Visualization of completeness of the data in the column.

Uniqueness

Figure 356. Visualization of uniqueness of the data in the column.

valid_ticks

Table 278. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name valid_ticks
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 279. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
valid_ticks 1 - 1 0.0 0 0 0 0 0 145,838 77,990 ( 53.5% ) 67,848 ( 46.5% ) 0 ( 0.0% ) 2 ( 0.0% )
Table 280. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
valid_ticks
46.52%
0.00%
0 0

Data Distribution Top 20

Figure 357. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 358. Distribution of values in the column.

Completeness

Figure 359. Visualization of completeness of the data in the column.

Uniqueness

Figure 360. Visualization of uniqueness of the data in the column.

wind_speed_ms

Table 281. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_speed_ms
Description

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in or transformed to m/s.

Data type Decimal number
Descriptor pms:windSpeed [UID:0.0.WNDSP474]
Descriptor description

The rate at which air is moving horizontally past a given point. It may be a 2-minute average speed (reported as wind speed) or an instantaneous speed (reported as a peak wind speed, wind gust, or squall).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDSP474
Unit

m s-1

Table 282. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_speed_ms 1 - 5 1.366 0 0 0.1 2.1 28.8 145,838 89,398 ( 61.3% ) 28,218 ( 19.3% ) 0 ( 0.0% ) 171 ( 0.1% )
Table 283. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_speed_ms
38.70%
0.12%
n/a 13.41

Data Distribution Top 20

Figure 361. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 362. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 363. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 364. Visualization of completeness of the data in the column.

Uniqueness

Figure 365. Visualization of uniqueness of the data in the column.

wind_speed_kmh

Table 284. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_speed_kmh
Description

Not sufficiently specified by the data provider. Wind speed measured in the relevant interval, either at any time during this interval (for example, at the very end) for a period of 2 min or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in km/h.

Data type Decimal number
Descriptor pms:windSpeed [UID:0.0.WNDSP474]
Descriptor description

The rate at which air is moving horizontally past a given point. It may be a 2-minute average speed (reported as wind speed) or an instantaneous speed (reported as a peak wind speed, wind gust, or squall).

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDSP474
Unit

km h-1

Table 285. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_speed_kmh 1 - 1 0.1 0 0 0 0 8 145,838 116,181 ( 79.7% ) 27,685 ( 19.0% ) 0 ( 0.0% ) 7 ( 0.0% )
Table 286. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_speed_kmh
20.34%
0.00%
0 8

Data Distribution Top 20

Figure 366. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 367. Distribution of values in the column.

Completeness

Figure 368. Visualization of completeness of the data in the column.

Uniqueness

Figure 369. Visualization of uniqueness of the data in the column.

wind_deg

Table 287. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_deg
Description

Not sufficiently specified by the data provider. Wind direction measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. Given in degrees.

Data type Integer number
Descriptor pms:windDirection [UID:0.0.WNDDR475]
Descriptor description

The true direction from which the wind is blowing at a given location (i.e., wind blowing from the north to the south is a north wind). It is normally measured in tens of degrees from 10 degrees clockwise through 360 degrees. North is 360 degrees. A wind direction of 0 degrees is only used when wind is calm.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDDR475
Unit

°

Table 288. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_deg 1 - 3 135.4 0 56 110 220 360 145,838 89,398 ( 61.3% ) 328 ( 0.2% ) 0 ( 0.0% ) 362 ( 0.2% )
Table 289. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_deg
38.70%
0.25%
230 318

Data Distribution Top 20

Figure 370. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 371. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 372. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 373. Visualization of completeness of the data in the column.

Uniqueness

Figure 374. Visualization of uniqueness of the data in the column.

wind_gust_ms

Table 290. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_gust_ms
Description

Wind gust measured in the relevant interval. Given in or transformed to m/s.

Data type Decimal number
Descriptor pms:windGust [UID:0.0.WNDGS476]
Descriptor description

Rapid fluctuations in the wind speed with a variation of 10 knots or more between peaks and lulls. The speed of the gust will be the maximum instantaneous wind speed.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDGS476
Unit

m s-1

Table 291. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_gust_ms 1 - 5 5.854 0 0 0 10.8 115.2 145,838 89,486 ( 61.4% ) 28,432 ( 19.5% ) 0 ( 0.0% ) 185 ( 0.1% )
Table 292. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_gust_ms
38.64%
0.13%
n/a 18.9

Data Distribution Top 20

Figure 375. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 376. Distribution of 20 least common values, from lowest to highest.

Completeness

Figure 377. Visualization of completeness of the data in the column.

Uniqueness

Figure 378. Visualization of uniqueness of the data in the column.

wind_gust_kmh

Table 293. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name wind_gust_kmh
Description

Wind gust measured in the relevant interval. Given in km/h.

Data type Decimal number
Descriptor pms:windGust [UID:0.0.WNDGS476]
Descriptor description

Rapid fluctuations in the wind speed with a variation of 10 knots or more between peaks and lulls. The speed of the gust will be the maximum instantaneous wind speed.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.WNDGS476
Unit

km h-1

Table 294. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
wind_gust_kmh 1 - 2 2.6 0 0 1 4 32 145,838 116,181 ( 79.7% ) 11,511 ( 7.9% ) 0 ( 0.0% ) 18 ( 0.0% )
Table 295. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
wind_gust_kmh
20.34%
0.01%
0 32

Data Distribution Top 20

Figure 379. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 380. Distribution of values in the column.

Outliers

Figure 381. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 382. Visualization of completeness of the data in the column.

Uniqueness

Figure 383. Visualization of uniqueness of the data in the column.

irradiance

Table 296. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name irradiance
Description

Not sufficiently specified by the data provider. Solar irradiance measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Data type Decimal number
Descriptor pms:solarIrradiance [UID:0.0.SLRRR477]
Descriptor description

Solar irradiance is the power per unit area (surface power density) received from the Sun in the form of electromagnetic radiation in the wavelength range of the measuring instrument. Solar irradiance is measured in watts per square metre (W/m2) in SI units.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SLRRR477
Unit

W m-2

Table 297. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
irradiance 1 - 4 100.0 0 0 2 84 1,126 145,838 35,566 ( 24.4% ) 31,131 ( 21.3% ) 0 ( 0.0% ) 1,061 ( 0.7% )
Table 298. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
irradiance
75.61%
0.73%
0 1053

Continuous Data Distribution

Figure 384. Distribution of values in the column.

Outliers

Figure 385. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 386. Visualization of completeness of the data in the column.

Uniqueness

Figure 387. Visualization of uniqueness of the data in the column.

energy_density

Table 299. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name energy_density
Description

Not sufficiently specified by the data provider. Rate of solar radiation measured in the relevant interval, either at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval. No information is provided on the range of wavelengths that are measured.

Data type Decimal number
Descriptor pms:rateOfSolarRadiation [UID:0.0.RTFSL478]
Descriptor description

The langley (Ly) is a unit of heat transmission, especially used to express the rate of solar radiation (or insolation) received by the earth.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RTFSL478
Unit

J cm-2

Table 300. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
energy_density 1 - 3 67.6 0 0 3 115 364 145,838 137,055 ( 94.0% ) 4,130 ( 2.8% ) 0 ( 0.0% ) 363 ( 0.2% )
Table 301. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
energy_density
6.02%
0.25%
0 254

Data Distribution Top 20

Figure 388. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 389. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 390. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 391. Visualization of completeness of the data in the column.

Uniqueness

Figure 392. Visualization of uniqueness of the data in the column.

irradiance_max

Table 302. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name irradiance_max
Description

Maximum solar irradiance measured in the relevant interval.

Data type Decimal number
Descriptor pms:solarIrradiance [UID:0.0.SLRRR477]
Descriptor description

Solar irradiance is the power per unit area (surface power density) received from the Sun in the form of electromagnetic radiation in the wavelength range of the measuring instrument. Solar irradiance is measured in watts per square metre (W/m2) in SI units.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SLRRR477
Unit

W m-2

Table 303. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
irradiance_max 1 - 4 82.3 0 2 2 70 1,218 145,838 77,990 ( 53.5% ) 12,335 ( 8.5% ) 0 ( 0.0% ) 563 ( 0.4% )
Table 304. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
irradiance_max
46.52%
0.39%
2 960

Continuous Data Distribution

Figure 393. Distribution of values in the column.

Outliers

Figure 394. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 395. Visualization of completeness of the data in the column.

Uniqueness

Figure 396. Visualization of uniqueness of the data in the column.

clouds

Table 305. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name clouds
Description

No information is provided on this parameter, presumably the part of the sky that is covered with clouds.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

%

Table 306. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
clouds 1 - 3 44.6 0 20 40 75 100 145,838 119,055 ( 81.6% ) 5,181 ( 3.6% ) 0 ( 0.0% ) 102 ( 0.1% )
Table 307. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
clouds
18.36%
0.07%
75 78

Data Distribution Top 20

Figure 397. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 398. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 399. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 400. Visualization of completeness of the data in the column.

Uniqueness

Figure 401. Visualization of uniqueness of the data in the column.

visibility

Table 308. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name visibility
Description

No information is provided on this parameter, presumably the visibility in the atmosphere for the human eye, given in m.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

m

Table 309. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
visibility 1 - 5 8,920.7 0 9,999 10,000 10,000 10,000 145,838 119,156 ( 81.7% ) 155 ( 0.1% ) 0 ( 0.0% ) 67 ( 0.0% )
Table 310. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
visibility
18.30%
0.05%
10000 49

Data Distribution Top 20

Figure 402. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 403. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 404. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 405. Visualization of completeness of the data in the column.

Uniqueness

Figure 406. Visualization of uniqueness of the data in the column.

carbon_dioxide

Table 311. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name carbon_dioxide
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 312. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
carbon_dioxide 0 - 0 n/a n/a n/a n/a 145,838 145,838 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
Table 313. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
carbon_dioxide
0.00%
0.00%
n/a n/a

Data Distribution Top 20

Figure 407. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 408. Visualization of completeness of the data in the column.

Uniqueness

Figure 409. Visualization of uniqueness of the data in the column.

weather_id

Table 314. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_id
Description

Internal weather condition code adopted by the provider of the weather data.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 315. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_id 3 - 3 727.0 200 701 800 802 804 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 28 ( 0.0% )
Table 316. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_id
18.36%
0.02%
800 621

Data Distribution Top 20

Figure 410. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 411. Distribution of 20 least common values, from lowest to highest.

Outliers

Figure 412. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 413. Visualization of completeness of the data in the column.

Uniqueness

Figure 414. Visualization of uniqueness of the data in the column.

weather_main

Table 317. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_main
Description

Internal main group for the description of the weather adopted by the provider of the weather data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 318. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_main 0 - 12 n/a Clear n/a n/a n/a Thunderstorm 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 11 ( 0.0% )
Table 319. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_main
18.36%
0.01%
n/a Haze

Data Distribution Top 20

Figure 415. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 416. Visualization of completeness of the data in the column.

Uniqueness

Figure 417. Visualization of uniqueness of the data in the column.

weather_description

Table 320. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_description
Description

Internal subgroup for the description of the weather adopted by the provider of the weather data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 321. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_description 0 - 28 n/a broken cloud… n/a n/a n/a very heavy r… 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 29 ( 0.0% )
Table 322. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_description
18.36%
0.02%
n/a shower snow

Data Distribution Top 20

Figure 418. Distribution of 20 most common values, from highest to lowest.

Data Distribution Bottom 20

Figure 419. Distribution of 20 least common values, from lowest to highest.

Completeness

Figure 420. Visualization of completeness of the data in the column.

Uniqueness

Figure 421. Visualization of uniqueness of the data in the column.

weather_icon

Table 323. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name weather_icon
Description

Internal code for icons describing the weather condistions, adopted by the provider of the weather data.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 324. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
weather_icon 0 - 3 n/a 01d n/a n/a n/a 50n 145,838 119,055 ( 81.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 19 ( 0.0% )
Table 325. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
weather_icon
18.36%
0.01%
n/a 11n

Data Distribution Top 20

Figure 422. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 423. Visualization of completeness of the data in the column.

Uniqueness

Figure 424. Visualization of uniqueness of the data in the column.

battery

Table 326. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name battery
Description

Voltage of the batteries of the device/system.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

V

Table 327. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
battery 1 - 4 4.169 3 4.15 4.15 4.2 4.25 145,838 65,223 ( 44.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 6 ( 0.0% )
Table 328. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
battery
55.28%
0.00%
n/a 3

Data Distribution Top 20

Figure 425. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 426. Distribution of values in the column.

Outliers

Figure 427. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 428. Visualization of completeness of the data in the column.

Uniqueness

Figure 429. Visualization of uniqueness of the data in the column.

payload

Table 329. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name payload
Description

No information is provided on this parameter.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 330. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
payload 0 - 23 n/a x n/a n/a n/a x7355810d052… 145,838 100,651 ( 69.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 44,675 ( 30.6% )
Table 331. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
payload
30.98%
30.63%
n/a x7349c10d612e607a1701ff

Completeness

Figure 430. Visualization of completeness of the data in the column.

Uniqueness

Figure 431. Visualization of uniqueness of the data in the column.

time_sync_error_s

Table 332. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name time_sync_error_s
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

s

Table 333. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
time_sync_error_s 1 - 4 8.8 -300 0 11 15 299 145,838 77,990 ( 53.5% ) 22,689 ( 15.6% ) 0 ( 0.0% ) 599 ( 0.4% )
Table 334. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
time_sync_error_s
46.52%
0.41%
0 -82

Continuous Data Distribution

Figure 432. Distribution of values in the column.

Outliers

Figure 433. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 434. Visualization of completeness of the data in the column.

Uniqueness

Figure 435. Visualization of uniqueness of the data in the column.

seq_number_modem

Table 335. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name seq_number_modem
Description

No information is provided on this parameter.

Data type Integer number
Descriptor Integer [UID:0.0.NTGER313]
Descriptor description

A number with no fractional part, including the negative and positive numbers as well as zero.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313
Unit

n/a

Table 336. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
seq_number_modem 1 - 4 1,829.8 0 959.25 1,726 2,545.75 4,095 145,838 133,622 ( 91.6% ) 2 ( 0.0% ) 0 ( 0.0% ) 4,097 ( 2.8% )
Table 337. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
seq_number_modem
8.38%
2.81%
438 76

Continuous Data Distribution

Figure 436. Distribution of values in the column.

Outliers

Figure 437. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 438. Visualization of completeness of the data in the column.

Uniqueness

Figure 439. Visualization of uniqueness of the data in the column.

seq_number_firmware

Table 338. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name seq_number_firmware
Description

No information is provided on this parameter.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 339. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
seq_number_firmware 0 - 0 n/a n/a n/a n/a 145,838 145,838 ( 100.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 1 ( 0.0% )
Table 340. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
seq_number_firmware
0.00%
0.00%
n/a n/a

Data Distribution Top 20

Figure 440. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 441. Visualization of completeness of the data in the column.

Uniqueness

Figure 442. Visualization of uniqueness of the data in the column.

temperature_wetbulb_stull2011_C

Table 341. Standardised metadata of the column. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name temperature_wetbulb_stull2011_C
Description

No information is provided on this parameter.

Data type Decimal number
Descriptor DecimalNumber [UID:0.0.DCMLN314]
Descriptor description

Any of the rational or irrational numbers.

IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.DCMLN314
Unit

°C

Table 342. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Name Length Mean Min Q1 Median Q3 Max Total Missing Zero Blank Distinct
temperature_wetbulb_stull2011_C 1 - 5 10.667 -4.35 7 10.66 14.21 24.67 145,838 123,177 ( 84.5% ) 1 ( 0.0% ) 0 ( 0.0% ) 2,530 ( 1.7% )
Table 343. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Name Completeness Uniqueness Most Common Value Least Common Value
temperature_wetbulb_stull2011_C
15.54%
1.73%
n/a 16.98

Continuous Data Distribution

Figure 443. Distribution of values in the column.

Outliers

Figure 444. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 445. Visualization of completeness of the data in the column.

Uniqueness

Figure 446. Visualization of uniqueness of the data in the column.

Changes made to preparatory file

  1. Data reporting atmospheric pressure in the raw data files was either provided in Pa or in hPa. Values provided in hPa were transformed to Pa in order to store all values in a common unit while maintaining the raw data.
  2. Data reporting wind speed in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.
  3. Data reporting wind gust in the raw data files was either provided in m/s or in km/h. Values provided in km/h were transformed to m/s in order to store all values in a common unit while maintaining the raw data.

Changes made to data

No changes were made in the data file.

Unresolved issues

  1. In columns temperature, temperature_min, temperature_max, feelslike and dew_point obtained from the device located in Romania 21.694 records contain values > 100°C. These records must be revised by the data provider.
  2. In 3 records from the device in Belgium values are out of range. These records must be revised by the data provider.
  3. The description of the data (metadata) is largely incomplete and allows no clear standardisation of the data.
    • For column temperature it is unclear, how the temperature is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column feelslike it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column dewpoint it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For column RH it is unclear, how the relative humidity is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column atmpressure_pa and atmpressureh_pa it is unclear, how the atmospheric pressure is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column atmpressure_sealevel_Pa and atmpressure_sealevel_hPa it is unclear which algorithm was used to calculate the values reported in the raw data files.
    • For data reported in column atmpressure_grndlevel_hPa it is unclear what is reported in the ray data file.
    • For data reported in column rain_counter it is unclear what is reported in the raw data file.
    • For data reported in column rain_max it is unclear what is reported in the raw data file.
    • For data reported in column valid_ticks it is unclear what is reported in the raw data file.
    • For column wind_speed_ms and wind_speed_kmh it is unclear, how the wind speed is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_gust_ms and wind_gust_kmh it is unclear, how the wind gust is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column wind_deg it is unclear, how the wind direction is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column irradiance it is unclear, how the solar irradiance is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For column energy density it is unclear, how the rate of solar radiation is reported in the single raw data files (either measured at any time during this interval (for example, at the very end) or calculated as arithmetic mean from all measurements or from a subset of measurements in this interval).
    • For data reported in column clouds it is unclear what is reported in the raw data file.
    • For data reported in column visibility it is unclear what is reported in the raw data file.
    • For data reported in column carbon_dioxide it is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column payload it is unclear what is reported in the raw data file.
    • For data reported in column time_sync_error_s it is unclear what is reported in the raw data file.
    • For data reported in column seq_number_modem it is unclear what is reported in the raw data file.
    • For data reported in column seq_number_firmware it is unclear what is reported in the raw data file. Since this column does not contain data in either of the two tables, it could also be deleted.
    • For data reported in column temperature_wetbulb_stull2011_C it is unclear what is reported in the raw data file.
  4. Data in raw data files acquired in 2020 contain:
    • 9 triplicate records for the same date and time (3 x 8 records in raw data files from the device in Switzerland, 3 x 1 record in a raw data file from the device in Romania). These records must be revised by the data provider.
    • 413 duplicate records for the same date and time (2 x 272 records in raw data files from the device in Switzerland, 2 x 141 records in a raw data file from the device in Romania). These records must be revised by the data provider.