EU Pollinator Hub

, ,

Dataset Report
Unique identifier: CNTRS2.0.0
Title: Countries
Long title: EUPH reference dataset containing names and codes of countries, aggregates and subdivisions of countries
Status: Quality Validated
Current Version: v. 1.0
Published: 2023-01-26
Reviewed by:
Citation proposal:
EU Pollinator Hub 2023 Report of dataset Countries, v. 1.0 [CNTRS2.0.0]. EU Pollinator Hub. [2026-02-24] app.pollinatorhub.eu
Compliance with FAIR* principles
Findable
Accessible
Interoperable
Reusable
See https://www.go-fair.org/fair-principles for more information about FAIR principles
Data Quality
Under evaluation

Document history

Release

Version v. 1.0 released on 2023-01-26.

Revision

Table 1. List of revisions made to the document. Identifier of revision (No); date of revision (Date); description of revision (Description); reason for revision (Reason).
No Date Description Reason
1 2023-01-26 00:01:00 Initial release. n/a

Abbreviations

EUPH
EU Pollinator Hub
ISO
International Organization for Standardization
OBP
Online Browsing Platform
UID
Unique Identifier
UNSD
United Nations Statistics Division
n.a.
not available

Executive summary

Data overview:

The dataset contains ISO 3166 country codes published by and the Organization for Standardization (ISO) and the names of countries published by the United Nations Statistics Division (UNSD) of the entire world, which are currently in use, for internal use on the EU Pollinator Hub.

Data value:

The dataset is required by the EUPH for administrative purposes and fulfils an important role in data standardisation across datasets.

Data description:

The dataset contains 291 records of countries included in the ISO standard 3166-1:2020 - Part 1: Country code and ISO 3166-3:2020 - Part 3: Code for formerly used names of countries and 747 records of region names, sub-region names, intermediate region names, and 249 distinct countries or areas and their respective codes in three languages (English, French, Spanish) according to the Standard country or area codes for statistical use (M49) published by the United Nations Statistics Division (UNSD).

Data application:

This data will be used for the standardisation of data integrated on the EU Pollinator Hub (EUPH).

Unresolved issues:

n/a

Introduction

Standardisation is an important goal of the EU Pollinator Hub (EUPH). Existing standards are prioritised to achieve this goal. The Organization for Standardization (ISO) is an important source for internationally recognized standards, for some of which ISO grants free-of-charge use. ISO-3166 is an internationally recognised and adopted, free-of-charge standard for country codes and their subdivisions. Its purpose is to provide codes of letters and numbers, which can – and should – be used when referring to countries and their subdivisions. The standard does not include the name of the countries, which have been imported from the Standard country or area codes for statistical use (M49) of the United Nations Statistics Division (UNSD). The dataset contains ISO-3166 country codes published by and the Organization for Standardization (ISO) and the names of countries published by the United Nations Statistics Division (UNSD) of the entire world, which are currently in use, for internal use on the EU Pollinator Hub.

Material and methods

Data acquisition

Raw data, containing part of the data contained in “ISO 3166-1:2020 Codes for the representation of names of countries and their subdivisions — Part 1: Country code” published by the Organization for Standardization (ISO), was manually sampled from the web page Online Browsing Platform (OBP) [1] on 2022-11-14 after selecting ‘Country codes’ from the main list, ‘Officially assigned codes’, ‘Formerly used’, ‘Transitionally reserved’ and ‘Exceptionally reserved’ as Code type and following the individual link provided for each country contained in the list. ‘Indeterminately reserved’ and ‘Unassigned’ code types were not integrated in the dataset. Standard country or area codes for statistical use (M49) where downloaded from the Website of the United Nations Statistics Division (UNSD) on 2023-01-26 [2].

Table 2. List of raw data and metadata files included in the dataset. Identifier of table row (No); name of the file (File); the type of the file (Type); file contains data (D); file contains metadata (M); date of upload of the file to the EU Pollinator Hub (Arrival); number of data points contained within the file (if applicable); uploaded file size.
No File Type D M Arrival Data points File size
1 euph_000002_countries_table_iso3166-1-2020.csv CSV - Comma seperated values Yes No 2023-09-01 10:09:37 4,365 n/a
2 unsdm49_PREP_MR_230127.csv CSV - Comma seperated values Yes No 2025-02-12 17:02:27 10,959 69.19 KiB

Data preparation

Data from the ISO web page was copied into a Microsoft® Excel® worksheet (Microsoft Corporation, Version 2210 Build 16.0.15726.20070). Prepared data was then converted to CSV. Data from, USDN was downloaded as csv file. Both were imported for profiling into a SQL database (MariaDB foundation, Server-Version 10.4.24) running in a XAMPP environment (BitRock, version 3.3.0). Data was exported from the MySQL database to CSV format using utf-8 coding. Data profiling was performed according to SOP-005 Data profiling using phpMyAdmin (version 5.2.0).

Data validation

n.a.

Data analysis

n.a.

Data description

Dataset

Table 3. Summary of tables belonging to the dataset. Table row identifier (No); name of the table (Table); description of the table (Description).
No Table Description
1 ISO 3166:2020 Data in this table was obtained from the International Organization for Standardization (ISO) Data, an independent, non-governmental international organization with…
2 UNSD M49 Data in this table was obtained from the United Nations Statistics Division (UNSD), serving under the United Nations Department of…
Table 4. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
interactions.single.uid CNTRS2.0.0
Title Countries
Long title EUPH reference dataset containing names and codes of countries, aggregates and subdivisions of countries
Target IRI https://app.pollinatorhub.eu/dataset-discovery/CNTRS2.0.0
interactions.single.section-details.licence EU Pollinator Hub
DOI n/a
Created 2023-01-26
Published 2023-01-26
Contact www.iso.org
Keywords n/a
Data collection years n/a
Regions, the data was collected in n/a
Abstract

The dataset contains standardised information on countries published by the Organization for Standardization (ISO) and the United Nations Statistics Division (UNSD).

Table 5. Standardised metadata of the data provider EU Pollinator Hub. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Name EU Pollinator Hub
Url
Acronym EUPH
IRI https://app.pollinatorhub.eu/data-providers/euph
Address
Country Belgium
Contact https://www.linkedin.com/company/beelife-european-beekeeping-coordination/ pollinatorhub.eu
Description

The EU Pollinator Hub (EUPH) is a data hub related to pollinators, which is provided by the European Food Safety Authority (EFSA).

Tables

ISO 3166:2020

Table 6. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier CNTRS2.ISOAB1.0
Name ISO 3166:2020
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/CNTRS2.ISOAB1.0
Table Type File
Licence EU Pollinator Hub
Description

Data in this table was obtained from the International Organization for Standardization (ISO) Data, an independent, non-governmental international organization with a membership of 167 national standards bodies. It contains codes and names for the representation of names of countries and their subdivisions.

Data in this table was obtained from the International Organization for Standardization (ISO) Data, an independent, non-governmental international organization with a membership of 167 national standards bodies. It contains codes and names for the representation of names of countries and their subdivisions.

Metadata

Table 7. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit
CountryId
String pms:recordID [0.0.RCRDD344]

n/a

NumericCode
String unsd:m49Area [0.0.MAREA519]

n/a

FullName

Full name of the country, as recorded in UNTERM.

String Text [0.0.TEXTA315]

n/a

ShortName
String iso-3166:shortUppercaseNameOfCountry [0.0.SHRTC3]

n/a

ShortNameLcEn
String iso-3166:shortLowercaseNameOfCountry [0.0.SHRTC10]

n/a

Alpha2Code
String iso-3166:alpha-2CountryCode [0.0.LPHCN4]

n/a

Alpha3Code
String iso-3166:alpha-3CountryCode [0.0.LPHCN5]

n/a

Alpha4Code
String iso-3166:alpha-4CountryCode [0.0.LPHCN841]

n/a

StartDate
Integer number pms:recordStartYearValidity [0.0.STRTR14]

n/a

EndDate
Integer number pms:recordEndYearValidity [0.0.NDDTE15]

n/a

Status

Information on the status of the country in the dataset.

String Text [0.0.TEXTA315]

n/a

AnnotationStatus

Annotation to the status of the country in the dataset.

String Text [0.0.TEXTA315]

n/a

AnnotationP1En

Remarks related to the country name, such as other widely-used country names, as defined in ISO 3166-1:2013, Codes for the representation of names of countries and their subdivisions — Part 1: Country codes.

String Text [0.0.TEXTA315]

n/a

AnnotationP2En

Remarks related to a country's subdivisions, as deinfed in ISO 3166-2:2013, Codes for the representation of names of countries and their subdivisions — Part 2: Country subdivision code.

String Text [0.0.TEXTA315]

n/a

AnnotationP3En

Remarks related to formerly used codes of a country, as defined in ISO 3166-3:2013, Codes for the representation of names of countries and their subdivisions — Part 3: Code for formerly used names of countries.

String Text [0.0.TEXTA315]

n/a

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 8. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
CountryId 1 - 3 146.0 1 73 146 219 291 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 291 ( 100.0% )
NumericCode 1 - 4 n/a 4 n/a n/a n/a NULL 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 266 ( 91.4% )
FullName 4 - 56 n/a NULL n/a n/a n/a the United K… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 174 ( 59.8% )
ShortName 4 - 52 n/a CHAD n/a n/a n/a UNITED KINGD… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 291 ( 100.0% )
ShortNameLcEn 4 - 58 n/a Chad n/a n/a n/a United Kingd… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 291 ( 100.0% )
Alpha2Code 2 - 2 n/a AC n/a n/a n/a ZW 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 285 ( 97.9% )
Alpha3Code 3 - 4 n/a ABW n/a n/a n/a NULL 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 285 ( 97.9% )
Alpha4Code 4 - 4 n/a AIDJ n/a n/a n/a ZRCD 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 11.0% )
StartDate 4 - 10 n/a NULL n/a n/a n/a 1974-12-15 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.7% )
EndDate 4 - 10 n/a NULL n/a n/a n/a 1993-06-15 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 14 ( 4.8% )
Status 13 - 23 n/a Formerly use… n/a n/a n/a Transitional… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 1.4% )
AnnotationStatus 2 - 172 n/a ye n/a n/a n/a Refers to Eu… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 24 ( 8.2% )
AnnotationP1En 4 - 520 n/a NULL n/a n/a n/a WIPO uses th… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 88 ( 30.2% )
AnnotationP2En 4 - 541 n/a NULL n/a n/a n/a BS 6879 give… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 46 ( 15.8% )
AnnotationP3En 4 - 575 n/a NULL n/a n/a n/a Removed from… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 65 ( 22.3% )

Quality measures

Table 9. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
CountryId
100.00%
100.00%
1 1
NumericCode
100.00%
91.41%
NULL 10
FullName
100.00%
59.79%
NULL the Republic of Bulgaria
ShortName
100.00%
100.00%
ANTARCTICA ANTARCTICA
ShortNameLcEn
100.00%
100.00%
Antarctica Antarctica
Alpha2Code
100.00%
97.94%
BY AQ
Alpha3Code
100.00%
97.94%
NULL ATA
Alpha4Code
100.00%
11.00%
NULL BQAQ
StartDate
100.00%
0.69%
NULL 1974-12-15
EndDate
100.00%
4.81%
NULL 1992-06-15
Status
100.00%
1.37%
Officially assigned Transitionally reserved
AnnotationStatus
100.00%
8.25%
NULL Includes: the islands Bonaire, Saint Eustatius and Saba
AnnotationP1En
100.00%
30.24%
NULL Territories south of 60° south latitude.
AnnotationP2En
100.00%
15.81%
NULL Remark: the forms used in the list are English-language forms provided by Myanmar.
AnnotationP3En
100.00%
22.34%
NULL French Southern and Antarctic Territories (FQ, ATF, --) are now part of Antarctica and French Southern Territories (TF, ATF, 260). See also code element FQHH. Dronning Maud Land (NQ, ATN, 216) is now part of Antarctica. See also code element NQAQ.

Changes made to preparatory file

None

Changes made to data

None

Unresolved issues

None

UNSD M49

Table 10. Standardised metadata of the dataset. Reported parameter (Parameter); content of the parameter (Content).
Parameter Content
Unique identifier CNTRS2.UNSDM2.0
Name UNSD M49
Target IRI https://app.pollinatorhub.eu/dataset-discovery/parts/CNTRS2.UNSDM2.0
Table Type File
Licence EU Pollinator Hub
Description

Data in this table was obtained from the United Nations Statistics Division (UNSD), serving under the United Nations Department of Economic and Social Affairs as the central mechanism within the Secretariat of the United Nations to supply the statistical needs and coordinating activities of the global statistical system. It contains names of region names, sub-region names, intermediate region names as well as countries or areas and their respective codes.

Data in this table was obtained from the United Nations Statistics Division (UNSD), serving under the United Nations Department of Economic and Social Affairs as the central mechanism within the Secretariat of the United Nations to supply the statistical needs and coordinating activities of the global statistical system. It contains names of region names, sub-region names, intermediate region names as well as countries or areas and their respective codes.

Metadata

n/a
Table 11. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Column Description Datatype Descriptor Unit
id
String pms:recordID [0.0.RCRDD344]

n/a

language
String iso-639:alpha-3LanguageCode [0.0.LPHLN107]

n/a

globalcode

M49 code of the highest level aggregate of countries, areas or geographic regions in the M49 standard, namely the world, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

regioncode

M49 code of the second highest level aggregate of countries, areas or geographic regions in the M49 standard, namely the continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

subregioncode

M49 code of the third highest level aggregate of countries, areas or geographic regions in the M49 standard, namely first order subdivisions of the continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

intermregioncode

M49 code of the fourth highest level aggregate of countries in the M49 standard, namely second order subdivisions of the continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

ldc

M49 code of the aggregate of least developed countries (LDC) in the M49 standard, namely subdivisions of subdivisions of continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

lldc

M49 code of the aggregate of landlocked developing countries (LLDC) in the M49 standard, namely subdivisions of subdivisions of continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

sids

M49 code of the aggregate of small island developing states (SIDS) in the M49 standard, namely subdivisions of subdivisions of continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

String Text [0.0.TEXTA315]

n/a

area
String unsd:m49Area [0.0.MAREA519]

n/a

m49code
String unsd:m49Code [0.0.MCODE356]

n/a

isoalpha2
String iso-3166:alpha-2CountryCode [0.0.LPHCN4]

n/a

isoalpha3
String iso-3166:alpha-3CountryCode [0.0.LPHCN5]

n/a

Metadata of individual tables can be found in Annex 1.

Descriptive measures

Table 12. Content analysis of the table. Column name (Name); range of length of characters (Length); arithmetic mean of values in column (Mean); lowest value in column (Min); first quartile of values in column (Q1); median of values in column (Median); third quartile of values in column (Q3); highest value in column (Max); number of records (Total); number and percentage (between brackets) of all values containing NULL (Missing), the value 0 (Zero), exclusively blank characters (Blank), and of distinct values including NULL, Zero and blank (Distinct).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
id 1 - 3 422.0 1 211 422 633 843 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 843 ( 100.0% )
language 3 - 3 n/a 001 n/a n/a n/a spa 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.5% )
globalcode 3 - 3 1.0 1 1 1 1 1 843 6 ( 0.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
regioncode 3 - 3 65.2 2 9 19 142 150 843 30 ( 3.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 6 ( 0.7% )
subregioncode 3 - 3 183.6 15 54 154 202 419 843 81 ( 9.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 18 ( 2.1% )
intermregioncode 3 - 3 16.5 5 11 14 29 29 843 528 ( 62.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 0.9% )
ldc 3 - 3 199.0 199 199 199 199 199 843 705 ( 83.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
lldc 3 - 3 432.0 432 432 432 432 432 843 747 ( 88.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
sids 3 - 3 722.0 722 722 722 722 722 843 684 ( 81.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
area 4 - 52 n/a Asia n/a n/a n/a United Kingd… 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 662 ( 78.5% )
m49code 3 - 3 396.5 1 155 392 626 894 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 281 ( 33.3% )
isoalpha2 0 - 2 n/a n/a n/a n/a ZW 843 99 ( 11.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 249 ( 29.5% )
isoalpha3 0 - 3 n/a n/a n/a n/a ZWE 843 99 ( 11.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 249 ( 29.5% )

Quality measures

Table 13. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
id
100.00%
100.00%
1 1
language
100.00%
0.47%
eng 001
globalcode
99.29%
0.24%
001 null
regioncode
96.44%
0.71%
002 null
subregioncode
90.39%
2.14%
202 021
intermregioncode
37.37%
0.95%
null 018
ldc
16.37%
0.24%
null 199
lldc
11.39%
0.24%
null 432
sids
18.86%
0.24%
null 722
area
100.00%
78.53%
Burundi Algeria
m49code
100.00%
33.33%
012 012
isoalpha2
88.26%
29.54%
null DZ
isoalpha3
88.26%
29.54%
null DZA

Changes made to preparatory file

None

Changes made to data

None

Unresolved issues

None

References

There are no sources in the current document.

Annex 1: Table column reports

Table: ISO 3166:2020

Column: CountryId

Table 14. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name CountryId
Description
Data type String
Descriptor pms:recordID [UID:0.0.RCRDD344]
Descriptor description

Unique sequence of integers associated with a record within a certain table.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RCRDD344
Unit

n/a

Table 15. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
CountryId 1 - 3 146.0 1 73 146 219 291 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 291 ( 100.0% )
Table 16. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
CountryId
100.00%
100.00%
1 1

Continuous Data Distribution

Figure 1. Distribution of values in the column.

Outliers

Figure 2. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 3. Visualization of completeness of the data in the column.

Uniqueness

Figure 4. Visualization of uniqueness of the data in the column.

Column: NumericCode

Table 17. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name NumericCode
Description
Data type String
Descriptor unsd:m49Area [UID:0.0.MAREA519]
Descriptor description

Name of the countries or areas or geographic regions in the M49 standard used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.MAREA519
Unit

n/a

Table 18. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
NumericCode 1 - 4 n/a 4 n/a n/a n/a NULL 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 266 ( 91.4% )
Table 19. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
NumericCode
100.00%
91.41%
NULL 10

Completeness

Figure 5. Visualization of completeness of the data in the column.

Uniqueness

Figure 6. Visualization of uniqueness of the data in the column.

Column: FullName

Table 20. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name FullName
Description

Full name of the country, as recorded in UNTERM.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 21. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
FullName 4 - 56 n/a NULL n/a n/a n/a the United K… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 174 ( 59.8% )
Table 22. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
FullName
100.00%
59.79%
NULL the Republic of Bulgaria

Completeness

Figure 7. Visualization of completeness of the data in the column.

Uniqueness

Figure 8. Visualization of uniqueness of the data in the column.

Column: ShortName

Table 23. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ShortName
Description
Data type String
Descriptor iso-3166:shortUppercaseNameOfCountry [UID:0.0.SHRTC3]
Descriptor description

[...] Short form of the country name [in capital letters], distinctive word first. [...] In language of the ISO 3166 standard. [...] This item might be inverted, allowing the distinctive word to appear first, so that items can be easily found in an alphabetical list. See [Annex F]https://www.iso.org/obp/ui/en/#iso:std:iso:3166:-1:ed-4:v1:en:sec:F), principles F.2.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/CNTRS2.0.SHRTC3
Unit

n/a

Table 24. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ShortName 4 - 52 n/a CHAD n/a n/a n/a UNITED KINGD… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 291 ( 100.0% )
Table 25. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ShortName
100.00%
100.00%
ANTARCTICA ANTARCTICA

Completeness

Figure 9. Visualization of completeness of the data in the column.

Uniqueness

Figure 10. Visualization of uniqueness of the data in the column.

Column: ShortNameLcEn

Table 26. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ShortNameLcEn
Description
Data type String
Descriptor iso-3166:shortLowercaseNameOfCountry [UID:0.0.SHRTC10]
Descriptor description

Short form of the country name, distinctive word first, based on official short form in UNTERM. [...] In language of the ISO 3166 standard. [...] This item might be inverted, listed with its articles if any, allowing an alphabetical order on the distinctive word. See Annex F, principles F.1.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.SHRTC10
Unit

n/a

Table 27. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ShortNameLcEn 4 - 58 n/a Chad n/a n/a n/a United Kingd… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 291 ( 100.0% )
Table 28. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ShortNameLcEn
100.00%
100.00%
Antarctica Antarctica

Completeness

Figure 11. Visualization of completeness of the data in the column.

Uniqueness

Figure 12. Visualization of uniqueness of the data in the column.

Column: Alpha2Code

Table 29. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Alpha2Code
Description
Data type String
Descriptor iso-3166:alpha-2CountryCode [UID:0.0.LPHCN4]
Descriptor description

A two-letter code that represents a country name, recommended as the general purpose code.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN4
Unit

n/a

Table 30. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Alpha2Code 2 - 2 n/a AC n/a n/a n/a ZW 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 285 ( 97.9% )
Table 31. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Alpha2Code
100.00%
97.94%
BY AQ

Completeness

Figure 13. Visualization of completeness of the data in the column.

Uniqueness

Figure 14. Visualization of uniqueness of the data in the column.

Column: Alpha3Code

Table 32. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Alpha3Code
Description
Data type String
Descriptor iso-3166:alpha-3CountryCode [UID:0.0.LPHCN5]
Descriptor description

A three-letter code that represents a country name, which is usually more closely related to the country name.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN5
Unit

n/a

Table 33. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Alpha3Code 3 - 4 n/a ABW n/a n/a n/a NULL 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 285 ( 97.9% )
Table 34. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Alpha3Code
100.00%
97.94%
NULL ATA

Completeness

Figure 15. Visualization of completeness of the data in the column.

Uniqueness

Figure 16. Visualization of uniqueness of the data in the column.

Column: Alpha4Code

Table 35. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Alpha4Code
Description
Data type String
Descriptor iso-3166:alpha-4CountryCode [UID:0.0.LPHCN841]
Descriptor description

A four-letter code that represents a country name that is no longer in use. The structure depends on the reason why the country name was removed from ISO 3166-1 and added to ISO 3166-3.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN841
Unit

n/a

Table 36. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Alpha4Code 4 - 4 n/a AIDJ n/a n/a n/a ZRCD 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 32 ( 11.0% )
Table 37. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Alpha4Code
100.00%
11.00%
NULL BQAQ

Completeness

Figure 17. Visualization of completeness of the data in the column.

Uniqueness

Figure 18. Visualization of uniqueness of the data in the column.

Column: StartDate

Table 38. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name StartDate
Description
Data type Integer number
Descriptor pms:recordStartYearValidity [UID:0.0.STRTR14]
Descriptor description

Year in which the item reported in a record becomes valid.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.STRTR14
Unit

n/a

Table 39. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
StartDate 4 - 10 n/a NULL n/a n/a n/a 1974-12-15 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.7% )
Table 40. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
StartDate
100.00%
0.69%
NULL 1974-12-15

Data Distribution Top 20

Figure 19. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 20. Visualization of completeness of the data in the column.

Uniqueness

Figure 21. Visualization of uniqueness of the data in the column.

Column: EndDate

Table 41. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name EndDate
Description
Data type Integer number
Descriptor pms:recordEndYearValidity [UID:0.0.NDDTE15]
Descriptor description

Year in which the item reported in a record loses its validity.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NDDTE15
Unit

n/a

Table 42. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
EndDate 4 - 10 n/a NULL n/a n/a n/a 1993-06-15 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 14 ( 4.8% )
Table 43. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
EndDate
100.00%
4.81%
NULL 1992-06-15

Data Distribution Top 20

Figure 22. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 23. Visualization of completeness of the data in the column.

Uniqueness

Figure 24. Visualization of uniqueness of the data in the column.

Column: Status

Table 44. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name Status
Description

Information on the status of the country in the dataset.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 45. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
Status 13 - 23 n/a Formerly use… n/a n/a n/a Transitional… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 1.4% )
Table 46. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
Status
100.00%
1.37%
Officially assigned Transitionally reserved

Data Distribution Top 20

Figure 25. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 26. Visualization of completeness of the data in the column.

Uniqueness

Figure 27. Visualization of uniqueness of the data in the column.

Column: AnnotationStatus

Table 47. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name AnnotationStatus
Description

Annotation to the status of the country in the dataset.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 48. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
AnnotationStatus 2 - 172 n/a ye n/a n/a n/a Refers to Eu… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 24 ( 8.2% )
Table 49. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
AnnotationStatus
100.00%
8.25%
NULL Includes: the islands Bonaire, Saint Eustatius and Saba

Completeness

Figure 28. Visualization of completeness of the data in the column.

Uniqueness

Figure 29. Visualization of uniqueness of the data in the column.

Column: AnnotationP1En

Table 50. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name AnnotationP1En
Description

Remarks related to the country name, such as other widely-used country names, as defined in ISO 3166-1:2013, Codes for the representation of names of countries and their subdivisions — Part 1: Country codes.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 51. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
AnnotationP1En 4 - 520 n/a NULL n/a n/a n/a WIPO uses th… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 88 ( 30.2% )
Table 52. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
AnnotationP1En
100.00%
30.24%
NULL Territories south of 60° south latitude.

Completeness

Figure 30. Visualization of completeness of the data in the column.

Uniqueness

Figure 31. Visualization of uniqueness of the data in the column.

Column: AnnotationP2En

Table 53. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name AnnotationP2En
Description

Remarks related to a country's subdivisions, as deinfed in ISO 3166-2:2013, Codes for the representation of names of countries and their subdivisions — Part 2: Country subdivision code.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 54. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
AnnotationP2En 4 - 541 n/a NULL n/a n/a n/a BS 6879 give… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 46 ( 15.8% )
Table 55. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
AnnotationP2En
100.00%
15.81%
NULL Remark: the forms used in the list are English-language forms provided by Myanmar.

Completeness

Figure 32. Visualization of completeness of the data in the column.

Uniqueness

Figure 33. Visualization of uniqueness of the data in the column.

Column: AnnotationP3En

Table 56. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name AnnotationP3En
Description

Remarks related to formerly used codes of a country, as defined in ISO 3166-3:2013, Codes for the representation of names of countries and their subdivisions — Part 3: Code for formerly used names of countries.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 57. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
AnnotationP3En 4 - 575 n/a NULL n/a n/a n/a Removed from… 291 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 65 ( 22.3% )
Table 58. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
AnnotationP3En
100.00%
22.34%
NULL French Southern and Antarctic Territories (FQ, ATF, --) are now part of Antarctica and French Southern Territories (TF, ATF, 260). See also code element FQHH. Dronning Maud Land (NQ, ATN, 216) is now part of Antarctica. See also code element NQAQ.

Completeness

Figure 34. Visualization of completeness of the data in the column.

Uniqueness

Figure 35. Visualization of uniqueness of the data in the column.

Table: UNSD M49

Column: id

Table 59. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name id
Description
Data type String
Descriptor pms:recordID [UID:0.0.RCRDD344]
Descriptor description

Unique sequence of integers associated with a record within a certain table.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.RCRDD344
Unit

n/a

Table 60. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
id 1 - 3 422.0 1 211 422 633 843 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 843 ( 100.0% )
Table 61. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
id
100.00%
100.00%
1 1

Continuous Data Distribution

Figure 36. Distribution of values in the column.

Outliers

Figure 37. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 38. Visualization of completeness of the data in the column.

Uniqueness

Figure 39. Visualization of uniqueness of the data in the column.

Column: language

Table 62. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name language
Description
Data type String
Descriptor iso-639:alpha-3LanguageCode [UID:0.0.LPHLN107]
Descriptor description

ISO 639-3 is a set of codes that defines three-letter identifiers for all known human languages. At the core of ISO 639-3 are the individual languages already accounted for in ISO 639-2. The large number of living languages in the initial inventory of ISO 639-3 beyond those already included in ISO 639-2 was derived primarily from Ethnologue (15th edition). Additional extinct, historic, and constructed languages were obtained from the Linguist List.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.SLPHD107
Unit

n/a

Table 63. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
language 3 - 3 n/a 001 n/a n/a n/a spa 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 4 ( 0.5% )
Table 64. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
language
100.00%
0.47%
eng 001

Data Distribution Top 20

Figure 40. Distribution of 20 most common values, from highest to lowest.

Completeness

Figure 41. Visualization of completeness of the data in the column.

Uniqueness

Figure 42. Visualization of uniqueness of the data in the column.

Column: globalcode

Table 65. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name globalcode
Description

M49 code of the highest level aggregate of countries, areas or geographic regions in the M49 standard, namely the world, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 66. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
globalcode 3 - 3 1.0 1 1 1 1 1 843 6 ( 0.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
Table 67. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
globalcode
99.29%
0.24%
001 null

Data Distribution Top 20

Figure 43. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 44. Distribution of values in the column.

Outliers

Figure 45. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 46. Visualization of completeness of the data in the column.

Uniqueness

Figure 47. Visualization of uniqueness of the data in the column.

Column: regioncode

Table 68. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name regioncode
Description

M49 code of the second highest level aggregate of countries, areas or geographic regions in the M49 standard, namely the continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 69. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
regioncode 3 - 3 65.2 2 9 19 142 150 843 30 ( 3.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 6 ( 0.7% )
Table 70. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
regioncode
96.44%
0.71%
002 null

Data Distribution Top 20

Figure 48. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 49. Distribution of values in the column.

Outliers

Figure 50. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 51. Visualization of completeness of the data in the column.

Uniqueness

Figure 52. Visualization of uniqueness of the data in the column.

Column: subregioncode

Table 71. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name subregioncode
Description

M49 code of the third highest level aggregate of countries, areas or geographic regions in the M49 standard, namely first order subdivisions of the continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 72. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
subregioncode 3 - 3 183.6 15 54 154 202 419 843 81 ( 9.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 18 ( 2.1% )
Table 73. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
subregioncode
90.39%
2.14%
202 021

Data Distribution Top 20

Figure 53. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 54. Distribution of values in the column.

Outliers

Figure 55. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 56. Visualization of completeness of the data in the column.

Uniqueness

Figure 57. Visualization of uniqueness of the data in the column.

Column: intermregioncode

Table 74. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name intermregioncode
Description

M49 code of the fourth highest level aggregate of countries in the M49 standard, namely second order subdivisions of the continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 75. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
intermregioncode 3 - 3 16.5 5 11 14 29 29 843 528 ( 62.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 8 ( 0.9% )
Table 76. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
intermregioncode
37.37%
0.95%
null 018

Data Distribution Top 20

Figure 58. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 59. Distribution of values in the column.

Outliers

Figure 60. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 61. Visualization of completeness of the data in the column.

Uniqueness

Figure 62. Visualization of uniqueness of the data in the column.

Column: ldc

Table 77. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name ldc
Description

M49 code of the aggregate of least developed countries (LDC) in the M49 standard, namely subdivisions of subdivisions of continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 78. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
ldc 3 - 3 199.0 199 199 199 199 199 843 705 ( 83.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
Table 79. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
ldc
16.37%
0.24%
null 199

Data Distribution Top 20

Figure 63. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 64. Distribution of values in the column.

Outliers

Figure 65. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 66. Visualization of completeness of the data in the column.

Uniqueness

Figure 67. Visualization of uniqueness of the data in the column.

Column: lldc

Table 80. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name lldc
Description

M49 code of the aggregate of landlocked developing countries (LLDC) in the M49 standard, namely subdivisions of subdivisions of continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 81. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
lldc 3 - 3 432.0 432 432 432 432 432 843 747 ( 88.6% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
Table 82. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
lldc
11.39%
0.24%
null 432

Data Distribution Top 20

Figure 68. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 69. Distribution of values in the column.

Outliers

Figure 70. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 71. Visualization of completeness of the data in the column.

Uniqueness

Figure 72. Visualization of uniqueness of the data in the column.

Column: sids

Table 83. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name sids
Description

M49 code of the aggregate of small island developing states (SIDS) in the M49 standard, namely subdivisions of subdivisions of continents, used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Data type String
Descriptor Text [UID:0.0.TEXTA315]
Descriptor description

In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315
Unit

n/a

Table 84. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
sids 3 - 3 722.0 722 722 722 722 722 843 684 ( 81.1% ) 0 ( 0.0% ) 0 ( 0.0% ) 2 ( 0.2% )
Table 85. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
sids
18.86%
0.24%
null 722

Data Distribution Top 20

Figure 73. Distribution of 20 most common values, from highest to lowest.

Continuous Data Distribution

Figure 74. Distribution of values in the column.

Outliers

Figure 75. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 76. Visualization of completeness of the data in the column.

Uniqueness

Figure 77. Visualization of uniqueness of the data in the column.

Column: area

Table 86. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name area
Description
Data type String
Descriptor unsd:m49Area [UID:0.0.MAREA519]
Descriptor description

Name of the countries or areas or geographic regions in the M49 standard used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.MAREA519
Unit

n/a

Table 87. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
area 4 - 52 n/a Asia n/a n/a n/a United Kingd… 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 662 ( 78.5% )
Table 88. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
area
100.00%
78.53%
Burundi Algeria

Completeness

Figure 78. Visualization of completeness of the data in the column.

Uniqueness

Figure 79. Visualization of uniqueness of the data in the column.

Column: m49code

Table 89. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name m49code
Description
Data type String
Descriptor unsd:m49Code [UID:0.0.MCODE356]
Descriptor description

Three-digit numerical code of of the countries or areas or geographic regions in the M49 standard used for statistical processing purposes by the Statistics Division of the United Nations Secretariat.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.MCODE356
Unit

n/a

Table 90. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
m49code 3 - 3 396.5 1 155 392 626 894 843 0 ( 0.0% ) 0 ( 0.0% ) 0 ( 0.0% ) 281 ( 33.3% )
Table 91. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
m49code
100.00%
33.33%
012 012

Continuous Data Distribution

Figure 80. Distribution of values in the column.

Outliers

Figure 81. Visualization of median, min, max, and outliers in the column.

Completeness

Figure 82. Visualization of completeness of the data in the column.

Uniqueness

Figure 83. Visualization of uniqueness of the data in the column.

Column: isoalpha2

Table 92. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name isoalpha2
Description
Data type String
Descriptor iso-3166:alpha-2CountryCode [UID:0.0.LPHCN4]
Descriptor description

A two-letter code that represents a country name, recommended as the general purpose code.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN4
Unit

n/a

Table 93. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
isoalpha2 0 - 2 n/a n/a n/a n/a ZW 843 99 ( 11.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 249 ( 29.5% )
Table 94. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
isoalpha2
88.26%
29.54%
null DZ

Completeness

Figure 84. Visualization of completeness of the data in the column.

Uniqueness

Figure 85. Visualization of uniqueness of the data in the column.

Column: isoalpha3

Table 95. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Parameter Content
Column name isoalpha3
Description
Data type String
Descriptor iso-3166:alpha-3CountryCode [UID:0.0.LPHCN5]
Descriptor description

A three-letter code that represents a country name, which is usually more closely related to the country name.

Descriptor target IRI https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHCN5
Unit

n/a

Table 96. Structural analysis of the table. Column name (Name); concise description of the column (Description); data type in which values are stored (Data type); EUPH-Descriptor (Descriptor); unit in which the values are provided (Unit).
Column Name Range Mean Minimum Q1 Median Q3 Maximum Total Missing Zero Blank Distinct
isoalpha3 0 - 3 n/a n/a n/a n/a ZWE 843 99 ( 11.7% ) 0 ( 0.0% ) 0 ( 0.0% ) 249 ( 29.5% )
Table 97. Quality analysis of the table. Column name (Name); completeness of the column (Completeness); uniqueness of the column (Uniqueness); most common value in the column (Most Common Value); least common value in the column (Least Common Value).
Column Name Completeness Uniqueness Most Common Value Least Common Value
isoalpha3
88.26%
29.54%
null DZA

Completeness

Figure 86. Visualization of completeness of the data in the column.

Uniqueness

Figure 87. Visualization of uniqueness of the data in the column.