|
|
, , • • |
| Unique identifier: | LNGGE20.0.0 |
| Title: | Language |
| Long title: | Partial content of ISO 639 containing information on languages |
| Status: | Quality Validated |
| Current Version: | v. 1.0 |
| Published: | 2023-03-23 |
| Reviewed by: | |
| Citation proposal: |
EU Pollinator Hub 2023 Report of dataset Language, v. 1.0 [LNGGE20.0.0]. EU Pollinator Hub. [2026-02-24] app.pollinatorhub.eu
|
|
Data Quality
Under evaluation
|
|||||||||||||||
Document history
Release
Version v. 1.0 released on 2023-03-23.
Revision
| No | Date | Description | Reason |
|---|---|---|---|
| 1 | 2023-03-23 00:03:00 | Initial release. | n/a |
Abbreviations
No abbreviations.
Executive summary
Data overview:
Data value:
Data description:
Data application:
Unresolved issues:
Introduction
Material and methods
Data acquisition
| No | File | Type | D | M | Arrival | Data points | File size |
|---|---|---|---|---|---|---|---|
| 1 | euph_000020_language_table_iso639_1.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:27 | 736 | n/a |
| 2 | euph_000020_language_table_iso639_2.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:27 | 2,435 | n/a |
| 3 | euph_000020_language_table_iso639_3.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:29 | 63,328 | n/a |
| 4 | euph_000020_language_table_iso639_3_macrolanguages.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:41 | 1,362 | n/a |
| 5 | euph_000020_language_table_iso639_3_names.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:41 | 33,148 | n/a |
| 6 | euph_000020_language_table_iso639_3_retirements.csv | CSV - Comma seperated values | Yes | No | 2023-09-01 10:09:49 | 2,316 | n/a |
Data preparation
Data validation
Data analysis
Data description
Dataset
| No | Table | Description |
|---|---|---|
| 1 | Languages | This table contains 2-letter code representations and names of languages obtained from table iso639_2 and EU Regulation No 1 from… |
| 2 | ISO 639:2 | Data of this table was obtained from the maintenance agency of ISO 639-2:1998 (Codes for the representation of names of… |
| 3 | ISO 639:3 | Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of… |
| 4 | ISO 639:3 macrolanguages | Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of… |
| 5 | ISO 639:3 names | Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of… |
| 6 | ISO 639:3 retirements | Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of… |
| Parameter | Content |
|---|---|
| interactions.single.uid | LNGGE20.0.0 |
| Title | Language |
| Long title | Partial content of ISO 639 containing information on languages |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/LNGGE20.0.0 |
| interactions.single.section-details.licence | EU Pollinator Hub |
| DOI | n/a |
| Created | 2023-01-26 |
| Published | 2023-03-23 |
| Contact | n/a |
| Keywords | n/a |
| Data collection years | n/a |
| Regions, the data was collected in | n/a |
| Abstract | This dataset contains information on languages spoken worldwide and contained in ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages) as well as official languages in the European Union, as defined in Regulation No 1 from 1958, as amended, determining the languages to be used by the European Economic Community. |
| Parameter | Content |
|---|---|
| Name | EU Pollinator Hub |
| Url | |
| Acronym | EUPH |
| IRI | https://app.pollinatorhub.eu/data-providers/euph |
| Address | |
| Country | Belgium |
| Contact | https://www.linkedin.com/company/beelife-european-beekeeping-coordination/ pollinatorhub.eu |
| Description | The EU Pollinator Hub (EUPH) is a data hub related to pollinators, which is provided by the European Food Safety Authority (EFSA). |
Tables
Languages
| Parameter | Content |
|---|---|
| Unique identifier | LNGGE20.LNGGS45.0 |
| Name | Languages |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/LNGGE20.LNGGS45.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
This table contains 2-letter code representations and names of languages obtained from table iso639_2 and EU Regulation No 1 from 1958 regulating their status in the EU. |
This table contains 2-letter code representations and names of languages obtained from table iso639_2 and EU Regulation No 1 from 1958 regulating their status in the EU.
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| iso639_1_alpha2 |
|
String | Text [0.0.TEXTA315] | n/a |
| name_en |
|
String | Text [0.0.TEXTA315] | n/a |
| euofficiallog |
|
Integer number | Integer [0.0.NTGER313] | n/a |
| euprocedurallog |
|
Integer number | Integer [0.0.NTGER313] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_1_alpha2 | 2 - 2 | n/a | aa | n/a | n/a | n/a | zu | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 184 ( 100.0% ) |
| name_en | 3 - 80 | n/a | Ewe | n/a | n/a | n/a | Church Slavi… | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 184 ( 100.0% ) |
| euofficiallog | 0 - 1 | n/a | n/a | n/a | n/a | 1 | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 1.1% ) | |
| euprocedurallog | 0 - 1 | n/a | n/a | n/a | n/a | 1 | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 1.1% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_1_alpha2 |
100.00%
|
100.00%
|
aa | aa |
| name_en |
100.00%
|
100.00%
|
Afar | Afar |
| euofficiallog |
100.00%
|
1.09%
|
0 | 1 |
| euprocedurallog |
100.00%
|
1.09%
|
0 | 1 |
Changes made to preparatory file
Changes made to data
Unresolved issues
ISO 639:2
| Parameter | Content |
|---|---|
| Unique identifier | LNGGE20.ISOAB46.0 |
| Name | ISO 639:2 |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/LNGGE20.ISOAB46.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
Data of this table was obtained from the maintenance agency of ISO 639-2:1998 (Codes for the representation of names of languages — Part 2: Alpha-3 code), The Library of Congress (LOC), which is an agency of the legislative branch of the U.S. government. It contains the content of ISO 639-2:1998. |
Data of this table was obtained from the maintenance agency of ISO 639-2:1998 (Codes for the representation of names of languages — Part 2: Alpha-3 code), The Library of Congress (LOC), which is an agency of the legislative branch of the U.S. government. It contains the content of ISO 639-2:1998.
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| iso639_2_alpha3b |
|
String | ISO 639-2 3-Alpha Identifier for Bibliographic [LNGGE20.ISOAB46.SLPHD102] | n/a |
| iso639_2_alpha3t |
|
String | ISO 639-2 3-Alpha Identifier for Terminology [LNGGE20.ISOAB46.SLPHD103] | n/a |
| iso639_1_alpha2 |
|
String | ISO 639-1 2-Alpha Identifier [LNGGE20.ISOAB46.SLPHD104] | n/a |
| name_en |
|
String | Name [LNGGE20.ISOAB46.NAMEA105] | n/a |
| name_fr |
|
String | French Name [LNGGE20.ISOAB46.FRNCH106] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_2_alpha3b | 3 - 3 | n/a | aar | n/a | n/a | n/a | zza | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| iso639_2_alpha3t | 3 - 3 | n/a | aar | n/a | n/a | n/a | zza | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| iso639_1_alpha2 | 2 - 4 | n/a | aa | n/a | n/a | n/a | NULL | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 185 ( 38.0% ) |
| name_en | 2 - 80 | n/a | Ga | n/a | n/a | n/a | Church Slavi… | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| name_fr | 2 - 62 | n/a | ga | n/a | n/a | n/a | slavon d'égl… | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_2_alpha3b |
100.00%
|
100.00%
|
aar | aar |
| iso639_2_alpha3t |
100.00%
|
100.00%
|
aar | aar |
| iso639_1_alpha2 |
100.00%
|
37.99%
|
NULL | aa |
| name_en |
100.00%
|
100.00%
|
Afar | Afar |
| name_fr |
100.00%
|
100.00%
|
afar | afar |
Changes made to preparatory file
Changes made to data
Unresolved issues
ISO 639:3
| Parameter | Content |
|---|---|
| Unique identifier | LNGGE20.ISOAB47.0 |
| Name | ISO 639:3 |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/LNGGE20.ISOAB47.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains the content of ISO 639-3:2007. |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains the content of ISO 639-3:2007.
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| iso639_3_alpha |
|
String | iso-639:alpha-3LanguageCode [0.0.LPHLN107] | n/a |
| iso639_2_alpha3b |
|
String | ISO 639-2 3-Alpha Identifier for Bibliographic [LNGGE20.ISOAB47.SLPHD108] | n/a |
| iso639_2_alpha3t |
|
String | ISO 639-2 3-Alpha Identifier for Terminology [LNGGE20.ISOAB47.SLPHD109] | n/a |
| iso639_1_alpha2 |
|
String | iso-639:alpha-2LanguageCode [0.0.LPHLN110] | n/a |
| scope |
|
String | Scope [LNGGE20.ISOAB47.SCOPE111] | n/a |
| type |
|
String | Type [LNGGE20.ISOAB47.TYPEA112] | n/a |
| ref_name |
|
String | Reference name [LNGGE20.ISOAB47.RFRNC113] | n/a |
| comment |
|
String | Comment [LNGGE20.ISOAB47.CMMNT114] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_3_alpha | 3 - 3 | n/a | aaa | n/a | n/a | n/a | zzj | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 7,916 ( 100.0% ) |
| iso639_2_alpha3b | 3 - 4 | n/a | aar | n/a | n/a | n/a | NULL | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 421 ( 5.3% ) |
| iso639_2_alpha3t | 3 - 4 | n/a | aar | n/a | n/a | n/a | NULL | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 421 ( 5.3% ) |
| iso639_1_alpha2 | 2 - 4 | n/a | aa | n/a | n/a | n/a | NULL | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 185 ( 2.3% ) |
| scope | 1 - 1 | n/a | I | n/a | n/a | n/a | S | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 3 ( 0.0% ) |
| type | 1 - 1 | n/a | A | n/a | n/a | n/a | S | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 6 ( 0.1% ) |
| ref_name | 1 - 58 | n/a | E | n/a | n/a | n/a | Interlingua… | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 7,916 ( 100.0% ) |
| comment | 4 - 42 | n/a | NULL | n/a | n/a | n/a | Code element… | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 0.0% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_3_alpha |
100.00%
|
100.00%
|
aaa | aaa |
| iso639_2_alpha3b |
100.00%
|
5.32%
|
NULL | aar |
| iso639_2_alpha3t |
100.00%
|
5.32%
|
NULL | aar |
| iso639_1_alpha2 |
100.00%
|
2.34%
|
NULL | aa |
| scope |
100.00%
|
0.04%
|
I | S |
| type |
100.00%
|
0.08%
|
L | S |
| ref_name |
100.00%
|
100.00%
|
Ghotuo | Ghotuo |
| comment |
100.00%
|
0.03%
|
NULL | Code element for 639-1 has been deprecated |
Changes made to preparatory file
Changes made to data
Unresolved issues
ISO 639:3 macrolanguages
| Parameter | Content |
|---|---|
| Unique identifier | LNGGE20.SMCRL48.0 |
| Name | ISO 639:3 macrolanguages |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/LNGGE20.SMCRL48.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains the complete set of mappings from macrolanguages to the individual languages that comprise them. |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains the complete set of mappings from macrolanguages to the individual languages that comprise them.
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| m_id |
|
String | Macrolanguage Identifier [LNGGE20.SMCRL48.MCRLN115] | n/a |
| i_id |
|
String | Language Identifier [LNGGE20.SMCRL48.LNGGD116] | n/a |
| i_status |
|
String | Status Code [LNGGE20.SMCRL48.STTSC117] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| m_id | 3 - 3 | n/a | aka | n/a | n/a | n/a | zza | 454 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 62 ( 13.7% ) |
| i_id | 3 - 3 | n/a | aae | n/a | n/a | n/a | zzj | 454 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 454 ( 100.0% ) |
| i_status | 1 - 1 | n/a | A | n/a | n/a | n/a | R | 454 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 0.4% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| m_id |
100.00%
|
13.66%
|
zap | syr |
| i_id |
100.00%
|
100.00%
|
aae | aae |
| i_status |
100.00%
|
0.44%
|
A | R |
Changes made to preparatory file
Changes made to data
Unresolved issues
ISO 639:3 names
| Parameter | Content |
|---|---|
| Unique identifier | LNGGE20.SNMES49.0 |
| Name | ISO 639:3 names |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/LNGGE20.SNMES49.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains language names, primarily in English forms or variant anglicized spellings of indigenous names. |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains language names, primarily in English forms or variant anglicized spellings of indigenous names.
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| id |
|
String | Identifier [LNGGE20.SNMES49.DNTFR118] | n/a |
| iso639_3_alpha |
|
String | ISO 639-2 3-Alpha Identifier [LNGGE20.SNMES49.SLPHD119] | n/a |
| print_name |
|
String | Print Name [LNGGE20.SNMES49.PRNTN120] | n/a |
| inverted_name |
|
String | Inverted Name [LNGGE20.SNMES49.NVRTD121] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 1 - 4 | 4,144.0 | 1 | 2,072 | 4,144 | 6,216 | 8,287 | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 8,287 ( 100.0% ) |
| iso639_3_alpha | 3 - 3 | n/a | aaa | n/a | n/a | n/a | zzj | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 7,916 ( 95.5% ) |
| print_name | 1 - 58 | n/a | E | n/a | n/a | n/a | Interlingua… | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 8,286 ( 100.0% ) |
| inverted_name | 1 - 58 | n/a | E | n/a | n/a | n/a | Interlingua… | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 8,286 ( 100.0% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| id |
100.00%
|
100.00%
|
1 | 1 |
| iso639_3_alpha |
100.00%
|
95.52%
|
zza | aaa |
| print_name |
100.00%
|
99.99%
|
Wyandot | Ghotuo |
| inverted_name |
100.00%
|
99.99%
|
Wyandot | Ghotuo |
Changes made to preparatory file
Changes made to data
Unresolved issues
ISO 639:3 retirements
| Parameter | Content |
|---|---|
| Unique identifier | LNGGE20.SRTRM50.0 |
| Name | ISO 639:3 retirements |
| Target IRI | https://app.pollinatorhub.eu/dataset-discovery/parts/LNGGE20.SRTRM50.0 |
| Table Type | File |
| Licence | EU Pollinator Hub |
| Description |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains a complete listing of the code elements that have been deprecated with instructions on how to update existing data. |
Data of this table was obtained from the maintenance agency of ISO 639-3:2007 (codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages), SIL International, is a Christian-faith-based global non-for-profit organisation, involved in projects on literacy, education, linguistic research and language tools. It contains a complete listing of the code elements that have been deprecated with instructions on how to update existing data.
Metadata
| Column Name | Column Description | Datatype | Descriptor | Unit |
|---|---|---|---|---|
| id |
|
String | ISO 639-2 3-Alpha Identifier [LNGGE20.SRTRM50.SLPHD122] | n/a |
| ref_name |
|
String | iso-639:languageName [0.0.LNGGN123] | n/a |
| ret_reason |
|
String | Retirement Reason [LNGGE20.SRTRM50.RTRMN124] | n/a |
| change_to |
|
String | Change To [LNGGE20.SRTRM50.CHNGT125] | n/a |
| ret_remedy |
|
String | Retirement Remedy [LNGGE20.SRTRM50.RTRMN126] | n/a |
| effective |
|
Date | Effective Date [LNGGE20.SRTRM50.FFCTV127] | n/a |
Metadata of individual tables can be found in Annex 1.
Descriptive measures
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 3 - 3 | n/a | aam | n/a | n/a | n/a | zua | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 386 ( 100.0% ) |
| ref_name | 3 - 36 | n/a | Ahe | n/a | n/a | n/a | Borna (Democ… | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 386 ( 100.0% ) |
| ret_reason | 1 - 1 | n/a | C | n/a | n/a | n/a | S | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 5 ( 1.3% ) |
| change_to | 3 - 4 | n/a | aas | n/a | n/a | n/a | NULL | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 151 ( 39.1% ) |
| ret_remedy | 4 - 253 | n/a | NULL | n/a | n/a | n/a | Split into [… | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 105 ( 27.2% ) |
| effective | 10 - 10 | 2,012.0 | 2007-02-01 | 2,008 | 2,012 | 2,016 | 2023-01-20 | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 28 ( 7.3% ) |
Quality measures
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| id |
100.00%
|
100.00%
|
aam | aam |
| ref_name |
100.00%
|
100.00%
|
Aramanik | Aramanik |
| ret_reason |
100.00%
|
1.30%
|
M | C |
| change_to |
100.00%
|
39.12%
|
NULL | aas |
| ret_remedy |
100.00%
|
27.20%
|
NULL | Split into Pahanan Agta [apf] and Paranan [prf] (new identifier) |
| effective |
100.00%
|
7.25%
|
2008-01-14 | 2007-02-01 |
Changes made to preparatory file
Changes made to data
Unresolved issues
References
There are no sources in the current document.
Annex 1: Table column reports
Table: Languages
Column: iso639_1_alpha2
| Parameter | Content |
|---|---|
| Column name | iso639_1_alpha2 |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_1_alpha2 | 2 - 2 | n/a | aa | n/a | n/a | n/a | zu | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 184 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_1_alpha2 |
100.00%
|
100.00%
|
aa | aa |
Completeness
Uniqueness
Column: name_en
| Parameter | Content |
|---|---|
| Column name | name_en |
| Description |
|
| Data type | String |
| Descriptor | Text [UID:0.0.TEXTA315] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.TEXTA315 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| name_en | 3 - 80 | n/a | Ewe | n/a | n/a | n/a | Church Slavi… | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 184 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| name_en |
100.00%
|
100.00%
|
Afar | Afar |
Completeness
Uniqueness
Column: euofficiallog
| Parameter | Content |
|---|---|
| Column name | euofficiallog |
| Description |
|
| Data type | Integer number |
| Descriptor | Integer [UID:0.0.NTGER313] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| euofficiallog | 0 - 1 | n/a | n/a | n/a | n/a | 1 | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 1.1% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| euofficiallog |
100.00%
|
1.09%
|
0 | 1 |
Data Distribution Top 20
Completeness
Uniqueness
Column: euprocedurallog
| Parameter | Content |
|---|---|
| Column name | euprocedurallog |
| Description |
|
| Data type | Integer number |
| Descriptor | Integer [UID:0.0.NTGER313] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.NTGER313 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| euprocedurallog | 0 - 1 | n/a | n/a | n/a | n/a | 1 | 184 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 1.1% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| euprocedurallog |
100.00%
|
1.09%
|
0 | 1 |
Data Distribution Top 20
Completeness
Uniqueness
Table: ISO 639:2
Column: iso639_2_alpha3b
| Parameter | Content |
|---|---|
| Column name | iso639_2_alpha3b |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-2 3-Alpha Identifier for Bibliographic [UID:LNGGE20.ISOAB46.SLPHD102] |
| Descriptor description |
ISO 639-2 three-letter alphabetic code for the representation of names of languages for bibliographic applications. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB46.SLPHD102 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_2_alpha3b | 3 - 3 | n/a | aar | n/a | n/a | n/a | zza | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_2_alpha3b |
100.00%
|
100.00%
|
aar | aar |
Completeness
Uniqueness
Column: iso639_2_alpha3t
| Parameter | Content |
|---|---|
| Column name | iso639_2_alpha3t |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-2 3-Alpha Identifier for Terminology [UID:LNGGE20.ISOAB46.SLPHD103] |
| Descriptor description |
ISO 639-2 three-letter alphabetic code for the representation of names of languages for terminology applications. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB46.SLPHD103 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_2_alpha3t | 3 - 3 | n/a | aar | n/a | n/a | n/a | zza | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_2_alpha3t |
100.00%
|
100.00%
|
aar | aar |
Completeness
Uniqueness
Column: iso639_1_alpha2
| Parameter | Content |
|---|---|
| Column name | iso639_1_alpha2 |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-1 2-Alpha Identifier [UID:LNGGE20.ISOAB46.SLPHD104] |
| Descriptor description |
Equivalent 639-1 identifier, if there is one. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB46.SLPHD104 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_1_alpha2 | 2 - 4 | n/a | aa | n/a | n/a | n/a | NULL | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 185 ( 38.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_1_alpha2 |
100.00%
|
37.99%
|
NULL | aa |
Completeness
Uniqueness
Column: name_en
| Parameter | Content |
|---|---|
| Column name | name_en |
| Description |
|
| Data type | String |
| Descriptor | Name [UID:LNGGE20.ISOAB46.NAMEA105] |
| Descriptor description |
Name of the language in English and French from file iso639_2.csv. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB46.NAMEA105 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| name_en | 2 - 80 | n/a | Ga | n/a | n/a | n/a | Church Slavi… | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| name_en |
100.00%
|
100.00%
|
Afar | Afar |
Completeness
Uniqueness
Column: name_fr
| Parameter | Content |
|---|---|
| Column name | name_fr |
| Description |
|
| Data type | String |
| Descriptor | French Name [UID:LNGGE20.ISOAB46.FRNCH106] |
| Descriptor description |
Name of the language in French from file iso639_2.csv. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB46.FRNCH106 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| name_fr | 2 - 62 | n/a | ga | n/a | n/a | n/a | slavon d'égl… | 487 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 487 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| name_fr |
100.00%
|
100.00%
|
afar | afar |
Completeness
Uniqueness
Table: ISO 639:3
Column: iso639_3_alpha
| Parameter | Content |
|---|---|
| Column name | iso639_3_alpha |
| Description |
|
| Data type | String |
| Descriptor | iso-639:alpha-3LanguageCode [UID:0.0.LPHLN107] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.SLPHD107 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_3_alpha | 3 - 3 | n/a | aaa | n/a | n/a | n/a | zzj | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 7,916 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_3_alpha |
100.00%
|
100.00%
|
aaa | aaa |
Completeness
Uniqueness
Column: iso639_2_alpha3b
| Parameter | Content |
|---|---|
| Column name | iso639_2_alpha3b |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-2 3-Alpha Identifier for Bibliographic [UID:LNGGE20.ISOAB47.SLPHD108] |
| Descriptor description |
Equivalent 639-2 identifier of the bibliographic applications code set, if there is one. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.SLPHD108 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_2_alpha3b | 3 - 4 | n/a | aar | n/a | n/a | n/a | NULL | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 421 ( 5.3% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_2_alpha3b |
100.00%
|
5.32%
|
NULL | aar |
Completeness
Uniqueness
Column: iso639_2_alpha3t
| Parameter | Content |
|---|---|
| Column name | iso639_2_alpha3t |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-2 3-Alpha Identifier for Terminology [UID:LNGGE20.ISOAB47.SLPHD109] |
| Descriptor description |
Equivalent 639-2 identifier of the terminology applications code set, if there is one. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.SLPHD109 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_2_alpha3t | 3 - 4 | n/a | aar | n/a | n/a | n/a | NULL | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 421 ( 5.3% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_2_alpha3t |
100.00%
|
5.32%
|
NULL | aar |
Completeness
Uniqueness
Column: iso639_1_alpha2
| Parameter | Content |
|---|---|
| Column name | iso639_1_alpha2 |
| Description |
|
| Data type | String |
| Descriptor | iso-639:alpha-2LanguageCode [UID:0.0.LPHLN110] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/0.0.LPHLN110 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_1_alpha2 | 2 - 4 | n/a | aa | n/a | n/a | n/a | NULL | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 185 ( 2.3% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_1_alpha2 |
100.00%
|
2.34%
|
NULL | aa |
Completeness
Uniqueness
Column: scope
| Parameter | Content |
|---|---|
| Column name | scope |
| Description |
|
| Data type | String |
| Descriptor | Scope [UID:LNGGE20.ISOAB47.SCOPE111] |
| Descriptor description |
One of the following terms: I (Individual), M (Macrolanguage), S (Special). |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.SCOPE111 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| scope | 1 - 1 | n/a | I | n/a | n/a | n/a | S | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 3 ( 0.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| scope |
100.00%
|
0.04%
|
I | S |
Data Distribution Top 20
Completeness
Uniqueness
Column: type
| Parameter | Content |
|---|---|
| Column name | type |
| Description |
|
| Data type | String |
| Descriptor | Type [UID:LNGGE20.ISOAB47.TYPEA112] |
| Descriptor description |
One of the following terms: A (Ancient), C (Constructed), E (Extinct), H (Historical), L (Living), S (Special). |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.TYPEA112 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| type | 1 - 1 | n/a | A | n/a | n/a | n/a | S | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 6 ( 0.1% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| type |
100.00%
|
0.08%
|
L | S |
Data Distribution Top 20
Completeness
Uniqueness
Column: ref_name
| Parameter | Content |
|---|---|
| Column name | ref_name |
| Description |
|
| Data type | String |
| Descriptor | Reference name [UID:LNGGE20.ISOAB47.RFRNC113] |
| Descriptor description |
Name of the language in English from file iso639_2.csv. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.RFRNC113 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ref_name | 1 - 58 | n/a | E | n/a | n/a | n/a | Interlingua… | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 7,916 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| ref_name |
100.00%
|
100.00%
|
Ghotuo | Ghotuo |
Completeness
Uniqueness
Column: comment
| Parameter | Content |
|---|---|
| Column name | comment |
| Description |
|
| Data type | String |
| Descriptor | Comment [UID:LNGGE20.ISOAB47.CMMNT114] |
| Descriptor description |
Comment relating to one or more of the columns. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.ISOAB47.CMMNT114 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| comment | 4 - 42 | n/a | NULL | n/a | n/a | n/a | Code element… | 7,916 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 0.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| comment |
100.00%
|
0.03%
|
NULL | Code element for 639-1 has been deprecated |
Data Distribution Top 20
Completeness
Uniqueness
Table: ISO 639:3 macrolanguages
Column: m_id
| Parameter | Content |
|---|---|
| Column name | m_id |
| Description |
|
| Data type | String |
| Descriptor | Macrolanguage Identifier [UID:LNGGE20.SMCRL48.MCRLN115] |
| Descriptor description |
The identifier for a macrolanguage in ISO 639 3-alpha format. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SMCRL48.MCRLN115 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| m_id | 3 - 3 | n/a | aka | n/a | n/a | n/a | zza | 454 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 62 ( 13.7% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| m_id |
100.00%
|
13.66%
|
zap | syr |
Completeness
Uniqueness
Column: i_id
| Parameter | Content |
|---|---|
| Column name | i_id |
| Description |
|
| Data type | String |
| Descriptor | Language Identifier [UID:LNGGE20.SMCRL48.LNGGD116] |
| Descriptor description |
The identifier for an individual language that is a member of the macrolanguage in ISO 639 3 three-letter format. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SMCRL48.LNGGD116 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| i_id | 3 - 3 | n/a | aae | n/a | n/a | n/a | zzj | 454 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 454 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| i_id |
100.00%
|
100.00%
|
aae | aae |
Completeness
Uniqueness
Column: i_status
| Parameter | Content |
|---|---|
| Column name | i_status |
| Description |
|
| Data type | String |
| Descriptor | Status Code [UID:LNGGE20.SMCRL48.STTSC117] |
| Descriptor description |
One of the following terms indicating the status of the individual code element (column i_id): A (active), R (retired). |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SMCRL48.STTSC117 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| i_status | 1 - 1 | n/a | A | n/a | n/a | n/a | R | 454 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 2 ( 0.4% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| i_status |
100.00%
|
0.44%
|
A | R |
Data Distribution Top 20
Completeness
Uniqueness
Table: ISO 639:3 names
Column: id
| Parameter | Content |
|---|---|
| Column name | id |
| Description |
|
| Data type | String |
| Descriptor | Identifier [UID:LNGGE20.SNMES49.DNTFR118] |
| Descriptor description |
Identifier of the record. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SNMES49.DNTFR118 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 1 - 4 | 4,144.0 | 1 | 2,072 | 4,144 | 6,216 | 8,287 | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 8,287 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| id |
100.00%
|
100.00%
|
1 | 1 |
Continuous Data Distribution
Outliers
Completeness
Uniqueness
Column: iso639_3_alpha
| Parameter | Content |
|---|---|
| Column name | iso639_3_alpha |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-2 3-Alpha Identifier [UID:LNGGE20.SNMES49.SLPHD119] |
| Descriptor description |
The three-letter 639-3 identifier. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SNMES49.SLPHD119 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iso639_3_alpha | 3 - 3 | n/a | aaa | n/a | n/a | n/a | zzj | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 7,916 ( 95.5% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| iso639_3_alpha |
100.00%
|
95.52%
|
zza | aaa |
Completeness
Uniqueness
Column: print_name
| Parameter | Content |
|---|---|
| Column name | print_name |
| Description |
|
| Data type | String |
| Descriptor | Print Name [UID:LNGGE20.SNMES49.PRNTN120] |
| Descriptor description |
One of the names associated with this identifier. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SNMES49.PRNTN120 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| print_name | 1 - 58 | n/a | E | n/a | n/a | n/a | Interlingua… | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 8,286 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| print_name |
100.00%
|
99.99%
|
Wyandot | Ghotuo |
Completeness
Uniqueness
Column: inverted_name
| Parameter | Content |
|---|---|
| Column name | inverted_name |
| Description |
|
| Data type | String |
| Descriptor | Inverted Name [UID:LNGGE20.SNMES49.NVRTD121] |
| Descriptor description |
The inverted form of this Print Name form. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SNMES49.NVRTD121 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| inverted_name | 1 - 58 | n/a | E | n/a | n/a | n/a | Interlingua… | 8,287 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 8,286 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| inverted_name |
100.00%
|
99.99%
|
Wyandot | Ghotuo |
Completeness
Uniqueness
Table: ISO 639:3 retirements
Column: id
| Parameter | Content |
|---|---|
| Column name | id |
| Description |
|
| Data type | String |
| Descriptor | ISO 639-2 3-Alpha Identifier [UID:LNGGE20.SRTRM50.SLPHD122] |
| Descriptor description |
The three-letter 639-3 identifier. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SRTRM50.SLPHD122 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 3 - 3 | n/a | aam | n/a | n/a | n/a | zua | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 386 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| id |
100.00%
|
100.00%
|
aam | aam |
Completeness
Uniqueness
Column: ref_name
| Parameter | Content |
|---|---|
| Column name | ref_name |
| Description |
|
| Data type | String |
| Descriptor | iso-639:languageName [UID:0.0.LNGGN123] |
| Descriptor description |
|
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors0.0.LNGGN123 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ref_name | 3 - 36 | n/a | Ahe | n/a | n/a | n/a | Borna (Democ… | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 386 ( 100.0% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| ref_name |
100.00%
|
100.00%
|
Aramanik | Aramanik |
Completeness
Uniqueness
Column: ret_reason
| Parameter | Content |
|---|---|
| Column name | ret_reason |
| Description |
|
| Data type | String |
| Descriptor | Retirement Reason [UID:LNGGE20.SRTRM50.RTRMN124] |
| Descriptor description |
One of the following terms, containing the code for retirement: C (change), D (duplicate), N (non-existent), S (split), M (merge). |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SRTRM50.RTRMN124 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ret_reason | 1 - 1 | n/a | C | n/a | n/a | n/a | S | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 5 ( 1.3% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| ret_reason |
100.00%
|
1.30%
|
M | C |
Data Distribution Top 20
Completeness
Uniqueness
Column: change_to
| Parameter | Content |
|---|---|
| Column name | change_to |
| Description |
|
| Data type | String |
| Descriptor | Change To [UID:LNGGE20.SRTRM50.CHNGT125] |
| Descriptor description |
In the cases of Ret_Reason containing C, D, and M, the identifier to which all instances of this Id should be changed. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SRTRM50.CHNGT125 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| change_to | 3 - 4 | n/a | aas | n/a | n/a | n/a | NULL | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 151 ( 39.1% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| change_to |
100.00%
|
39.12%
|
NULL | aas |
Completeness
Uniqueness
Column: ret_remedy
| Parameter | Content |
|---|---|
| Column name | ret_remedy |
| Description |
|
| Data type | String |
| Descriptor | Retirement Remedy [UID:LNGGE20.SRTRM50.RTRMN126] |
| Descriptor description |
The instructions for updating an instance of the retired (split) identifier. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SRTRM50.RTRMN126 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ret_remedy | 4 - 253 | n/a | NULL | n/a | n/a | n/a | Split into [… | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 105 ( 27.2% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| ret_remedy |
100.00%
|
27.20%
|
NULL | Split into Pahanan Agta [apf] and Paranan [prf] (new identifier) |
Completeness
Uniqueness
Column: effective
| Parameter | Content |
|---|---|
| Column name | effective |
| Description |
|
| Data type | Date |
| Descriptor | Effective Date [UID:LNGGE20.SRTRM50.FFCTV127] |
| Descriptor description |
The date the retirement became effective. |
| Descriptor target IRI | https://app.pollinatorhub.eu/vocabulary/descriptors/LNGGE20.SRTRM50.FFCTV127 |
| Unit | n/a |
| Column Name | Range | Mean | Minimum | Q1 | Median | Q3 | Maximum | Total | Missing | Zero | Blank | Distinct |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| effective | 10 - 10 | 2,012.0 | 2007-02-01 | 2,008 | 2,012 | 2,016 | 2023-01-20 | 386 | 0 ( 0.0% ) | 0 ( 0.0% ) | 0 ( 0.0% ) | 28 ( 7.3% ) |
| Column Name | Completeness | Uniqueness | Most Common Value | Least Common Value |
|---|---|---|---|---|
| effective |
100.00%
|
7.25%
|
2008-01-14 | 2007-02-01 |