Guideline Directory
| General guidelines | Specific guidelines |
| Workflow | Data collection | Data preparation |
1. Preparing Data for Storage on the EU Pollinator Hub
The EU Pollinator Hub (EUPH) has implemented a set of Standard Operating Procedures (SOP) and Work Instructions (WI), which are mandatory for individuals operating within the EUPH Data Governance Framework. SOP and WI are, however, publicly accessible and data providers are free use them as a guideline, unless they have their own policies. The aim of these documents is to assure a high level of data quality during data preparation, before they are uploaded to the EUPH.
-
SOP 003 DR instructions Instructions for creating a Dataset Report (DR) hosted on the EU Pollinator Hub
-
SOP 006 Data preparation Instructions for the preparation of datasets for integration into the EU Pollinator Hub.
- WI 002 Raw data preparation Preparation of files of raw data for import into the preparation and profiling tool.
- WI 003 pMA Setup Setting up of the software application phpMyAdmin for manual preparation of datasets.
- WI 004 pMA Data Import Import of files into the database administration tool phpMyAdmin.
-
SOP 007 Data profiling Instructions for profiling of datasets hosted on the EU Pollinator Hub.
-
SOP 008 Data cleansing Procedure for the cleaning of data to be integrated on the EU Pollinator Hub.
-
SOP 018 Data exploration Procedure for data exploration on data hosted on the EU Pollinator Hub.
-
SOP 019 Data visualisation Procedure for visualising data on the EU Pollinator Hub.
Categories
Type of quality systems implemented
- not specified
Type of studies targeted
- not specified
Type of organisations targeted
- not specified
2. Using EU Pollinator Hub in-house software for data processing
The EU Pollinator Hub (EUPH) has developed a set of Python scripts for data processing, which are publicly accessible and data providers are free use them. The aim of these documents is to assure a high level of data quality during data preparation, before they are uploaded to the EUPH.
-
MergeCsv.py Assesses all column names that are being used in a list of selected CSV files and merges the content of all files in the correct column of a single output file.
-
UnpivotToCsv.py Unpivots (melt) a CSV file
-
zpYoNMKS24NKVR9dfVtJEyJVykyancVHQ6Lz6SJATransposeCsv.py Transposes columns and roes of a CSV file
-
ParseDates.py Parses all dates in a column of a CSV file
Categories
Type of quality systems implemented
- not specified
Type of studies targeted
- not specified
Type of organisations targeted
- not specified