Specific guidelines - Data preparation

Guideline Directory

| General guidelines | Specific guidelines |

| Workflow | Data collection | Data preparation |

1. Preparing Data for Storage on the EU Pollinator Hub

The EU Pollinator Hub (EUPH) has implemented a set of Standard Operating Procedures (SOP) and Work Instructions (WI), which are mandatory for individuals operating within the EUPH Data Governance Framework. SOP and WI are, however, publicly accessible and data providers are free use them as a guideline, unless they have their own policies. The aim of these documents is to assure a high level of data quality during data preparation, before they are uploaded to the EUPH.

  • SOP 003 DR instructions Instructions for creating a Dataset Report (DR) hosted on the EU Pollinator Hub

  • SOP 006 Data preparation Instructions for the preparation of datasets for integration into the EU Pollinator Hub.

    • WI 002 Raw data preparation Preparation of files of raw data for import into the preparation and profiling tool.
    • WI 003 pMA Setup Setting up of the software application phpMyAdmin for manual preparation of datasets.
    • WI 004 pMA Data Import Import of files into the database administration tool phpMyAdmin.
  • SOP 007 Data profiling Instructions for profiling of datasets hosted on the EU Pollinator Hub.

    • WI 005 pMA Profiling Profiling of datasets using phpMyAdmin.
    • WI 023 Data Profiling Using the data profiling tool on the EU Pollinator Hub.
  • SOP 008 Data cleansing Procedure for the cleaning of data to be integrated on the EU Pollinator Hub.

  • SOP 018 Data exploration Procedure for data exploration on data hosted on the EU Pollinator Hub.

  • SOP 019 Data visualisation Procedure for visualising data on the EU Pollinator Hub.

Categories

Type of quality systems implemented

  • not specified

Type of studies targeted

  • not specified

Type of organisations targeted

  • not specified

2. Using EU Pollinator Hub in-house software for data processing

The EU Pollinator Hub (EUPH) has developed a set of Python scripts for data processing, which are publicly accessible and data providers are free use them. The aim of these documents is to assure a high level of data quality during data preparation, before they are uploaded to the EUPH.

Categories

Type of quality systems implemented

  • not specified

Type of studies targeted

  • not specified

Type of organisations targeted

  • not specified