BY-COVID - WP5 - Baseline Use Case: SARS-CoV-2 vaccine effectiveness

Data Quality Assessment (DQA)

Overview

This section provides an overview of the imported dataset. Dataset statistics, variable types, a missing data profile and potential alerts are shown below.

Discrete variable 37
Continuous variable 2
All missing variable 3


country_cd has constant value ESP Constant
exitus_bl has constant value FALSE Constant
exitus_dt has 10000 (100%) missing values Missing
essential_worker_bl has 10000 (100%) missing values Missing
confirmed_case_dt has 6659 (66.6%) missing values Missing
previous_infection_dt has 9750 (97.5%) missing values Missing
test_type_cd has 6686 (66.9%) missing values Missing
variant_cd has 10000 (100%) missing values Missing
person_id has all unique values Unique
The variable ‘person_id’ has all unique values Number of duplicate values: 0 Unique

Variables

This section provides more detailed information per variable in the imported dataset.

person_id

Class of the variable: character

More than 100 distinct values

More than 100 distinct values

age_nm

Class of the variable: integer

More than 100 distinct values

sex_cd

Class of the variable: character

socecon_lvl_cd

Class of the variable: character

residence_area_cd

Class of the variable: character

country_cd

Class of the variable: character

foreign_bl

Class of the variable: logical

exitus_dt

Class of the variable: Date

exitus_bl

Class of the variable: logical

essential_worker_bl

Class of the variable: logical

institutionalized_bl

Class of the variable: logical

dose_1_brand_cd

Class of the variable: character

dose_1_dt

Class of the variable: Date

More than 100 distinct values

dose_2_brand_cd

Class of the variable: character

dose_2_dt

Class of the variable: Date

More than 100 distinct values

dose_3_brand_cd

Class of the variable: character

dose_3_dt

Class of the variable: Date

More than 100 distinct values

doses_nm

Class of the variable: integer

fully_vaccinated_dt

Class of the variable: Date

More than 100 distinct values

fully_vaccinated_bl

Class of the variable: logical

vaccination_schedule_cd

Class of the variable: character

confirmed_case_dt

Class of the variable: Date

More than 100 distinct values

confirmed_case_bl

Class of the variable: logical

previous_infection_dt

Class of the variable: Date

More than 100 distinct values

previous_infection_bl

Class of the variable: logical

test_type_cd

Class of the variable: character

variant_cd

Class of the variable: character

diabetes_bl

Class of the variable: logical

obesity_bl

Class of the variable: logical

heart_failure_bl

Class of the variable: logical

copd_bl

Class of the variable: logical

solid_tumor_without_metastasis_bl

Class of the variable: logical

chronic_kidney_disease_bl

Class of the variable: logical

sickle_cell_disease_bl

Class of the variable: logical

hypertension_bl

Class of the variable: logical

chronic_liver_disease_bl

Class of the variable: logical

blood_cancer_bl

Class of the variable: logical

transplanted_bl

Class of the variable: logical

hiv_infection_bl

Class of the variable: logical

primary_immunodeficiency_bl

Class of the variable: logical

immunosuppression_bl

Class of the variable: logical

pregnancy_bl

Class of the variable: logical