Using Statistical Techniques and Replication Samples for Missing Values Imputation with an Application on Metabolomics

Akram Yazdani; Azam Yazdani

Using Statistical Techniques and Replication Samples for Missing Values Imputation with an Application on Metabolomics

Abstract

Akram Yazdani and Azam Yazdani

Background: Data preparation, such as missing values imputation and transformation, is the first step in any data analysis and requires crucial attention. We take advantage of availability of replication samples to identify the empirical distribution of missing values through utilization of statistical techniques. We apply these techniques to metabolomics data for imputation. Results: Using replication samples, we obtained the empirical distribution of missing values. After application of the techniques on metabolites, we observed that the rate of missing values is approximately distributed uniformly across metabolite range. Therefore, the missing values cannot be imputed with the lowest values. To have a realistic simulation, we designed a simulation study based on empirical distribution of missing values to find an optimal imputation approach. Our findings validated the optimal approach introduced previously for metabolomics. Conclusions: Our analysis utilized replication samples as a new approach to metabolite imputation and found empirical distribution of missing values, designed a simulation study close to reality, and compared different approaches for selecting an optimal imputation approach. The result of this study validated the optimal approach for metabolite imputation through a different data set and different approach, and the aim was to encourage researchers to pay more attention to metabolite imputation since imputing metabolomic missing values with lowest value is going to be a common approach, for example in genomic-metabolomic data analysis.

Avertissement: Ce résumé a été traduit à l'aide d'outils d'intelligence artificielle et n'a pas encore été examiné ni vérifié

Partagez cet article

Faits saillants de la revue

Indexé dans

Index Copernic
Google Scholar
Sherpa Roméo
Base de données des revues académiques
Ouvrir la porte J
JournalSeek de génamique
Clés académiques
JournalTOC
RechercheBible
Infrastructure nationale du savoir de Chine (CNKI)
Annuaire des périodiques d'Ulrich
Accès à la recherche mondiale en ligne sur l'agriculture (AGORA)
Bibliothèque de revues électroniques
Recherche de référence
Université Hamdard
EBSCO AZ
Répertoire d’indexation des résumés pour les revues
OCLC-WorldCat
Catalogue en ligne SWB
Bibliothèque virtuelle de biologie (vifabio)
Publons
Euro Pub

Journal de biométrie et biostatistique