Using Simple Statistics to Compare Genetic Sequences

Authors

  • Hossam Farag Abou-Shaara Department of Plant Protection, Faculty of Agriculture, Damanhour University, Damanhour, Egypt

Keywords:

Parametric, non-parametric, significant, bioinformatics, phylogenetic

Abstract

The aim of this study was to compare different sequences using simple statistics. Firstly, sequences of four viruses were downloaded from the National Center for Biotechnology Information (NCBI). Then, these sequences were arranged in Excel sheets as numbers, and subjected to the statistical analysis using parametric and non-parametric tests. The obtained results were compared with those obtained by the phylogenetic analysis and gene cluster analysis for these viruses. The results of the statistical analysis, from ANOVA and Kruskal-Wallis test, were similar to those of phylogenetic relationships and shared gene clusters. It was possible to get additional information from the sequences using simple statistics either using parametric or non-parametric tests. The results of this study could help software developer and bioinformatics specialists to develop simple analytical methods to acquire information from the sequences.

Downloads

Published

2020-06-30

How to Cite

Abou-Shaara, H. F. (2020). Using Simple Statistics to Compare Genetic Sequences. Thailand Statistician, 18(3), 373–380. Retrieved from https://ph02.tci-thaijo.org/index.php/thaistat/article/view/241288

Issue

Section

Articles