Patterns in Genomic Variation of SARS-CoV2

Student

Won Joon Choi

Advisor

Daniel Boley

Abstract

COVID-19 has changed the way people live in all parts of the world. Coronavirus also affected the world in a negative way in many aspects especially the economy of the world. The name of the virus is severe acute respiratory syndrome coronavirus 2(SARS-CoV-2) and it is part of the Coronaviridae family\cite{10.1093/ve/vey035}. It is highly contagious to humans and animals and can cause symptoms such as fever, chills, cough, sore throat, and even death. Since it is very contagious and mutates in different forms fast, it is very difficult for researchers to keep track of and prevent further damage from the virus.

SARS-CoV-2 consist of a single strand of RNA bound by protein and they have the largest continuous genome which is about 30,000 in length\cite{dormitzer}. Middle East Respiratory Syndrome(MERS), SARS-CoV, Porcine Reproductive and Respiratory Syndrome(PRRS), and more are from the same family of SARS-CoV-2. The focus of this project research is to find patterns among the family of Coronaviridae with genomes using a clustering algorithm. By conducting such an analysis, we will gain insights into the patterns of existing and new viruses of coronaviridae. Also, the patterns within SARS-CoV-2 alone will be analyzed to find possible patterns that exist in different variants that are monitored by researchers.