CpG-creating Mutations are Costly in Many Human Viruses
Caudill V., Qin S., Winstead R., Kaur J., Tisthammer K., Pineda G., Carja O., Eggo R., Koelle K., Lythgoe K., Roy S., Allen N., Aviles M., Baker B., Bauer W., Bermudez S., Carlson C., Catalan F., Chemel AK., Evans D., Fiutek N., Fryer E., Goodfellow SM., Hecht M., Hopp K., Hopson D., Jaberi A., Kinney C., Lao D., Le A., Lo J., Lopez A., López A., Lorenzo F., Luu G., Mahoney A., Melton R., Nascimento GD., Pradhananga A., Rodrigues N., Shieh A., Sims J., Singh R., Sulaeman H., Thu R., Tran K., Tran L., Winters E., Wong A., Pennings P.
Abstract Mutations can occur throughout the virus genome and may be beneficial or deleterious. We are interested in mutations that yield a C next to a G, producing CpG sites. CpG sites are rare in eukaryotic and viral genomes. For the eukaryotes, it is thought that CpG sites are rare because they are prone to mutation when methylated. In viruses, we know less about why CpG sites are rare. A previous study in HIV suggested that CpG-creating transition mutations are more costly that similar non-CpG-creating mutations. To determine if this is the case in other viruses, we analyzed the allele frequencies of CpG-creating and non-CpG-creating mutations across various strains, subtypes, and genes of viruses using existing data obtained from Genbank, HIV Databases, and Virus Pathogen Resource. Our results suggest that CpG sites are costly for most viruses. By understanding the cost of CpG sites, we can obtain further insights into the evolution and adaptation of viruses.