MotivationEstimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases.ResultsHere we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.Availability and implementationThe admixture estimation method is implemented in C ++ and the PCA method is implemented in R. The code is freely available at http://www.popgen.dk/software/index.php/FastNGSadmix.Contactemil.jorsboe@bio.ku.dk.Supplementary informationSupplementary data are available at Bioinformatics online.
Journal article
2017-10-01T00:00:00+00:00
33
3148 - 3150
2
Department of Biology, The Bioinformatics Centre, University of Copenhagen, 2200 Copenhagen N, Denmark.
Humans, Probability, Genetics, Population, Genotype, Principal Component Analysis, Software, High-Throughput Nucleotide Sequencing