fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample.

Jørsboe E., Hanghøj K., Albrechtsen A.

MotivationEstimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases.ResultsHere we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.Availability and implementationThe admixture estimation method is implemented in C ++ and the PCA method is implemented in R. The code is freely available at http://www.popgen.dk/software/index.php/FastNGSadmix.Contactemil.jorsboe@bio.ku.dk.Supplementary informationSupplementary data are available at Bioinformatics online.

More information Original publication

DOI

10.1093/bioinformatics/btx474

Type

Journal article

Publication Date

2017-10-01T00:00:00+00:00

Volume

Pages

3148 - 3150

Total pages

Addresses

D, e, p, a, r, t, m, e, n, t, , o, f, , B, i, o, l, o, g, y, ,, , T, h, e, , B, i, o, i, n, f, o, r, m, a, t, i, c, s, , C, e, n, t, r, e, ,, , U, n, i, v, e, r, s, i, t, y, , o, f, , C, o, p, e, n, h, a, g, e, n, ,, , 2, 2, 0, 0, , C, o, p, e, n, h, a, g, e, n, , N, ,, , D, e, n, m, a, r, k, .

Keywords

Humans, Probability, Genetics, Population, Genotype, Principal Component Analysis, Software, High-Throughput Nucleotide Sequencing

Cookies on this website