fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample.

Jørsboe E.; Hanghøj K.; Albrechtsen A.

fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample.

Jørsboe E., Hanghøj K., Albrechtsen A.

MOTIVATION: Estimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases. RESULTS: Here we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations. AVAILABILITY AND IMPLEMENTATION: The admixture estimation method is implemented in C ++ and the PCA method is implemented in R. The code is freely available at http://www.popgen.dk/software/index.php/FastNGSadmix. CONTACT: emil.jorsboe@bio.ku.dk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Original publication

DOI

10.1093/bioinformatics/btx474

Type

Journal article

Journal

Bioinformatics

Publication Date

01/10/2017

Volume

Pages

3148 - 3150

Keywords

Genetics, Population, Genotype, High-Throughput Nucleotide Sequencing, Humans, Principal Component Analysis, Probability, Software

Cookies on this website

fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample.

Jørsboe E., Hanghøj K., Albrechtsen A.

DOI

Type

Journal

Publication Date

Volume

Pages

Keywords