Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

MotivationEstimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases.ResultsHere we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.Availability and implementationThe admixture estimation method is implemented in C ++ and the PCA method is implemented in R. The code is freely available at http://www.popgen.dk/software/index.php/FastNGSadmix.Contactemil.jorsboe@bio.ku.dk.Supplementary informationSupplementary data are available at Bioinformatics online.

More information Original publication

DOI

10.1093/bioinformatics/btx474

Type

Journal article

Publication Date

2017-10-01T00:00:00+00:00

Volume

33

Pages

3148 - 3150

Total pages

2

Addresses

D, e, p, a, r, t, m, e, n, t, , o, f, , B, i, o, l, o, g, y, ,, , T, h, e, , B, i, o, i, n, f, o, r, m, a, t, i, c, s, , C, e, n, t, r, e, ,, , U, n, i, v, e, r, s, i, t, y, , o, f, , C, o, p, e, n, h, a, g, e, n, ,, , 2, 2, 0, 0, , C, o, p, e, n, h, a, g, e, n, , N, ,, , D, e, n, m, a, r, k, .

Keywords

Humans, Probability, Genetics, Population, Genotype, Principal Component Analysis, Software, High-Throughput Nucleotide Sequencing