Methods for haplotype construction and their applications

Posted on:2009-01-03

Degree:Ph.D

Type:Dissertation

University:University of California, Los Angeles

Candidate:Ayers, Kristin Lynn

Full Text:PDF

GTID:1444390005954125

Subject:Biostatistics

Abstract/Summary:

PDF Full Text Request

Haplotypes are frequently used in association testing and can improve the power to detect a disease locus. The EM algorithm is a widely used method for haplotype frequency estimation in short regions showing linkage disequilibrium. The optimal size of these regions, referred to as a block or window, has come into question when imputing maternal and paternal haplotypes. We propose two methods to improve haplotype imputation. Chapters 2 and 3 describe a dictionary model for haplotyping and its applications. According to the model, a haplotype is constructed by randomly concatenating haplotype segments from a given dictionary of haplotype segments. The dictionary model produces a parsimonious list of overlapping haplotype segments, which may parallel what remains from full length ancestral haplotypes after recombination and mutation have broken them into smaller fragments. Likelihood evaluations rely on forward and backward recurrences similar to the ones encountered in hidden Markov models. Parameter estimation is carried out with the EM algorithm.;These estimated haplotype segments in the dictionary may be used to haplotype (or phase) individuals and estimate missing genotypes using an MCMC method. The true pair of haplotypes corresponding to a person's multimarker genotype is reconstructed using a Markov chain that visits haplotype pairs according to their posterior probabilities. The dictionary model yields expected counts of conserved haplotype segments, which can be used as genetic predictors in association testing.;Chapter 4 proposes a diversity penalty for the frequently used EM algorithm for haplotype frequency estimation. The standard EM algorithm for haplotype frequency estimation can accommodate the penalty if one passes over to a more general MM (minorize-maximize) scheme for estimation. Our MM algorithm can improve haplotype frequency estimation, haplotyping, and missing data imputation by enforcing parsimony in estimation of haplotype frequencies. The penalty automatically and quickly discards potential haplotypes with low explanatory power. Our new MM algorithm converges in fewer iterations, dramatically reduces the computational complexity of each iteration, and eliminates marginal haplotypes from further consideration. Imposition of the diversity penalty shows large decreases in computation times compared to naive application of the EM algorithm with modest improvement in haplotyping and genotype imputation.

Keywords/Search Tags:

Haplotype, EM algorithm, Improve, Used

PDF Full Text Request

Related items

1	Of DNA Pools Methods Estimated Multilocus Haplotype Frequencies
2	Association Study Of GRIK4 Gene With Schizophrenia And Improvement Of EM Algorithm For Haplotype Inference
3	Application Of Cluster Analysis Algorithm In Thalassemia Disease
4	Haplotype-based Association Studies
5	Pharmacogenetic-guided Algorithm To Improve Warfarin Stable Dose In The Elderly Han-Chinese Population
6	Research Of The Iterative Algorithm To Improve The Quality Of Abdomen Images By DSCT Low Dose Scan For Patients With Different BMI
7	Initial Studies And Realization Based On SNPs Of Schizophrenia
8	Surfactant Protein A2 Haplotype Associated With Respiratory Distress Syndrome In Pretature Infant
9	Study Of Genetic Polymorphism Of Peroxisome Proliferator-activated Receptors And Haplotype On Body Mass Index And Waist Circumference
10	A Study On Vitro Cytotoxicity Mediated By NK Cells With Different KIR Haplotype Groups