A genome-wide tree- and forest-based association analysis of comorbidity of alcoholism and smoking. (17/6908)

Genetic mechanisms underlying alcoholism are complex. Understanding the etiology of alcohol dependence and its comorbid conditions such as smoking is important because of the significant health concerns. In this report, we describe a method based on classification trees and deterministic forests for association studies to perform a genome-wide joint association analysis of alcoholism and smoking. This approach is used to analyze the single-nucleotide polymorphism data from the Collaborative Study on the Genetics of Alcoholism in the Genetic Analysis Workshop 14. Our analysis reaffirmed the importance of sex difference in alcoholism. Our analysis also identified genes that were reported in other studies of alcoholism and identified new genes or single-nucleotide polymorphisms that can be useful candidates for future studies.  (+info)

Standard linkage and association methods identify the mechanism of four susceptibility genes for a simulated complex disease. (18/6908)

The simulated dataset of the Genetic Analysis Workshop 14 provided affection status and the presence or absence of 12 traits. It was determined that all affected individuals must have traits E, F and H (EFH phenotype) and they must also have either trait B (B subtype) or traits C, D, and G (CDG subtype). A genome screen was performed, and linkage peaks were identified on chromosomes 1, 3, 5, and 9 using microsatellite markers. Dense panels of single-nucleotide polymorphism (SNP) markers were ordered for each of the four linkage peaks. In each case, association analyses identified a single SNP that accounted for the linkage evidence. The SNP on chromosome 1 appeared to primarily influence the B subtype, while the SNPs on chromosomes 5 and 9 primarily influenced the CDG subtype. The chromosome 3 SNP had the strongest effect and influenced both subtypes, as well as the requisite EFH phenotype. Recognizing the two subtypes prior to linkage analysis was key to identifying these loci using only a single replicate. This highlights the need in real life situations for careful examination of the phenotypic data prior to genetic analysis.  (+info)

Multifactor-dimensionality reduction versus family-based association tests in detecting susceptibility loci in discordant sib-pair studies. (19/6908)

Complex diseases are generally thought to be under the influence of multiple, and possibly interacting, genes. Many association methods have been developed to identify susceptibility genes assuming a single-gene disease model, referred to as single-locus methods. Multilocus methods consider joint effects of multiple genes and environmental factors. One commonly used method for family-based association analysis is implemented in FBAT. The multifactor-dimensionality reduction method (MDR) is a multilocus method, which identifies multiple genetic loci associated with the occurrence of complex disease. Many studies of late onset complex diseases employ a discordant sib pairs design. We compared the FBAT and MDR in their ability to detect susceptibility loci using a discordant sib-pair dataset generated from the simulated data made available to participants in the Genetic Analysis Workshop 14. Using FBAT, we were able to identify the effect of one susceptibility locus. However, the finding was not statistically significant. We were not able to detect any of the interactions using this method. This is probably because the FBAT test is designed to find loci with major effects, not interactions. Using MDR, the best result we obtained identified two interactions. However, neither of these reached a level of statistical significance. This is mainly due to the heterogeneity of the disease trait and noise in the data.  (+info)

Identifying susceptibility genes by using joint tests of association and linkage and accounting for epistasis. (20/6908)

Simulated Genetic Analysis Workshop 14 data were analyzed by jointly testing linkage and association and by accounting for epistasis using a candidate gene approach. Our group was unblinded to the "answers." The 48 single-nucleotide polymorphisms (SNPs) within the six disease loci were analyzed in addition to five SNPs from each of two non-disease-related loci. Affected sib-parent data was extracted from the first 10 replicates for populations Aipotu, Kaarangar, and Danacaa, and analyzed separately for each replicate. We developed a likelihood for testing association and/or linkage using data from affected sib pairs and their parents. Identical-by-descent (IBD) allele sharing between sibs was explicitly modeled using a conditional logistic regression approach and incorporating a covariate that represents expected IBD allele sharing given the genotypes of the sibs and their parents. Interactions were accounted for by performing likelihood ratio tests in stages determined by the highest order interaction term in the model. In the first stage, main effects were tested independently, and in subsequent stages, multilocus effects were tested conditional on significant marginal effects. A reduction in the number of tests performed was achieved by prescreening gene combinations with a goodness-of-fit chi square statistic that depended on mating-type frequencies. SNP-specific joint effects of linkage and association were identified for loci D1, D2, D3, and D4 in multiple replicates. The strongest effect was for SNP B03T3056, which had a median p-value of 1.98 x 10(-34). No two- or three-locus effects were found in more than one replicate.  (+info)

The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets. (21/6908)

We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes.  (+info)

Linkage analysis of alcoholism-related electrophysiological phenotypes: genome scans with microsatellites compared to single-nucleotide polymorphisms. (22/6908)

P300 amplitude is an electrophysiological quantitative trait that is correlated with both alcoholism and smoking status. Using the Collaborative Study on the Genetics of Alcoholism data, we performed model-free linkage analysis to investigate the relationship between alcoholism, P300 amplitude, and habitual smoking. We also analyzed the effect of parent-of-origin on alcoholism, and utilized both microsatellites (MS) markers and single-nucleotide polymorphisms (SNPs). We found significant evidence of linkage for alcoholism to chromosome 10; inclusion of P300 amplitude as a covariate provided additional evidence of linkage to chromosome 12. This same region on chromosome 12 showed some evidence for a parent-of-origin effect. We found evidence of linkage for the P300 phenotype to chromosome 7 in non-smokers, and to chromosome 17 in alcoholics. The effects of alcoholism and habitual smoking on P300 amplitude appear to have separate genetic determinants. Overall, there were few differences between MS and SNP genome scans. The use of covariates and parent-of-origin effects allowed detection of linkage not seen otherwise.  (+info)

A genome scan for parent-of-origin linkage effects in alcoholism. (23/6908)

BACKGROUND: Alcoholism is a complex disease in which genomic imprinting may play an important role in its susceptibility. OBJECTIVE: To conduct a genome-wide search for loci that may have strong parent-of-origin linkage effects in alcoholism; to compare the linkage results between the microsatellites and the two single-nucleotide polymorphism (SNP) platforms. METHODS: Nonparametric linkage analyses were performed using ALLEGRO with the three sets of markers provided by the Genetic Analysis Workshop 14 for the Collaborative Study on the Genetics of Alcoholism Problem 1 data. Both sex-averaged and sex-specific genetic maps were used. We also provided a valid statistical test to determine whether the parental allele sharing differed significantly. RESULTS: Significant maternal linkage effects (paternal imprinting) were observed on chromosome 12 using either the microsatellite markers or the two SNP panels. The two SNP sets did not improve the linkage signals compared to the results from the microsatellite markers on chromosome 12. Possible paternal linkage effects (maternal imprinting) on chromosome 7 and maternal linkage effects (paternal imprinting) on chromosome 10 were found using the two SNP panels. CONCLUSION: For diseases which may have parent-of-origin effects, linkage analysis looking at parental sharing separately may reduce locus heterogeneity and increase the ability to identify that which can not be identified with usual linkage analysis.  (+info)

A genome-wide linkage analysis of alcoholism on microsatellite and single-nucleotide polymorphism data, using alcohol dependence phenotypes and electroencephalogram measures. (24/6908)

The Collaborative Study on the Genetics of Alcoholism (COGA) is a large-scale family study designed to identify genes that affect the risk for alcoholism and alcohol-related phenotypes. We performed genome-wide linkage analyses on the COGA data made available to participants in the Genetic Analysis Workshop 14 (GAW 14). The dataset comprised 1,350 participants from 143 families. The samples were analyzed on three technologies: microsatellites spaced at 10 cM, Affymetrix GeneChip Human Mapping 10 K Array (HMA10K) and Illumina SNP-based Linkage III Panel. We used ALDX1 and ALDX2, the COGA definitions of alcohol dependence, as well as electrophysiological measures TTTH1 and ECB21 to detect alcoholism susceptibility loci. Many chromosomal regions were found to be significant for each of the phenotypes at a p-value of 0.05. The most significant region for ALDX1 is on chromosome 7, with a maximum LOD score of 2.25 for Affymetrix SNPs, 1.97 for Illumina SNPs, and 1.72 for microsatellites. The same regions on chromosome 7 (96-106 cM) and 10 (149-176 cM) were found to be significant for both ALDX1 and ALDX2. A region on chromosome 7 (112-153 cM) and a region on chromosome 6 (169-185 cM) were identified as the most significant regions for TTTH1 and ECB21, respectively. We also performed linkage analysis on denser maps of markers by combining the SNPs datasets from Affymetrix and Illumina. Adding the microsatellite data to the combined SNP dataset improved the results only marginally. The results indicated that SNPs outperform microsatellites with the densest marker sets performing the best.  (+info)