Survey of transcripts in the adult Drosophila brain. (57/10766)

BACKGROUND: Classic methods of identifying genes involved in neural function include the laborious process of behavioral screening of mutagenized flies and then rescreening candidate lines for pleiotropic effects due to developmental defects. To accelerate the molecular analysis of brain function in Drosophila we constructed a cDNA library exclusively from adult brains. Our goal was to begin to develop a catalog of transcripts expressed in the brain. These transcripts are expected to contain a higher proportion of clones that are involved in neuronal function. RESULTS: The library contains approximately 6.75 million independent clones. From our initial characterization of 271 randomly chosen clones, we expect that approximately 11% of the clones in this library will identify transcribed sequences not found in expressed sequence tag databases. Furthermore, 15% of these 271 clones are not among the 13,601 predicted Drosophila genes. CONCLUSIONS: Our analysis of this unique Drosophila brain library suggests that the number of genes may be underestimated in this organism. This work complements the Drosophila genome project by providing information that facilitates more complete annotation of the genomic sequence. This library should be a useful resource that will help in determining how basic brain functions operate at the molecular level.  (+info)

The cohesin complex: sequence homologies, interaction networks and shared motifs. (58/10766)

BACKGROUND: Cohesin is a macromolecular complex that links sister chromatids together at the metaphase plate during mitosis. The links are formed during DNA replication and destroyed during the metaphase-to-anaphase transition. In budding yeast, the 14S cohesin complex comprises at least two classes of SMC (structural maintenance of chromosomes) proteins - Smc1 and Smc3 - and two SCC (sister-chromatid cohesion) proteins - Scc1 and Scc3. The exact function of these proteins is unknown. RESULTS: Searches of protein sequence databases have revealed new homologs of cohesin proteins. In mouse, Mmip1 (Mad member interacting protein 1) and Smc3 share 99% sequence identity and are products of the same gene. A phylogenetic tree of SMC homologs reveals five families: Smc1, Smc2, Smc3, Smc4 and an ancestral family that includes the sequences from the Archaea and Eubacteria. This ancestral family also includes sequences from eukaryotes. A cohesion interaction network, comprising 17 proteins, has been constructed using two proteomic databases. Genes encoding six proteins in the cohesion network share a common upstream region that includes the MluI cell-cycle box (MCB) element. Pairs of the proteins in this network share common sequence motifs that could represent common structural features such as binding sites. Scc2 shares a motif with Chk1 (kinase checkpoint protein), that comprises part of the serine/threonine protein kinase motif, including the active-site residue. CONCLUSIONS: We have combined genomic and proteomic data into a comprehensive network of information to reach a better understanding of the function of the cohesin complex. We have identified new SMC homologs, created a new SMC phylogeny and identified shared DNA and protein motifs. The potential for Scc2 to function as a kinase - a hypothesis that needs to be verified experimentally - could provide further evidence for the regulation of sister-chromatid cohesion by phosphorylation mechanisms, which are currently poorly understood.  (+info)

Brassica genomics: a complement to, and early beneficiary of, the Arabidopsis sequence. (59/10766)

Those studying the genus Brassica will be among the early beneficiaries of the now-completed Arabidopsis sequence. The remarkable morphological diversity of Brassica species and their relatives offers valuable opportunities to advance our knowledge of plant growth and development, and our understanding of rapid phenotypic evolution.  (+info)

A comparative genomics approach to prediction of new members of regulons. (60/10766)

Identifying the complete transcriptional regulatory network for an organism is a major challenge. For each regulatory protein, we want to know all the genes it regulates, that is, its regulon. Examples of known binding sites can be used to estimate the binding specificity of the protein and to predict other binding sites. However, binding site predictions can be unreliable because determining the true specificity of the protein is difficult because of the considerable variability of binding sites. Because regulatory systems tend to be conserved through evolution, we can use comparisons between species to increase the reliability of binding site predictions. In this article, an approach is presented to evaluate the computational predictions of regulatory sites. We combine the prediction of transcription units having orthologous genes with the prediction of transcription factor binding sites based on probabilistic models. We augment the sets of genes in Escherichia coli that are expected to be regulated by two transcription factors, the cAMP receptor protein and the fumarate and nitrate reduction regulatory protein, through a comparison with the Haemophilus influenzae genome. At the same time, we learned more about the regulatory networks of H. influenzae, a species with much less experimental knowledge than E. coli. By studying orthologous genes subject to regulation by the same transcription factor, we also gained understanding of the evolution of the entire regulatory systems.  (+info)

09/15: Comparative genomics of a conserved chromosomal region associated with a complex human phenotype. (61/10766)

Three genes that encode related immunoglobulin superfamily molecules have recently been mapped to human chromosome 15 in the region q22.3-q23 and to the syntenic region on mouse chromosome 9. These genes presumably derived from gene duplications, and they are highly similar to Deleted in Colorectal Cancer (DCC), which functions as an axon guidance molecule during development of the nervous system. To find out whether additional genes of this class were present in a chromosomal cluster, we produced a comparative physical map within the region of synteny between mouse chromosome 9 and human chromosome 15. This interval overlaps the critical region for the fourth genetic locus for Bardet-Biedl syndrome (BBS4) in humans. Bardet-Biedl syndrome (OMIM 600374) is characterized by poly/syn/brachydactyly, retinal degeneration, hypogonadism, mental retardation, obesity, diabetes, and kidney abnormalities. A detailed map of this locus will help to identify candidate genes for this disorder.  (+info)

Methylation matters. (62/10766)

DNA methylation is not just for basic scientists any more. There is a growing awareness in the medical field that having the correct pattern of genomic methylation is essential for healthy cells and organs. If methylation patterns are not properly established or maintained, disorders as diverse as mental retardation, immune deficiency, and sporadic or inherited cancers may follow. Through inappropriate silencing of growth regulating genes and simultaneous destabilisation of whole chromosomes, methylation defects help create a chaotic state from which cancer cells evolve. Methylation defects are present in cells before the onset of obvious malignancy and therefore cannot be explained simply as a consequence of a deregulated cancer cell. Researchers are now able to detect with exquisite sensitivity the cells harbouring methylation defects, sometimes months or years before the time when cancer is clinically detectable. Furthermore, aberrant methylation of specific genes has been directly linked with the tumour response to chemotherapy and patient survival. Advances in our ability to observe the methylation status of the entire cancer cell genome have led us to the unmistakable conclusion that methylation abnormalities are far more prevalent than expected. This methylomics approach permits the integration of an ever growing repertoire of methylation defects with the genetic alterations catalogued from tumours over the past two decades. Here we discuss the current knowledge of DNA methylation in normal cells and disease states, and how this relates directly to our current understanding of the mechanisms by which tumours arise.  (+info)

Comparative genomics of lactococcal phages: insight from the complete genome sequence of Lactococcus lactis phage BK5-T. (63/10766)

Lactococcus lactis phage BK5-T and Streptococcus thermophilus phage Sfi21, two cos-site temperate Siphoviridae with 40-kb genomes, share an identical genome organization, sequence similarity at the amino acid level over about half of their genomes, and nucleotide sequence identity of 60% over the DNA packaging and head morphogenesis modules. Siphoviridae with similarly organized genomes and substantial protein sequence similarity were identified in several genera of low-GC-content Gram-positive bacteria. These phages demonstrated a gradient of relatedness ranging from nucleotide sequence similarity to protein sequence similarity to gene map similarity over the DNA packaging and head morphogenesis modules. Interestingly, the degree of relatedness was correlated with the evolutionary distance separating their bacterial hosts. These observations suggest elements of vertical evolution in phages. The structural genes from BK5-T shared no sequence relationships with corresponding genes/proteins from lactococcal phages belonging to distinct lactococcal phage species, including phage sk1 (phage species 936) that showed a closely related gene map. Despite a clearly distinct genome organization, lactococcal phages sk1 and c2 showed nine sequence-related proteins. Over the early gene cluster phage BK5-T shared nine regions of high nucleotide sequence similarity, covering at most two adjacent genes, with lactococcal phage r1t (phage species P335). Over the structural genes, the closest relatives of phage r1t were not lactococcal phages belonging to other phage species, but Siphoviridae from Mycobacteria (high-GC-content Gram-positive bacteria). Evidence for recent horizontal gene transfer between distinct phage species was obtained for dairy phages, but these transfers were limited to phages infecting the same bacterial host species.  (+info)

The basic helix-loop-helix protein family: comparative genomics and phylogenetic analysis. (64/10766)

The basic Helix-Loop-Helix (bHLH) proteins are transcription factors that play important roles during the development of various metazoans including fly, nematode, and vertebrates. They are also involved in human diseases, particularly in cancerogenesis. We made an extensive search for bHLH sequences in the completely sequenced genomes of Caenorhabditis elegans and of Drosophila melanogaster. We found 35 and 56 different genes, respectively, which may represent the complete set of bHLH of these organisms. A phylogenetic analysis of these genes, together with a large number (>350) of bHLH from other sources, led us to define 44 orthologous families among which 36 include bHLH from animals only, and two have representatives in both yeasts and animals. In addition, we identified two bHLH motifs present only in yeast, and four that are present only in plants; however, the latter number is certainly an underestimate. Most animal families (35/38) comprise fly, nematode, and vertebrate genes, suggesting that their common ancestor, which lived in pre-Cambrian times (600 million years ago) already owned as many as 35 different bHLH genes.  (+info)