Optical mapping of Plasmodium falciparum chromosome 2.
Detailed restriction maps of microbial genomes are a valuable resource in genome sequencing studies but are toilsome to construct by contig construction of maps derived from cloned DNA. Analysis of genomic DNA enables large stretches of the genome to be mapped and circumvents library construction and associated cloning artifacts. We used pulsed-field gel electrophoresis purified Plasmodium falciparum chromosome 2 DNA as the starting material for optical mapping, a system for making ordered restriction maps from ensembles of individual DNA molecules. DNA molecules were bound to derivatized glass surfaces, cleaved with NheI or BamHI, and imaged by digital fluorescence microscopy. Large pieces of the chromosome containing ordered DNA restriction fragments were mapped. Maps were assembled from 50 molecules producing an average contig depth of 15 molecules and high-resolution restriction maps covering the entire chromosome. Chromosome 2 was found to be 976 kb by optical mapping with NheI, and 946 kb with BamHI, which compares closely to the published size of 947 kb from large-scale sequencing. The maps were used to further verify assemblies from the plasmid library used for sequencing. Maps generated in silico from the sequence data were compared to the optical mapping data, and good correspondence was found. Such high-resolution restriction maps may become an indispensable resource for large-scale genome sequencing projects. (+info)
Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro.
In this paper, we report the sequences of 100 cDNA clones newly determined from a set of size-fractionated human brain cDNA libraries and predict the coding sequences of the corresponding genes, named KIAA0819 to KIAA0918. These cDNA clones were selected on the basis of their coding potentials of large proteins (50 kDa and more) by using in vitro transcription/translation assays. The sequence data showed that the average sizes of the inserts and corresponding open reading frames are 4.4 kb and 2.5 kb (831 amino acid residues), respectively. Homology and motif/domain searches against the public databases indicated that the predicted coding sequences of 83 genes were similar to those of known genes, 59% of which (49 genes) were categorized as coding for proteins functionally related to cell signaling/communication, cell structure/motility and nucleic acid management. The chromosomal locations and the expression profiles of all the genes were also examined. For 54 clones including brain-specific ones, the mRNA levels were further examined among 8 brain regions (amygdala, corpus callosum, cerebellum, caudate nucleus, hippocampus, substantia nigra, subthalamic nucleus, and thalamus), spinal cord, and fetal brain. (+info)
Structural analysis of Arabidopsis thaliana chromosome 5. VIII. Sequence features of the regions of 1,081,958 bp covered by seventeen physically assigned P1 and TAC clones.
A total of 17 Pl and TAC clones each representing an assigned region of chromosome 5 were isolated from P1 and TAC genomic libraries of Arabidopsis thaliana Columbia, and their nucleotide sequences were determined. The length of the clones sequenced in this study summed up to 1,081,958 bp. As we have previously reported the sequence of 9,072,622 bp by analysis of 125 P1 and TAC clones, the total length of the sequences of chromosome 5 determined so far is now 10,154,580 bp. The sequences were subjected to similarity search against protein and EST databases and analysis with computer programs for gene modeling. As a consequence, a total of 253 potential protein-coding genes with known or predicted functions were identified. The positions of exons which do not show apparent similarity to known genes were also assigned using computer programs for exon prediction. The average density of the genes identified in this study was 1 gene per 4277 bp. Introns were observed in 74% of the potential protein genes, and the average number per gene and the average length of the introns were 4.3 and 168 bp, respectively. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/. (+info)
Molecular characterization of the genes pcaG and pcaH, encoding protocatechuate 3,4-dioxygenase, which are essential for vanillin catabolism in Pseudomonas sp. strain HR199.
Pseudomonas sp. strain HR199 is able to utilize eugenol (4-allyl-2-methoxyphenol), vanillin (4-hydroxy-3-methoxybenzaldehyde), or protocatechuate as the sole carbon source for growth. Mutants of this strain which were impaired in the catabolism of vanillin but retained the ability to utilize eugenol or protocatechuate were obtained after nitrosoguanidine mutagenesis. One mutant (SK6169) was used as recipient of a Pseudomonas sp. strain HR199 genomic library in cosmid pVK100, and phenotypic complementation was achieved with a 5.8-kbp EcoRI fragment (E58). The amino acid sequences deduced from two corresponding open reading frames (ORF) identified on E58 revealed high degrees of homology to pcaG and pcaH, encoding the two subunits of protocatechuate 3,4-dioxygenase. Three additional ORF most probably encoded a 4-hydroxybenzoate 3-hydroxylase (PobA) and two putative regulatory proteins, which exhibited homology to PcaQ of Agrobacterium tumefaciens and PobR of Pseudomonas aeruginosa, respectively. Since mutant SK6169 was also complemented by a subfragment of E58 that harbored only pcaH, this mutant was most probably lacking a functional beta subunit of the protocatechuate 3, 4-dioxygenase. Since this mutant was still able to grow on protocatechuate and lacked protocatechuate 4,5-dioxygenase and protocatechuate 2,3-dioxygenase, the degradation had to be catalyzed by different enzymes. Two other mutants (SK6184 and SK6190), which were also impaired in the catabolism of vanillin, were not complemented by fragment E58. Since these mutants accumulated 3-carboxy muconolactone during cultivation on eugenol, they most probably exhibited a defect in a step of the catabolic pathway following the ortho cleavage. Moreover, in these mutants cyclization of 3-carboxymuconic acid seems to occur by a syn absolute stereochemical course, which is normally only observed for cis, cis-muconate lactonization in pseudomonads. In conclusion, vanillin is degraded through the ortho-cleavage pathway in Pseudomonas sp. strain HR199 whereas protocatechuate could also be metabolized via a different pathway in the mutants. (+info)
Der(22) syndrome and velo-cardio-facial syndrome/DiGeorge syndrome share a 1.5-Mb region of overlap on chromosome 22q11.
Derivative 22 (der) syndrome is a rare disorder associated with multiple congenital anomalies, including profound mental retardation, preauricular skin tags or pits, and conotruncal heart defects. It can occur in offspring of carriers of the constitutional t(11;22)(q23;q11) translocation, owing to a 3:1 meiotic malsegregation event resulting in partial trisomy of chromosomes 11 and 22. The trisomic region on chromosome 22 overlaps the region hemizygously deleted in another congenital anomaly disorder, velo-cardio-facial syndrome/DiGeorge syndrome (VCFS/DGS). Most patients with VCFS/DGS have a similar 3-Mb deletion, whereas some have a nested distal deletion endpoint resulting in a 1.5-Mb deletion, and a few rare patients have unique deletions. To define the interval on 22q11 containing the t(11;22) breakpoint, haplotype analysis and FISH mapping were performed for five patients with der(22) syndrome. Analysis of all the patients was consistent with 3:1 meiotic malsegregation in the t(11;22) carrier parent. FISH-mapping studies showed that the t(11;22) breakpoint occurred in the same interval as the 1.5-Mb distal deletion breakpoint for VCFS. The deletion breakpoint of one VCFS patient with an unbalanced t(18;22) translocation also occurred in the same region. Hamster-human somatic hybrid cell lines from a patient with der(22) syndrome and a patient with VCFS showed that the breakpoints occurred in an interval containing low-copy repeats, distal to RANBP1 and proximal to ZNF74. The presence of low-copy repetitive sequences may confer susceptibility to chromosome rearrangements. A 1.5-Mb region of overlap on 22q11 in both syndromes suggests the presence of dosage-dependent genes in this interval. (+info)
Cloning of mnuA, a membrane nuclease gene of Mycoplasma pulmonis, and analysis of its expression in Escherichia coli.
Membrane nucleases of mycoplasmas are believed to play important roles in growth and pathogenesis, although no clear evidence for their importance has yet been obtained. As a first step in defining the function of this unusual membrane activity, studies were undertaken to clone and analyze one of the membrane nuclease genes from Mycoplasma pulmonis. A novel screening strategy was used to identify a recombinant lambda phage expressing nuclease activity, and its cloned fragment was analyzed. Transposon mutagenesis was used to identify an open reading frame of 1,410 bp, which coded for nuclease activity in Escherichia coli. This gene coded for a 470-amino-acid polypeptide of 53,739 Da and was designated mnuA (for "membrane nuclease"). The MnuA protein contained a prolipoprotein signal peptidase II recognition sequence along with an extensive hydrophobic region near the amino terminus, suggesting that the protein may be lipid modified or that it is anchored in the membrane by this membrane-spanning region. Antisera raised against two MnuA peptide sequences identified an M. pulmonis membrane protein of approximately 42 kDa by immunoblotting, which corresponded to a trypsin-sensitive nucleolytic band of the same size. Maxicell experiments with E. coli confirmed that mnuA coded for a nuclease of unknown specificity. Hybridization studies showed that mnuA sequences are found in few Mycoplasma species, suggesting that mycoplasma membrane nucleases display significant sequence variation within the genus Mycoplasma. (+info)
Characterization of the recD gene of Neisseria gonorrhoeae MS11 and the effect of recD inactivation on pilin variation and DNA transformation.
Pilin antigenic variation in Neisseria gonorrhoeae may result following intrachromosomal recombination between homologous pil genes. Despite extensive study, recA is the only previously characterized gene known to be involved in this process. In this study, the gonococcal recD gene, encoding one subunit of the putative RecBCD holoenzyme, was characterized and its role in pilin variation assessed. The complete recD gene of N. gonorrhoeae MS11 was cloned and its nucleotide sequence determined. The gonococcal recD gene complemented a defined Escherichia coli recD mutant, based on plaque formation of bacteriophage lambda and the restoration of ATP-dependent nuclease activity. Inactivation of the gonococcal recD gene had no measurable effect on cell viability or survival following UV exposure, but did decrease the frequency of DNA transformation approximately threefold. The frequency at which non-parental pilin phenotypes were spawned was 12-fold greater in MS11 recD mutants compared with the parental MS11 rec+ strain. Similar results were obtained using recD mutants that were not competent for DNA transformation. Complementation of the MS11 recD mutant with a wild-type recD gene copy restored the frequency of pilin phenotypic variation to approximately wild-type levels. The nucleotide changes at pilE in the recD mutants were confined to the variable regions of the gene and were similar to changes previously attributed to gene conversion. (+info)
Cytochrome c550 is an essential component of the quinoprotein ethanol oxidation system in Pseudomonas aeruginosa: cloning and sequencing of the genes encoding cytochrome c550 and an adjacent acetaldehyde dehydrogenase.
Pseudomonas aeruginosa ATCC 17933 grown aerobically on ethanol produces a soluble cytochrome c550 together with a quinoprotein ethanol dehydrogenase. A 3.2 kb genomic DNA fragment containing the gene encoding cytochrome c550 was cloned and sequenced. Two other complete and two truncated ORFs were also identified. A truncated ORF encoding the quinoprotein ethanol dehydrogenase (exaA) was found upstream of the cytochrome c550 gene (exaB) and in reverse orientation. An ORF encoding a NAD(+)-dependent acetaldehyde dehydrogenase (exaC) was located downstream of the cytochrome c550 gene and in the same orientation. Another ORF showed similarity to the pqqA gene and a truncated ORF similarity to the pqqB gene, both involved in the biosynthesis of the prosthetic group PQQ. The organization of these genes was found to be different from the well-studied methanol oxidation system in methylotrophic bacteria. The deduced amino acid sequence of cytochrome c550 from P. aeruginosa showed some similarity to cytochrome c6 of the alga Chlamydomonas reinhardtii and the haem domain of quinohaemoprotein alcohol dehydrogenases of acetic acid bacteria, but no similarity to the soluble cytochrome cL of the quinoprotein methanol oxidation system of methylotrophs could be detected. A mutant of P. aeruginosa with an interrupted cytochrome c550 gene was unable to grow on ethanol, which proves that cytochrome c550 is an essential component of the ethanol oxidation system in this organism. (+info)