Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12. (65/1209)

Escherichia coli O157:H7 is a major food-borne infectious pathogen that causes diarrhea, hemorrhagic colitis, and hemolytic uremic syndrome. Here we report the complete chromosome sequence of an O157:H7 strain isolated from the Sakai outbreak, and the results of genomic comparison with a benign laboratory strain, K-12 MG1655. The chromosome is 5.5 Mb in size, 859 Kb larger than that of K-12. We identified a 4.1-Mb sequence highly conserved between the two strains, which may represent the fundamental backbone of the E. coli chromosome. The remaining 1.4-Mb sequence comprises of O157:H7-specific sequences, most of which are horizontally transferred foreign DNAs. The predominant roles of bacteriophages in the emergence of O157:H7 is evident by the presence of 24 prophages and prophage-like elements that occupy more than half of the O157:H7-specific sequences. The O157:H7 chromosome encodes 1632 proteins and 20 tRNAs that are not present in K-12. Among these, at least 131 proteins are assumed to have virulence-related functions. Genome-wide codon usage analysis suggested that the O157:H7-specific tRNAs are involved in the efficient expression of the strain-specific genes. A complete set of the genes specific to O157:H7 presented here sheds new insight into the pathogenicity and the physiology of O157:H7, and will open a way to fully understand the molecular mechanisms underlying the O157:H7 infection.  (+info)

Predicted highly expressed and putative alien genes of Deinococcus radiodurans and implications for resistance to ionizing radiation damage. (66/1209)

Predicted highly expressed (PHX) and putative alien genes determined by codon usages are characterized in the genome of Deinococcus radiodurans (strain R1). Deinococcus radiodurans (DEIRA) can survive very high doses of ionizing radiation that are lethal to virtually all other organisms. It has been argued that DEIRA is endowed with enhanced repair systems that provide protection and stability. However, predicted expression levels of DNA repair proteins with the exception of RecA tend to be low and do not distinguish DEIRA from other prokaryotes. In this paper, the capability of DEIRA to resist extreme doses of ionizing and UV radiation is attributed to an unusually high number of PHX chaperone/degradation, protease, and detoxification genes. Explicitly, compared with all current complete prokaryotic genomes, DEIRA contains the greatest number of PHX detoxification and protease proteins. Other sources of environmental protection against severe conditions of UV radiation, desiccation, and thermal effects for DEIRA are the several S-layer (surface structure) PHX proteins. The top PHX gene of DEIRA is the multifunctional tricarboxylic acid (TCA) gene aconitase, which, apart from its role in respiration, also alerts the cell to oxidative damage.  (+info)

Expanding the genetic code of Escherichia coli. (67/1209)

A unique transfer RNA (tRNA)/aminoacyl-tRNA synthetase pair has been generated that expands the number of genetically encoded amino acids in Escherichia coli. When introduced into E. coli, this pair leads to the in vivo incorporation of the synthetic amino acid O-methyl-l-tyrosine into protein in response to an amber nonsense codon. The fidelity of translation is greater than 99%, as determined by analysis of dihydrofolate reductase containing the unnatural amino acid. This approach should provide a general method for increasing the genetic repertoire of living cells to include a variety of amino acids with novel structural, chemical, and physical properties not found in the common 20 amino acids.  (+info)

Enlarging the amino acid set of Escherichia coli by infiltration of the valine coding pathway. (68/1209)

Aminoacyl transfer RNA (tRNA) synthetases establish the rules of the genetic code by catalyzing the aminoacylation of tRNAs. For some synthetases, accuracy depends critically on an editing function at a site distinct from the aminoacylation site. Mutants of Escherichia coli that incorrectly charge tRNA(Val) with cysteine were selected after random mutagenesis of the whole chromosome. All mutations obtained were located in the editing site of valyl-tRNA synthetase. More than 20% of the valine in cellular proteins from such an editing mutant organism could be replaced with the noncanonical aminobutyrate, sterically similar to cysteine. Thus, the editing function may have played a central role in restricting the genetic code to 20 amino acids. Disabling this editing function offers a powerful approach for diversifying the chemical composition of proteins and for emulating evolutionary stages of ambiguous translation.  (+info)

The complete genome sequence of the murine respiratory pathogen Mycoplasma pulmonis. (69/1209)

Mycoplasma pulmonis is a wall-less eubacterium belonging to the Mollicutes (trivial name, mycoplasmas) and responsible for murine respiratory diseases. The genome of strain UAB CTIP is composed of a single circular 963 879 bp chromosome with a G + C content of 26.6 mol%, i.e. the lowest reported among bacteria, Ureaplasma urealyticum apart. This genome contains 782 putative coding sequences (CDSs) covering 91.4% of its length and a function could be assigned to 486 CDSs whilst 92 matched the gene sequences of hypothetical proteins, leaving 204 CDSs without significant database match. The genome contains a single set of rRNA genes and only 29 tRNAs genes. The replication origin oriC was localized by sequence analysis and by using the G + C skew method. Sequence polymorphisms within stretches of repeated nucleotides generate phase-variable protein antigens whilst a recombinase gene is likely to catalyse the site-specific DNA inversions in major M.pulmonis surface antigens. Furthermore, a hemolysin, secreted nucleases and a glyco-protease are predicted virulence factors. Surprisingly, several of the genes previously reported to be essential for a self-replicating minimal cell are missing in the M.pulmonis genome although this one is larger than the other mycoplasma genomes fully sequenced until now.  (+info)

Sequence analysis of cloned cDNA encoding part of an immunoglobulin heavy chain. (70/1209)

The recombinant plasmid pH21-1 consists of mouse-derived complementary DNA (cDNA) in the E. coli plasmid pMB9. The mouse insertion has been completely sequenced, and encodes the CH3 domain and half the CH2 domain of the immunoglobulin gamma1 heavy chain. The predicted amino acid sequence differs at several positions from that previously published for this protein. The pattern of codon usage resembles that in some other eukaryotic messenger RNAs. A computer program has been used to predict the optimum secondary structure for the mRNA encoding the CH3 domain and the inter-domain junction.  (+info)

Why genetic information processing could have a quantum basis. (71/1209)

Living organisms are not just random collections of organic molecules. There is continuous information processing going on in the apparent bouncing around of molecules of life. Optimization criteria in this information processing can be searched for using the laws of physics. Quantum dynamics can explain why living organisms have 4 nucleotide bases and 20 amino acids, as optimal solutions of the molecular assembly process. Experiments should be able to tell whether evolution indeed took advantage of quantum dynamics or not.  (+info)

Analysis of codon:anticodon interactions within the ribosome provides new insights into codon reading and the genetic code structure. (72/1209)

Although the decoding rules have been largely elucidated, the physical-chemical reasons for the "correctness" of codon:anticodon duplexes have never been clear. In this work, on the basis of the available data, we propose that the correct codon:anticodon duplexes are those whose formation and interaction with the ribosomal decoding center are not accompanied by uncompensated losses of hydrogen and ionic bonds. Other factors such as proofreading, base-base stacking and aminoacyl-tRNA concentration contribute to the efficiency and accuracy of aminoacyl-tRNA selection, and certainly these factors are important; but we suggest that analyses of hydrogen and ionic bonding alone provides a robust first-order approximation of decoding accuracy. Thus our model can simplify predictions about decoding accuracy and error. The model can be refined with data, but is already powerful enough to explain all of the available data on decoding accuracy. Here we predict which duplexes should be considered correct, which duplexes are responsible for virtually all misreading, and we suggest an evolutionary scheme that gave rise to the mixed boxes of the genetic code.  (+info)