Sequence Analysis, DNA
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Amino Acid Sequence
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
RNA, Ribosomal, 16S
Constituent of 30S subunit prokaryotic ribosomes containing 1600 nucleotides and 21 proteins. 16S rRNA is involved in initiation of polypeptide synthesis.
Deoxyribonucleic acid that makes up the genetic material of bacteria.
Sequence Homology, Nucleic Acid
The sequential correspondence of nucleotides in one nucleic acid molecule with those of another nucleic acid molecule. Sequence homology is an indication of the genetic relatedness of different organisms and gene function.
Sequence Homology, Amino Acid
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
The functional hereditary units of BACTERIA.
Polymerase Chain Reaction
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
Open Reading Frames
Nucleic Acid Hybridization
Widely used technique which exploits the ability of complementary sequences in single-stranded DNAs or RNAs to pair with each other to form a double helix. Hybridization can take place between two complimentary DNA sequences, between a single-stranded DNA and a complementary RNA, or between two RNA sequences. The technique is used to detect and isolate specific sequences, measure homology, or define other characteristics of one or both strands. (Kendrew, Encyclopedia of Molecular Biology, 1994, p503)
Sequence Analysis, Protein
A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations.
The restriction of a characteristic behavior, anatomical structure or physical system, such as immune response; metabolic response, or gene or gene variant to the members of one species. It refers to that property which differentiates one species from another but it is also used for phylogenetic levels higher or lower than the species.
Bacterial Typing Techniques
Procedures for identifying types and strains of bacteria. The most frequently employed typing systems are BACTERIOPHAGE TYPING and SEROTYPING as well as bacteriocin typing and biotyping.
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
Extrachromosomal, usually CIRCULAR DNA molecules that are self-replicating and transferable from one organism to another. They are found in a variety of bacterial, archaeal, fungal, algal, and plant species. They are used in GENETIC ENGINEERING as CLONING VECTORS.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
DNA, Ribosomal Spacer
Ribonucleic acid in bacteria having regulatory and catalytic roles as well as involvement in protein synthesis.
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Any method used for determining the location of and relative distances between genes on a chromosome.
A method (first developed by E.M. Southern) for detection of DNA that has been electrophoretically separated and immobilized by blotting on nitrocellulose or other type of paper or nylon membrane followed by hybridization with labeled NUCLEIC ACID PROBES.
The presence of bacteria, viruses, and fungi in the soil. This term is not restricted to pathogenic organisms.
Repetitive Sequences, Nucleic Acid
Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES).
DNA Restriction Enzymes
Enzymes that are part of the restriction-modification systems. They catalyze the endonucleolytic cleavage of DNA sequences which lack the species-specific methylation pattern in the host cell's DNA. Cleavage yields random or specific double-stranded fragments with terminal 5'-phosphates. The function of restriction enzymes is to destroy any foreign DNA that invades the host cell. Most have been studied in bacterial systems, but a few have been found in eukaryotic organisms. They are also used as tools for the systematic dissection and mapping of chromosomes, in the determination of base sequences of DNAs, and have made it possible to splice and recombine genes from one organism into the genome of another. EC 3.21.1.
Sequence Analysis, RNA
A multistage process that includes cloning, physical mapping, subcloning, sequencing, and information analysis of an RNA SEQUENCE.
DNA Transposable Elements
Discrete segments of DNA which can excise and reintegrate to another site in the genome. Most are inactive, i.e., have not been found to exist outside the integrated state. DNA transposable elements include bacterial IS (insertion sequence) elements, Tn elements, the maize controlling elements Ac and Ds, Drosophila P, gypsy, and pogo elements, the human Tigger elements and the Tc and mariner elements which are found throughout the animal kingdom.
A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both.
Electrophoresis, Polyacrylamide Gel
The genetic constitution of the individual, comprising the ALLELES present at each GENETIC LOCUS.
Genetic Complementation Test
Detection of RNA that has been electrophoretically separated and immobilized by blotting on nitrocellulose or other type of paper or nylon membrane followed by hybridization with labeled NUCLEIC ACID PROBES.
Polymorphism, Restriction Fragment Length
Mutagenesis where the mutation is caused by the introduction of foreign DNA sequences into a gene or extragenic sequence. This may occur spontaneously in vivo or be experimentally induced in vivo or in vitro. Proviral DNA insertions into or adjacent to a cellular proto-oncogene can interrupt GENETIC TRANSLATION of the coding sequences or interfere with recognition of regulatory elements and cause unregulated expression of the proto-oncogene resulting in tumor formation.
Promoter Regions, Genetic
A set of three nucleotides in a protein coding sequence that specifies individual amino acids or a termination signal (CODON, TERMINATOR). Most codons are universal, but some organisms do not produce the transfer RNAs (RNA, TRANSFER) complementary to all codons. These codons are referred to as unassigned codons (CODONS, NONSENSE).
Chromatography, High Pressure Liquid
Proteins prepared by recombinant DNA technology.
Synthetic or natural oligonucleotides used in hybridization studies in order to identify and study specific nucleic acid fragments, e.g., DNA segments near or within a specific gene locus or gene. The probe hybridizes with a specific mRNA, if present. Conventional techniques used for testing for the hybridization product include dot blot assays, Southern blot assays, and DNA:RNA hybrid-specific antibody tests. Conventional labels for the probe include the radioisotope labels 32P and 125I and the chemical label biotin.
The phenotypic manifestation of a gene or genes by the processes of GENETIC TRANSCRIPTION and GENETIC TRANSLATION.
An order of gram-positive, primarily aerobic BACTERIA that tend to form branching filaments.
Multilocus Sequence Typing
Direct nucleotide sequencing of gene fragments from multiple housekeeping genes for the purpose of phylogenetic analysis, organism identification, and typing of species, strain, serovar, or other distinguishable phylogenetic level.
Production of new arrangements of DNA by various mechanisms such as assortment and segregation, CROSSING OVER; GENE CONVERSION; GENETIC TRANSFORMATION; GENETIC CONJUGATION; GENETIC TRANSDUCTION; or mixed infection of viruses.
Organic compounds that generally contain an amino (-NH2) and a carboxyl (-COOH) group. Twenty alpha-amino acids are the subunits which are polymerized to form proteins.
Gene Expression Regulation, Bacterial
Any of the processes by which cytoplasmic or intercellular factors influence the differential control of gene action in bacteria.
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
Reverse Transcriptase Polymerase Chain Reaction
A variation of the PCR technique in which cDNA is made from RNA via reverse transcription. The resultant cDNA is then amplified using standard PCR protocols.
A polynucleotide consisting essentially of chains with a repeating backbone of phosphate and ribose units to which nitrogenous bases are attached. RNA is unique among biological macromolecules in that it can encode genetic information, serve as an abundant structural component of cells, and also possesses catalytic activity. (Rieger et al., Glossary of Genetics: Classical and Molecular, 5th ed)
Nucleic Acid Conformation
The presence of bacteria, viruses, and fungi in water. This term is not restricted to pathogenic organisms.
Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
A mass of organic or inorganic solid fragmented material, or the solid fragment itself, that comes from the weathering of rock and is carried by, suspended in, or dropped by air, water, or ice. It refers also to a mass that is accumulated by any other natural agent and that forms in layers on the earth's surface, such as sand, gravel, silt, mud, fill, or loess. (McGraw-Hill Dictionary of Scientific and Technical Terms, 4th ed, p1689)
The degree of pathogenicity within a group or species of microorganisms or viruses as indicated by case fatality rates and/or the ability of the organism to invade the tissues of the host. The pathogenic capacity of an organism is determined by its VIRULENCE FACTORS.
The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain).
RNA, Ribosomal, 23S
One of the three domains of life (the others being Eukarya and ARCHAEA), also called Eubacteria. They are unicellular prokaryotic microorganisms which generally possess rigid cell walls, multiply by cell division, and exhibit three principal forms: round or coccal, rodlike or bacillary, and spiral or spirochetal. Bacteria can be classified by their response to OXYGEN: aerobic, anaerobic, or facultatively anaerobic; by the mode by which they obtain their energy: chemotrophy (via chemical reaction) or PHOTOTROPHY (via light reaction); for chemotrophs by their source of chemical energy: CHEMOLITHOTROPHY (from inorganic compounds) or chemoorganotrophy (from organic compounds); and by their source for CARBON; NITROGEN; etc.; HETEROTROPHY (from organic sources) or AUTOTROPHY (from CARBON DIOXIDE). They can also be classified by whether or not they stain (based on the structure of their CELL WALLS) with CRYSTAL VIOLET dye: gram-negative or gram-positive.
The regular and simultaneous occurrence in a single interbreeding population of two or more discontinuous genotypes. The concept includes differences in genotypes ranging in size from a single nucleotide site (POLYMORPHISM, SINGLE NUCLEOTIDE) to large nucleotide sequences visible at a chromosomal level.
Protein Structure, Tertiary
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
A genus of bacteria that form a nonfragmented aerial mycelium. Many species have been identified with some being pathogenic. This genus is responsible for producing a majority of the ANTI-BACTERIAL AGENTS of practical value.
RNA, Ribosomal, 5.8S
Multicellular, eukaryotic life forms of kingdom Plantae (sensu lato), comprising the VIRIDIPLANTAE; RHODOPHYTA; and GLAUCOPHYTA; all of which acquired chloroplasts by direct endosymbiosis of CYANOBACTERIA. They are characterized by a mainly photosynthetic mode of nutrition; essentially unlimited growth at localized regions of cell divisions (MERISTEMS); cellulose within cells providing rigidity; the absence of organs of locomotion; absence of nervous and sensory systems; and an alternation of haploid and diploid generations.
Former kingdom, located on Korea Peninsula between Sea of Japan and Yellow Sea on east coast of Asia. In 1948, the kingdom ceased and two independent countries were formed, divided by the 38th parallel.
Process of generating a genetic MUTATION. It may occur spontaneously or be induced by MUTAGENS.
Transport proteins that carry specific substances in the blood or across cell membranes.
A group of the proteobacteria comprised of facultatively anaerobic and fermentative gram-negative bacteria.
Any of various animals that constitute the family Suidae and comprise stout-bodied, short-legged omnivorous mammals with thick skin, usually covered with coarse bristles, a rather long mobile snout, and small tail. Included are the genera Babyrousa, Phacochoerus (wart hogs), and Sus, the latter containing the domestic pig (see SUS SCROFA).
The class of all enzymes catalyzing oxidoreduction reactions. The substrate that is oxidized is regarded as a hydrogen donor. The systematic name is based on donor:acceptor oxidoreductase. The recommended name will be dehydrogenase, wherever this is possible; as an alternative, reductase can be used. Oxidase is only used in cases where O2 is the acceptor. (Enzyme Nomenclature, 1992, p9)
Any normal or abnormal coloring matter in PLANTS; ANIMALS or micro-organisms.
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
Deoxyribonucleic acid that makes up the genetic material of plants.
Gene Expression Regulation
The most abundant form of RNA. Together with proteins, it forms the ribosomes, playing a structural role and also a role in ribosomal binding of mRNA and tRNAs. Individual chains are conventionally designated by their sedimentation coefficients. In eukaryotes, four large chains exist, synthesized in the nucleolus and constituting about 50% of the ribosome. (Dorland, 28th ed)
Bacterial Outer Membrane Proteins
Proteins isolated from the outer membrane of Gram-negative bacteria.
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
Diseases of plants.
The property of objects that determines the direction of heat flow when they are placed in direct thermal contact. The temperature is the energy of microscopic motions (vibrational and translational) of the particles of atoms.
The sequential location of genes on a chromosome.
Analysis of PEPTIDES that are generated from the digestion or fragmentation of a protein or mixture of PROTEINS, by ELECTROPHORESIS; CHROMATOGRAPHY; or MASS SPECTROMETRY. The resulting peptide fingerprints are analyzed for a variety of purposes including the identification of the proteins in a sample, GENETIC POLYMORPHISMS, patterns of gene expression, and patterns diagnostic for diseases.
RNA, Ribosomal, 18S
Species- or subspecies-specific DNA (including COMPLEMENTARY DNA; conserved genes, whole chromosomes, or whole genomes) used in hybridization studies in order to identify microorganisms, to measure DNA-DNA homologies, to group subspecies, etc. The DNA probe hybridizes with a specific mRNA, if present. Conventional techniques used for testing for the hybridization product include dot blot assays, Southern blot assays, and DNA:RNA hybrid-specific antibody tests. Conventional labels for the DNA probe include the radioisotope labels 32P and 125I and the chemical label biotin. The use of DNA probes provides a specific, sensitive, rapid, and inexpensive replacement for cell culture techniques for diagnosing infections.
Members of the class of compounds composed of AMINO ACIDS joined together by peptide bonds between adjacent amino acids into linear, branched or cyclical structures. OLIGOPEPTIDES are composed of approximately 2-12 amino acids. Polypeptides are composed of approximately 13 or more amino acids. PROTEINS are linear polypeptides that are normally synthesized on RIBOSOMES.
Proteins that form the CAPSID of VIRUSES.
A technique for identifying individuals of a species that is based on the uniqueness of their DNA sequence. Uniqueness is determined by identifying which combination of allelic variations occur in the individual at a statistically relevant number of different loci. In forensic studies, RESTRICTION FRAGMENT LENGTH POLYMORPHISM of multiple, highly polymorphic VNTR LOCI or MICROSATELLITE REPEAT loci are analyzed. The number of loci used for the profile depends on the ALLELE FREQUENCY in the population.
A process whereby multiple RNA transcripts are generated from a single gene. Alternative splicing involves the splicing together of other possible sets of EXONS during the processing of some, but not all, transcripts of the gene. Thus a particular exon may be connected to any one of several alternative exons to form a mature RNA. The alternative forms of mature MESSENGER RNA produce PROTEIN ISOFORMS in which one part of the isoforms is common while the other parts are different.
Recombinant Fusion Proteins
Polymorphism, Single-Stranded Conformational
Variation in a population's DNA sequence that is detected by determining alterations in the conformation of denatured DNA fragments. Denatured DNA fragments are allowed to renature under conditions that prevent the formation of double-stranded DNA and allow secondary structure to form in single stranded fragments. These fragments are then run through polyacrylamide gels to detect variations in the secondary structure that is manifested as an alteration in migration through the gels.
Escherichia coli Proteins
Proteins obtained from ESCHERICHIA COLI.
Viral Structural Proteins
Viral proteins that are components of the mature assembled VIRUS PARTICLES. They may include nucleocapsid core proteins (gag proteins), enzymes packaged within the virus particle (pol proteins), and membrane components (env proteins). These do not include the proteins encoded in the VIRAL GENOME that are produced in infected cells but which are not packaged in the mature virus particle,i.e. the so called non-structural proteins (VIRAL NONSTRUCTURAL PROTEINS).
A genus of gram-negative, anaerobic, nonsporeforming, nonmotile rods. Organisms of this genus had originally been classified as members of the BACTEROIDES genus but overwhelming biochemical and chemical findings in 1990 indicated the need to separate them from other Bacteroides species, and hence, this new genus was established.
Protein Structure, Secondary
The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to alpha helices, beta strands (which align to form beta sheets) or other types of coils. This is the first folding level of protein conformation.
The three possible sequences of CODONS by which GENETIC TRANSLATION may occur from one nucleotide sequence. A segment of mRNA 5'AUCCGA3' could be translated as 5'AUC.. or 5'UCC.. or 5'CCG.., depending on the location of the START CODON.
A type of mutation in which a number of NUCLEOTIDES deleted from or inserted into a protein coding sequence is not divisible by three, thereby causing an alteration in the READING FRAMES of the entire coding sequence downstream of the mutation. These mutations may be induced by certain types of MUTAGENS or may occur spontaneously.
Regulatory Sequences, Nucleic Acid
Nucleic acid sequences involved in regulating the expression of genes.
Chromatography, Ion Exchange
The application of molecular biology to the answering of epidemiological questions. The examination of patterns of changes in DNA to implicate particular carcinogens and the use of molecular markers to predict which individuals are at highest risk for a disease are common examples.
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
Genes bearing close resemblance to known genes at different loci, but rendered non-functional by additions or deletions in structure that prevent normal transcription or translation. When lacking introns and containing a poly-A segment near the downstream end (as a result of reverse copying from processed nuclear RNA into double-stranded DNA), they are called processed genes.
A genus of asporogenous bacteria that is widely distributed in nature. Its organisms appear as straight to slightly curved rods and are known to be human and animal parasites and pathogens.
The functional hereditary units of PLANTS.
Any member of the group of ENDOPEPTIDASES containing at the active site a serine residue involved in catalysis.
Chromosomes, Artificial, Bacterial
DNA constructs that are composed of, at least, a REPLICATION ORIGIN, for successful replication, propagation to and maintenance as an extra chromosome in bacteria. In addition, they can carry large amounts (about 200 kilobases) of other sequence for a variety of bioengineering purposes.
Amino Acid Motifs
Compounds and molecular complexes that consist of very large numbers of atoms and are generally over 500 kDa in size. In biological systems macromolecular substances usually can be visualized using ELECTRON MICROSCOPY and are distinguished from ORGANELLES by the lack of a membrane structure.
Structures within the nucleus of bacterial cells consisting of or containing DNA, which carry genetic information essential to the cell.
Protein Sorting Signals
A genus of gram-positive, microaerophilic, rod-shaped bacteria occurring widely in nature. Its species are also part of the many normal flora of the mouth, intestinal tract, and vagina of many mammals, including humans. Pathogenicity from this genus is rare.
Amino Acid Substitution
The naturally occurring or experimentally induced replacement of one or more AMINO ACIDS in a protein with another. If a functionally equivalent amino acid is substituted, the protein may retain wild-type activity. Substitution may also diminish, enhance, or eliminate protein function. Experimentally induced substitution is often used to study enzyme activities and binding site properties.