A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
Gene Expression Profiling
Oligonucleotide Array Sequence Analysis
Hybridization of a nucleic acid sample to a very large set of OLIGONUCLEOTIDE PROBES, which have been attached individually in columns and rows to a solid support, to determine a BASE SEQUENCE, or to detect variations in a gene sequence, GENE EXPRESSION, or for GENE MAPPING.
Principal Component Analysis
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Random Amplified Polymorphic DNA Technique
Technique that utilizes low-stringency polymerase chain reaction (PCR) amplification with single primers of arbitrary sequence to generate strain-specific arrays of anonymous DNA fragments. RAPD technique may be used to determine taxonomic identity, assess kinship relationships, analyze mixed genome samples, and create specific probes.
A technique for identifying individuals of a species that is based on the uniqueness of their DNA sequence. Uniqueness is determined by identifying which combination of allelic variations occur in the individual at a statistically relevant number of different loci. In forensic studies, RESTRICTION FRAGMENT LENGTH POLYMORPHISM of multiple, highly polymorphic VNTR LOCI or MICROSATELLITE REPEAT loci are analyzed. The number of loci used for the profile depends on the ALLELE FREQUENCY in the population.
Sequence Analysis, DNA
Deoxyribonucleic acid that makes up the genetic material of bacteria.
Bacterial Typing Techniques
Procedures for identifying types and strains of bacteria. The most frequently employed typing systems are BACTERIOPHAGE TYPING and SEROTYPING as well as bacteriocin typing and biotyping.
A primary headache disorder that is characterized by severe, strictly unilateral PAIN which is orbital, supraorbital, temporal or in any combination of these sites, lasting 15-180 min. occurring 1 to 8 times a day. The attacks are associated with one or more of the following, all of which are ipsilateral: conjunctival injection, lacrimation, nasal congestion, rhinorrhea, facial SWEATING, eyelid EDEMA, and miosis. (International Classification of Headache Disorders, 2nd ed. Cephalalgia 2004: suppl 1)
The genetic constitution of the individual, comprising the ALLELES present at each GENETIC LOCUS.
The outward appearance of the individual. It is the product of interactions between genes, and between the GENOTYPE and the environment.
Reproducibility of Results
The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results.
Amino Acid Sequence
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
Amplified Fragment Length Polymorphism Analysis
Polymerase Chain Reaction
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
The functional hereditary units of BACTERIA.
The restriction of a characteristic behavior, anatomical structure or physical system, such as immune response; metabolic response, or gene or gene variant to the members of one species. It refers to that property which differentiates one species from another but it is also used for phylogenetic levels higher or lower than the species.
RNA, Ribosomal, 16S
Constituent of 30S subunit prokaryotic ribosomes containing 1600 nucleotides and 21 proteins. 16S rRNA is involved in initiation of polypeptide synthesis.
Data Interpretation, Statistical
Polymorphism, Restriction Fragment Length
Variation occurring within a species in the presence or length of DNA fragment generated by a specific endonuclease at a specific site in the genome. Such variations are generated by mutations that create or abolish recognition sites for these enzymes or change the length of the fragment.
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Pattern Recognition, Automated
Deep grooves or clefts in the surface of teeth equivalent to class 1 cavities in Black's classification of dental caries.
Factor Analysis, Statistical
A set of statistical methods for analyzing the correlations among several variables in order to estimate the number of fundamental dimensions that underlie the observed data and to describe and measure those dimensions. It is used frequently in the development of scoring systems for rating scales and questionnaires.
Analysis of Variance
Sequence Homology, Amino Acid
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
Electrophoresis, Gel, Pulsed-Field
Gel electrophoresis in which the direction of the electric field is changed periodically. This technique is similar to other electrophoretic methods normally used to separate double-stranded DNA molecules ranging in size up to tens of thousands of base-pairs. However, by alternating the electric field direction one is able to separate DNA molecules up to several million base-pairs in length.
The process of cumulative change at the level of DNA; RNA; and PROTEINS, over successive generations.
Expressed Sequence Tags
Partial cDNA (DNA, COMPLEMENTARY) sequences that are unique to the cDNAs from which they were derived.
Electrophoresis, Starch Gel
Geographic variety, population, or race, within a species, that is genetically adapted to a particular habitat. An ecotype typically exhibits phenotypic differences but is capable of interbreeding with other ecotypes.
The regular and simultaneous occurrence in a single interbreeding population of two or more discontinuous genotypes. The concept includes differences in genotypes ranging in size from a single nucleotide site (POLYMORPHISM, SINGLE NUCLEOTIDE) to large nucleotide sequences visible at a chromosomal level.
Statistics as Topic
The systematic surveying, mapping, charting, and description of specific geographical sites, with reference to the physical features that were presumed to influence health and disease. Medical topography should be differentiated from EPIDEMIOLOGY in that the former emphasizes geography whereas the latter emphasizes disease outbreaks.
Tandem arrays of moderately repetitive, short (10-60 bases) DNA sequences which are found dispersed throughout the GENOME, at the ends of chromosomes (TELOMERES), and clustered near telomeres. Their degree of repetition is two to several hundred at each locus. Loci number in the thousands but each locus shows a distinctive repeat unit.
One of the three domains of life (the others being Eukarya and ARCHAEA), also called Eubacteria. They are unicellular prokaryotic microorganisms which generally possess rigid cell walls, multiply by cell division, and exhibit three principal forms: round or coccal, rodlike or bacillary, and spiral or spirochetal. Bacteria can be classified by their response to OXYGEN: aerobic, anaerobic, or facultatively anaerobic; by the mode by which they obtain their energy: chemotrophy (via chemical reaction) or PHOTOTROPHY (via light reaction); for chemotrophs by their source of chemical energy: CHEMOLITHOTROPHY (from inorganic compounds) or chemoorganotrophy (from organic compounds); and by their source for CARBON; NITROGEN; etc.; HETEROTROPHY (from organic sources) or AUTOTROPHY (from CARBON DIOXIDE). They can also be classified by whether or not they stain (based on the structure of their CELL WALLS) with CRYSTAL VIOLET dye: gram-negative or gram-positive.
A theorem in probability theory named for Thomas Bayes (1702-1761). In epidemiology, it is used to obtain the probability of disease in a group of people with some characteristic on the basis of the overall rate of that disease and of the likelihood of that characteristic in healthy and diseased individuals. The most familiar application is in clinical decision analysis where it is used for estimating the probability of a particular diagnosis given the appearance of some symptoms or test result.
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
Gene Expression Regulation
Geographic Information Systems
Gene Expression Regulation, Neoplastic
Neoplasms, Plasma Cell
Process of determining and distinguishing species of bacteria or viruses based on antigens they share.
Reverse Transcriptase Polymerase Chain Reaction
Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations.
Protein Array Analysis
A set of techniques used when variation in several variables has to be studied simultaneously. In statistics, multivariate analysis is interpreted as any analytic method that allows simultaneous study of two or more dependent variables.
Ribonucleic acid in bacteria having regulatory and catalytic roles as well as involvement in protein synthesis.
Nucleic Acid Hybridization
Widely used technique which exploits the ability of complementary sequences in single-stranded DNAs or RNAs to pair with each other to form a double helix. Hybridization can take place between two complimentary DNA sequences, between a single-stranded DNA and a complementary RNA, or between two RNA sequences. The technique is used to detect and isolate specific sequences, measure homology, or define other characteristics of one or both strands. (Kendrew, Encyclopedia of Molecular Biology, 1994, p503)
The presence of bacteria, viruses, and fungi in the soil. This term is not restricted to pathogenic organisms.
Using MOLECULAR BIOLOGY techniques, such as DNA SEQUENCE ANALYSIS; PULSED-FIELD GEL ELECTROPHORESIS; and DNA FINGERPRINTING, to identify, classify, and compare organisms and their subtypes.
A large collection of DNA fragments cloned (CLONING, MOLECULAR) from a given organism, tissue, organ, or cell type. It may contain complete genomic sequences (GENOMIC LIBRARY) or complementary DNA sequences, the latter being formed from messenger RNA and lacking intron sequences.
Chorda Tympani Nerve
A branch of the facial (7th cranial) nerve which passes through the middle ear and continues through the petrotympanic fissure. The chorda tympani nerve carries taste sensation from the anterior two-thirds of the tongue and conveys parasympathetic efferents to the salivary glands.
Extensive collections, reputedly complete, of facts and data garnered from material of a specialized subject area and made available for analysis and application. The collection can be automated by various contemporary methods for retrieval. The concept should be differentiated from DATABASES, BIBLIOGRAPHIC which is restricted to collections of bibliographic references.
The pattern of GENE EXPRESSION at the level of genetic transcription in a specific organism or under specific circumstances in specific cells.
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Studies in which subsets of a defined population are identified. These groups may or may not be exposed to factors hypothesized to influence the probability of the occurrence of a particular disease or other outcome. Cohorts are defined populations which, as a whole, are followed in an attempt to determine distinguishing subgroup characteristics.
DNA, Ribosomal Spacer
The discipline studying genetic composition of populations and effects of factors such as GENETIC SELECTION, population size, MUTATION, migration, and GENETIC DRIFT on the frequencies of various GENOTYPES and PHENOTYPES using a variety of GENETIC TECHNIQUES.
Severity of Illness Index
Spectroscopy, Fourier Transform Infrared
Tumor Markers, Biological
Molecular products metabolized and secreted by neoplastic tissue and characterized biochemically in cells or body fluids. They are indicators of tumor stage and grade as well as useful for monitoring responses to treatment and predicting recurrence. Many chemical groups are represented including hormones, antigens, amino and nucleic acids, enzymes, polyamines, and specific cell membrane proteins and lipids.
Sequence Homology, Nucleic Acid
Gene Expression Regulation, Bacterial
Any of the processes by which cytoplasmic or intercellular factors influence the differential control of gene action in bacteria.
Sensitivity and Specificity
Binary classification measures to assess test results. Sensitivity or recall rate is the proportion of true positives. Specificity is the probability of correctly determining the absence of a condition. (From Last, Dictionary of Epidemiology, 2d ed)
Protein Structure, Secondary
The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to alpha helices, beta strands (which align to form beta sheets) or other types of coils. This is the first folding level of protein conformation.
Scales, questionnaires, tests, and other methods used to assess pain severity and duration in patients or experimental animals to aid in diagnosis, therapy, and physiological studies.
A genus of gram-negative, aerobic, rod-shaped bacteria widely distributed in nature. Some species are pathogenic for humans, animals, and plants.
Sequence Analysis, Protein
A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence.
The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain).
Electron Spin Resonance Spectroscopy
A technique applicable to the wide variety of substances which exhibit paramagnetism because of the magnetic moments of unpaired electrons. The spectra are useful for detection and identification, for determination of electron structure, for study of interactions between molecules, and for measurement of nuclear spins and moments. (From McGraw-Hill Encyclopedia of Science and Technology, 7th edition) Electron nuclear double resonance (ENDOR) spectroscopy is a variant of the technique which can give enhanced resolution. Electron spin resonance analysis can now be used in vivo, including imaging applications such as MAGNETIC RESONANCE IMAGING.
Image Processing, Computer-Assisted
Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
Iron-containing proteins that transfer electrons, usually at a low potential, to flavoproteins; the iron is not present as in heme. (McGraw-Hill Dictionary of Scientific and Technical Terms, 5th ed)
A common nonarticular rheumatic syndrome characterized by myalgia and multiple points of focal muscle tenderness to palpation (trigger points). Muscle pain is typically aggravated by inactivity or exposure to cold. This condition is often associated with general symptoms, such as sleep disturbances, fatigue, stiffness, HEADACHES, and occasionally DEPRESSION. There is significant overlap between fibromyalgia and the chronic fatigue syndrome (FATIGUE SYNDROME, CHRONIC). Fibromyalgia may arise as a primary or secondary disease process. It is most frequent in females aged 20 to 50 years. (From Adams et al., Principles of Neurology, 6th ed, p1494-95)