Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
Protein Structure, Secondary
The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to alpha helices, beta strands (which align to form beta sheets) or other types of coils. This is the first folding level of protein conformation.
The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain).
Sequence Analysis, Protein
A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence.
Structural Homology, Protein
Protein Structure, Tertiary
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Amino Acid Sequence
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Nuclear Magnetic Resonance, Biomolecular
NMR spectroscopy on small- to medium-size biological macromolecules. This is often used for structural investigation of proteins and nucleic acids, and often involves more than one isotope.
The study of crystal structure using X-RAY DIFFRACTION techniques. (McGraw-Hill Dictionary of Scientific and Technical Terms, 4th ed)
A rigorously mathematical analysis of energy relationships (heat, work, temperature, and equilibrium). It describes systems whose states are determined by thermal parameters, such as temperature, in addition to mechanical and electromagnetic parameters. (From Hawley's Condensed Chemical Dictionary, 12th ed)
Organic compounds that generally contain an amino (-NH2) and a carboxyl (-COOH) group. Twenty alpha-amino acids are the subunits which are polymerized to form proteins.
Sequence Homology, Amino Acid
Molecular Dynamics Simulation
Liquids that dissolve other substances (solutes), generally solids, without any change in chemical composition, as, water containing sugar. (Grant & Hackh's Chemical Dictionary, 5th ed)
Hydrophobic and Hydrophilic Interactions
The thermodynamic interaction between a substance and WATER.
Magnetic Resonance Spectroscopy
The location of the atoms, groups or ions relative to one another in a molecule, as well as the number, type and location of covalent bonds.
Protein Structure, Quaternary
The characteristic 3-dimensional shape and arrangement of multimeric proteins (aggregates of more than one polypeptide chain).
Extensive collections, reputedly complete, of facts and data garnered from material of a specialized subject area and made available for analysis and application. The collection can be automated by various contemporary methods for retrieval. The concept should be differentiated from DATABASES, BIBLIOGRAPHIC which is restricted to collections of bibliographic references.
A basic enzyme that is present in saliva, tears, egg white, and many animal fluids. It functions as an antibacterial agent. The enzyme catalyzes the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrin. EC 18.104.22.168.
A molecule that binds to another molecule, used especially to refer to a small molecule that binds specifically to a larger molecule, e.g., an antigen binding to an antibody, a hormone or neurotransmitter binding to a receptor, or a substrate or allosteric effector binding to an enzyme. Ligands are also molecules that donate or accept a pair of electrons to form a coordinate covalent bond with the central metal atom of a coordination complex. (From Dorland, 27th ed)
Theory and development of COMPUTER SYSTEMS which perform tasks that normally require human intelligence. Such tasks may include speech recognition, LEARNING; VISUAL PERCEPTION; MATHEMATICAL COMPUTING; reasoning, PROBLEM SOLVING, DECISION-MAKING, and translation of language.
Information Storage and Retrieval
Protein Interaction Mapping
Methods for determining interaction between PROTEINS.
Database Management Systems
Procedures by which protein structure and function are changed or created in vitro by altering existing or synthesizing new structural genes that direct the synthesis of proteins with sought-after properties. Such procedures may include the design of MOLECULAR MODELS of proteins using COMPUTER GRAPHICS or other molecular modeling techniques; site-specific mutagenesis (MUTAGENESIS, SITE-SPECIFIC) of existing genes; and DIRECTED MOLECULAR EVOLUTION techniques to create new genes.
Amino Acid Motifs
A conjugated protein which is the oxygen-transporting pigment of muscle. It is made up of one globin polypeptide chain and one heme group.
Protein Interaction Domains and Motifs
A clear, odorless, tasteless liquid that is essential for most animal and plant life and is an excellent solvent for many substances. The chemical formula is hydrogen oxide (H2O). (McGraw-Hill Dictionary of Scientific and Technical Terms, 4th ed)
The scattering of x-rays by matter, especially crystals, with accompanying variation in intensity due to interference effects. Analysis of the crystal structure of materials is performed by passing x-rays through them and registering the diffraction image of the rays (CRYSTALLOGRAPHY, X-RAY). (From McGraw-Hill Dictionary of Scientific and Technical Terms, 4th ed)
Monte Carlo Method
In statistics, a technique for numerically approximating the solution of a mathematical problem by studying the distribution of some random variable, often generated by a computer. The name alludes to the randomness characteristic of the games of chance played at the gambling casinos in Monte Carlo. (From Random House Unabridged Dictionary, 2d ed, 1993)
The formation of crystalline substances from solutions or melts. (McGraw-Hill Dictionary of Scientific and Technical Terms, 4th ed)
Pattern Recognition, Automated
Members of the class of compounds composed of AMINO ACIDS joined together by peptide bonds between adjacent amino acids into linear, branched or cyclical structures. OLIGOPEPTIDES are composed of approximately 2-12 amino acids. Polypeptides are composed of approximately 13 or more amino acids. PROTEINS are linear polypeptides that are normally synthesized on RIBOSOMES.
A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both.
The property of objects that determines the direction of heat flow when they are placed in direct thermal contact. The temperature is the energy of microscopic motions (vibrational and translational) of the particles of atoms.
Neural Networks (Computer)
A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming.
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
The measure of that part of the heat or energy of a system which is not available to perform work. Entropy increases in all natural (spontaneous and irreversible) processes. (From Dorland, 28th ed)
Reproducibility of Results
The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results.
Amino Acid Substitution
The naturally occurring or experimentally induced replacement of one or more AMINO ACIDS in a protein with another. If a functionally equivalent amino acid is substituted, the protein may retain wild-type activity. Substitution may also diminish, enhance, or eliminate protein function. Experimentally induced substitution is often used to study enzyme activities and binding site properties.
The extent to which an enzyme retains its structural conformation or its activity when subjected to storage, isolation, and purification or various other physical or chemical manipulations, including proteolytic enzymes and heat.
The first chemical element in the periodic table. It has the atomic symbol H, atomic number 1, and atomic weight [1.00784; 1.00811]. It exists, under normal conditions, as a colorless, odorless, tasteless, diatomic gas. Hydrogen ions are PROTONS. Besides the common H1 isotope, hydrogen exists as the stable isotope DEUTERIUM and the unstable, radioactive isotope TRITIUM.
Genetically engineered MUTAGENESIS at a specific site in the DNA molecule that introduces a base substitution, or an insertion or deletion.
Deuterium Exchange Measurement
A research technique to measure solvent exposed regions of molecules that is used to provide insight about PROTEIN CONFORMATION.
A stochastic process such that the conditional probability distribution for a state at any future instant, given the present state, is unaffected by any additional knowledge of the past history of the system.
An essential amino acid that is necessary for normal growth in infants and for NITROGEN balance in adults. It is a precursor of INDOLE ALKALOIDS in plants. It is a precursor of SEROTONIN (hence its use as an antidepressant and sleep aid). It can be a precursor to NIACIN, albeit inefficiently, in mammals.
Stable elementary particles having the smallest known negative charge, present in all elements; also called negatrons. Positively charged electrons are called positrons. The numbers, energies and arrangement of electrons around atomic nuclei determine the chemical identities of elements. Beams of electrons are called CATHODE RAYS.
Molecular Sequence Annotation
The addition of descriptive information about the function or structure of a molecular sequence to its MOLECULAR SEQUENCE DATA record.
Virulent bacteriophage and type species of the genus T4-like phages, in the family MYOVIRIDAE. It infects E. coli and is the best known of the T-even phages. Its virion contains linear double-stranded DNA, terminally redundant and circularly permuted.
Escherichia coli Proteins
Proteins obtained from ESCHERICHIA COLI.
Scattering, Small Angle
Scattering of a beam of electromagnetic or acoustic RADIATION, or particles, at small angles by particles or cavities whose dimensions are many times as large as the wavelength of the radiation or the de Broglie wavelength of the scattered particles. Also know as low angle scattering. (McGraw-Hill Dictionary of Scientific and Technical Terms, 6th ed) Small angle scattering (SAS) techniques, small angle neutron (SANS), X-ray (SAXS), and light (SALS, or just LS) scattering, are used to characterize objects on a nanoscale.
Proteins prepared by recombinant DNA technology.
The molecular designing of drugs for specific purposes (such as DNA-binding, enzyme inhibition, anti-cancer efficacy, etc.) based on knowledge of molecular properties such as activity of functional groups, molecular geometry, and electronic structure, and also on information cataloged on analogous molecules. Drug design is generally computer-assisted molecular modeling and does not include pharmacokinetics, dosage analysis, or drug administration analysis.
Spectroscopy, Fourier Transform Infrared
Data Interpretation, Statistical
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
Rhodopsins found in the PURPLE MEMBRANE of halophilic archaea such as HALOBACTERIUM HALOBIUM. Bacteriorhodopsins function as an energy transducers, converting light energy into electrochemical energy via PROTON PUMPS.
Macrocyclic polyethers with the repeating unit of (-CH2-CH2-O)n where n is greater than 2 and some oxygens may be replaced by nitrogen, sulfur or phosphorus. These compounds are useful for coordinating CATIONS. The nomenclature uses a prefix to indicate the size of the ring and a suffix for the number of heteroatoms.
The protein complement of an organism coded for by its genome.
Repetitive Sequences, Amino Acid
A sequential pattern of amino acids occurring more than once in the same protein sequence.
The protein components of a number of complexes, such as enzymes (APOENZYMES), ferritin (APOFERRITINS), or lipoproteins (APOLIPOPROTEINS).
Specifications and instructions applied to the software.
Databases as Topic
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
Organic compounds containing the -CO-NH2 radical. Amides are derived from acids by replacement of -OH by -NH2 or from ammonia by the replacement of H by an acyl group. (From Grant & Hackh's Chemical Dictionary, 5th ed)
Analysis based on the mathematical function first formulated by Jean-Baptiste-Joseph Fourier in 1807. The function, known as the Fourier transform, describes the sinusoidal pattern of any fluctuating pattern in the physical world in terms of its amplitude and its phase. It has broad applications in biomedicine, e.g., analysis of the x-ray crystallography data pivotal in identifying the double helical nature of DNA and in analysis of other molecules, including viruses, and the modified back-projection algorithm universally used in computerized tomography imaging, etc. (From Segen, The Dictionary of Modern Medicine, 1992)
The act of testing the software for compliance with a standard.
Physical motion, i.e., a change in position of a body or subject as a result of an external force. It is distinguished from MOVEMENT, a process resulting from biological activity.
An atom or group of atoms that have a positive or negative electric charge due to a gain (negative charge) or loss (positive charge) of one or more electrons. Atoms with a positive charge are known as CATIONS; those with a negative charge are ANIONS.
Presence of warmth or heat or a temperature notably higher than an accustomed norm.
The transfer of energy of a given form among different scales of motion. (From McGraw-Hill Dictionary of Scientific and Technical Terms, 6th ed). It includes the transfer of kinetic energy and the transfer of chemical energy. The transfer of chemical energy from one molecule to another depends on proximity of molecules so it is often used as in techniques to measure distance such as the use of FORSTER RESONANCE ENERGY TRANSFER.
Proteins that have one or more tightly bound metal ions forming part of their structure. (Dorland, 28th ed)
Spectrum Analysis, Raman
National Institute of General Medical Sciences (U.S.)
Component of the NATIONAL INSTITUTES OF HEALTH. It conducts and supports basic biomedical research that is not targeted to specific diseases and funds studies on genes, proteins, and cells, as well as on fundamental processes like communication within and between cells and metabolism. It was established in 1962.
A highly conserved 76-amino acid peptide universally found in eukaryotic cells that functions as a marker for intracellular PROTEIN TRANSPORT and degradation. Ubiquitin becomes activated through a series of complicated steps and forms an isopeptide bond to lysine residues of specific proteins within the cell. These "ubiquitinated" proteins can be recognized and degraded by proteosomes or be transported to specific compartments within the cell.
Deuterium. The stable isotope of hydrogen. It has one neutron and one proton in the nucleus.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
Databases, Nucleic Acid
The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA.
Particles consisting of aggregates of molecules held loosely together by secondary bonds. The surface of micelles are usually comprised of amphiphatic compounds that are oriented in a way that minimizes the energy of interaction between the micelle and its environment. Liquids that contain large numbers of suspended micelles are referred to as EMULSIONS.
A method for determining points of contact between interacting proteins or binding sites of proteins to nucleic acids. Protein footprinting utilizes a protein cutting reagent or protease. Protein cleavage is inhibited where the proteins, or nucleic acids and protein, contact each other. After completion of the cutting reaction, the remaining peptide fragments are analyzed by electrophoresis.
Sequence Analysis, DNA
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
A chemical reaction in which an electron is transferred from one molecule to another. The electron-donating molecule is the reducing agent or reductant; the electron-accepting molecule is the oxidizing agent or oxidant. Reducing and oxidizing agents function as conjugate reductant-oxidant pairs or redox pairs (Lehninger, Principles of Biochemistry, 1982, p471).
A set of three nucleotides in a protein coding sequence that specifies individual amino acids or a termination signal (CODON, TERMINATOR). Most codons are universal, but some organisms do not produce the transfer RNAs (RNA, TRANSFER) complementary to all codons. These codons are referred to as unassigned codons (CODONS, NONSENSE).
Electron Spin Resonance Spectroscopy
A technique applicable to the wide variety of substances which exhibit paramagnetism because of the magnetic moments of unpaired electrons. The spectra are useful for detection and identification, for determination of electron structure, for study of interactions between molecules, and for measurement of nuclear spins and moments. (From McGraw-Hill Encyclopedia of Science and Technology, 7th edition) Electron nuclear double resonance (ENDOR) spectroscopy is a variant of the technique which can give enhanced resolution. Electron spin resonance analysis can now be used in vivo, including imaging applications such as MAGNETIC RESONANCE IMAGING.
Electropositive chemical elements characterized by ductility, malleability, luster, and conductance of heat and electricity. They can replace the hydrogen of an acid and form bases with hydroxyl radicals. (Grant & Hackh's Chemical Dictionary, 5th ed)
Protein Processing, Post-Translational
Any of various enzymatically catalyzed post-translational modifications of PEPTIDES or PROTEINS in the cell of origin. These modifications include carboxylation; HYDROXYLATION; ACETYLATION; PHOSPHORYLATION; METHYLATION; GLYCOSYLATION; ubiquitination; oxidation; proteolysis; and crosslinking and result in changes in molecular weight and electrophoretic motility.
Molecular Docking Simulation
A computer simulation technique that is used to model the interaction between two molecules. Typically the docking simulation measures the interactions of a small molecule or ligand with a part of a larger molecule such as a protein.
Electrophoresis, Polyacrylamide Gel
Electrically neutral elementary particles found in all atomic nuclei except light hydrogen; the mass is equal to that of the proton and electron combined and they are unstable when isolated from the nucleus, undergoing beta decay. Slow, thermal, epithermal, and fast neutrons refer to the energy levels with which the neutrons are ejected from heavier nuclei during their decay.
A compound formed in the liver from ammonia produced by the deamination of amino acids. It is the principal end product of protein catabolism and constitutes about one half of the total urinary solids.
Compounds and molecular complexes that consist of very large numbers of atoms and are generally over 500 kDa in size. In biological systems macromolecular substances usually can be visualized using ELECTRON MICROSCOPY and are distinguished from ORGANELLES by the lack of a membrane structure.