Protein Interaction Domains and Motifs
Protein Interaction Mapping
Methods for determining interaction between PROTEINS.
Protein Structure, Tertiary
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Amino Acid Sequence
Two-Hybrid System Techniques
Screening techniques first developed in yeast to identify genes encoding interacting proteins. Variations are used to evaluate interplay between proteins and other molecules. Two-hybrid techniques refer to analysis for protein-protein interactions, one-hybrid for DNA-protein interactions, three-hybrid interactions for RNA-protein interactions or ligand-based interactions. Reverse n-hybrid techniques refer to analysis for mutations or other small molecules that dissociate known interactions.
Amino Acid Motifs
src Homology Domains
Regions of AMINO ACID SEQUENCE similarity in the SRC-FAMILY TYROSINE KINASES that fold into specific functional tertiary structures. The SH1 domain is a CATALYTIC DOMAIN. SH2 and SH3 domains are protein interaction domains. SH2 usually binds PHOSPHOTYROSINE-containing proteins and SH3 interacts with CYTOSKELETAL PROTEINS.
Sequence Homology, Amino Acid
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
Protein Interaction Maps
Graphs representing sets of measurable, non-covalent physical contacts with specific PROTEINS in living organisms or in cells.
Adaptor Proteins, Signal Transducing
A broad category of carrier proteins that play a role in SIGNAL TRANSDUCTION. They generally contain several modular domains, each of which having its own binding activity, and act by forming complexes with other intracellular-signaling molecules. Signal-transducing adaptor proteins lack enzyme activity, however their activity can be modulated by other signal-transducing enzymes
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
Transport proteins that carry specific substances in the blood or across cell membranes.
The intracellular transfer of information (biological activation/inhibition) through a signal pathway. In each signal transduction system, an activation/inhibition signal from a biologically active molecule (hormone, neurotransmitter) is mediated via the coupling of a receptor/enzyme to a second messenger system or to an ion channel. Signal transduction plays an important role in activating cellular functions, cell differentiation, and cell proliferation. Examples of signal transduction systems are the GAMMA-AMINOBUTYRIC ACID-postsynaptic receptor-calcium ion channel system, the receptor-mediated T-cell activation pathway, and the receptor-mediated activation of phospholipases. Those coupled to membrane depolarization or intracellular release of calcium include the receptor-mediated activation of cytotoxic functions in granulocytes and the synaptic potentiation of protein kinase activation. Some signal transduction pathways may be part of larger signal transduction pathways; for example, protein kinase activation is part of the platelet activation signal pathway.
Recombinant Fusion Proteins
Recombinant proteins produced by the GENETIC TRANSLATION of fused genes formed by the combination of NUCLEIC ACID REGULATORY SEQUENCES of one or more genes with the protein coding sequences of one or more genes.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain).
Protein interaction domains of about 70-90 amino acid residues, named after a common structure found in PSD-95, Discs Large, and Zona Occludens 1 proteins. PDZ domains are involved in the recruitment and interaction of proteins, and aid the formation of protein scaffolds and signaling networks. This is achieved by sequence-specific binding between a PDZ domain in one protein and a PDZ motif in another protein.
Proteins prepared by recombinant DNA technology.
Protein Structure, Secondary
The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to alpha helices, beta strands (which align to form beta sheets) or other types of coils. This is the first folding level of protein conformation.
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
The first continuously cultured human malignant CELL LINE, derived from the cervical carcinoma of Henrietta Lacks. These cells are used for VIRUS CULTIVATION and antitumor drug screening assays.
Saccharomyces cerevisiae Proteins
Macromolecular complexes formed from the association of defined protein subunits.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
Process of generating a genetic MUTATION. It may occur spontaneously or be induced by MUTAGENS.
Members of the class of compounds composed of AMINO ACIDS joined together by peptide bonds between adjacent amino acids into linear, branched or cyclical structures. OLIGOPEPTIDES are composed of approximately 2-12 amino acids. Polypeptides are composed of approximately 13 or more amino acids. PROTEINS are linear polypeptides that are normally synthesized on RIBOSOMES.
The process of moving proteins from one cellular compartment (including extracellular) to another by various sorting and transport mechanisms such as gated transport, protein translocation, and vesicular transport.
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
Within a eukaryotic cell, a membrane-limited body which contains chromosomes and one or more nucleoli (CELL NUCLEOLUS). The nuclear membrane consists of a double unit-type membrane which is perforated by a number of pores; the outermost membrane is continuous with the ENDOPLASMIC RETICULUM. A cell may contain more than one nucleus. (From Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
Amino Acid Substitution
The naturally occurring or experimentally induced replacement of one or more AMINO ACIDS in a protein with another. If a functionally equivalent amino acid is substituted, the protein may retain wild-type activity. Substitution may also diminish, enhance, or eliminate protein function. Experimentally induced substitution is often used to study enzyme activities and binding site properties.
CELL LINES derived from the CV-1 cell line by transformation with a replication origin defective mutant of SV40 VIRUS, which codes for wild type large T antigen (ANTIGENS, POLYOMAVIRUS TRANSFORMING). They are used for transfection and cloning. (The CV-1 cell line was derived from the kidney of an adult male African green monkey (CERCOPITHECUS AETHIOPS).)
Extrachromosomal, usually CIRCULAR DNA molecules that are self-replicating and transferable from one organism to another. They are found in a variety of bacterial, archaeal, fungal, algal, and plant species. They are used in GENETIC ENGINEERING as CLONING VECTORS.
A molecule that binds to another molecule, used especially to refer to a small molecule that binds specifically to a larger molecule, e.g., an antigen binding to an antibody, a hormone or neurotransmitter binding to a receptor, or a substrate or allosteric effector binding to an enzyme. Ligands are also molecules that donate or accept a pair of electrons to form a coordinate covalent bond with the central metal atom of a coordination complex. (From Dorland, 27th ed)
Promoter Regions, Genetic
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
The part of a cell that contains the CYTOSOL and small structures excluding the CELL NUCLEUS; MITOCHONDRIA; and large VACUOLES. (Glick, Glossary of Biochemistry and Molecular Biology, 1990)
Sequence Analysis, Protein
A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence.
The first DNA-binding protein motif to be recognized. Helix-turn-helix motifs were originally identified in bacterial proteins but have since been found in hundreds of DNA-BINDING PROTEINS from both eukaryotes and prokaryotes. They are constructed from two alpha helices connected by a short extended chain of amino acids, which constitute the "turn." The two helices are held at a fixed angle, primarily through interactions between the two helices. (From Alberts et al., Molecular Biology of the Cell, 3d ed, p408-9)
Gene Expression Regulation
Surface Plasmon Resonance
A biosensing technique in which biomolecules capable of binding to specific analytes or ligands are first immobilized on one side of a metallic film. Light is then focused on the opposite side of the film to excite the surface plasmons, that is, the oscillations of free electrons propagating along the film's surface. The refractive index of light reflecting off this surface is measured. When the immobilized biomolecules are bound by their ligands, an alteration in surface plasmons on the opposite side of the film is created which is directly proportional to the change in bound, or adsorbed, mass. Binding is measured by changes in the refractive index. The technique is used to study biomolecular interactions, such as antigen-antibody binding.
Nuclear Magnetic Resonance, Biomolecular
NMR spectroscopy on small- to medium-size biological macromolecules. This is often used for structural investigation of proteins and nucleic acids, and often involves more than one isotope.
Different forms of a protein that may be produced from different GENES, or from the same gene by ALTERNATIVE SPLICING.
Compounds and molecular complexes that consist of very large numbers of atoms and are generally over 500 kDa in size. In biological systems macromolecular substances usually can be visualized using ELECTRON MICROSCOPY and are distinguished from ORGANELLES by the lack of a membrane structure.
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
The lipid- and protein-containing, selectively permeable membrane that surrounds the cytoplasm in prokaryotic and eukaryotic cells.
A cell line generated from human embryonic kidney cells that were transformed with human adenovirus type 5.
Motifs in DNA- and RNA-binding proteins whose amino acids are folded into a single structural unit around a zinc atom. In the classic zinc finger, one zinc atom is bound to two cysteines and two histidines. In between the cysteines and histidines are 12 residues which form a DNA binding fingertip. By variations in the composition of the sequences in the fingertip and the number and spacing of tandem repeats of the motif, zinc fingers can form a large number of different sequence specific binding sites.
Protein Structure, Quaternary
The characteristic 3-dimensional shape and arrangement of multimeric proteins (aggregates of more than one polypeptide chain).
Intracellular Signaling Peptides and Proteins
Proteins and peptides that are involved in SIGNAL TRANSDUCTION within the cell. Included here are peptides and proteins that regulate the activity of TRANSCRIPTION FACTORS and cellular processes in response to signals from CELL SURFACE RECEPTORS. Intracellular signaling peptide and proteins may be part of an enzymatic signaling cascade or act through binding to and modifying the action of other signaling factors.
Green Fluorescent Proteins
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Cell Cycle Proteins
Proteins that control the CELL DIVISION CYCLE. This family of proteins includes a wide variety of classes, including CYCLIN-DEPENDENT KINASES, mitogen-activated kinases, CYCLINS, and PHOSPHOPROTEIN PHOSPHATASES as well as their putative substrates such as chromatin-associated proteins, CYTOSKELETAL PROTEINS, and TRANSCRIPTION FACTORS.
Catalyzes the ATP-dependent PHOSPHORYLATION of GMP to generate GDP and ADP.
Genes whose expression is easily detectable and therefore used to study promoter activity at many positions in a target genome. In recombinant DNA technology, these genes may be attached to a promoter region of interest.
A process whereby multiple RNA transcripts are generated from a single gene. Alternative splicing involves the splicing together of other possible sets of EXONS during the processing of some, but not all, transcripts of the gene. Thus a particular exon may be connected to any one of several alternative exons to form a mature RNA. The alternative forms of mature MESSENGER RNA produce PROTEIN ISOFORMS in which one part of the isoforms is common while the other parts are different.
A member of the p300-CBP transcription factor family that was initially identified as a binding partner for CAMP RESPONSE ELEMENT-BINDING PROTEIN. Mutations in CREB-binding protein are associated with RUBINSTEIN-TAYBI SYNDROME.
Nuclear Localization Signals
Short, predominantly basic amino acid sequences identified as nuclear import signals for some proteins. These sequences are believed to interact with specific receptors at the NUCLEAR PORE.
Major constituent of the cytoskeleton found in the cytoplasm of eukaryotic cells. They form a flexible framework for the cell, provide attachment points for organelles and formed bodies, and make communication between parts of the cell possible.
LIM Domain Proteins
A large class of structurally-related proteins that contain one or more LIM zinc finger domains. Many of the proteins in this class are involved in intracellular signaling processes and mediate their effects via LIM domain protein-protein interactions. The name LIM is derived from the first three proteins in which the motif was found: LIN-11, Isl1 and Mec-3.
Fluorescence Resonance Energy Transfer
A type of FLUORESCENCE SPECTROSCOPY using two FLUORESCENT DYES with overlapping emission and absorption spectra, which is used to indicate proximity of labeled molecules. This technique is useful for studying interactions of molecules and PROTEIN FOLDING.
Reagents with two reactive groups, usually at opposite ends of the molecule, that are capable of reacting with and thereby forming bridges between side chains of amino acids in proteins; the locations of naturally reactive areas within proteins can thereby be identified; may also be used for other macromolecules, like glycoproteins, nucleic acids, or other.
Electrophoresis, Polyacrylamide Gel
A diverse class of enzymes that interact with UBIQUITIN-CONJUGATING ENZYMES and ubiquitination-specific protein substrates. Each member of this enzyme group has its own distinct specificity for a substrate and ubiquitin-conjugating enzyme. Ubiquitin-protein ligases exist as both monomeric proteins multiprotein complexes.
CELL LINE derived from the ovary of the Chinese hamster, Cricetulus griseus (CRICETULUS). The species is a favorite for cytogenetic studies because of its small chromosome number. The cell line has provided model systems for the study of genetic alterations in cultured mammalian cells.
DNA-binding motifs formed from two alpha-helixes which intertwine for about eight turns into a coiled coil and then bifurcate to form Y shaped structures. Leucines occurring in heptad repeats end up on the same sides of the helixes and are adjacent to each other in the stem of the Y (the "zipper" region). The DNA-binding residues are located in the bifurcated region of the Y.
Nucleic Acid Conformation
A mutation in which a codon is mutated to one directing the incorporation of a different amino acid. This substitution may result in an inactive or unstable product. (From A Dictionary of Genetics, King & Stansfield, 5th ed)
Active Transport, Cell Nucleus
Cell lines whose original growing procedure consisted being transferred (T) every 3 days and plated at 300,000 cells per plate (J Cell Biol 17:299-313, 1963). Lines have been developed using several different strains of mice. Tissues are usually fibroblasts derived from mouse embryos but other types and sources have been developed as well. The 3T3 lines are valuable in vitro host systems for oncogenic virus transformation studies, since 3T3 cells possess a high sensitivity to CONTACT INHIBITION.
Nuclear Receptor Co-Repressor 1
A nuclear protein that regulates the expression of genes involved in a diverse array of processes related to metabolism and reproduction. The protein contains three nuclear receptor interaction domains and three repressor domains and is closely-related in structure to NUCLEAR RECEPTOR CO-REPRESSOR 2.
A polynucleotide consisting essentially of chains with a repeating backbone of phosphate and ribose units to which nitrogenous bases are attached. RNA is unique among biological macromolecules in that it can encode genetic information, serve as an abundant structural component of cells, and also possesses catalytic activity. (Rieger et al., Glossary of Genetics: Classical and Molecular, 5th ed)
Receptors, Cytoplasmic and Nuclear
Intracellular receptors that can be found in the cytoplasm or in the nucleus. They bind to extracellular signaling molecules that migrate through or are transported across the CELL MEMBRANE. Many members of this class of receptors occur in the cytoplasm and are transported to the CELL NUCLEUS upon ligand-binding where they signal via DNA-binding and transcription regulation. Also included in this category are receptors found on INTRACELLULAR MEMBRANES that act via mechanisms similar to CELL SURFACE RECEPTORS.
Caenorhabditis elegans Proteins
Gene Expression Regulation, Developmental
Escherichia coli Proteins
Proteins obtained from ESCHERICHIA COLI.
Repetitive Sequences, Amino Acid
A rigorously mathematical analysis of energy relationships (heat, work, temperature, and equilibrium). It describes systems whose states are determined by thermal parameters, such as temperature, in addition to mechanical and electromagnetic parameters. (From Hawley's Condensed Chemical Dictionary, 12th ed)
Nuclear Receptor Coactivator 2
Tumor Cells, Cultured
Cells grown in vitro from neoplastic tissue. If they can be established as a TUMOR CELL LINE, they can be propagated in cell culture indefinitely.
Gene Regulatory Networks
Interacting DNA-encoded regulatory subsystems in the GENOME that coordinate input from activator and repressor TRANSCRIPTION FACTORS during development, cell differentiation, or in response to environmental cues. The networks function to ultimately specify expression of particular sets of GENES for specific conditions, times, or locations.
Analysis of PEPTIDES that are generated from the digestion or fragmentation of a protein or mixture of PROTEINS, by ELECTROPHORESIS; CHROMATOGRAPHY; or MASS SPECTROMETRY. The resulting peptide fingerprints are analyzed for a variety of purposes including the identification of the proteins in a sample, GENETIC POLYMORPHISMS, patterns of gene expression, and patterns diagnostic for diseases.
A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both.
Structural Homology, Protein
A large family of signal-transducing adaptor proteins present in wide variety of eukaryotes. They are PHOSPHOSERINE and PHOSPHOTHREONINE binding proteins involved in important cellular processes including SIGNAL TRANSDUCTION; CELL CYCLE control; APOPTOSIS; and cellular stress responses. 14-3-3 proteins function by interacting with other signal-transducing proteins and effecting changes in their enzymatic activity and subcellular localization. The name 14-3-3 derives from numerical designations used in the original fractionation patterns of the proteins.
Nuclear Receptor Coactivator 1
Regulatory proteins that act as molecular switches. They control a wide range of biological processes including: receptor signaling, intracellular signal transduction pathways, and protein synthesis. Their activity is regulated by factors that control their ability to bind to and hydrolyze GTP to GDP. EC 3.6.1.-.
Cell Line, Tumor
A cell line derived from cultured tumor cells.
The restriction of a characteristic behavior, anatomical structure or physical system, such as immune response; metabolic response, or gene or gene variant to the members of one species. It refers to that property which differentiates one species from another but it is also used for phylogenetic levels higher or lower than the species.
Polymerase Chain Reaction
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
Components of a cell produced by various separation techniques which, though they disrupt the delicate anatomy of a cell, preserve the structure and physiology of its functioning constituents for biochemical and ultrastructural analysis. (From Alberts et al., Molecular Biology of the Cell, 2d ed, p163)
Proteins whose abnormal expression (gain or loss) are associated with the development, growth, or progression of NEOPLASMS. Some neoplasm proteins are tumor antigens (ANTIGENS, NEOPLASM), i.e. they induce an immune reaction to their tumor. Many neoplasm proteins have been characterized and are used as tumor markers (BIOMARKERS, TUMOR) when they are detectable in cells and body fluids as monitors for the presence or growth of tumors. Abnormal expression of ONCOGENE PROTEINS is involved in neoplastic transformation, whereas the loss of expression of TUMOR SUPPRESSOR PROTEINS is involved with the loss of growth control and progression of the neoplasm.
Proteins that catalyze the unwinding of duplex DNA during replication by binding cooperatively to single-stranded regions of DNA or to short regions of duplex DNA that are undergoing transient opening. In addition DNA helicases are DNA-dependent ATPases that harness the free energy of ATP hydrolysis to translocate DNA strands.
Adaptor Proteins, Vesicular Transport
A class of proteins involved in the transport of molecules via TRANSPORT VESICLES. They perform functions such as binding to the cell membrane, capturing cargo molecules and promoting the assembly of CLATHRIN. The majority of adaptor proteins exist as multi-subunit complexes, however monomeric varieties have also been found.
Sequence Analysis, DNA
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
The interaction of two or more substrates or ligands with the same binding site. The displacement of one by the other is used in quantitative and selective affinity measurements.
Protein Processing, Post-Translational
Any of various enzymatically catalyzed post-translational modifications of PEPTIDES or PROTEINS in the cell of origin. These modifications include carboxylation; HYDROXYLATION; ACETYLATION; PHOSPHORYLATION; METHYLATION; GLYCOSYLATION; ubiquitination; oxidation; proteolysis; and crosslinking and result in changes in molecular weight and electrophoretic motility.
Magnetic Resonance Spectroscopy
The commonest and widest ranging species of the clawed "frog" (Xenopus) in Africa. This species is used extensively in research. There is now a significant population in California derived from escaped laboratory animals.
A gene silencing phenomenon whereby specific dsRNAs (RNA, DOUBLE-STRANDED) trigger the degradation of homologous mRNA (RNA, MESSENGER). The specific dsRNAs are processed into SMALL INTERFERING RNA (siRNA) which serves as a guide for cleavage of the homologous mRNA in the RNA-INDUCED SILENCING COMPLEX. DNA METHYLATION may also be triggered during this process.