TATA-Box Binding Protein
A general transcription factor that plays a major role in the activation of eukaryotic genes transcribed by RNA POLYMERASES. It binds specifically to the TATA BOX promoter element, which lies close to the position of transcription initiation in RNA transcribed by RNA POLYMERASE II. Although considered a principal component of TRANSCRIPTION FACTOR TFIID it also takes part in general transcription factor complexes involved in RNA POLYMERASE I and RNA POLYMERASE III transcription.
TATA Box
A conserved A-T rich sequence which is contained in promoters for RNA polymerase II. The segment is seven base pairs long and the nucleotides most commonly found are TATAAAA.
Transcription Factor TFIID
The major sequence-specific DNA-binding component involved in the activation of transcription of RNA POLYMERASE II. It was originally described as a complex of TATA-BOX BINDING PROTEIN and TATA-BINDING PROTEIN ASSOCIATED FACTORS. It is now know that TATA BOX BINDING PROTEIN-LIKE PROTEINS may take the place of TATA-box binding protein in the complex.
Transcription Factor TFIIA
An RNA POLYMERASE II specific transcription factor. It may play a role in transcriptional activation of gene expression by interacting with the TATA-BOX BINDING PROTEIN component of TRANSCRIPTION FACTOR TFIID.
Transcription Factors
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
TATA-Binding Protein Associated Factors
Factors that associate with TATA-BOX BINDING PROTEIN. Many of them are components of TRANSCRIPTION FACTOR TFIID
Promoter Regions, Genetic
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
DNA-Binding Proteins
Transcription Factor TFIIB
An RNA POLYMERASE II specific transcription factor. It plays a role in assembly of the pol II transcriptional preinitiation complex and has been implicated as a target of gene-specific transcriptional activators.
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Transcription, Genetic
Protein Binding
The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments.
Base Sequence
Amino Acid Sequence
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
Binding Sites
Transcriptional Activation
Processes that stimulate the GENETIC TRANSCRIPTION of a gene or set of genes.
TATA Box Binding Protein-Like Proteins
A class of proteins related in structure and function to TATA-BOX BINDING PROTEIN that can take the place of TATA-BOX BINDING PROTEIN in the transcription initiation complex. They are found in most multicellular organisms and may be involved in tissue-specific promoter regulation. They bind to DNA and interact with TATA-BINDING PROTEIN ASSOCIATED FACTORS, however they may lack specificity for the TATA-BOX.
Transcription Factors, TFII
The so-called general transcription factors that bind to RNA POLYMERASE II and that are required to initiate transcription. They include TFIIA; TFIIB; TFIID; TFIIE; TFIIF; TFIIH; TFII-I; and TFIIJ. In vivo they apparently bind in an ordered multi-step process and/or may form a large preinitiation complex called RNA polymerase II holoenzyme.
Saccharomyces cerevisiae
Carrier Proteins
Transport proteins that carry specific substances in the blood or across cell membranes.
Nuclear Proteins
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
Mutation
Cloning, Molecular
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
Sequence Homology, Amino Acid
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
CCAAT-Binding Factor
A heterotrimeric DNA-binding protein that binds to CCAAT motifs in the promoters of eukaryotic genes. It is composed of three subunits: A, B and C.
Gene Expression Regulation
DNA
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
Sp1 Transcription Factor
Promoter-specific RNA polymerase II transcription factor that binds to the GC box, one of the upstream promoter elements, in mammalian cells. The binding of Sp1 is necessary for the initiation of transcription in the promoters of a variety of cellular and viral GENES.
Protein Structure, Tertiary
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
Poly(A)-Binding Proteins
Proteins that bind to the 3' polyadenylated region of MRNA. When complexed with RNA the proteins serve an array of functions such as stabilizing the 3' end of RNA, promoting poly(A) synthesis and stimulating mRNA translation.
Transfection
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
Membrane Proteins
CCAAT-Enhancer-Binding Proteins
A class of proteins that were originally identified by their ability to bind the DNA sequence CCAAT. The typical CCAAT-enhancer binding protein forms dimers and consists of an activation domain, a DNA-binding basic region, and a leucine-rich dimerization domain (LEUCINE ZIPPERS). CCAAT-BINDING FACTOR is structurally distinct type of CCAAT-enhancer binding protein consisting of a trimer of three different subunits.
Mutagenesis, Site-Directed
Genetically engineered MUTAGENESIS at a specific site in the DNA molecule that introduces a base substitution, or an insertion or deletion.
Trans-Activators
Proteins
Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
Models, Molecular
RNA, Messenger
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Regulatory Sequences, Nucleic Acid
Nucleic acid sequences involved in regulating the expression of genes.
HeLa Cells
The first continuously cultured human malignant CELL LINE, derived from the cervical carcinoma of Henrietta Lacks. These cells are used for VIRUS CULTIVATION and antitumor drug screening assays.
Sequence Alignment
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Protein Transport
The process of moving proteins from one cellular compartment (including extracellular) to another by various sorting and transport mechanisms such as gated transport, protein translocation, and vesicular transport.
Restriction Mapping
Sequence Homology, Nucleic Acid
Signal Transduction
The intracellular transfer of information (biological activation/inhibition) through a signal pathway. In each signal transduction system, an activation/inhibition signal from a biologically active molecule (hormone, neurotransmitter) is mediated via the coupling of a receptor/enzyme to a second messenger system or to an ion channel. Signal transduction plays an important role in activating cellular functions, cell differentiation, and cell proliferation. Examples of signal transduction systems are the GAMMA-AMINOBUTYRIC ACID-postsynaptic receptor-calcium ion channel system, the receptor-mediated T-cell activation pathway, and the receptor-mediated activation of phospholipases. Those coupled to membrane depolarization or intracellular release of calcium include the receptor-mediated activation of cytotoxic functions in granulocytes and the synaptic potentiation of protein kinase activation. Some signal transduction pathways may be part of larger signal transduction pathways; for example, protein kinase activation is part of the platelet activation signal pathway.
Plasmids
Extrachromosomal, usually CIRCULAR DNA molecules that are self-replicating and transferable from one organism to another. They are found in a variety of bacterial, archaeal, fungal, algal, and plant species. They are used in GENETIC ENGINEERING as CLONING VECTORS.
Introns
Apoptosis
One of the mechanisms by which CELL DEATH occurs (compare with NECROSIS and AUTOPHAGOCYTOSIS). Apoptosis is the mechanism responsible for the physiological deletion of cells and appears to be intrinsically programmed. It is characterized by distinctive morphologic changes in the nucleus and cytoplasm, chromatin cleavage at regularly spaced sites, and the endonucleolytic cleavage of genomic DNA; (DNA FRAGMENTATION); at internucleosomal sites. This mode of cell death serves as a balance to mitosis in regulating the size of animal tissues and in mediating pathologic processes associated with tumor growth.
Recombinant Proteins
Proteins prepared by recombinant DNA technology.
Genes
Tacrolimus Binding Proteins
A family of immunophilin proteins that bind to the immunosuppressive drugs TACROLIMUS (also known as FK506) and SIROLIMUS. EC 5.2.1.-
Saccharomyces cerevisiae Proteins
Proteins obtained from the species SACCHAROMYCES CEREVISIAE. The function of specific proteins from this organism are the subject of intense scientific interest and have been used to derive basic understanding of the functioning similar proteins in higher eukaryotes.
DNA, Complementary
Recombinant Fusion Proteins
Deoxyribonuclease I
An enzyme capable of hydrolyzing highly polymerized DNA by splitting phosphodiester linkages, preferentially adjacent to a pyrimidine nucleotide. This catalyzes endonucleolytic cleavage of DNA yielding 5'-phosphodi- and oligonucleotide end-products. The enzyme has a preference for double-stranded DNA.
RNA Polymerase II
A DNA-dependent RNA polymerase present in bacterial, plant, and animal cells. It functions in the nucleoplasmic structure and transcribes DNA into RNA. It has different requirements for cations and salt than RNA polymerase I and is strongly inhibited by alpha-amanitin. EC 2.7.7.6.
RNA-Binding Proteins
Proteins that bind to RNA molecules. Included here are RIBONUCLEOPROTEINS and other proteins whose function is to bind specifically to RNA.
Cell Nucleus
Within a eukaryotic cell, a membrane-limited body which contains chromosomes and one or more nucleoli (CELL NUCLEOLUS). The nuclear membrane consists of a double unit-type membrane which is perforated by a number of pores; the outermost membrane is continuous with the ENDOPLASMIC RETICULUM. A cell may contain more than one nucleus. (From Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
Exons
The parts of a transcript of a split GENE remaining after the INTRONS are removed. They are spliced together to become a MESSENGER RNA or other functional RNA.
Poly(A)-Binding Protein I
A poly(A) binding protein that has a variety of functions such as mRNA stabilization and protection of RNA from nuclease activity. Although poly(A) binding protein I is considered a major cytoplasmic RNA-binding protein it is also found in the CELL NUCLEUS and may be involved in transport of mRNP particles.
Consensus Sequence
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
Oligodeoxyribonucleotides
Conserved Sequence
A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences.
Enhancer Elements, Genetic
Repressor Proteins
Proteins which maintain the transcriptional quiescence of specific GENES or OPERONS. Classical repressor proteins are DNA-binding proteins that are normally bound to the OPERATOR REGION of an operon, or the ENHANCER SEQUENCES of a gene until a signal occurs that causes their release.
Insulin-Like Growth Factor Binding Proteins
A family of soluble proteins that bind insulin-like growth factors and modulate their biological actions at the cellular level. (Int J Gynaecol Obstet 1992;39(1):3-9)
Plant Proteins
DNA Footprinting
A method for determining the sequence specificity of DNA-binding proteins. DNA footprinting utilizes a DNA damaging agent (either a chemical reagent or a nuclease) which cleaves DNA at every base pair. DNA cleavage is inhibited where the ligand binds to DNA. (from Rieger et al., Glossary of Genetics: Classical and Molecular, 5th ed)
DNA Primers
Short sequences (generally about 10 base pairs) of DNA that are complementary to sequences of messenger RNA and allow reverse transcriptases to start copying the adjacent sequences of mRNA. Primers are used extensively in genetic and molecular biology techniques.
Genes, Reporter
Transcription Initiation Site
The first nucleotide of a transcribed DNA sequence where RNA polymerase (DNA-DIRECTED RNA POLYMERASE) begins synthesizing the RNA transcript.
Chloramphenicol O-Acetyltransferase
An enzyme that catalyzes the acetylation of chloramphenicol to yield chloramphenicol 3-acetate. Since chloramphenicol 3-acetate does not bind to bacterial ribosomes and is not an inhibitor of peptidyltransferase, the enzyme is responsible for the naturally occurring chloramphenicol resistance in bacteria. The enzyme, for which variants are known, is found in both gram-negative and gram-positive bacteria. EC 2.3.1.28.
Gene Expression Regulation, Viral
Sequence Analysis, DNA
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
Genomic Library
Gene Expression
The phenotypic manifestation of a gene or genes by the processes of GENETIC TRANSCRIPTION and GENETIC TRANSLATION.
Calcium-Binding Proteins
Escherichia coli
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
Arabidopsis
A plant genus of the family BRASSICACEAE that contains ARABIDOPSIS PROTEINS and MADS DOMAIN PROTEINS. The species A. thaliana is used for experiments in classical plant genetics as well as molecular genetic studies in plant physiology, biochemistry, and development.
Histones
Fatty Acid-Binding Proteins
Cells, Cultured
Tumor Cells, Cultured
Cells grown in vitro from neoplastic tissue. If they can be established as a TUMOR CELL LINE, they can be propagated in cell culture indefinitely.
Arabidopsis Proteins
Proteins that originate from plants species belonging to the genus ARABIDOPSIS. The most intensely studied species of Arabidopsis, Arabidopsis thaliana, is commonly used in laboratory experiments.
Repetitive Sequences, Nucleic Acid
Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES).