TATA Box
A conserved A-T rich sequence which is contained in promoters for RNA polymerase II. The segment is seven base pairs long and the nucleotides most commonly found are TATAAAA.
TATA-Box Binding Protein
A general transcription factor that plays a major role in the activation of eukaryotic genes transcribed by RNA POLYMERASES. It binds specifically to the TATA BOX promoter element, which lies close to the position of transcription initiation in RNA transcribed by RNA POLYMERASE II. Although considered a principal component of TRANSCRIPTION FACTOR TFIID it also takes part in general transcription factor complexes involved in RNA POLYMERASE I and RNA POLYMERASE III transcription.
Promoter Regions, Genetic
Base Sequence
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Transcription, Genetic
Transcription Factors
DNA-Binding Proteins
Regulatory Sequences, Nucleic Acid
Nucleic acid sequences involved in regulating the expression of genes.
Transcription Factor TFIID
The major sequence-specific DNA-binding component involved in the activation of transcription of RNA POLYMERASE II. It was originally described as a complex of TATA-BOX BINDING PROTEIN and TATA-BINDING PROTEIN ASSOCIATED FACTORS. It is now know that TATA BOX BINDING PROTEIN-LIKE PROTEINS may take the place of TATA-box binding protein in the complex.
Cloning, Molecular
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
DNA
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
Sp1 Transcription Factor
Promoter-specific RNA polymerase II transcription factor that binds to the GC box, one of the upstream promoter elements, in mammalian cells. The binding of Sp1 is necessary for the initiation of transcription in the promoters of a variety of cellular and viral GENES.
Binding Sites
Restriction Mapping
Genes
RNA Polymerase II
A DNA-dependent RNA polymerase present in bacterial, plant, and animal cells. It functions in the nucleoplasmic structure and transcribes DNA into RNA. It has different requirements for cations and salt than RNA polymerase I and is strongly inhibited by alpha-amanitin. EC 2.7.7.6.
Gene Expression Regulation
Amino Acid Sequence
Transcription Factor TFIIA
An RNA POLYMERASE II specific transcription factor. It may play a role in transcriptional activation of gene expression by interacting with the TATA-BOX BINDING PROTEIN component of TRANSCRIPTION FACTOR TFIID.
Introns
Sequence Homology, Nucleic Acid
HeLa Cells
The first continuously cultured human malignant CELL LINE, derived from the cervical carcinoma of Henrietta Lacks. These cells are used for VIRUS CULTIVATION and antitumor drug screening assays.
Deoxyribonuclease I
An enzyme capable of hydrolyzing highly polymerized DNA by splitting phosphodiester linkages, preferentially adjacent to a pyrimidine nucleotide. This catalyzes endonucleolytic cleavage of DNA yielding 5'-phosphodi- and oligonucleotide end-products. The enzyme has a preference for double-stranded DNA.
TATA Box Binding Protein-Like Proteins
A class of proteins related in structure and function to TATA-BOX BINDING PROTEIN that can take the place of TATA-BOX BINDING PROTEIN in the transcription initiation complex. They are found in most multicellular organisms and may be involved in tissue-specific promoter regulation. They bind to DNA and interact with TATA-BINDING PROTEIN ASSOCIATED FACTORS, however they may lack specificity for the TATA-BOX.
Transcription Initiation Site
The first nucleotide of a transcribed DNA sequence where RNA polymerase (DNA-DIRECTED RNA POLYMERASE) begins synthesizing the RNA transcript.
Exons
The parts of a transcript of a split GENE remaining after the INTRONS are removed. They are spliced together to become a MESSENGER RNA or other functional RNA.
DNA Footprinting
A method for determining the sequence specificity of DNA-binding proteins. DNA footprinting utilizes a DNA damaging agent (either a chemical reagent or a nuclease) which cleaves DNA at every base pair. DNA cleavage is inhibited where the ligand binds to DNA. (from Rieger et al., Glossary of Genetics: Classical and Molecular, 5th ed)
Plasmids
Extrachromosomal, usually CIRCULAR DNA molecules that are self-replicating and transferable from one organism to another. They are found in a variety of bacterial, archaeal, fungal, algal, and plant species. They are used in GENETIC ENGINEERING as CLONING VECTORS.
Transcriptional Activation
Processes that stimulate the GENETIC TRANSCRIPTION of a gene or set of genes.
Transfection
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
Enhancer Elements, Genetic
Chloramphenicol O-Acetyltransferase
An enzyme that catalyzes the acetylation of chloramphenicol to yield chloramphenicol 3-acetate. Since chloramphenicol 3-acetate does not bind to bacterial ribosomes and is not an inhibitor of peptidyltransferase, the enzyme is responsible for the naturally occurring chloramphenicol resistance in bacteria. The enzyme, for which variants are known, is found in both gram-negative and gram-positive bacteria. EC 2.3.1.28.
Genomic Library
Mutation
Transcription Factor TFIIB
An RNA POLYMERASE II specific transcription factor. It plays a role in assembly of the pol II transcriptional preinitiation complex and has been implicated as a target of gene-specific transcriptional activators.
RNA, Messenger
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Gene Expression Regulation, Viral
Protein Binding
Oligodeoxyribonucleotides
Saccharomyces cerevisiae
Consensus Sequence
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
Single-Strand Specific DNA and RNA Endonucleases
Enzymes that catalyze the endonucleolytic cleavage of single-stranded regions of DNA or RNA molecules while leaving the double-stranded regions intact. They are particularly useful in the laboratory for producing "blunt-ended" DNA molecules from DNA with single-stranded ends and for sensitive GENETIC TECHNIQUES such as NUCLEASE PROTECTION ASSAYS that involve the detection of single-stranded DNA and RNA.
Genes, Regulator
Genes which regulate or circumscribe the activity of other genes; specifically, genes which code for PROTEINS or RNAs which have GENE EXPRESSION REGULATION functions.
5' Flanking Region
The region of DNA which borders the 5' end of a transcription unit and where a variety of regulatory sequences are located.
Repetitive Sequences, Nucleic Acid
Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES).
RNA, Small Nuclear
Short chains of RNA (100-300 nucleotides long) that are abundant in the nucleus and usually complexed with proteins in snRNPs (RIBONUCLEOPROTEINS, SMALL NUCLEAR). Many function in the processing of messenger RNA precursors. Others, the snoRNAs (RNA, SMALL NUCLEOLAR), are involved with the processing of ribosomal RNA precursors.
Genes, Reporter
Trans-Activators
Diffusible gene products that act on homologous or heterologous molecules of viral or cellular DNA to regulate the expression of proteins.
DNA Restriction Enzymes
Enzymes that are part of the restriction-modification systems. They catalyze the endonucleolytic cleavage of DNA sequences which lack the species-specific methylation pattern in the host cell's DNA. Cleavage yields random or specific double-stranded fragments with terminal 5'-phosphates. The function of restriction enzymes is to destroy any foreign DNA that invades the host cell. Most have been studied in bacterial systems, but a few have been found in eukaryotic organisms. They are also used as tools for the systematic dissection and mapping of chromosomes, in the determination of base sequences of DNAs, and have made it possible to splice and recombine genes from one organism into the genome of another. EC 3.21.1.
Nuclear Proteins
Mutagenesis, Site-Directed
Genetically engineered MUTAGENESIS at a specific site in the DNA molecule that introduces a base substitution, or an insertion or deletion.
Saccharomyces cerevisiae Proteins
Proteins obtained from the species SACCHAROMYCES CEREVISIAE. The function of specific proteins from this organism are the subject of intense scientific interest and have been used to derive basic understanding of the functioning similar proteins in higher eukaryotes.
Nucleic Acid Conformation
Oligonucleotide Probes
Synthetic or natural oligonucleotides used in hybridization studies in order to identify and study specific nucleic acid fragments, e.g., DNA segments near or within a specific gene locus or gene. The probe hybridizes with a specific mRNA, if present. Conventional techniques used for testing for the hybridization product include dot blot assays, Southern blot assays, and DNA:RNA hybrid-specific antibody tests. Conventional labels for the probe include the radioisotope labels 32P and 125I and the chemical label biotin.
Transcription Factors, TFII
The so-called general transcription factors that bind to RNA POLYMERASE II and that are required to initiate transcription. They include TFIIA; TFIIB; TFIID; TFIIE; TFIIF; TFIIH; TFII-I; and TFIIJ. In vivo they apparently bind in an ordered multi-step process and/or may form a large preinitiation complex called RNA polymerase II holoenzyme.
Immediate-Early Proteins
Proteins that are coded by immediate-early genes, in the absence of de novo protein synthesis. The term was originally used exclusively for viral regulatory proteins that were synthesized just after viral integration into the host cell. It is also used to describe cellular proteins which are synthesized immediately after the resting cell is stimulated by extracellular signals.
Conserved Sequence
A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences.
Chromosome Mapping
Any method used for determining the location of and relative distances between genes on a chromosome.
Adenovirus Early Proteins
Proteins encoded by adenoviruses that are synthesized prior to, and in the absence of, viral DNA replication. The proteins are involved in both positive and negative regulation of expression in viral and cellular genes, and also affect the stability of viral mRNA. Some are also involved in oncogenic transformation.
Sequence Alignment
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Gene Expression Regulation, Fungal
Transcription Factor TFIIIB
One of several general transcription factors that are specific for RNA POLYMERASE III. TFIIIB recruits and positions pol III over the initiation site and remains stably bound to the DNA through multiple rounds of re-initiation by RNA POLYMERASE III.
Recombinant Fusion Proteins
Repressor Proteins
Proteins which maintain the transcriptional quiescence of specific GENES or OPERONS. Classical repressor proteins are DNA-binding proteins that are normally bound to the OPERATOR REGION of an operon, or the ENHANCER SEQUENCES of a gene until a signal occurs that causes their release.
Luciferases
Enzymes that oxidize certain LUMINESCENT AGENTS to emit light (PHYSICAL LUMINESCENCE). The luciferases from different organisms have evolved differently so have different structures and substrates.
Sequence Analysis, DNA
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
Globins
Blotting, Southern
A method (first developed by E.M. Southern) for detection of DNA that has been electrophoretically separated and immobilized by blotting on nitrocellulose or other type of paper or nylon membrane followed by hybridization with labeled NUCLEIC ACID PROBES.
DNA Primers
Short sequences (generally about 10 base pairs) of DNA that are complementary to sequences of messenger RNA and allow reverse transcriptases to start copying the adjacent sequences of mRNA. Primers are used extensively in genetic and molecular biology techniques.
DNA, Complementary
Cell Nucleus
Within a eukaryotic cell, a membrane-limited body which contains chromosomes and one or more nucleoli (CELL NUCLEOLUS). The nuclear membrane consists of a double unit-type membrane which is perforated by a number of pores; the outermost membrane is continuous with the ENDOPLASMIC RETICULUM. A cell may contain more than one nucleus. (From Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
Recombinant Proteins
Proteins prepared by recombinant DNA technology.
Tumor Cells, Cultured
Cells grown in vitro from neoplastic tissue. If they can be established as a TUMOR CELL LINE, they can be propagated in cell culture indefinitely.
Chromatin
Sequence Homology, Amino Acid
RNA Polymerase III
A DNA-dependent RNA polymerase present in bacterial, plant, and animal cells. It functions in the nucleoplasmic structure where it transcribes DNA into RNA. It has specific requirements for cations and salt and has shown an intermediate sensitivity to alpha-amanitin in comparison to RNA polymerase I and II. EC 2.7.7.6.
Gene Expression Regulation, Enzymologic
Response Elements
Nucleotide sequences, usually upstream, which are recognized by specific regulatory transcription factors, thereby causing gene response to various regulatory agents. These elements may be found in both promoter and enhancer regions.
TATA-Binding Protein Associated Factors
Factors that associate with TATA-BOX BINDING PROTEIN. Many of them are components of TRANSCRIPTION FACTOR TFIID
DNA, Recombinant
Sp3 Transcription Factor
A specificity protein transcription factor that regulates expression of a variety of genes including VASCULAR ENDOTHELIAL GROWTH FACTOR and CYCLIN-DEPENDENT KINASE INHIBITOR P27.
Upstream Stimulatory Factors
Ubiquitously expressed basic HELIX-LOOP-HELIX MOTIF transcription factors. They bind CANNTG sequences in the promoters of a variety of GENES involved in carbohydrate and lipid metabolism.
Mutagenesis
Process of generating a genetic MUTATION. It may occur spontaneously or be induced by MUTAGENS.
YY1 Transcription Factor
Adenoviruses, Human
Species of the genus MASTADENOVIRUS, causing a wide range of diseases in humans. Infections are mostly asymptomatic, but can be associated with diseases of the respiratory, ocular, and gastrointestinal systems. Serotypes (named with Arabic numbers) have been grouped into species designated Human adenovirus A-F.
Gene Expression
The phenotypic manifestation of a gene or genes by the processes of GENETIC TRANSCRIPTION and GENETIC TRANSLATION.
beta-Galactosidase
A group of enzymes that catalyzes the hydrolysis of terminal, non-reducing beta-D-galactose residues in beta-galactosides. Deficiency of beta-Galactosidase A1 may cause GANGLIOSIDOSIS, GM1.
HIV Long Terminal Repeat
Regulatory sequences important for viral replication that are located on each end of the HIV genome. The LTR includes the HIV ENHANCER, promoter, and other sequences. Specific regions in the LTR include the negative regulatory element (NRE), NF-kappa B binding sites , Sp1 binding sites, TATA BOX, and trans-acting responsive element (TAR). The binding of both cellular and viral proteins to these regions regulates HIV transcription.
Nucleosomes
Blotting, Northern
Models, Genetic
Drosophila
Templates, Genetic
Polymerase Chain Reaction
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
HMGB1 Protein
A 24-kDa HMGB protein that binds to and distorts the minor grove of DNA.
Adenoviridae
Erythroid-Specific DNA-Binding Factors
A group of transcription factors that were originally described as being specific to ERYTHROID CELLS.
NFI Transcription Factors
Transcription factors that were originally identified as site-specific DNA-binding proteins essential for DNA REPLICATION by ADENOVIRUSES. They play important roles in MAMMARY GLAND function and development.
Nucleic Acid Hybridization
Widely used technique which exploits the ability of complementary sequences in single-stranded DNAs or RNAs to pair with each other to form a double helix. Hybridization can take place between two complimentary DNA sequences, between a single-stranded DNA and a complementary RNA, or between two RNA sequences. The technique is used to detect and isolate specific sequences, measure homology, or define other characteristics of one or both strands. (Kendrew, Encyclopedia of Molecular Biology, 1994, p503)
Histones
Small chromosomal proteins (approx 12-20 kD) possessing an open, unfolded structure and attached to the DNA in cell nuclei by ionic linkages. Classification into the various types (designated histone I, histone II, etc.) is based on the relative amounts of arginine and lysine in each.