Genes that encode highly conserved TRANSCRIPTION FACTORS that control positional identity of cells (BODY PATTERNING) and MORPHOGENESIS throughout development. Their sequences contain a 180 nucleotide sequence designated the homeobox, so called because mutations of these genes often results in homeotic transformations, in which one body structure replaces another. The proteins encoded by homeobox genes are called HOMEODOMAIN PROTEINS.
Proteins encoded by homeobox genes (GENES, HOMEOBOX) that exhibit structural similarity to certain prokaryotic and eukaryotic DNA-binding proteins. Homeodomain proteins are involved in the control of gene expression during morphogenesis and development (GENE EXPRESSION REGULATION, DEVELOPMENTAL).
Antennapedia homeodomain protein is a homeobox protein involved in limb patterning in ARTHROPODS. Mutations in the gene for the antennapedia homeodomain protein are associated with the conversion of antenna to leg or leg to antenna DROSOPHILA.
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
Organic compounds that generally contain an amino (-NH2) and a carboxyl (-COOH) group. Twenty alpha-amino acids are the subunits which are polymerized to form proteins.
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence.
A subclass of LIM domain proteins that include an additional centrally-located homeodomain region that binds AT-rich sites on DNA. Many LIM-homeodomain proteins play a role as transcriptional regulators that direct cell fate.
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
Proteins which bind to DNA. The family includes proteins which bind to both double- and single-stranded DNA and also includes specific DNA binding proteins in serum which can be used as markers for malignant diseases.
Any of the processes by which nuclear, cytoplasmic, or intercellular factors influence the differential control of gene action during the developmental stages of an organism.
The parts of a macromolecule that directly participate in its specific combination with another molecule.
A family of transcription factors that control EMBRYONIC DEVELOPMENT within a variety of cell lineages. They are characterized by a highly conserved paired DNA-binding domain that was first identified in DROSOPHILA segmentation genes.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
Proteins which maintain the transcriptional quiescence of specific GENES or OPERONS. Classical repressor proteins are DNA-binding proteins that are normally bound to the OPERATOR REGION of an operon, or the ENHANCER SEQUENCES of a gene until a signal occurs that causes their release.
The naturally occurring or experimentally induced replacement of one or more AMINO ACIDS in a protein with another. If a functionally equivalent amino acid is substituted, the protein may retain wild-type activity. Substitution may also diminish, enhance, or eliminate protein function. Experimentally induced substitution is often used to study enzyme activities and binding site properties.
The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments.
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations.
Proteins that originate from insect species belonging to the genus DROSOPHILA. The proteins from the most intensely studied species of Drosophila, DROSOPHILA MELANOGASTER, are the subject of much interest in the area of MORPHOGENESIS and development.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of DNA from an RNA template is called REVERSE TRANSCRIPTION.
Recombinant proteins produced by the GENETIC TRANSLATION of fused genes formed by the combination of NUCLEIC ACID REGULATORY SEQUENCES of one or more genes with the protein coding sequences of one or more genes.
A genus of small, two-winged flies containing approximately 900 described species. These organisms are the most extensively studied of all genera from the standpoint of genetics and cytology.
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
A homeodomain protein that interacts with TATA-BOX BINDING PROTEIN. It represses GENETIC TRANSCRIPTION of target GENES and plays a critical role in ODONTOGENESIS.
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
Diffusible gene products that act on homologous or heterologous molecules of viral or cellular DNA to regulate the expression of proteins.
A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences.
Commonly observed structural components of proteins formed by simple combinations of adjacent secondary structures. A commonly observed structure may be composed of a CONSERVED SEQUENCE which can be represented by a CONSENSUS SEQUENCE.
Single-stranded complementary DNA synthesized from an RNA template by the action of RNA-dependent DNA polymerase. cDNA (i.e., complementary DNA, not circular DNA, not C-DNA) is used in a variety of molecular cloning experiments as well as serving as a specific hybridization probe.
A technique that localizes specific nucleic acid sequences within intact chromosomes, eukaryotic cells, or bacterial cells through the use of specific nucleic acid-labeled probes.
A ubiquitously expressed octamer transcription factor that regulates GENETIC TRANSCRIPTION of SMALL NUCLEAR RNA; IMMUNOGLOBULIN GENES; and HISTONE H2B genes.
A family of VERTEBRATE homeodomain proteins that share homology with orthodenticle protein, Drosophila. They regulate GENETIC TRANSCRIPTION and play an important role in EMBRYONIC DEVELOPMENT of the BRAIN.
Genetically engineered MUTAGENESIS at a specific site in the DNA molecule that introduces a base substitution, or an insertion or deletion.
Processes that stimulate the GENETIC TRANSCRIPTION of a gene or set of genes.
Goosecoid protein is a homeodomain protein that was first identified in XENOPUS. It is found in the SPEMANN ORGANIZER of VERTEBRATES and plays an important role in neuronal CELL DIFFERENTIATION and ORGANOGENESIS.
Established cell cultures that have the potential to propagate indefinitely.
Short sequences (generally about 10 base pairs) of DNA that are complementary to sequences of messenger RNA and allow reverse transcriptases to start copying the adjacent sequences of mRNA. Primers are used extensively in genetic and molecular biology techniques.
Models used experimentally or theoretically to study molecular shape, electronic properties, or interactions; includes analogous molecules, computer-generated graphics, and mechanical structures.
'Nerve tissue proteins' are specialized proteins found within the nervous system's biological tissue, including neurofilaments, neuronal cytoskeletal proteins, and neural cell adhesion molecules, which facilitate structural support, intracellular communication, and synaptic connectivity essential for proper neurological function.
Amino acids that are not synthesized by the human body in amounts sufficient to carry out physiological functions. They are obtained from dietary foodstuffs.
'Eye proteins' are structural or functional proteins, such as crystallins, opsins, and collagens, located in various parts of the eye, including the cornea, lens, retina, and aqueous humor, that contribute to maintaining transparency, refractive power, phototransduction, and overall integrity of the visual system.
A POU domain factor that regulates expression of GROWTH HORMONE; PROLACTIN; and THYROTROPIN-BETA in the ANTERIOR PITUITARY GLAND.
Proteins prepared by recombinant DNA technology.
The sequential correspondence of nucleotides in one nucleic acid molecule with those of another nucleic acid molecule. Sequence homology is an indication of the genetic relatedness of different organisms and gene function.
The relationship between the chemical structure of a compound and its biological or pharmacological activity. Compounds are often classed together because they have structural characteristics in common including shape, size, stereochemical arrangement, and distribution of functional groups.
The developmental entity of a fertilized egg (ZYGOTE) in animal species other than MAMMALS. For chickens, use CHICK EMBRYO.
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
A cellular transcriptional coactivator that was originally identified by its requirement for the stable assembly IMMEDIATE-EARLY PROTEINS of the HERPES SIMPLEX VIRUS. It is a nuclear protein that is a transcriptional coactivator for a number of transcription factors including VP16 PROTEIN; GA-BINDING PROTEIN; EARLY GROWTH RESPONSE PROTEIN 2; and E2F4 TRANSCRIPTION FACTOR. It also interacts with and stabilizes HERPES SIMPLEX VIRUS PROTEIN VMW65 and helps regulate GENETIC TRANSCRIPTION of IMMEDIATE-EARLY GENES in HERPES SIMPLEX VIRUS.
A family of transcription factors characterized by the presence of a bipartite DNA-binding domain known as the POU domain. The POU domain contains two subdomains, a POU-specific domain and a POU-homeodomain. The POU domain was originally identified as a region of approximately 150 amino acids shared between the Pit-1, Oct-1, Oct-2, and Unc-86 transcription factors.
Any of the processes by which nuclear, cytoplasmic, or intercellular factors influence the differential control (induction or repression) of gene action at the level of transcription or translation.
The processes occurring in early development that direct morphogenesis. They specify the body plan ensuring that cells will proceed to differentiate, grow, and diversify in size and shape at the correct relative positions. Included are axial patterning, segmentation, compartment specification, limb position, organ boundary patterning, blood vessel patterning, etc.
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
Cellular proteins and protein complexes that transport amino acids across biological membranes.
Screening techniques first developed in yeast to identify genes encoding interacting proteins. Variations are used to evaluate interplay between proteins and other molecules. Two-hybrid techniques refer to analysis for protein-protein interactions, one-hybrid for DNA-protein interactions, three-hybrid interactions for RNA-protein interactions or ligand-based interactions. Reverse n-hybrid techniques refer to analysis for mutations or other small molecules that dissociate known interactions.
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
A onecut transcription factor that regulates expression of GENES involved in EMBRYONIC DEVELOPMENT of the PANCREAS and LIVER.
The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain).
The outward appearance of the individual. It is the product of interactions between genes, and between the GENOTYPE and the environment.
Proteins obtained from various species of Xenopus. Included here are proteins from the African clawed frog (XENOPUS LAEVIS). Many of these proteins have been the subject of scientific investigations in the area of MORPHOGENESIS and development.
Fushi tarazu transcription factors were originally identified in DROSOPHILA. They are found throughout ARTHROPODS and play important roles in segmentation and CENTRAL NERVOUS SYSTEM development.
Progressive restriction of the developmental potential and increasing specialization of function that leads to the formation of specialized cells, tissues, and organs.
The relationships of groups of organisms as reflected by their genetic makeup.
Cis-acting DNA sequences which can increase transcription of genes. Enhancers can usually function in either orientation and at various distances from a promoter.
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
Hormones secreted by insects. They influence their growth and development. Also synthetic substances that act like insect hormones.
Partial proteins formed by partial hydrolysis of complete proteins or generated through PROTEIN ENGINEERING techniques.
A species of fruit fly much used in genetics because of the large size of its chromosomes.
Deletion of sequences of nucleic acids from the genetic material of an individual.