Genes that encode highly conserved TRANSCRIPTION FACTORS that control positional identity of cells (BODY PATTERNING) and MORPHOGENESIS throughout development. Their sequences contain a 180 nucleotide sequence designated the homeobox, so called because mutations of these genes often results in homeotic transformations, in which one body structure replaces another. The proteins encoded by homeobox genes are called HOMEODOMAIN PROTEINS.
Proteins encoded by homeobox genes (GENES, HOMEOBOX) that exhibit structural similarity to certain prokaryotic and eukaryotic DNA-binding proteins. Homeodomain proteins are involved in the control of gene expression during morphogenesis and development (GENE EXPRESSION REGULATION, DEVELOPMENTAL).
Antennapedia homeodomain protein is a homeobox protein involved in limb patterning in ARTHROPODS. Mutations in the gene for the antennapedia homeodomain protein are associated with the conversion of antenna to leg or leg to antenna DROSOPHILA.
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
A subclass of LIM domain proteins that include an additional centrally-located homeodomain region that binds AT-rich sites on DNA. Many LIM-homeodomain proteins play a role as transcriptional regulators that direct cell fate.
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Any of the processes by which nuclear, cytoplasmic, or intercellular factors influence the differential control of gene action during the developmental stages of an organism.
Proteins which bind to DNA. The family includes proteins which bind to both double- and single-stranded DNA and also includes specific DNA binding proteins in serum which can be used as markers for malignant diseases.
A family of transcription factors that control EMBRYONIC DEVELOPMENT within a variety of cell lineages. They are characterized by a highly conserved paired DNA-binding domain that was first identified in DROSOPHILA segmentation genes.
The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence.
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
Proteins which maintain the transcriptional quiescence of specific GENES or OPERONS. Classical repressor proteins are DNA-binding proteins that are normally bound to the OPERATOR REGION of an operon, or the ENHANCER SEQUENCES of a gene until a signal occurs that causes their release.
Proteins that originate from insect species belonging to the genus DROSOPHILA. The proteins from the most intensely studied species of Drosophila, DROSOPHILA MELANOGASTER, are the subject of much interest in the area of MORPHOGENESIS and development.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
The parts of a macromolecule that directly participate in its specific combination with another molecule.
A homeodomain protein that interacts with TATA-BOX BINDING PROTEIN. It represses GENETIC TRANSCRIPTION of target GENES and plays a critical role in ODONTOGENESIS.
A genus of small, two-winged flies containing approximately 900 described species. These organisms are the most extensively studied of all genera from the standpoint of genetics and cytology.
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
Diffusible gene products that act on homologous or heterologous molecules of viral or cellular DNA to regulate the expression of proteins.
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments.
The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of DNA from an RNA template is called REVERSE TRANSCRIPTION.
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
A ubiquitously expressed octamer transcription factor that regulates GENETIC TRANSCRIPTION of SMALL NUCLEAR RNA; IMMUNOGLOBULIN GENES; and HISTONE H2B genes.
A family of VERTEBRATE homeodomain proteins that share homology with orthodenticle protein, Drosophila. They regulate GENETIC TRANSCRIPTION and play an important role in EMBRYONIC DEVELOPMENT of the BRAIN.
Goosecoid protein is a homeodomain protein that was first identified in XENOPUS. It is found in the SPEMANN ORGANIZER of VERTEBRATES and plays an important role in neuronal CELL DIFFERENTIATION and ORGANOGENESIS.
A technique that localizes specific nucleic acid sequences within intact chromosomes, eukaryotic cells, or bacterial cells through the use of specific nucleic acid-labeled probes.
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
A POU domain factor that regulates expression of GROWTH HORMONE; PROLACTIN; and THYROTROPIN-BETA in the ANTERIOR PITUITARY GLAND.
Recombinant proteins produced by the GENETIC TRANSLATION of fused genes formed by the combination of NUCLEIC ACID REGULATORY SEQUENCES of one or more genes with the protein coding sequences of one or more genes.
Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations.
Processes that stimulate the GENETIC TRANSCRIPTION of a gene or set of genes.
'Eye proteins' are structural or functional proteins, such as crystallins, opsins, and collagens, located in various parts of the eye, including the cornea, lens, retina, and aqueous humor, that contribute to maintaining transparency, refractive power, phototransduction, and overall integrity of the visual system.
The developmental entity of a fertilized egg (ZYGOTE) in animal species other than MAMMALS. For chickens, use CHICK EMBRYO.
A cellular transcriptional coactivator that was originally identified by its requirement for the stable assembly IMMEDIATE-EARLY PROTEINS of the HERPES SIMPLEX VIRUS. It is a nuclear protein that is a transcriptional coactivator for a number of transcription factors including VP16 PROTEIN; GA-BINDING PROTEIN; EARLY GROWTH RESPONSE PROTEIN 2; and E2F4 TRANSCRIPTION FACTOR. It also interacts with and stabilizes HERPES SIMPLEX VIRUS PROTEIN VMW65 and helps regulate GENETIC TRANSCRIPTION of IMMEDIATE-EARLY GENES in HERPES SIMPLEX VIRUS.
A family of transcription factors characterized by the presence of a bipartite DNA-binding domain known as the POU domain. The POU domain contains two subdomains, a POU-specific domain and a POU-homeodomain. The POU domain was originally identified as a region of approximately 150 amino acids shared between the Pit-1, Oct-1, Oct-2, and Unc-86 transcription factors.
'Nerve tissue proteins' are specialized proteins found within the nervous system's biological tissue, including neurofilaments, neuronal cytoskeletal proteins, and neural cell adhesion molecules, which facilitate structural support, intracellular communication, and synaptic connectivity essential for proper neurological function.
The processes occurring in early development that direct morphogenesis. They specify the body plan ensuring that cells will proceed to differentiate, grow, and diversify in size and shape at the correct relative positions. Included are axial patterning, segmentation, compartment specification, limb position, organ boundary patterning, blood vessel patterning, etc.
A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences.
A onecut transcription factor that regulates expression of GENES involved in EMBRYONIC DEVELOPMENT of the PANCREAS and LIVER.
Fushi tarazu transcription factors were originally identified in DROSOPHILA. They are found throughout ARTHROPODS and play important roles in segmentation and CENTRAL NERVOUS SYSTEM development.
Proteins obtained from various species of Xenopus. Included here are proteins from the African clawed frog (XENOPUS LAEVIS). Many of these proteins have been the subject of scientific investigations in the area of MORPHOGENESIS and development.
Screening techniques first developed in yeast to identify genes encoding interacting proteins. Variations are used to evaluate interplay between proteins and other molecules. Two-hybrid techniques refer to analysis for protein-protein interactions, one-hybrid for DNA-protein interactions, three-hybrid interactions for RNA-protein interactions or ligand-based interactions. Reverse n-hybrid techniques refer to analysis for mutations or other small molecules that dissociate known interactions.
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
Any of the processes by which nuclear, cytoplasmic, or intercellular factors influence the differential control (induction or repression) of gene action at the level of transcription or translation.
Cis-acting DNA sequences which can increase transcription of genes. Enhancers can usually function in either orientation and at various distances from a promoter.
Progressive restriction of the developmental potential and increasing specialization of function that leads to the formation of specialized cells, tissues, and organs.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.