Genes that encode highly conserved TRANSCRIPTION FACTORS that control positional identity of cells (BODY PATTERNING) and MORPHOGENESIS throughout development. Their sequences contain a 180 nucleotide sequence designated the homeobox, so called because mutations of these genes often results in homeotic transformations, in which one body structure replaces another. The proteins encoded by homeobox genes are called HOMEODOMAIN PROTEINS.
Proteins encoded by homeobox genes (GENES, HOMEOBOX) that exhibit structural similarity to certain prokaryotic and eukaryotic DNA-binding proteins. Homeodomain proteins are involved in the control of gene expression during morphogenesis and development (GENE EXPRESSION REGULATION, DEVELOPMENTAL).
Antennapedia homeodomain protein is a homeobox protein involved in limb patterning in ARTHROPODS. Mutations in the gene for the antennapedia homeodomain protein are associated with the conversion of antenna to leg or leg to antenna DROSOPHILA.
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process.
The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence.
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
A subclass of LIM domain proteins that include an additional centrally-located homeodomain region that binds AT-rich sites on DNA. Many LIM-homeodomain proteins play a role as transcriptional regulators that direct cell fate.
Any of the processes by which nuclear, cytoplasmic, or intercellular factors influence the differential control of gene action during the developmental stages of an organism.
Proteins which bind to DNA. The family includes proteins which bind to both double- and single-stranded DNA and also includes specific DNA binding proteins in serum which can be used as markers for malignant diseases.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
A family of transcription factors that control EMBRYONIC DEVELOPMENT within a variety of cell lineages. They are characterized by a highly conserved paired DNA-binding domain that was first identified in DROSOPHILA segmentation genes.
An increased tendency of the GENOME to acquire MUTATIONS when various processes involved in maintaining and replicating the genome are dysfunctional.
Proteins which maintain the transcriptional quiescence of specific GENES or OPERONS. Classical repressor proteins are DNA-binding proteins that are normally bound to the OPERATOR REGION of an operon, or the ENHANCER SEQUENCES of a gene until a signal occurs that causes their release.
The systematic study of the complete DNA sequences (GENOME) of organisms.
The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of DNA from an RNA template is called REVERSE TRANSCRIPTION.
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
The parts of a macromolecule that directly participate in its specific combination with another molecule.
A form of GENE LIBRARY containing the complete DNA sequences present in the genome of a given organism. It contrasts with a cDNA library which contains only sequences utilized in protein coding (lacking introns).
Proteins that originate from insect species belonging to the genus DROSOPHILA. The proteins from the most intensely studied species of Drosophila, DROSOPHILA MELANOGASTER, are the subject of much interest in the area of MORPHOGENESIS and development.
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
A method for comparing two sets of chromosomal DNA by analyzing differences in the copy number and location of specific sequences. It is used to look for large sequence changes such as deletions, duplications, amplifications, or translocations.
Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations.
A genus of small, two-winged flies containing approximately 900 described species. These organisms are the most extensively studied of all genera from the standpoint of genetics and cytology.
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species.
A homeodomain protein that interacts with TATA-BOX BINDING PROTEIN. It represses GENETIC TRANSCRIPTION of target GENES and plays a critical role in ODONTOGENESIS.
The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Diffusible gene products that act on homologous or heterologous molecules of viral or cellular DNA to regulate the expression of proteins.
The relationships of groups of organisms as reflected by their genetic makeup.
The level of protein structure in which combinations of secondary protein structures (alpha helices, beta sheets, loop regions, and motifs) pack together to form folded shapes called domains. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Small proteins usually consist of only one domain but larger proteins may contain a number of domains connected by segments of polypeptide chain which lack regular secondary structure.
Distinct units in some bacterial, bacteriophage or plasmid GENOMES that are types of MOBILE GENETIC ELEMENTS. Encoded in them are a variety of fitness conferring genes, such as VIRULENCE FACTORS (in "pathogenicity islands or islets"), ANTIBIOTIC RESISTANCE genes, or genes required for SYMBIOSIS (in "symbiosis islands or islets"). They range in size from 10 - 500 kilobases, and their GC CONTENT and CODON usage differ from the rest of the genome. They typically contain an INTEGRASE gene, although in some cases this gene has been deleted resulting in "anchored genomic islands".
A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences.
Any method used for determining the location of and relative distances between genes on a chromosome.
The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA.
A technique that localizes specific nucleic acid sequences within intact chromosomes, eukaryotic cells, or bacterial cells through the use of specific nucleic acid-labeled probes.
The outward appearance of the individual. It is the product of interactions between genes, and between the GENOTYPE and the environment.
A ubiquitously expressed octamer transcription factor that regulates GENETIC TRANSCRIPTION of SMALL NUCLEAR RNA; IMMUNOGLOBULIN GENES; and HISTONE H2B genes.
Recombinant proteins produced by the GENETIC TRANSLATION of fused genes formed by the combination of NUCLEIC ACID REGULATORY SEQUENCES of one or more genes with the protein coding sequences of one or more genes.
A family of VERTEBRATE homeodomain proteins that share homology with orthodenticle protein, Drosophila. They regulate GENETIC TRANSCRIPTION and play an important role in EMBRYONIC DEVELOPMENT of the BRAIN.
Short sequences (generally about 10 base pairs) of DNA that are complementary to sequences of messenger RNA and allow reverse transcriptases to start copying the adjacent sequences of mRNA. Primers are used extensively in genetic and molecular biology techniques.
Any of the processes by which nuclear, cytoplasmic, or intercellular factors influence the differential control (induction or repression) of gene action at the level of transcription or translation.
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
Goosecoid protein is a homeodomain protein that was first identified in XENOPUS. It is found in the SPEMANN ORGANIZER of VERTEBRATES and plays an important role in neuronal CELL DIFFERENTIATION and ORGANOGENESIS.
Processes that stimulate the GENETIC TRANSCRIPTION of a gene or set of genes.
'Eye proteins' are structural or functional proteins, such as crystallins, opsins, and collagens, located in various parts of the eye, including the cornea, lens, retina, and aqueous humor, that contribute to maintaining transparency, refractive power, phototransduction, and overall integrity of the visual system.
Single-stranded complementary DNA synthesized from an RNA template by the action of RNA-dependent DNA polymerase. cDNA (i.e., complementary DNA, not circular DNA, not C-DNA) is used in a variety of molecular cloning experiments as well as serving as a specific hybridization probe.
The developmental entity of a fertilized egg (ZYGOTE) in animal species other than MAMMALS. For chickens, use CHICK EMBRYO.
A POU domain factor that regulates expression of GROWTH HORMONE; PROLACTIN; and THYROTROPIN-BETA in the ANTERIOR PITUITARY GLAND.
The process of cumulative change at the level of DNA; RNA; and PROTEINS, over successive generations.
'Nerve tissue proteins' are specialized proteins found within the nervous system's biological tissue, including neurofilaments, neuronal cytoskeletal proteins, and neural cell adhesion molecules, which facilitate structural support, intracellular communication, and synaptic connectivity essential for proper neurological function.
The sequential correspondence of nucleotides in one nucleic acid molecule with those of another nucleic acid molecule. Sequence homology is an indication of the genetic relatedness of different organisms and gene function.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
A cellular transcriptional coactivator that was originally identified by its requirement for the stable assembly IMMEDIATE-EARLY PROTEINS of the HERPES SIMPLEX VIRUS. It is a nuclear protein that is a transcriptional coactivator for a number of transcription factors including VP16 PROTEIN; GA-BINDING PROTEIN; EARLY GROWTH RESPONSE PROTEIN 2; and E2F4 TRANSCRIPTION FACTOR. It also interacts with and stabilizes HERPES SIMPLEX VIRUS PROTEIN VMW65 and helps regulate GENETIC TRANSCRIPTION of IMMEDIATE-EARLY GENES in HERPES SIMPLEX VIRUS.
Established cell cultures that have the potential to propagate indefinitely.
The processes occurring in early development that direct morphogenesis. They specify the body plan ensuring that cells will proceed to differentiate, grow, and diversify in size and shape at the correct relative positions. Included are axial patterning, segmentation, compartment specification, limb position, organ boundary patterning, blood vessel patterning, etc.
A family of transcription factors characterized by the presence of a bipartite DNA-binding domain known as the POU domain. The POU domain contains two subdomains, a POU-specific domain and a POU-homeodomain. The POU domain was originally identified as a region of approximately 150 amino acids shared between the Pit-1, Oct-1, Oct-2, and Unc-86 transcription factors.
A theoretical representative nucleotide or amino acid sequence in which each nucleotide or amino acid is the one which occurs most frequently at that site in the different sequences which occur in nature. The phrase also refers to an actual sequence which approximates the theoretical consensus. A known CONSERVED SEQUENCE set is represented by a consensus sequence. Commonly observed supersecondary protein structures (AMINO ACID MOTIFS) are often formed by conserved sequences.
The variable phenotypic expression of a GENE depending on whether it is of paternal or maternal origin, which is a function of the DNA METHYLATION pattern. Imprinted regions are observed to be more methylated and less transcriptionally active. (Segen, Dictionary of Modern Medicine, 1992)
The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs.
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment.
Cis-acting DNA sequences which can increase transcription of genes. Enhancers can usually function in either orientation and at various distances from a promoter.
A species of fruit fly much used in genetics because of the large size of its chromosomes.
The parts of a transcript of a split GENE remaining after the INTRONS are removed. They are spliced together to become a MESSENGER RNA or other functional RNA.
The genetic complement of a BACTERIA as represented in its DNA.
A onecut transcription factor that regulates expression of GENES involved in EMBRYONIC DEVELOPMENT of the PANCREAS and LIVER.
Sequences of DNA in the genes that are located between the EXONS. They are transcribed along with the exons but are removed from the primary gene transcript by RNA SPLICING to leave mature RNA. Some introns code for separate genes.
Progressive restriction of the developmental potential and increasing specialization of function that leads to the formation of specialized cells, tissues, and organs.
Widely used technique which exploits the ability of complementary sequences in single-stranded DNAs or RNAs to pair with each other to form a double helix. Hybridization can take place between two complimentary DNA sequences, between a single-stranded DNA and a complementary RNA, or between two RNA sequences. The technique is used to detect and isolate specific sequences, measure homology, or define other characteristics of one or both strands. (Kendrew, Encyclopedia of Molecular Biology, 1994, p503)
Proteins obtained from various species of Xenopus. Included here are proteins from the African clawed frog (XENOPUS LAEVIS). Many of these proteins have been the subject of scientific investigations in the area of MORPHOGENESIS and development.
Screening techniques first developed in yeast to identify genes encoding interacting proteins. Variations are used to evaluate interplay between proteins and other molecules. Two-hybrid techniques refer to analysis for protein-protein interactions, one-hybrid for DNA-protein interactions, three-hybrid interactions for RNA-protein interactions or ligand-based interactions. Reverse n-hybrid techniques refer to analysis for mutations or other small molecules that dissociate known interactions.
Fushi tarazu transcription factors were originally identified in DROSOPHILA. They are found throughout ARTHROPODS and play important roles in segmentation and CENTRAL NERVOUS SYSTEM development.
The uptake of naked or purified DNA by CELLS, usually meaning the process as it occurs in eukaryotic cells. It is analogous to bacterial transformation (TRANSFORMATION, BACTERIAL) and both are routinely employed in GENE TRANSFER TECHNIQUES.
Hybridization of a nucleic acid sample to a very large set of OLIGONUCLEOTIDE PROBES, which have been attached individually in columns and rows to a solid support, to determine a BASE SEQUENCE, or to detect variations in a gene sequence, GENE EXPRESSION, or for GENE MAPPING.
The determination of the pattern of genes expressed at the level of GENETIC TRANSCRIPTION, under specific circumstances or in a specific cell.
Hormones secreted by insects. They influence their growth and development. Also synthetic substances that act like insect hormones.
A large collection of DNA fragments cloned (CLONING, MOLECULAR) from a given organism, tissue, organ, or cell type. It may contain complete genomic sequences (GENOMIC LIBRARY) or complementary DNA sequences, the latter being formed from messenger RNA and lacking intron sequences.
An exotic species of the family CYPRINIDAE, originally from Asia, that has been introduced in North America. They are used in embryological studies and to study the effects of certain chemicals on development.
Proteins obtained from the ZEBRAFISH. Many of the proteins in this species have been the subject of studies involving basic embryological development (EMBRYOLOGY).
Genes whose expression is easily detectable and therefore used to study promoter activity at many positions in a target genome. In recombinant DNA technology, these genes may be attached to a promoter region of interest.