A variety of simple repeat sequences that are distributed throughout the GENOME. They are characterized by a short repeat unit of 2-8 basepairs that is repeated up to 100 times. They are also known as short tandem repeats (STRs).
The most common of the microsatellite tandem repeats (MICROSATELLITE REPEATS) dispersed in the euchromatic arms of chromosomes. They consist of two nucleotides repeated in tandem; guanine and thymine, (GT)n, is the most frequently seen.
Highly repetitive DNA sequences found in HETEROCHROMATIN, mainly near centromeres. They are composed of simple sequences (very short) (see MINISATELLITE REPEATS) repeated in tandem many times to form large blocks of sequence. Additionally, following the accumulation of mutations, these blocks of repeats have been repeated in tandem themselves. The degree of repetition is on the order of 1000 to 10 million at each locus. Loci are few, usually one or two per chromosome. They were called satellites since in density gradients, they often sediment as distinct, satellite bands separate from the bulk of genomic DNA owing to a distinct BASE COMPOSITION.
Microsatellite repeats consisting of three nucleotides dispersed in the euchromatic arms of chromosomes.
Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES).
The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence.
Variant forms of the same gene, occupying the same locus on homologous CHROMOSOMES, and governing the variants in production of the same gene product.
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
A phenotypically recognizable genetic trait which can be used to identify a genetic locus, a linkage group, or a recombination event.
The occurrence of highly polymorphic mono- and dinucleotide MICROSATELLITE REPEATS in somatic cells. It is a form of genome instability associated with defects in DNA MISMATCH REPAIR.
The regular and simultaneous occurrence in a single interbreeding population of two or more discontinuous genotypes. The concept includes differences in genotypes ranging in size from a single nucleotide site (POLYMORPHISM, SINGLE NUCLEOTIDE) to large nucleotide sequences visible at a chromosomal level.
Any method used for determining the location of and relative distances between genes on a chromosome.
Short sequences (generally about 10 base pairs) of DNA that are complementary to sequences of messenger RNA and allow reverse transcriptases to start copying the adjacent sequences of mRNA. Primers are used extensively in genetic and molecular biology techniques.
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA.
A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine).
The record of descent or ancestry, particularly of a particular condition or trait, indicating individual family members, their relationships, and their status with respect to the trait or condition.
Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations.
Genotypic differences observed among individuals in a population.
Specific regions that are mapped within a GENOME. Genetic loci are usually identified with a shorthand notation that indicates the chromosome number and the position of a specific band along the P or Q arm of the chromosome where they are found. For example the locus 6p21 is found within band 21 of the P-arm of CHROMOSOME 6. Many well known genetic loci are also known by common names that are associated with a genetic function or HEREDITARY DISEASE.
A single-stranded DNA-binding protein that is found in EUKARYOTIC CELLS. It is required for DNA REPLICATION; DNA REPAIR; and GENETIC RECOMBINATION.
Copies of DNA sequences which lie adjacent to each other in the same orientation (direct tandem repeats) or in the opposite direction to each other (INVERTED TANDEM REPEATS).
Proteins which bind to DNA. The family includes proteins which bind to both double- and single-stranded DNA and also includes specific DNA binding proteins in serum which can be used as markers for malignant diseases.
Deoxyribonucleic acid that makes up the genetic material of viruses.
Established cell cultures that have the potential to propagate indefinitely.
The genetic constitution of the individual, comprising the ALLELES present at each GENETIC LOCUS.
The loss of one allele at a specific locus, caused by a deletion mutation; or loss of a chromosome from a chromosome pair, resulting in abnormal HEMIZYGOSITY. It is detected when heterozygous markers for a locus appear monomorphic because one of the ALLELES was deleted.
Deoxyribonucleic acid that makes up the genetic material of plants.
The discipline studying genetic composition of populations and effects of factors such as GENETIC SELECTION, population size, MUTATION, migration, and GENETIC DRIFT on the frequencies of various GENOTYPES and PHENOTYPES using a variety of GENETIC TECHNIQUES.
Extrachromosomal, usually CIRCULAR DNA molecules that are self-replicating and transferable from one organism to another. They are found in a variety of bacterial, archaeal, fungal, algal, and plant species. They are used in GENETIC ENGINEERING as CLONING VECTORS.
Proteins found in any species of virus.
Ribonucleic acid that makes up the genetic material of viruses.
Tandem arrays of moderately repetitive, short (10-60 bases) DNA sequences which are found dispersed throughout the GENOME, at the ends of chromosomes (TELOMERES), and clustered near telomeres. Their degree of repetition is two to several hundred at each locus. Loci number in the thousands but each locus shows a distinctive repeat unit.
The temporal order in which the DNA of the GENOME is replicated.
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
An increased number of contiguous trinucleotide repeats in the DNA sequence from one generation to the next. The presence of these regions is associated with diseases such as FRAGILE X SYNDROME and MYOTONIC DYSTROPHY. Some CHROMOSOME FRAGILE SITES are composed of sequences where trinucleotide repeat expansion occurs.
Any DNA sequence capable of independent replication or a molecule that possesses a REPLICATION ORIGIN and which is therefore potentially capable of being replicated in a suitable cell. (Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
An individual having different alleles at one or more loci regarding a specific character.
MutS homolog 2 protein is found throughout eukaryotes and is a homolog of the MUTS DNA MISMATCH-BINDING PROTEIN. It plays an essential role in meiotic RECOMBINATION and DNA REPAIR of mismatched NUCLEOTIDES.
The co-inheritance of two or more non-allelic GENES due to their being located more or less closely on the same CHROMOSOME.
Proteins found in the nucleus of a cell. Do not confuse with NUCLEOPROTEINS which are proteins conjugated with nucleic acids, that are not necessarily present in the nucleus.
DNA present in neoplastic tissue.
The presence of an uncomplimentary base in double-stranded DNA caused by spontaneous deamination of cytosine or adenine, mismatching during homologous recombination, or errors in DNA replication. Multiple, sequential base pair mismatches lead to formation of heteroduplex DNA; (NUCLEIC ACID HETERODUPLEXES).
The reconstruction of a continuous two-stranded DNA molecule without mismatch from a molecule which contained damaged regions. The major repair mechanisms are excision repair, in which defective regions in one strand are excised and resynthesized using the complementary base pairing information in the intact strand; photoreactivation repair, in which the lethal and mutagenic effects of ultraviolet light are eliminated; and post-replication repair, in which the primary lesions are not repaired, but the gaps in one daughter duplex are filled in by incorporation of portions of the other (undamaged) daughter duplex. Excision repair and post-replication repair are sometimes referred to as "dark repair" because they do not require light.
Production of new arrangements of DNA by various mechanisms such as assortment and segregation, CROSSING OVER; GENE CONVERSION; GENETIC TRANSFORMATION; GENETIC CONJUGATION; GENETIC TRANSDUCTION; or mixed infection of viruses.
An increased tendency of the GENOME to acquire MUTATIONS when various processes involved in maintaining and replicating the genome are dysfunctional.
Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment.
An animal or plant species in danger of extinction. Causes can include human activity, changing climate, or change in predator/prey ratios.
Proteins that catalyze the unwinding of duplex DNA during replication by binding cooperatively to single-stranded regions of DNA or to short regions of duplex DNA that are undergoing transient opening. In addition DNA helicases are DNA-dependent ATPases that harness the free energy of ATP hydrolysis to translocate DNA strands.
A DNA-binding protein that consists of 5 polypeptides and plays an essential role in DNA REPLICATION in eukaryotes. It binds DNA PRIMER-template junctions and recruits PROLIFERATING CELL NUCLEAR ANTIGEN and DNA POLYMERASES to the site of DNA synthesis.
The complete genetic complement contained in a DNA or RNA molecule in a virus.
A DNA repair pathway involved in correction of errors introduced during DNA replication when an incorrect base, which cannot form hydrogen bonds with the corresponding base in the parent strand, is incorporated into the daughter strand. Excinucleases recognize the BASE PAIR MISMATCH and cause a segment of polynucleotide chain to be excised from the daughter strand, thereby removing the mismatched base. (from Oxford Dictionary of Biochemistry and Molecular Biology, 2001)
A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc.
The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of DNA from an RNA template is called REVERSE TRANSCRIPTION.
The proportion of one particular in the total of all ALLELES for one genetic locus in a breeding POPULATION.
The relationships of groups of organisms as reflected by their genetic makeup.
The restriction of a characteristic behavior, anatomical structure or physical system, such as immune response; metabolic response, or gene or gene variant to the members of one species. It refers to that property which differentiates one species from another but it is also used for phylogenetic levels higher or lower than the species.
Phase of the CELL CYCLE following G1 and preceding G2 when the entire DNA content of the nucleus is replicated. It is achieved by bidirectional replication at multiple sites along each chromosome.
The genetic constitution of individuals with respect to one member of a pair of allelic genes, or sets of genes that are closely linked and tend to be inherited together such as those of the MAJOR HISTOCOMPATIBILITY COMPLEX.
Protein motif that contains a 33-amino acid long sequence that often occurs in tandem arrays. This repeating sequence of 33-amino acids was discovered in ANKYRIN where it is involved in interaction with the anion exchanger (ANION EXCHANGE PROTEIN 1, ERYTHROCYTE). Ankyrin repeats cooperatively fold into domains that mediate molecular recognition via protein-protein interactions.
Any of the processes by which cytoplasmic factors influence the differential control of gene action in viruses.
The type species of LENTIVIRUS and the etiologic agent of AIDS. It is characterized by its cytopathic effect and affinity for the T4-lymphocyte.
Tumors or cancer of the COLON or the RECTUM or both. Risk factors for colorectal cancer include chronic ULCERATIVE COLITIS; FAMILIAL POLYPOSIS COLI; exposure to ASBESTOS; and irradiation of the CERVIX UTERI.
The outward appearance of the individual. It is the product of interactions between genes, and between the GENOTYPE and the environment.
The first continuously cultured human malignant CELL LINE, derived from the cervical carcinoma of Henrietta Lacks. These cells are used for VIRUS CULTIVATION and antitumor drug screening assays.
A type of mutation in which a number of NUCLEOTIDES deleted from or inserted into a protein coding sequence is not divisible by three, thereby causing an alteration in the READING FRAMES of the entire coding sequence downstream of the mutation. These mutations may be induced by certain types of MUTAGENS or may occur spontaneously.
A sequential pattern of amino acids occurring more than once in the same protein sequence.
The change in gene frequency in a population due to migration of gametes or individuals (ANIMAL MIGRATION) across population barriers. In contrast, in GENETIC DRIFT the cause of gene frequency changes are not a result of population or gamete movement.
A group of autosomal-dominant inherited diseases in which COLON CANCER arises in discrete adenomas. Unlike FAMILIAL POLYPOSIS COLI with hundreds of polyps, hereditary nonpolyposis colorectal neoplasms occur much later, in the fourth and fifth decades. HNPCC has been associated with germline mutations in mismatch repair (MMR) genes. It has been subdivided into Lynch syndrome I or site-specific colonic cancer, and LYNCH SYNDROME II which includes extracolonic cancer.
The spatial arrangement of the atoms of a nucleic acid or polynucleotide that results in its characteristic 3-dimensional shape.
Copies of nucleic acid sequence that are arranged in opposing orientation. They may lie adjacent to each other (tandem) or be separated by some sequence that is not part of the repeat (hyphenated). They may be true palindromic repeats, i.e. read the same backwards as forward, or complementary which reads as the base complement in the opposite orientation. Complementary inverted repeats have the potential to form hairpin loop or stem-loop structures which results in cruciform structures (such as CRUCIFORM DNA) when the complementary inverted repeats occur in double stranded regions.
DNA-dependent DNA polymerases found in bacteria, animal and plant cells. During the replication process, these enzymes catalyze the addition of deoxyribonucleotide residues to the end of a DNA strand in the presence of DNA as template-primer. They also possess exonuclease activity and therefore function in DNA repair.
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
Proteins that control the CELL DIVISION CYCLE. This family of proteins includes a wide variety of classes, including CYCLIN-DEPENDENT KINASES, mitogen-activated kinases, CYCLINS, and PHOSPHOPROTEIN PHOSPHATASES as well as their putative substrates such as chromatin-associated proteins, CYTOSKELETAL PROTEINS, and TRANSCRIPTION FACTORS.
Biochemical identification of mutational changes in a nucleotide sequence.
The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments.
Deoxyribonucleic acid that makes up the genetic material of bacteria.
Deoxyribonucleic acid that makes up the genetic material of fungi.
A region of DNA that is highly polymorphic and is prone to strand breaks, rearrangements or other MUTATIONS because of the nature of its sequence. These regions often harbor palindromic, or repetitive sequences (REPETITIVE SEQUENCES, NUCLEIC ACID). Variability in stability of the DNA sequence is seen at CHROMOSOME FRAGILE SITES.
Deletion of sequences of nucleic acids from the genetic material of an individual.
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
The process of cumulative change at the level of DNA; RNA; and PROTEINS, over successive generations.
The science dealing with the earth and its life, especially the description of land, sea, and air and the distribution of plant and animal life, including humanity and human industries with reference to the mutual relations of these elements. (From Webster, 3d ed)
A broad category of carrier proteins that play a role in SIGNAL TRANSDUCTION. They generally contain several modular domains, each of which having its own binding activity, and act by forming complexes with other intracellular-signaling molecules. Signal-transducing adaptor proteins lack enzyme activity, however their activity can be modulated by other signal-transducing enzymes
Double-stranded DNA of MITOCHONDRIA. In eukaryotes, the mitochondrial GENOME is circular and codes for ribosomal RNAs, transfer RNAs, and about 10 proteins.
Injuries to DNA that introduce deviations from its normal, intact structure and which may, if left unrepaired, result in a MUTATION or a block of DNA REPLICATION. These deviations may be caused by physical or chemical agents and occur by natural or unnatural, introduced circumstances. They include the introduction of illegitimate bases during replication or by deamination or other modification of bases; the loss of a base from the DNA backbone leaving an abasic site; single-strand breaks; double strand breaks; and intrastrand (PYRIMIDINE DIMERS) or interstrand crosslinking. Damage can often be repaired (DNA REPAIR). If the damage is extensive, it can induce APOPTOSIS.
A genetic rearrangement through loss of segments of DNA or RNA, bringing sequences which are normally separated into close proximity. This deletion may be detected using cytogenetic techniques and can also be inferred from the phenotype, indicating a deletion at one specific locus.
A species of the genus SACCHAROMYCES, family Saccharomycetaceae, order Saccharomycetales, known as "baker's" or "brewer's" yeast. The dried form is used as a dietary supplement.
The total relative probability, expressed on a logarithmic scale, that a linkage relationship exists among selected loci. Lod is an acronym for "logarithmic odds."
Within a eukaryotic cell, a membrane-limited body which contains chromosomes and one or more nucleoli (CELL NUCLEOLUS). The nuclear membrane consists of a double unit-type membrane which is perforated by a number of pores; the outermost membrane is continuous with the ENDOPLASMIC RETICULUM. A cell may contain more than one nucleus. (From Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
A single chain of deoxyribonucleotides that occurs in some bacteria and viruses. It usually exists as a covalently closed circle.
Proteins encoded by a VIRAL GENOME that are produced in the organisms they infect, but not packaged into the VIRUS PARTICLES. Some of these proteins may play roles within the infected cell during VIRUS REPLICATION or act in regulation of virus replication or VIRUS ASSEMBLY.
Transport proteins that carry specific substances in the blood or across cell membranes.
The functional hereditary units of VIRUSES.
Nonrandom association of linked genes. This is the tendency of the alleles of two separate but already linked loci to be found together more frequently than would be expected by chance alone.
An increase number of repeats of a genomic, tandemly repeated DNA sequence from one generation to the next.
The origin recognition complex is a multi-subunit DNA-binding protein that initiates DNA REPLICATION in eukaryotes.
Process of generating a genetic MUTATION. It may occur spontaneously or be induced by MUTAGENS.
The parts of a macromolecule that directly participate in its specific combination with another molecule.
Proteins found in any species of bacterium.
A specific pair of human chromosomes in group A (CHROMOSOMES, HUMAN, 1-3) of the human chromosome classification.
Structures within the nucleus of bacterial cells consisting of or containing DNA, which carry genetic information essential to the cell.
Agents used in the prophylaxis or therapy of VIRUS DISEASES. Some of the ways they may act include preventing viral replication by inhibiting viral DNA polymerase; binding to specific cell-surface receptors and inhibiting viral penetration or uncoating; inhibiting viral protein synthesis; or blocking late stages of virus assembly.
Proteins obtained from the species SACCHAROMYCES CEREVISIAE. The function of specific proteins from this organism are the subject of intense scientific interest and have been used to derive basic understanding of the functioning similar proteins in higher eukaryotes.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
In a prokaryotic cell or in the nucleus of a eukaryotic cell, a structure consisting of or containing DNA which carries the genetic information essential to the cell. (From Singleton & Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d ed)
Cells propagated in vitro in special media conducive to their growth. Cultured cells are used to study developmental, morphologic, metabolic, physiologic, and genetic processes, among others.
The complex series of phenomena, occurring between the end of one CELL DIVISION and the end of the next, by which cellular material is duplicated and then divided between two daughter cells. The cell cycle includes INTERPHASE, which includes G0 PHASE; G1 PHASE; S PHASE; and G2 PHASE, and CELL DIVISION PHASE.
An antineoplastic agent that inhibits DNA synthesis through the inhibition of ribonucleoside diphosphate reductase.
Regulatory sequences important for viral replication that are located on each end of the HIV genome. The LTR includes the HIV ENHANCER, promoter, and other sequences. Specific regions in the LTR include the negative regulatory element (NRE), NF-kappa B binding sites , Sp1 binding sites, TATA BOX, and trans-acting responsive element (TAR). The binding of both cellular and viral proteins to these regions regulates HIV transcription.
A latent susceptibility to disease at the genetic level, which may be activated under certain conditions.
Establishing the father relationship of a man and a child.
A CELL LINE derived from the kidney of the African green (vervet) monkey, (CERCOPITHECUS AETHIOPS) used primarily in virus replication studies and plaque assays.
Nuclear antigen with a role in DNA synthesis, DNA repair, and cell cycle progression. PCNA is required for the coordinated synthesis of both leading and lagging strands at the replication fork during DNA replication. PCNA expression correlates with the proliferation activity of several malignant and non-malignant cell types.
Addition of methyl groups to DNA. DNA methyltransferases (DNA methylases) perform this reaction using S-ADENOSYLMETHIONINE as the methyl group donor.
Use of restriction endonucleases to analyze and generate a physical map of genomes, genes, or other segments of DNA.
The material of CHROMOSOMES. It is a complex of DNA; HISTONES; and nonhistone proteins (CHROMOSOMAL PROTEINS, NON-HISTONE) found within the nucleus of a cell.