Overlapping of cloned or sequenced DNA to construct a continuous region of a gene, chromosome or genome.
Any method used for determining the location of and relative distances between genes on a chromosome.
Chromosomes in which fragments of exogenous DNA ranging in length up to several hundred kilobase pairs have been cloned into yeast through ligation to vector sequences. These artificial chromosomes are used extensively in molecular biology for the construction of comprehensive genomic libraries of higher organisms.
Short tracts of DNA sequence that are used as landmarks in GENOME mapping. In most instances, 200 to 500 base pairs of sequence define a Sequence Tagged Site (STS) that is operationally unique in the human genome (i.e., can be specifically detected by the polymerase chain reaction in the presence of all other genomic sequences). The overwhelming advantage of STSs over mapping landmarks defined in other ways is that the means of testing for the presence of a particular STS can be completely described as information in a database.
Mapping of the linear order of genes on a chromosome with units indicating their distances by using methods other than genetic recombination. These methods include nucleotide sequencing, overlapping deletions in polytene chromosomes, and electron micrography of heteroduplex DNA. (From King & Stansfield, A Dictionary of Genetics, 5th ed)
A technique with which an unknown region of a chromosome can be explored. It is generally used to isolate a locus of interest for which no probe is available but that is known to be linked to a gene which has been identified and cloned. A fragment containing a known gene is selected and used as a probe to identify other overlapping fragments which contain the same gene. The nucleotide sequences of these fragments can then be characterized. This process continues for the length of the chromosome.
DNA constructs that are composed of, at least, a REPLICATION ORIGIN, for successful replication, propagation to and maintenance as an extra chromosome in bacteria. In addition, they can carry large amounts (about 200 kilobases) of other sequence for a variety of bioengineering purposes.
A species of temperate bacteriophage in the genus P1-like viruses, family MYOVIRIDAE, which infects E. coli. It is the largest of the COLIPHAGES and consists of double-stranded DNA, terminally redundant, and circularly permuted.
Plasmids containing at least one cos (cohesive-end site) of PHAGE LAMBDA. They are used as cloning vehicles.
A phenotypically recognizable genetic trait which can be used to identify a genetic locus, a linkage group, or a recombination event.
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis.
Partial cDNA (DNA, COMPLEMENTARY) sequences that are unique to the cDNAs from which they were derived.
A large collection of DNA fragments cloned (CLONING, MOLECULAR) from a given organism, tissue, organ, or cell type. It may contain complete genomic sequences (GENOMIC LIBRARY) or complementary DNA sequences, the latter being formed from messenger RNA and lacking intron sequences.
The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence.
A method for ordering genetic loci along CHROMOSOMES. The method involves fusing irradiated donor cells with host cells from another species. Following cell fusion, fragments of DNA from the irradiated cells become integrated into the chromosomes of the host cells. Molecular probing of DNA obtained from the fused cells is used to determine if two or more genetic loci are located within the same fragment of donor cell DNA.
Use of restriction endonucleases to analyze and generate a physical map of genomes, genes, or other segments of DNA.
A form of GENE LIBRARY containing the complete DNA sequences present in the genome of a given organism. It contrasts with a cDNA library which contains only sequences utilized in protein coding (lacking introns).
The co-inheritance of two or more non-allelic GENES due to their being located more or less closely on the same CHROMOSOME.
Analysis of PEPTIDES that are generated from the digestion or fragmentation of a protein or mixture of PROTEINS, by ELECTROPHORESIS; CHROMATOGRAPHY; or MASS SPECTROMETRY. The resulting peptide fingerprints are analyzed for a variety of purposes including the identification of the proteins in a sample, GENETIC POLYMORPHISMS, patterns of gene expression, and patterns diagnostic for diseases.
Methods used for studying the interactions of antibodies with specific regions of protein antigens. Important applications of epitope mapping are found within the area of immunochemistry.
The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells.
The genetic complement of a plant (PLANTS) as represented in its DNA.
Imaging techniques used to colocalize sites of brain functions or physiological activity with brain structures.
Complex nucleoprotein structures which contain the genomic DNA and are part of the CELL NUCLEUS of PLANTS.
A type of IN SITU HYBRIDIZATION in which target sequences are stained with fluorescent dye so their location and size can be determined using fluorescence microscopy. This staining is sufficiently distinct that the hybridization signal can be seen both in metaphase spreads and in interphase nuclei.
Recording of regional electrophysiological information by analysis of surface potentials to give a complete picture of the effects of the currents from the heart on the body surface. It has been applied to the diagnosis of old inferior myocardial infarction, localization of the bypass pathway in Wolff-Parkinson-White syndrome, recognition of ventricular hypertrophy, estimation of the size of a myocardial infarct, and the effects of different interventions designed to reduce infarct size. The limiting factor at present is the complexity of the recording and analysis, which requires 100 or more electrodes, sophisticated instrumentation, and dedicated personnel. (Braunwald, Heart Disease, 4th ed)
Sequential operating programs and data which instruct the functioning of a digital computer.
The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA.
A variety of simple repeat sequences that are distributed throughout the GENOME. They are characterized by a short repeat unit of 2-8 basepairs that is repeated up to 100 times. They are also known as short tandem repeats (STRs).
Structures within the nucleus of bacterial cells consisting of or containing DNA, which carry genetic information essential to the cell.
A specific pair of GROUP C CHROMOSOMES of the human chromosome classification.
Deoxyribonucleic acid that makes up the genetic material of plants.
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.
A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task.
Recording the locations and measurements of electrical activity in the EPICARDIUM by placing electrodes on the surface of the heart to analyze the patterns of activation and to locate arrhythmogenic sites.
Single-stranded complementary DNA synthesized from an RNA template by the action of RNA-dependent DNA polymerase. cDNA (i.e., complementary DNA, not circular DNA, not C-DNA) is used in a variety of molecular cloning experiments as well as serving as a specific hybridization probe.
The presence of two or more genetic loci on the same chromosome. Extensions of this original definition refer to the similarity in content and organization between chromosomes, of different species for example.
A specific pair of GROUP C CHROMOSOMES of the human chromosome classification.
The systematic study of the complete DNA sequences (GENOME) of organisms.
A specific pair of human chromosomes in group A (CHROMOSOMES, HUMAN, 1-3) of the human chromosome classification.
The functional hereditary units of PLANTS.
The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs.
The record of descent or ancestry, particularly of a particular condition or trait, indicating individual family members, their relationships, and their status with respect to the trait or condition.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
Techniques of nucleotide sequence analysis that increase the range, complexity, sensitivity, and accuracy of results by greatly increasing the scale of operations and thus the number of nucleotides, and the number of copies of each nucleotide sequenced. The sequencing may be done by analysis of the synthesis or ligation products, hybridization to preexisting sequences, etc.
Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES).
A technique for identifying individuals of a species that is based on the uniqueness of their DNA sequence. Uniqueness is determined by identifying which combination of allelic variations occur in the individual at a statistically relevant number of different loci. In forensic studies, RESTRICTION FRAGMENT LENGTH POLYMORPHISM of multiple, highly polymorphic VNTR LOCI or MICROSATELLITE REPEAT loci are analyzed. The number of loci used for the profile depends on the ALLELE FREQUENCY in the population.
A specific pair of GROUP E CHROMOSOMES of the human chromosome classification.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
The genetic constitution of individuals with respect to one member of a pair of allelic genes, or sets of genes that are closely linked and tend to be inherited together such as those of the MAJOR HISTOCOMPATIBILITY COMPLEX.
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment.
Species- or subspecies-specific DNA (including COMPLEMENTARY DNA; conserved genes, whole chromosomes, or whole genomes) used in hybridization studies in order to identify microorganisms, to measure DNA-DNA homologies, to group subspecies, etc. The DNA probe hybridizes with a specific mRNA, if present. Conventional techniques used for testing for the hybridization product include dot blot assays, Southern blot assays, and DNA:RNA hybrid-specific antibody tests. Conventional labels for the DNA probe include the radioisotope labels 32P and 125I and the chemical label biotin. The use of DNA probes provides a specific, sensitive, rapid, and inexpensive replacement for cell culture techniques for diagnosing infections.
The genetic complement of a BACTERIA as represented in its DNA.
Any cell, other than a ZYGOTE, that contains elements (such as NUCLEI and CYTOPLASM) from two or more different cells, usually produced by artificial CELL FUSION.
A specific pair of GROUP C CHROMOSOMES of the human chromosome classification.