The pattern of GENE EXPRESSION at the level of genetic transcription in a specific organism or under specific circumstances in specific cells.
Gene Expression Profiling
Molecular Sequence Annotation
The addition of descriptive information about the function or structure of a molecular sequence to its MOLECULAR SEQUENCE DATA record.
Sequence Analysis, RNA
A multistage process that includes cloning, physical mapping, subcloning, sequencing, and information analysis of an RNA SEQUENCE.
Oligonucleotide Array Sequence Analysis
Hybridization of a nucleic acid sample to a very large set of OLIGONUCLEOTIDE PROBES, which have been attached individually in columns and rows to a solid support, to determine a BASE SEQUENCE, or to detect variations in a gene sequence, GENE EXPRESSION, or for GENE MAPPING.
Expressed Sequence Tags
Partial cDNA (DNA, COMPLEMENTARY) sequences that are unique to the cDNAs from which they were derived.
High-Throughput Nucleotide Sequencing
Techniques of nucleotide sequence analysis that increase the range, complexity, sensitivity, and accuracy of results by greatly increasing the scale of operations and thus the number of nucleotides, and the number of copies of each nucleotide sequenced. The sequencing may be done by analysis of the synthesis or ligation products, hybridization to preexisting sequences, etc.
Gene Expression Regulation, Plant
A large collection of DNA fragments cloned (CLONING, MOLECULAR) from a given organism, tissue, organ, or cell type. It may contain complete genomic sequences (GENOMIC LIBRARY) or complementary DNA sequences, the latter being formed from messenger RNA and lacking intron sequences.
Sets of structured vocabularies used for describing and categorizing genes, and gene products by their molecular function, involvement in biological processes, and cellular location. These vocabularies and their associations to genes and gene products (Gene Ontology annotations) are generated and curated by the Gene Ontology Consortium.
A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both.
Sequence Analysis, DNA
RNA sequences that serve as templates for protein synthesis. Bacterial mRNAs are generally primary transcripts in that they do not require post-transcriptional processing. Eukaryotic mRNA is synthesized in the nucleus and must be exported to the cytoplasm for translation. Most eukaryotic mRNAs have a sequence of polyadenylic acid at the 3' end, referred to as the poly(A) tail. The function of this tail is not known for certain, but it may play a role in the export of mature mRNA from the nucleus as well as in helping stabilize some mRNA molecules by retarding their degradation in the cytoplasm.
Overlapping of cloned or sequenced DNA to construct a continuous region of a gene, chromosome or genome.
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
The systematic study of the complete DNA sequences (GENOME) of organisms.
The protein complement of an organism coded for by its genome.
Gene Regulatory Networks
Interacting DNA-encoded regulatory subsystems in the GENOME that coordinate input from activator and repressor TRANSCRIPTION FACTORS during development, cell differentiation, or in response to environmental cues. The networks function to ultimately specify expression of particular sets of GENES for specific conditions, times, or locations.
Reverse Transcriptase Polymerase Chain Reaction
A variation of the PCR technique in which cDNA is made from RNA via reverse transcription. The resultant cDNA is then amplified using standard PCR protocols.
Gene Expression Regulation
Gene Expression Regulation, Bacterial
The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA.
A plant genus of the family BRASSICACEAE that contains ARABIDOPSIS PROTEINS and MADS DOMAIN PROTEINS. The species A. thaliana is used for experiments in classical plant genetics as well as molecular genetic studies in plant physiology, biochemistry, and development.
Metabolic Networks and Pathways
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Gene Expression Regulation, Developmental
Reproducibility of Results
The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results.
A process whereby multiple RNA transcripts are generated from a single gene. Alternative splicing involves the splicing together of other possible sets of EXONS during the processing of some, but not all, transcripts of the gene. Thus a particular exon may be connected to any one of several alternative exons to form a mature RNA. The alternative forms of mature MESSENGER RNA produce PROTEIN ISOFORMS in which one part of the isoforms is common while the other parts are different.
Single-stranded complementary DNA synthesized from an RNA template by the action of RNA-dependent DNA polymerase. cDNA (i.e., complementary DNA, not circular DNA, not C-DNA) is used in a variety of molecular cloning experiments as well as serving as a specific hybridization probe.
Expanded structures, usually green, of vascular plants, characteristically consisting of a bladelike expansion attached to a stem, and functioning as the principal organ of photosynthesis and transpiration. (American Heritage Dictionary, 2d ed)
Real-Time Polymerase Chain Reaction
Methods used for detecting the amplified DNA products from the polymerase chain reaction as they accumulate instead of at the end of the reaction.
A polynucleotide consisting essentially of chains with a repeating backbone of phosphate and ribose units to which nitrogenous bases are attached. RNA is unique among biological macromolecules in that it can encode genetic information, serve as an abundant structural component of cells, and also possesses catalytic activity. (Rieger et al., Glossary of Genetics: Classical and Molecular, 5th ed)
Proteins found in plants (flowers, herbs, shrubs, trees, etc.). The concept does not include proteins found in vegetables for which VEGETABLE PROTEINS is available.
The intracellular transfer of information (biological activation/inhibition) through a signal pathway. In each signal transduction system, an activation/inhibition signal from a biologically active molecule (hormone, neurotransmitter) is mediated via the coupling of a receptor/enzyme to a second messenger system or to an ion channel. Signal transduction plays an important role in activating cellular functions, cell differentiation, and cell proliferation. Examples of signal transduction systems are the GAMMA-AMINOBUTYRIC ACID-postsynaptic receptor-calcium ion channel system, the receptor-mediated T-cell activation pathway, and the receptor-mediated activation of phospholipases. Those coupled to membrane depolarization or intracellular release of calcium include the receptor-mediated activation of cytotoxic functions in granulocytes and the synaptic potentiation of protein kinase activation. Some signal transduction pathways may be part of larger signal transduction pathways; for example, protein kinase activation is part of the platelet activation signal pathway.
The different gene transcripts generated from a single gene by RNA EDITING or ALTERNATIVE SPLICING of RNA PRECURSORS.
Small double-stranded, non-protein coding RNAs, 21-25 nucleotides in length generated from single-stranded microRNA gene transcripts by the same RIBONUCLEASE III, Dicer, that produces small interfering RNAs (RNA, SMALL INTERFERING). They become part of the RNA-INDUCED SILENCING COMPLEX and repress the translation (TRANSLATION, GENETIC) of target RNA by binding to homologous 3'UTR region as an imperfect match. The small temporal RNAs (stRNAs), let-7 and lin-4, from C. elegans, are the first 2 miRNAs discovered, and are from a class of miRNAs involved in developmental timing.
The parts of the messenger RNA sequence that do not code for product, i.e. the 5' UNTRANSLATED REGIONS and 3' UNTRANSLATED REGIONS.
The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs.
A variety of simple repeat sequences that are distributed throughout the GENOME. They are characterized by a short repeat unit of 2-8 basepairs that is repeated up to 100 times. They are also known as short tandem repeats (STRs).
The restriction of a characteristic behavior, anatomical structure or physical system, such as immune response; metabolic response, or gene or gene variant to the members of one species. It refers to that property which differentiates one species from another but it is also used for phylogenetic levels higher or lower than the species.
The process of cumulative change at the level of DNA; RNA; and PROTEINS, over successive generations.
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
Gene Expression Regulation, Fungal
RNA molecules which hybridize to complementary sequences in either RNA or DNA altering the function of the latter. Endogenous antisense RNAs function as regulators of gene expression by a variety of mechanisms. Synthetic antisense RNAs are used to effect the functioning of specific genes for investigative or therapeutic purposes.
Principal Component Analysis
Databases, Nucleic Acid
Databases containing information about NUCLEIC ACIDS such as BASE SEQUENCE; SNPS; NUCLEIC ACID CONFORMATION; and other properties. Information about the DNA fragments kept in a GENE LIBRARY or GENOMIC LIBRARY is often maintained in DNA databases.
Open Reading Frames
Polymorphism, Single Nucleotide
Proteins that originate from plants species belonging to the genus ARABIDOPSIS. The most intensely studied species of Arabidopsis, Arabidopsis thaliana, is commonly used in laboratory experiments.
RNA, Small Untranslated
Short RNA, about 200 base pairs in length or shorter, that does not code for protein.
Laser Capture Microdissection
The systematic study of the complete complement of proteins (PROTEOME) of organisms.
A positive regulatory effect on physiological processes at the molecular, cellular, or systemic level. At the molecular level, the major regulatory sites include membrane receptors, genes (GENE EXPRESSION REGULATION), mRNAs (RNA, MESSENGER), and proteins.
The GENETIC RECOMBINATION of the parts of two or more GENES resulting in a gene with different or additional regulatory regions, or a new chimeric gene product. ONCOGENE FUSION includes an ONCOGENE as at least one of the fusion partners and such gene fusions are often detected in neoplastic cells and are transcribed into ONCOGENE FUSION PROTEINS. ARTIFICIAL GENE FUSION is carried out in vitro by RECOMBINANT DNA technology.
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Fruiting Bodies, Fungal
Plant Growth Regulators
Transcription Initiation Site
Very young plant after GERMINATION of SEEDS.
Commonly observed BASE SEQUENCE or nucleotide structural components which can be represented by a CONSENSUS SEQUENCE or a SEQUENCE LOGO.
A negative regulatory effect on physiological processes at the molecular, cellular, or systemic level. At the molecular level, the major regulatory sites include membrane receptors, genes (GENE EXPRESSION REGULATION), mRNAs (RNA, MESSENGER), and proteins.
A process that changes the nucleotide sequence of mRNA from that of the DNA template encoding it. Some major classes of RNA editing are as follows: 1, the conversion of cytosine to uracil in mRNA; 2, the addition of variable number of guanines at pre-determined sites; and 3, the addition and deletion of uracils, templated by guide-RNAs (RNA, GUIDE).
RNA Splice Sites
Life Cycle Stages
Gene Expression Regulation, Neoplastic
The joining of RNA from two different genes. One type of trans-splicing is the "spliced leader" type (primarily found in protozoans such as trypanosomes and in lower invertebrates such as nematodes) which results in the addition of a capped, noncoding, spliced leader sequence to the 5' end of mRNAs. Another type of trans-splicing is the "discontinuous group II introns" type (found in plant/algal chloroplasts and plant mitochondria) which results in the joining of two independently transcribed coding sequences. Both are mechanistically similar to conventional nuclear pre-mRNA cis-splicing. Mammalian cells are also capable of trans-splicing.
Plants that can grow well in soils that have a high SALINITY.
The capacity of an organism to defend itself against pathological processes or the agents of those processes. This most often involves innate immunity whereby the organism responds to pathogens in a generic way. The term disease resistance is used most frequently when referring to plants.
Theoretical representations that simulate the behavior or activity of biological processes or diseases. For disease models in living animals, DISEASE MODELS, ANIMAL is available. Biological models include the use of mathematical equations, computers, and other electronic equipment.
Root Nodules, Plant
Knobbed structures formed from and attached to plant roots, especially of LEGUMES, which result from symbiotic infection by nitrogen fixing bacteria such as RHIZOBIUM or FRANKIA. Root nodules are structures related to MYCORRHIZAE formed by symbiotic associations with fungi.
Polymerase Chain Reaction
In vitro method for producing large amounts of specific DNA or RNA fragments of defined length and sequence from small amounts of short oligonucleotide flanking sequences (primers). The essential steps include thermal denaturation of the double-stranded target molecules, annealing of the primers to their complementary sequences, and extension of the annealed primers by enzymatic synthesis with DNA polymerase. The reaction is efficient, specific, and extremely sensitive. Uses for the reaction include disease diagnosis, detection of difficult-to-isolate pathogens, mutation analysis, genetic testing, DNA sequencing, and analyzing evolutionary relationships.
The extent to which an RNA molecule retains its structural integrity and resists degradation by RNASE, and base-catalyzed HYDROLYSIS, under changing in vivo or in vitro conditions.
Promoter Regions, Genetic
DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes.
Embryonic Stem Cells
Plants, Genetically Modified
PLANTS, or their progeny, whose GENOME has been altered by GENETIC ENGINEERING.
The fleshy or dry ripened ovary of a plant, enclosing the seed or seeds.
An element with the atomic symbol N, atomic number 7, and atomic weight [14.00643; 14.00728]. Nitrogen exists as a diatomic gas and makes up about 78% of the earth's atmosphere by volume. It is a constituent of proteins and nucleic acids and found in all living cells.
The Alu sequence family (named for the restriction endonuclease cleavage enzyme Alu I) is the most highly repeated interspersed repeat element in humans (over a million copies). It is derived from the 7SL RNA component of the SIGNAL RECOGNITION PARTICLE and contains an RNA polymerase III promoter. Transposition of this element into coding and regulatory regions of genes is responsible for many heritable diseases.
A plant species of the family SOLANACEAE, native of South America, widely cultivated for their edible, fleshy, usually red fruit.
RNA, Long Noncoding
A class of untranslated RNA molecules that are typically greater than 200 nucleotides in length and do not code for proteins. Members of this class have been found to play roles in transcriptional regulation, post-transcriptional processing, CHROMATIN REMODELING, and in the epigenetic control of chromatin.
A plant genus in the family PINACEAE, order Pinales, class Pinopsida, division Coniferophyta. They are evergreen trees mainly in temperate climates.
Specific regions that are mapped within a GENOME. Genetic loci are usually identified with a shorthand notation that indicates the chromosome number and the position of a specific band along the P or Q arm of the chromosome where they are found. For example the locus 6p21 is found within band 21 of the P-arm of CHROMOSOME 6. Many well known genetic loci are also known by common names that are associated with a genetic function or HEREDITARY DISEASE.
The addition of a tail of polyadenylic acid (POLY A) to the 3' end of mRNA (RNA, MESSENGER). Polyadenylation involves recognizing the processing site signal, (AAUAAA), and cleaving of the mRNA to create a 3' OH terminal end to which poly A polymerase (POLYNUCLEOTIDE ADENYLYLTRANSFERASE) adds 60-200 adenylate residues. The 3' end processing of some messenger RNAs, such as histone mRNA, is carried out by a different process that does not include the addition of poly A as described here.
Protein Array Analysis
A plant species of the genus CITRUS, family RUTACEAE that provides the familiar orange fruit which is also a source of orange oil.
A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences.
A plant genus in the family VITACEAE, order Rhamnales, subclass Rosidae. It is a woody vine cultivated worldwide. It is best known for grapes, the edible fruit and used to make WINE and raisins.
Amino Acid Sequence
Comparative Genomic Hybridization
A plant species of the family FABACEAE used to study GENETICS because it is DIPLOID, self fertile, has a small genome, and short generation time.
A plant genus of the family EUPHORBIACEAE, order Euphorbiales, subclass Rosidae. Commercial natural RUBBER is mainly obtained from Hevea brasiliensis but also from some other plants.
Any of the DNA in between gene-coding DNA, including untranslated regions, 5' and 3' flanking regions, INTRONS, non-functional pseudogenes, and non-functional repetitive sequences. This DNA may or may not encode regulatory functions.
A plant genus in the family PINACEAE, order Pinales, class Pinopsida, division Coniferophyta. They are coniferous evergreen trees with long, flat, spirally arranged needles that grow directly from the branch.
Flagellate EUKARYOTES, found mainly in the oceans. They are characterized by the presence of transverse and longitudinal flagella which propel the organisms in a rotating manner through the water. Dinoflagellida were formerly members of the class Phytomastigophorea under the old five kingdom paradigm.
Comprehensive, methodical analysis of complex biological systems by monitoring responses to perturbations of biological processes. Large scale, computerized collection and analysis of the data are used to develop and test models of biological systems.
Short sequences (generally about 10 base pairs) of DNA that are complementary to sequences of messenger RNA and allow reverse transcriptases to start copying the adjacent sequences of mRNA. Primers are used extensively in genetic and molecular biology techniques.
The outermost layer of a cell in most PLANTS; BACTERIA; FUNGI; and ALGAE. The cell wall is usually a rigid structure that lies external to the CELL MEMBRANE, and provides a protective barrier against physical or chemical agents.
The synthesis by organisms of organic chemical compounds, especially carbohydrates, from carbon dioxide using energy obtained from light rather than from the oxidation of chemical compounds. Photosynthesis comprises two separate processes: the light reactions and the dark reactions. In higher plants; GREEN ALGAE; and CYANOBACTERIA; NADPH and ATP formed by the light reactions drive the dark reactions which result in the fixation of carbon dioxide. (from Oxford Dictionary of Biochemistry and Molecular Biology, 2001)
The most abundant natural aromatic organic polymer found in all vascular plants. Lignin together with cellulose and hemicellulose are the major cell wall components of the fibers of all wood and grass species. Lignin is composed of coniferyl, p-coumaryl, and sinapyl alcohols in varying ratios in different plant species. (From Merck Index, 11th ed)
Sequence Homology, Nucleic Acid
Proteins synthesized by organisms belonging to the phylum ARTHROPODA. Included in this heading are proteins from the subdivisions ARACHNIDA; CRUSTACEA; and HORSESHOE CRABS. Note that a separate heading for INSECT PROTEINS is listed under this heading.
Paired respiratory organs of fishes and some amphibians that are analogous to lungs. They are richly supplied with blood vessels by which oxygen and carbon dioxide are exchanged directly with the environment.
Inorganic compounds that include a positively charged tetrahedral nitrogen (ammonium ion) as part of their structure. This class of compounds includes a broad variety of simple ammonium salts and derivatives.