Data Mining
Information Storage and Retrieval
PubMed
A bibliographic database that includes MEDLINE as its primary subset. It is produced by the National Center for Biotechnology Information (NCBI), part of the NATIONAL LIBRARY OF MEDICINE. PubMed, which is searchable through NLM's Web site, also includes access to additional citations to selected life sciences journals not in MEDLINE, and links to other resources such as the full-text of articles at participating publishers' Web sites, NCBI's molecular biology databases, and PubMed Central.
Computational Biology
A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets.
Natural Language Processing
Algorithms
Software
Database Management Systems
Databases, Factual
Extensive collections, reputedly complete, of facts and data garnered from material of a specialized subject area and made available for analysis and application. The collection can be automated by various contemporary methods for retrieval. The concept should be differentiated from DATABASES, BIBLIOGRAPHIC which is restricted to collections of bibliographic references.
MEDLINE
Internet
User-Computer Interface
Artificial Intelligence
Abstracting and Indexing as Topic
Databases, Protein
Databases containing information about PROTEINS such as AMINO ACID SEQUENCE; PROTEIN CONFORMATION; and other properties.
Workflow
Vocabulary, Controlled
A specified list of terms with a fixed and unalterable meaning, and from which a selection is made when CATALOGING; ABSTRACTING AND INDEXING; or searching BOOKS; JOURNALS AS TOPIC; and other documents. The control is intended to avoid the scattering of related subjects under different headings (SUBJECT HEADINGS). The list may be altered or extended only by the publisher or issuing agency. (From Harrod's Librarians' Glossary, 7th ed, p163)
Genomics
The systematic study of the complete DNA sequences (GENOME) of organisms.
Pattern Recognition, Automated
Terminology as Topic
Expressed Sequence Tags
Gene Expression Profiling
The determination of the pattern of genes expressed at the level of GENETIC TRANSCRIPTION, under specific circumstances or in a specific cell.
Databases, Bibliographic
Extensive collections, reputedly complete, of references and citations to books, articles, publications, etc., generally on a single subject or specialized subject area. Databases can operate through automated files, libraries, or computer disks. The concept should be differentiated from DATABASES, FACTUAL which is used for collections of data and facts apart from bibliographic references to them.
Uranium
Pneumoconiosis
A diffuse parenchymal lung disease caused by inhalation of dust and by tissue reaction to their presence. These inorganic, organic, particulate, or vaporized matters usually are inhaled by workers in their occupational environment, leading to the various forms (ASBESTOSIS; BYSSINOSIS; and others). Similar air pollution can also have deleterious effects on the general population.
Computer Graphics
Cluster Analysis
A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both.
Decision Trees
Drug Repositioning
Oligonucleotide Array Sequence Analysis
Gold
Systems Integration
Coal
Documentation
Databases, Nucleic Acid
Knowledge Bases
Soil Pollutants
Substances which pollute the soil. Use for soil pollutants in general or for which there is no specific heading.
Dictionaries as Topic
Publications
Protein Interaction Mapping
Methods for determining interaction between PROTEINS.
Satellite Imagery
Decision Support Systems, Management
Molecular Sequence Annotation
The addition of descriptive information about the function or structure of a molecular sequence to its MOLECULAR SEQUENCE DATA record.
Occupational Exposure
Databases as Topic
Sequence Analysis, Protein
A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence.
Radon
Silicosis
A form of pneumoconiosis resulting from inhalation of dust containing crystalline form of SILICON DIOXIDE, usually in the form of quartz. Amorphous silica is relatively nontoxic.
Accidents, Occupational
Proteins
Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein.
Asbestos, Amphibole
Multigene Family
A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed)
Environmental Monitoring
Industrial Waste
Sequence Analysis, DNA
Search Engine
Software used to locate data or information stored in machine-readable form locally or at a distance such as an INTERNET site.
Sequence Alignment
The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms.
Gene Regulatory Networks
Interacting DNA-encoded regulatory subsystems in the GENOME that coordinate input from activator and repressor TRANSCRIPTION FACTORS during development, cell differentiation, or in response to environmental cues. The networks function to ultimately specify expression of particular sets of GENES for specific conditions, times, or locations.
Data Interpretation, Statistical
Metals, Heavy
Systems Biology
Adverse Drug Reaction Reporting Systems
Appalachian Region
A geographical area of the United States with no definite boundaries but comprising northeastern Alabama, northwestern Georgia, northwestern South Carolina, western North Carolina, eastern Kentucky, eastern Tennessee, western Virginia, West Virginia, western Maryland, southwestern Pennsylvania, southern Ohio, and southern New York.
Protein Interaction Maps
Graphs representing sets of measurable, non-covalent physical contacts with specific PROTEINS in living organisms or in cells.
Asbestosis
A form of pneumoconiosis caused by inhalation of asbestos fibers which elicit potent inflammatory responses in the parenchyma of the lung. The disease is characterized by interstitial fibrosis of the lung, varying from scattered sites to extensive scarring of the alveolar interstitium.
Polygonaceae
Hazardous Waste
Medical Records Systems, Computerized
Reproducibility of Results
The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results.
National Institute for Occupational Safety and Health (U.S.)
An institute of the CENTERS FOR DISEASE CONTROL AND PREVENTION which is responsible for assuring safe and healthful working conditions and for developing standards of safety and health. Research activities are carried out pertinent to these goals.
Unified Medical Language System
A research and development program initiated by the NATIONAL LIBRARY OF MEDICINE to build knowledge sources for the purpose of aiding the development of systems that help health professionals retrieve and integrate biomedical information. The knowledge sources can be used to link disparate information systems to overcome retrieval problems caused by differences in terminology and the scattering of relevant information across many databases. The three knowledge sources are the Metathesaurus, the Semantic Network, and the Specialist Lexicon.
Diamond
Biology
Neosartorya
Toxicogenetics
The study of existing genetic knowledge, and the generation of new genetic data, to understand and thus avoid DRUG TOXICITY and adverse effects from toxic substances from the environment.
Molecular Sequence Data
Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories.
Proteome
The protein complement of an organism coded for by its genome.
Thorium
Thorium. A radioactive element of the actinide series of metals. It has an atomic symbol Th, atomic number 90, and atomic weight 232.04. It is used as fuel in nuclear reactors to produce fissionable uranium isotopes. Because of its radioopacity, various thorium compounds are used to facilitate visualization in roentgenography.
Genome
The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA.
Neural Networks (Computer)
A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming.
Models, Statistical
Support Vector Machines
Learning algorithms which are a set of related supervised computer learning methods that analyze data and recognize patterns, and used for classification and regression analysis.
Environmental Pollution
Genes
Bayes Theorem
A theorem in probability theory named for Thomas Bayes (1702-1761). In epidemiology, it is used to obtain the probability of disease in a group of people with some characteristic on the basis of the overall rate of that disease and of the likelihood of that characteristic in healthy and diseased individuals. The most familiar application is in clinical decision analysis where it is used for estimating the probability of a particular diagnosis given the appearance of some symptoms or test result.
Mercury
A silver metallic element that exists as a liquid at room temperature. It has the atomic symbol Hg (from hydrargyrum, liquid silver), atomic number 80, and atomic weight 200.59. Mercury is used in many industrial applications and its salts have been employed therapeutically as purgatives, antisyphilitics, disinfectants, and astringents. It can be absorbed through the skin and mucous membranes which leads to MERCURY POISONING. Because of its toxicity, the clinical use of mercury and mercurials is diminishing.
Silicon Dioxide
Computer Simulation
Chromosome Mapping
Any method used for determining the location of and relative distances between genes on a chromosome.
Environmental Exposure
Software Design
Specifications and instructions applied to the software.
Talc
Hospital Administrators
Pharmacovigilance
Genome, Human
The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs.
Water Pollutants, Chemical
Databases, Pharmaceutical
Databases devoted to knowledge about PHARMACEUTICAL PRODUCTS.
Medical Informatics
Hypermedia
Computerized compilations of information units (text, sound, graphics, and/or video) interconnected by logical nonlinear linkages that enable users to follow optimal paths through the material and also the systems used to create and display this information. (From Thesaurus of ERIC Descriptors, 1994)
Knowledge
Institutional Practice
Oil and Gas Fields
Water Pollutants, Radioactive
Phenotype
The outward appearance of the individual. It is the product of interactions between genes, and between the GENOTYPE and the environment.
Amino Acid Sequence
The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION.