Design and implementation of a national clinical trials registry. (33/1253)

The authors have developed a Web-based system that provides summary information about clinical trials being conducted throughout the United States. The first version of the system, publicly available in February 2000, contains more than 4,000 records representing primarily trials sponsored by the National Institutes of Health. The impetus for this system has come from the Food and Drug Administration (FDA) Modernization Act of 1997, which mandated a registry of both federally and privately funded clinical trials "of experimental treatments for serious or life-threatening diseases or conditions." The system design and implementation have been guided by several principles. First, all stages of system development were guided by the needs of the primary intended audience, patients and other members of the public. Second, broad agreement on a common set of data elements was obtained. Third, the system was designed in a modular and extensible way, and search methods that take extensive advantage of the National Library of Medicine's Unified Medical Language System (UMLS) were developed. Finally, since this will be a long-term effort involving many individuals and organizations, the project is being implemented in several phases.  (+info)

Darwin v. 2.0: an interpreted computer language for the biosciences. (34/1253)

MOTIVATION: We announce the availability of the second release of Darwin v. 2.0, an interpreted computer language especially tailored to researchers in the biosciences. The system is a general tool applicable to a wide range of problems. RESULTS: This second release improves Darwin version 1.6 in several ways: it now contains (1) a larger set of libraries touching most of the classical problems from computational biology (pairwise alignment, all versus all alignments, tree construction, multiple sequence alignment), (2) an expanded set of general purpose algorithms (search algorithms for discrete problems, matrix decomposition routines, complex/long integer arithmetic operations), (3) an improved language with a cleaner syntax, (4) better on-line help, and (5) a number of fixes to user-reported bugs. AVAILABILITY: Darwin is made available for most operating systems free of char ge from the Computational Biochemistry Research Group (CBRG), reachable at http://chrg.inf.ethz.ch. CONTACT: [email protected]  (+info)

Taxonomic markup language: applying XML to systematic data. (35/1253)

SUMMARY: An XML document type definition (DTD) is provided for the description of taxonomic relationships between organisms. Two XSL style-sheets for the graphical presentation of simple taxonomic trees are also provided. AVAILABILITY: The DTD and stylesheets along with sample files are available at http://www.albany.edu/gilmr/pubxml/.  (+info)

GEM: a proposal for a more comprehensive guideline document model using XML. (36/1253)

OBJECTIVE: To develop a guideline document model that includes a sufficiently broad set of concepts to be useful throughout the guideline life cycle. DESIGN: Current guideline document models are limited in that they reflect the specific orientation of the stakeholder who created them; thus, developers and disseminators often provide few constructs for conceptualizing recommendations, while implementers de-emphasize concepts related to establishing guideline validity. The authors developed the Guideline Elements Model (GEM) using XML to better represent the heterogeneous knowledge contained in practice guidelines. Core constructs were derived from the Institute of Medicine's Guideline Appraisal Instrument, the National Guideline Clearinghouse, and the augmented decision table guideline representation. These were supplemented by additional concepts from a literature review. RESULTS: The GEM hierarchy includes more than 100 elements. Major concepts relate to a guideline's identity, developer, purpose, intended audience, method of development, target population, knowledge components, testing, and review plan. Knowledge components in guideline documents include recommendations (which in turn comprise conditionals and imperatives), definitions, and algorithms. CONCLUSION: GEM is more comprehensive than existing models and is expressively adequate to represent the heterogeneous information contained in guidelines. Use of XML contributes to a flexible, comprehensible, shareable, and reusable knowledge representation that is both readable by human beings and processible by computers.  (+info)

Object-oriented parsing of biological databases with Python. (37/1253)

MOTIVATION: While database activities in the biological area are increasing rapidly, rather little is done in the area of parsing them in a simple and object-oriented way. RESULTS: We present here an elegant, simple yet powerful way of parsing biological flat-file databases. We have taken EMBL, SWISSPROT and GENBANK as examples. EMBL and SWISS-PROT do not differ much in the format structure. GENBANK has a very different format structure than EMBL and SWISS-PROT. Extracting the desired fields in an entry (for example a sub-sequence with an associated feature) for later analysis is a constant need in the biological sequence-analysis community: this is illustrated with tools to make new splice-site databases. The interface to the parser is abstract in the sense that the access to all the databases is independent from their different formats, since parsing instructions are hidden.  (+info)

CADMIUM II: acquisition and representation of radiological knowledge for computerized decision support in mammography. (38/1253)

CADMIUM II is a system for the interpretation of mammograms. A novel aspect of the system is that it combines symbolic reasoning with image processing, in contrast with most other approaches, which use only image processing and rely on artificial neural networks (ANNs) to classify mammograms. A problem of ANNs is that the advice they give cannot be traced back to communicable diagnostic inferences. Our approach is to provide advice based on explicit knowledge about the diagnostic process. To this end, we have conducted a knowledge elicitation study which looked at the descriptors used by expert radiologists when making diagnostic decisions about mammograms. The analysis of the radiologists' reports yielded a set of salient diagnostic features. These were used to inform the advice provided by the symbolic decision making component of CADMIUM II.  (+info)

Mapping XML documents into databases: a Data-Driven Framework for bioinformatic data interchange. (39/1253)

The Data-Driven Framework (DDF) described here addresses two major problems for healthcare Electronic Data Interchange, data formats and software development costs. The use of a standard XML Document Type Definition (DTD) allows robust representation in any application area and leverages industry-standard tools and development directions. The DDF allows reduced software development and maintenance costs since all data-entry and database tools are generated from the DTD. The DTD can change and the tools can be regenerated. The case-study below uses the DDF for reporting cell assays to determine the roles of factors influencing cellular gene expression and regulation.  (+info)

Remote access to anatomical information: an integration between semantic knowledge and visual data. (40/1253)

A novel internet-based application is presented which provides access to anatomy knowledge through symbolic modality expressed by keywords taken from controlled or non-controlled terminology. The system is based on a database where anatomical concepts have been organized into a hierarchical framework. Along with term queries that allow retrieving concepts containing or exactly matching the used keyword, the system also provides semantic access to anatomical information. Queries can be setup, which retrieve concepts relying to a particular meaning and sharing a particular relationship. Moreover, the application has the capability to refine the search of the terms by querying the UMLS knowledge server. Anatomical image data have been integrated by using Visible Human Dataset. A set of these images has been indexed according to our anatomical classification and is used inside the application. The system has been implemented through Java client-server technology and works within standard Internet browsers.  (+info)