**A computational screen for methylation guide snoRNAs in yeast.**

Small nucleolar RNAs (snoRNAs) are required for ribose 2'-O-methylation of eukaryotic ribosomal RNA. Many of the genes for this snoRNA family have remained unidentified in Saccharomyces cerevisiae, despite the availability of a complete genome sequence. Probabilistic modeling methods akin to those used in speech recognition and computational linguistics were used to computationally screen the yeast genome and identify 22 methylation guide snoRNAs, snR50 to snR71. Gene disruptions and other experimental characterization confirmed their methylation guide function. In total, 51 of the 55 ribose methylated sites in yeast ribosomal RNA were assigned to 41 different guide snoRNAs. (+info)

(2/16923)

**Influence of sampling on estimates of clustering and recent transmission of Mycobacterium tuberculosis derived from DNA fingerprinting techniques.**

The availability of DNA fingerprinting techniques for Mycobacterium tuberculosis has led to attempts to estimate the extent of recent transmission in populations, using the assumption that groups of tuberculosis patients with identical isolates ("clusters") are likely to reflect recently acquired infections. It is never possible to include all cases of tuberculosis in a given population in a study, and the proportion of isolates found to be clustered will depend on the completeness of the sampling. Using stochastic simulation models based on real and hypothetical populations, the authors demonstrate the influence of incomplete sampling on the estimates of clustering obtained. The results show that as the sampling fraction increases, the proportion of isolates identified as clustered also increases and the variance of the estimated proportion clustered decreases. Cluster size is also important: the underestimation of clustering for any given sampling fraction is greater, and the variability in the results obtained is larger, for populations with small clusters than for those with the same number of individuals arranged in large clusters. A considerable amount of caution should be used in interpreting the results of studies on clustering of M. tuberculosis isolates, particularly when sampling fractions are small. (+info)

(3/16923)

**Capture-recapture models including covariate effects.**

Capture-recapture methods are used to estimate the incidence of a disease, using a multiple-source registry. Usually, log-linear methods are used to estimate population size, assuming that not all sources of notification are dependent. Where there are categorical covariates, a stratified analysis can be performed. The multinomial logit model has occasionally been used. In this paper, the authors compare log-linear and logit models with and without covariates, and use simulated data to compare estimates from different models. The crude estimate of population size is biased when the sources are not independent. Analyses adjusting for covariates produce less biased estimates. In the absence of covariates, or where all covariates are categorical, the log-linear model and the logit model are equivalent. The log-linear model cannot include continuous variables. To minimize potential bias in estimating incidence, covariates should be included in the design and analysis of multiple-source disease registries. (+info)

(4/16923)

**Sequence specificity, statistical potentials, and three-dimensional structure prediction with self-correcting distance geometry calculations of beta-sheet formation in proteins.**

A statistical analysis of a representative data set of 169 known protein structures was used to analyze the specificity of residue interactions between spatial neighboring strands in beta-sheets. Pairwise potentials were derived from the frequency of residue pairs in nearest contact, second nearest and third nearest contacts across neighboring beta-strands compared to the expected frequency of residue pairs in a random model. A pseudo-energy function based on these statistical pairwise potentials recognized native beta-sheets among possible alternative pairings. The native pairing was found within the three lowest energies in 73% of the cases in the training data set and in 63% of beta-sheets in a test data set of 67 proteins, which were not part of the training set. The energy function was also used to detect tripeptides, which occur frequently in beta-sheets of native proteins. The majority of native partners of tripeptides were distributed in a low energy range. Self-correcting distance geometry (SECODG) calculations using distance constraints sets derived from possible low energy pairing of beta-strands uniquely identified the native pairing of the beta-sheet in pancreatic trypsin inhibitor (BPTI). These results will be useful for predicting the structure of proteins from their amino acid sequence as well as for the design of proteins containing beta-sheets. (+info)

(5/16923)

**Pair potentials for protein folding: choice of reference states and sensitivity of predicted native states to variations in the interaction schemes.**

We examine the similarities and differences between two widely used knowledge-based potentials, which are expressed as contact matrices (consisting of 210 elements) that gives a scale for interaction energies between the naturally occurring amino acid residues. These are the Miyazawa-Jernigan contact interaction matrix M and the potential matrix S derived by Skolnick J et al., 1997, Protein Sci 6:676-688. Although the correlation between the two matrices is good, there is a relatively large dispersion between the elements. We show that when Thr is chosen as a reference solvent within the Miyazawa and Jernigan scheme, the dispersion between the M and S matrices is reduced. The resulting interaction matrix B gives hydrophobicities that are in very good agreement with experiment. The small dispersion between the S and B matrices, which arises due to differing reference states, is shown to have dramatic effect on the predicted native states of lattice models of proteins. These findings and other arguments are used to suggest that for reliable predictions of protein structures, pairwise additive potentials are not sufficient. We also establish that optimized protein sequences can tolerate relatively large random errors in the pair potentials. We conjecture that three body interaction may be needed to predict the folds of proteins in a reliable manner. (+info)

(6/16923)

**Cloning, overexpression, purification, and physicochemical characterization of a cold shock protein homolog from the hyperthermophilic bacterium Thermotoga maritima.**

Thermotoga maritima (Tm) expresses a 7 kDa monomeric protein whose 18 N-terminal amino acids show 81% identity to N-terminal sequences of cold shock proteins (Csps) from Bacillus caldolyticus and Bacillus stearothermophilus. There were only trace amounts of the protein in Thermotoga cells grown at 80 degrees C. Therefore, to perform physicochemical experiments, the gene was cloned in Escherichia coli. A DNA probe was produced by PCR from genomic Tm DNA with degenerated primers developed from the known N-terminus of TmCsp and the known C-terminus of CspB from Bacillus subtilis. Southern blot analysis of genomic Tm DNA allowed to produce a partial gene library, which was used as a template for PCRs with gene- and vector-specific primers to identify the complete DNA sequence. As reported for other csp genes, the 5' untranslated region of the mRNA was anomalously long; it contained the putative Shine-Dalgarno sequence. The coding part of the gene contained 198 bp, i.e., 66 amino acids. The sequence showed 61% identity to CspB from B. caldolyticus and high similarity to all other known Csps. Computer-based homology modeling allowed the conclusion that TmCsp represents a beta-barrel similar to CspB from B. subtilis and CspA from E. coli. As indicated by spectroscopic analysis, analytical gel permeation chromatography, and mass spectrometry, overexpression of the recombinant protein yielded authentic TmCsp with a molecular weight of 7,474 Da. This was in agreement with the results of analytical ultracentrifugation confirming the monomeric state of the protein. The temperature-induced equilibrium transition at 87 degrees C exceeds the maximum growth temperature of Tm and represents the maximal Tm-value reported for Csps so far. (+info)

(7/16923)

**pKa calculations for class A beta-lactamases: influence of substrate binding.**

Beta-Lactamases are responsible for bacterial resistance to beta-lactams and are thus of major clinical importance. However, the identity of the general base involved in their mechanism of action is still unclear. Two candidate residues, Glu166 and Lys73, have been proposed to fulfill this role. Previous studies support the proposal that Glu166 acts during the deacylation, but there is no consensus on the possible role of this residue in the acylation step. Recent experimental data and theoretical considerations indicate that Lys73 is protonated in the free beta-lactamases, showing that this residue is unlikely to act as a proton abstractor. On the other hand, it has been proposed that the pKa of Lys73 would be dramatically reduced upon substrate binding and would thus be able to act as a base. To check this hypothesis, we performed continuum electrostatic calculations for five wild-type and three beta-lactamase mutants to estimate the pKa of Lys73 in the presence of substrates, both in the Henri-Michaelis complex and in the tetrahedral intermediate. In all cases, the pKa of Lys73 was computed to be above 10, showing that it is unlikely to act as a proton abstractor, even when a beta-lactam substrate is bound in the enzyme active site. The pKa of Lys234 is also raised in the tetrahedral intermediate, thus confirming a probable role of this residue in the stabilization of the tetrahedral intermediate. The influence of the beta-lactam carboxylate on the pKa values of the active-site lysines is also discussed. (+info)

(8/16923)

**Simplified methods for pKa and acid pH-dependent stability estimation in proteins: removing dielectric and counterion boundaries.**

Much computational research aimed at understanding ionizable group interactions in proteins has focused on numerical solutions of the Poisson-Boltzmann (PB) equation, incorporating protein exclusion zones for solvent and counterions in a continuum model. Poor agreement with measured pKas and pH-dependent stabilities for a (protein, solvent) relative dielectric boundary of (4,80) has lead to the adoption of an intermediate (20,80) boundary. It is now shown that a simple Debye-Huckel (DH) calculation, removing both the low dielectric and counterion exclusion regions associated with protein, is equally effective in general pKa calculations. However, a broad-based discrepancy to measured pH-dependent stabilities is maintained in the absence of ionizable group interactions in the unfolded state. A simple model is introduced for these interactions, with a significantly improved match to experiment that suggests a potential utility in predicting and analyzing the acid pH-dependence of protein stability. The methods are applied to the relative pH-dependent stabilities of the pore-forming domains of colicins A and N. The results relate generally to the well-known preponderance of surface ionizable groups with solvent-mediated interactions. Although numerical PB solutions do not currently have a significant advantage for overall pKa estimations, development based on consideration of microscopic solvation energetics in tandem with the continuum model could combine the large deltapKas of a subset of ionizable groups with the overall robustness of the DH model. (+info)

#### hierarchical

- Marginal maximum-likelihood procedures for parameter estimation and testing the fit of a hierarchical model for speed and accuracy on test items are presented. (wiley.com)

#### logistic regression models

- Due to a severe underreporting problem on the slight injury crashes binary and mixed binary logistic regression models were also estimated for two categories of severity: fatal and serious crashes. (surrey.ac.uk)

#### Approaches

- The work presented in this dissertation is a demonstration of statistical modeling approaches to evaluate population variability in anatomy of the knee and function of its tibiofemoral (TF) and patellofemoral (PF) joints. (du.edu)

#### predict

- But DHS isn't the only law enforcement agency looking to statistic modeling to predict crime. (technoccult.net)

#### multivariate

- The model is a composition of two first-level models for dichotomous responses and response times along with multivariate normal models for their item and person parameters. (wiley.com)
- Bayesian methodology using MCMC has been extended along with new material on smoothing models, multivariate responses, missing data, latent normal transformations for discrete responses, structural equation modeling and survival models. (idreambooks.com)

#### parameters

- To test the fit of the model, Lagrange multiplier tests of the assumptions of subpopulation invariance of the item parameters (i.e., no differential item functioning), the shape of the response functions, and three different types of conditional independence were derived. (wiley.com)
- This concept is used in the present study in order to construct adequate adjusted models enabling to make predictions for the different roughness parameters characterizing machining of PEEK composites when using PCD and K10 tools. (academicjournals.org)
- The results from both multinomial and binary response models are found to be fairly consistent but the results from the random parameters model seem more reasonable. (surrey.ac.uk)
- Statistical Shape Models capture the shape variation of a training set of shapes and can be registered to an image of an object of the class they represent by simple adjustment of their parameters. (uantwerpen.be)

#### Analyses

- The random and mixed-effects models were used for the statistical analyses. (apta.org)

#### estimation

- The model produces a shape that is an estimation of the shape of the patient's trachea if it were not narrowed. (uantwerpen.be)
- This is important when considering exact and asymptotic self-similar models concurrently in the self-similarity parameter estimation method. (utm.my)
- Due to the needs of high accuracy and fast estimation, the Optimization Method (OM) based on Second Order Self-similarity (SOSS) statistical model was proposed in the previous works to estimate self-similarity parameter. (utm.my)
- An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes. (projecteuclid.org)

#### framework

- Biomod2EZ adds to the functionality of Biomod2 by incorporating a report generation feature, detailed script annotation, and sample dataset/tutorial to ease the transition from ecological niche modeling using a Graphical User Interface to the coding environment of the R framework. (biorxiv.org)

#### predictions

- Three-dimensional (3D) computational models of the bone and cartilage in the knee were characterized using a principal component analysis (PCA) algorithm to understand the primary sources of variability in shape and motion and make predictions from sparse data. (du.edu)
- Statistical models were used to investigate relationships between natural knee anatomy and kinematics and make predictions of both shape and function from sparse data. (du.edu)
- Our current project examines how to choose distinctive (i.e., salient) colors for items on high-clutter and low-clutter maps using predictions of a model of visual search. (arvojournals.org)

#### findings

- Results from tissue specimens of patients with symptomatic OA strikingly resembled our findings from the OA animal model. (bmj.com)
- The DAATS Model: Initial psychometric and statistical findings: A top ten illustration. (usfsp.edu)

#### Methods

- References Index Harvey Goldstein is the Professor of Statistical Methods at the Institute of Education, University of London. (idreambooks.com)

#### adjustment

- In many scientific fields, non-linear regression based models are of great utility to perform curve adjustment of experimental data. (academicjournals.org)
- After adjustment of models, better diabetes-related processes of care, better health status, and non-Medicaid insurance were associated with mammography performance. (diabetesjournals.org)

#### Probability

- A method to calculate coverage probability from uncertainties in radiotherapy via a statistical shape model. (openrepository.com)
- We introduce a novel method of generating a coverage probability matrix, that may be used to determine treatment margins and calculate uncertainties in dose, from this statistical shape model. (openrepository.com)

#### cartilage

- Results In the DMM model, the loss of PKCδ expression prevented cartilage degeneration but exacerbated OA-associated hyperalgesia. (bmj.com)

#### geometric

- The idea is that a model with healthy tracheas only will not be influenced by local geometric variations typical of stenosis. (uantwerpen.be)
- In this paper we describe a technique that may be used to model the geometric uncertainties that accrue during the radiotherapy process. (openrepository.com)

#### 2016

- Smoger, Lowell Matthew, "Statistical Modeling to Investigate Anatomy and Function of the Knee" (2016). (du.edu)
- 2016) that is used to create ensemble ecological niche models using up to 11 different modeling techniques. (biorxiv.org)

#### shape

- We have applied statistical shape models of healthy tracheas to the assessment and stenting of tracheal stenosis. (uantwerpen.be)
- Results from this study were used in a subsequent investigation to build a statistical model of two-dimensional (2D) shape and alignment measures and 6 degree-of-freedom (DOF) kinematics to identify the key measures capable of predicting PF joint motion. (du.edu)
- The ability to reconstruct the 3D implanted patellar bone of a subject with a total knee replacement (TKR) was evaluated by a statistical shape model of the patella and simulated 2D edge profiles in a custom optimization algorithm. (du.edu)

#### data

- A statistical method is described which enables comparisons to be made between two sets of data where each datum can only be expressed as a positive or negative value. (strath.ac.uk)
- Multilevel modelling is now the accepted statistical technique for handling such data and is widely available in computer software packages. (idreambooks.com)
- Two nominal response models have been developed: a standard multinomial logit model (MNL) and a mixed logit model to injury-related crash data. (surrey.ac.uk)
- Using data from in-treatment cone beam CT scans, we simultaneously analyse non-uniform observer delineation variability and organ motion together with patient set-up errors via the creation of a point distribution model (PDM). (openrepository.com)

#### applications

- Key Features:Provides a clear introduction and a comprehensive account of multilevel models.New methodological developments and applications are explored.Written by a leading expert in the field of multilevel methodology.Illustrated throughout with real-life examples, explaining theoretical concepts. (idreambooks.com)
- The potential of graphical models is explored and illustrated through a number of example applications where the genetic element is substantial or dominating. (projecteuclid.org)

#### type

- Using levels of receptor chain expression and known binding affinities, we modeled the assemblage of functional type I and II receptor complexes. (rupress.org)

#### results

- We concluded that in both primary and metastatic pancreatic cancer models, the synthetic gene delivery system can achieve in vivo sst2 gene transfer and results in a significant antitumor effect characterized by an increase of apoptosis and an inhibition of cell proliferation. (aacrjournals.org)
- Biomod2EZ − An R script suite for visualizing projected niche model ensembles and reporting statistical results. (biorxiv.org)

#### natural

- This paper introduces graphical models as a natural environment in which to formulate and solve problems in genetics and related areas. (projecteuclid.org)

#### techniques

- This new edition of Multilevel Statistical Models brings these techniques together, starting from basic ideas and illustrating how more complex models are derived. (idreambooks.com)

#### mouse model

- Because liver tissue from pregnant women is not readily available, in the present study, we investigated the mechanism of such pregnancy-related changes in GLB disposition in a mouse model. (aspetjournals.org)
- The mouse model was used in our study for two reasons. (aspetjournals.org)

#### animal model

- To test this hypothesis, an animal model would be required, because liver tissue from pregnant women is not readily available. (aspetjournals.org)

#### utility

- The utility of statistical modeling is elucidated by the population-based evaluations of the musculoskeletal system described in this work and could continue to inform characteristics related to pathological conditions and large-scale computational evaluations of implant performance. (du.edu)

#### study

- In the present study, in vivo gene transfer of sst2 was investigated in two transplantable models of primary and metastatic pancreatic carcinoma developed in hamsters. (aacrjournals.org)
- In medicine, compartment models have been used to study the dynamic flow of chemicals (nutrients, hormones, drugs, radio-isotopes, etc.) between different organs of the human body. (eclipse.org)

#### different

- For each map, we calculated the model-predicted salience of a pushpin added to the map, given different potential pushpin colors (the 267 colors in the ISCC-NBS standard color system). (arvojournals.org)

#### book

- Perhaps the best reference to learn about he Epidemiological Compartment models defined in STEM is the book by ''Anderson and May'' [1]. (eclipse.org)

#### dynamic

- Application of dynamic and statistical models for calculation of runout distance. (europa.eu)

#### expression

- Our previous studies conducted in pancreatic cancer models established in nude mice and hamsters revealed that cloned somatostatin receptor subtype 2 ( sst2 ) gene expression induced both antioncogenic and local antitumor bystander effects in vivo . (aacrjournals.org)

#### Overview

- This paper follows the overview found in The DAATS Model: Where it Comes from and What it is (Wilkerson, 2008). (usfsp.edu)

#### paper

- This paper reviews commonly-applied physical models in the context of weak plume identification and quantification, identifies inherent error sources as well as those introduced by making simplifying assumptions, and indicates research areas. (mdpi.com)
- The primary objective of this paper is therefore to explore factors affecting the severity and frequency of road crashes in Riyadh city using appropriate statistical models aiming to establish effective safety policies ready to be implemented to reduce the severity and frequency of road crashes in Riyadh city. (surrey.ac.uk)

#### Analysis

- For frequency, two count models such as Negative Binomial (NB) models were employed and the unit of analysis was 168 HAIs (wards) in Riyadh city. (surrey.ac.uk)

#### distribution

- A compartment model that deals only with the trajectory of a disease in time implicitly assumes that the population (or populations) in question is so well mixed that there is no need to model the spatial distribution of people. (eclipse.org)

#### areas

- Particular emphasis is given to the relationships among various local computation algorithms which have been developed within the hitherto mostly separate areas of graphical models and genetics. (projecteuclid.org)

#### material

- The flow of material in a compartment model follows certain rule. (eclipse.org)

