A computational screen for methylation guide snoRNAs in yeast. (1/16923)

Small nucleolar RNAs (snoRNAs) are required for ribose 2'-O-methylation of eukaryotic ribosomal RNA. Many of the genes for this snoRNA family have remained unidentified in Saccharomyces cerevisiae, despite the availability of a complete genome sequence. Probabilistic modeling methods akin to those used in speech recognition and computational linguistics were used to computationally screen the yeast genome and identify 22 methylation guide snoRNAs, snR50 to snR71. Gene disruptions and other experimental characterization confirmed their methylation guide function. In total, 51 of the 55 ribose methylated sites in yeast ribosomal RNA were assigned to 41 different guide snoRNAs.  (+info)

Influence of sampling on estimates of clustering and recent transmission of Mycobacterium tuberculosis derived from DNA fingerprinting techniques. (2/16923)

The availability of DNA fingerprinting techniques for Mycobacterium tuberculosis has led to attempts to estimate the extent of recent transmission in populations, using the assumption that groups of tuberculosis patients with identical isolates ("clusters") are likely to reflect recently acquired infections. It is never possible to include all cases of tuberculosis in a given population in a study, and the proportion of isolates found to be clustered will depend on the completeness of the sampling. Using stochastic simulation models based on real and hypothetical populations, the authors demonstrate the influence of incomplete sampling on the estimates of clustering obtained. The results show that as the sampling fraction increases, the proportion of isolates identified as clustered also increases and the variance of the estimated proportion clustered decreases. Cluster size is also important: the underestimation of clustering for any given sampling fraction is greater, and the variability in the results obtained is larger, for populations with small clusters than for those with the same number of individuals arranged in large clusters. A considerable amount of caution should be used in interpreting the results of studies on clustering of M. tuberculosis isolates, particularly when sampling fractions are small.  (+info)

Capture-recapture models including covariate effects. (3/16923)

Capture-recapture methods are used to estimate the incidence of a disease, using a multiple-source registry. Usually, log-linear methods are used to estimate population size, assuming that not all sources of notification are dependent. Where there are categorical covariates, a stratified analysis can be performed. The multinomial logit model has occasionally been used. In this paper, the authors compare log-linear and logit models with and without covariates, and use simulated data to compare estimates from different models. The crude estimate of population size is biased when the sources are not independent. Analyses adjusting for covariates produce less biased estimates. In the absence of covariates, or where all covariates are categorical, the log-linear model and the logit model are equivalent. The log-linear model cannot include continuous variables. To minimize potential bias in estimating incidence, covariates should be included in the design and analysis of multiple-source disease registries.  (+info)

Sequence specificity, statistical potentials, and three-dimensional structure prediction with self-correcting distance geometry calculations of beta-sheet formation in proteins. (4/16923)

A statistical analysis of a representative data set of 169 known protein structures was used to analyze the specificity of residue interactions between spatial neighboring strands in beta-sheets. Pairwise potentials were derived from the frequency of residue pairs in nearest contact, second nearest and third nearest contacts across neighboring beta-strands compared to the expected frequency of residue pairs in a random model. A pseudo-energy function based on these statistical pairwise potentials recognized native beta-sheets among possible alternative pairings. The native pairing was found within the three lowest energies in 73% of the cases in the training data set and in 63% of beta-sheets in a test data set of 67 proteins, which were not part of the training set. The energy function was also used to detect tripeptides, which occur frequently in beta-sheets of native proteins. The majority of native partners of tripeptides were distributed in a low energy range. Self-correcting distance geometry (SECODG) calculations using distance constraints sets derived from possible low energy pairing of beta-strands uniquely identified the native pairing of the beta-sheet in pancreatic trypsin inhibitor (BPTI). These results will be useful for predicting the structure of proteins from their amino acid sequence as well as for the design of proteins containing beta-sheets.  (+info)

Pair potentials for protein folding: choice of reference states and sensitivity of predicted native states to variations in the interaction schemes. (5/16923)

We examine the similarities and differences between two widely used knowledge-based potentials, which are expressed as contact matrices (consisting of 210 elements) that gives a scale for interaction energies between the naturally occurring amino acid residues. These are the Miyazawa-Jernigan contact interaction matrix M and the potential matrix S derived by Skolnick J et al., 1997, Protein Sci 6:676-688. Although the correlation between the two matrices is good, there is a relatively large dispersion between the elements. We show that when Thr is chosen as a reference solvent within the Miyazawa and Jernigan scheme, the dispersion between the M and S matrices is reduced. The resulting interaction matrix B gives hydrophobicities that are in very good agreement with experiment. The small dispersion between the S and B matrices, which arises due to differing reference states, is shown to have dramatic effect on the predicted native states of lattice models of proteins. These findings and other arguments are used to suggest that for reliable predictions of protein structures, pairwise additive potentials are not sufficient. We also establish that optimized protein sequences can tolerate relatively large random errors in the pair potentials. We conjecture that three body interaction may be needed to predict the folds of proteins in a reliable manner.  (+info)

Cloning, overexpression, purification, and physicochemical characterization of a cold shock protein homolog from the hyperthermophilic bacterium Thermotoga maritima. (6/16923)

Thermotoga maritima (Tm) expresses a 7 kDa monomeric protein whose 18 N-terminal amino acids show 81% identity to N-terminal sequences of cold shock proteins (Csps) from Bacillus caldolyticus and Bacillus stearothermophilus. There were only trace amounts of the protein in Thermotoga cells grown at 80 degrees C. Therefore, to perform physicochemical experiments, the gene was cloned in Escherichia coli. A DNA probe was produced by PCR from genomic Tm DNA with degenerated primers developed from the known N-terminus of TmCsp and the known C-terminus of CspB from Bacillus subtilis. Southern blot analysis of genomic Tm DNA allowed to produce a partial gene library, which was used as a template for PCRs with gene- and vector-specific primers to identify the complete DNA sequence. As reported for other csp genes, the 5' untranslated region of the mRNA was anomalously long; it contained the putative Shine-Dalgarno sequence. The coding part of the gene contained 198 bp, i.e., 66 amino acids. The sequence showed 61% identity to CspB from B. caldolyticus and high similarity to all other known Csps. Computer-based homology modeling allowed the conclusion that TmCsp represents a beta-barrel similar to CspB from B. subtilis and CspA from E. coli. As indicated by spectroscopic analysis, analytical gel permeation chromatography, and mass spectrometry, overexpression of the recombinant protein yielded authentic TmCsp with a molecular weight of 7,474 Da. This was in agreement with the results of analytical ultracentrifugation confirming the monomeric state of the protein. The temperature-induced equilibrium transition at 87 degrees C exceeds the maximum growth temperature of Tm and represents the maximal Tm-value reported for Csps so far.  (+info)

pKa calculations for class A beta-lactamases: influence of substrate binding. (7/16923)

Beta-Lactamases are responsible for bacterial resistance to beta-lactams and are thus of major clinical importance. However, the identity of the general base involved in their mechanism of action is still unclear. Two candidate residues, Glu166 and Lys73, have been proposed to fulfill this role. Previous studies support the proposal that Glu166 acts during the deacylation, but there is no consensus on the possible role of this residue in the acylation step. Recent experimental data and theoretical considerations indicate that Lys73 is protonated in the free beta-lactamases, showing that this residue is unlikely to act as a proton abstractor. On the other hand, it has been proposed that the pKa of Lys73 would be dramatically reduced upon substrate binding and would thus be able to act as a base. To check this hypothesis, we performed continuum electrostatic calculations for five wild-type and three beta-lactamase mutants to estimate the pKa of Lys73 in the presence of substrates, both in the Henri-Michaelis complex and in the tetrahedral intermediate. In all cases, the pKa of Lys73 was computed to be above 10, showing that it is unlikely to act as a proton abstractor, even when a beta-lactam substrate is bound in the enzyme active site. The pKa of Lys234 is also raised in the tetrahedral intermediate, thus confirming a probable role of this residue in the stabilization of the tetrahedral intermediate. The influence of the beta-lactam carboxylate on the pKa values of the active-site lysines is also discussed.  (+info)

Simplified methods for pKa and acid pH-dependent stability estimation in proteins: removing dielectric and counterion boundaries. (8/16923)

Much computational research aimed at understanding ionizable group interactions in proteins has focused on numerical solutions of the Poisson-Boltzmann (PB) equation, incorporating protein exclusion zones for solvent and counterions in a continuum model. Poor agreement with measured pKas and pH-dependent stabilities for a (protein, solvent) relative dielectric boundary of (4,80) has lead to the adoption of an intermediate (20,80) boundary. It is now shown that a simple Debye-Huckel (DH) calculation, removing both the low dielectric and counterion exclusion regions associated with protein, is equally effective in general pKa calculations. However, a broad-based discrepancy to measured pH-dependent stabilities is maintained in the absence of ionizable group interactions in the unfolded state. A simple model is introduced for these interactions, with a significantly improved match to experiment that suggests a potential utility in predicting and analyzing the acid pH-dependence of protein stability. The methods are applied to the relative pH-dependent stabilities of the pore-forming domains of colicins A and N. The results relate generally to the well-known preponderance of surface ionizable groups with solvent-mediated interactions. Although numerical PB solutions do not currently have a significant advantage for overall pKa estimations, development based on consideration of microscopic solvation energetics in tandem with the continuum model could combine the large deltapKas of a subset of ionizable groups with the overall robustness of the DH model.  (+info)

... is a bimonthly peer-reviewed scientific journal covering statistical modelling. It is published by SAGE ... "Statistical Modelling". 2014 Journal Citation Reports. Web of Science (Science ed.). Thomson Reuters. 2015. Official website v ... Publications on behalf of the Statistical Modelling Society. The editors-in-chief are Brian D. Marx (Louisiana State University ...
... model Response modeling methodology Scientific model Statistical inference Statistical model specification Statistical model ... More generally, statistical models are part of the foundation of statistical inference. Informally, a statistical model can be ... A statistical model is a special class of mathematical model. What distinguishes a statistical model from other mathematical ... A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of sample ...
... model Sensitivity analysis Spurious relationship Statistical conclusion validity Statistical model selection Statistical model ... After the statistical model is fitted, the validation set is used as a measure of the model's error. If the model fits well on ... model validation is the task of evaluating whether a chosen statistical model is appropriate or not. Oftentimes in statistical ... To combat this, model validation is used to test whether a statistical model can hold up to permutations in the data. This ...
In statistics, model specification is part of the process of building a statistical model: specification consists of selecting ... For more on this topic, see statistical model selection. Abductive reasoning Conceptual model Data analysis Data transformation ... in a task called statistical model validation. Theoretical understanding can then guide the modification of the model in such a ... The statistician Sir David Cox has said, "How [the] translation from subject-matter problem to statistical model is done is ...
Statutes Statistical Modelling Society "Statistical Modelling Society - Executives and Working Groups". Retrieved 15 June 2018 ... Statistical Modelling. The society holds bi-annual elections to elect an Executive Committee. The Statistical Modelling Society ... and applications in statistical modelling; and bring together statisticians working on statistical modelling from various ... will promote statistical modelling as the general framework for the application of statistical ideas; promote important ...
The test statistic is constructed as: c ( β ^ u ′ ) [ ∇ β c ( β ^ u ) V ^ ∇ β c ( β ^ u ) ′ ] − 1 c ( β ^ u ) → d X Q 2 {\ ... This type of model is applied in many economic contexts, especially in modelling the choice-making behavior. For instance, Y i ... Denote a binary response index model as: P [ Y i = 1 ∣ X i ] = G ( X i β ) {\displaystyle P[Y_{i}=1\mid X_{i}]=G(X_{i}\beta ... In most of the cases, it is not easy or even feasible to compute the MLE under the restricted model when c ( β ) {\displaystyle ...
Exponential Random Graph Models (ERGMs) are a family of statistical models for analyzing data from social and other networks. ... To support statistical inference on the processes influencing the formation of network structure, a statistical model should ... Alternative statistical models should reflect the uncertainty associated with a given observation, permit inference about the ... The Exponential family is a broad family of models for covering many types of data, not just networks. An ERGM is a model from ...
Jordan, M. I. (2004). "Graphical Models". Statistical Science. 19: 140-155. doi:10.1214/088342304000000026. Ghahramani, Zoubin ... A graphical model or probabilistic graphical model (PGM) or structured probabilistic model is a probabilistic model for which a ... Classic machine learning models like hidden Markov models, neural networks and newer models such as variable-order Markov ... A restricted Boltzmann machine is a bipartite generative model specified over an undirected graph. The framework of the models ...
Aaron, S. D.; Stephenson, A. L.; Cameron, D. W.; Whitmore, G. A. (2015). "A statistical model to predict one-year risk of death ... An interest in the mathematical properties of first-hitting-times and statistical models and methods for analysis of survival ... Whitmore, G. A.; Neufeldt, A. H. (1970). "An application of statistical models in mental health research". Bull. Math. Biophys ... The model considers the event that the amount of money reaches 0, representing bankruptcy. The model can answer questions such ...
Cox, David R (1972). "Regression Models and Life-Tables". Journal of the Royal Statistical Society, Series B. 34 (2): 187-220. ... Proportional hazards models are a class of survival models in statistics. Survival models relate the time that passes, before ... Often there is an intercept term (also called a constant term or bias term) used in regression models. The Cox model lacks one ... The term Cox regression model (omitting proportional hazards) is sometimes used to describe the extension of the Cox model to ...
A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and ... model Fixed effects model Generalized linear mixed model Linear regression Mixed-design analysis of variance Multilevel model ... non-linear mixed effects models, missing data in mixed effects models, and Bayesian estimation of mixed effects models. Mixed ... Subsequently, mixed modeling has become a major area of statistical research, including work on computation of maximum ...
A hurdle model is a class of statistical models where a random variable is modelled using two parts, the first which is the ... Hurdle models differ from zero-inflated models in that zero-inflated models model the zeros using a two-component mixture model ... Zero-inflated model Truncated normal hurdle model Cragg, John G. (1971). "Some Statistical Models for Limited Dependent ... and a probit model was used to model the zeros. The probit part of the model was said to model the presence of "hurdles" that ...
... are a sequence of increasingly complex models used in statistical machine translation to train a ... extra fertility model Model 4: added relative alignment model Model 5: fixed deficiency problem. Model 6: Model 4 combined with ... The sequence of the six models can be summarized as: Model 1: lexical translation Model 2: additional absolute alignment model ... deficient models). IBM Model 5 reformulates IBM Model 4 by enhancing the alignment model with more training parameters in order ...
An important question in statistical mechanics is the dependence of model behaviour on the dimension of the system. The ... 2 Ising and Potts models". Journal of Statistical Physics. 50 (1-2): 1-40. Bibcode:1988JSP....50....1A. doi:10.1007/BF01022985 ... Consider the Ising model with the Hamiltonian (with N spins) H = − 1 2 ∑ i , j J ( r ( i , j ) ) s i s j {\displaystyle H=-{\ ... The shortcut model starts with a network built on a one-dimensional regular lattice. One then adds edges to create shortcuts ...
Bach, V.; Lieb, E. H.; Solovej, J. P. (1994). "Generalized Hartree-Fock Theory and the Hubbard Model". Journal of Statistical ... Anderson impurity model Bloch's theorem Electronic band structure Solid-state physics Bose-Hubbard model t-J model Heisenberg ... the behavior of the Hubbard model can be qualitatively different from a tight-binding model. For example, the Hubbard model ... The model is named for John Hubbard. The Hubbard model states that each electron experiences competing forces: one pushes it to ...
Luo, Xiyang; Bertozzi, Andrea L. (2017-05-01). "Convergence of the Graph Allen-Cahn Scheme". Journal of Statistical Physics. ... Phase-field models on graphs are a discrete analogue to phase-field models, defined on a graph. They are used in image analysis ... In analogy to continuum phase-field models, where regions with u close to 0 or 1 are models for two phases of the material, ... Graph cuts in computer vision Bertozzi, A.; Flenner, A. (2012-01-01). "Diffuse Interface Models on Graphs for Classification of ...
Journal of Statistical Physics. 141 (3): 459-475. arXiv:0910.0627. Bibcode:2010JSP...141..459A. doi:10.1007/s10955-010-0056-z. ... As opposed to the Erdős-Rényi model, the degree sequence of the configuration model is not restricted to have a Poisson ... This feature of the baseline model contradicts the known properties of empirical networks, but extensions of the model can ... in the configuration model (see the page modularity for details). In the DCM (directed configuration model), each node is given ...
Kinetic exchange models are multi-agent dynamic models inspired by the statistical physics of energy distribution, which try to ... Basic tools used in this type of modelling are probabilistic and statistical methods mostly taken from the kinetic theory of ... Cordier, S.; Pareschi, L.; Toscani, G. (2005). "On a kinetic model for a simple market economy". Journal of Statistical Physics ... The main modelling effort has been put to introduce the concepts of savings, and taxation in the setting of an ideal gas-like ...
... is implemented in the Statistical Parametric Mapping toolbox, in the Matlab function spm_log_evidence_ ... for this reduced model are rapidly computed from the full model using Bayesian model reduction. The hypothesis that the ... A full model is fitted to data using standard approaches. Hypotheses are then tested by defining one or more 'reduced' models ... Bayesian model reduction is a method for computing the evidence and posterior over the parameters of Bayesian models that ...
As with any statistical model it is important to check the model assumptions of a GAM. Residual plots should be examined in the ... "Smoothing parameter and model selection for general smooth models (with discussion)". Journal of the American Statistical ... in the model with such basis expansions we have turned the GAM into a generalized linear model (GLM), with a model matrix that ... In common with most R modelling functions gam expects a model formula to be supplied, specifying the model structure to fit. ...
Statistical method Simultaneous equations model - Type of statistical model Causal map - A network consisting of links or arcs ... Mixture model,lLatent class models[citation needed] Multilevel models, hierarchical models (e.g. people nested in groups)[ ... Fusion validity models item response theory models[citation needed] Latent class models[citation needed] Latent growth modeling ... multi-group modeling, longitudinal modeling, partial least squares path modeling, latent growth modeling and hierarchical or ...
... random-effects models, random parameter models, or split-plot designs) are statistical models of parameters that vary at more ... Multilevel models (also known as hierarchical linear models, linear mixed-effect model, mixed models, nested data models, ... See further Model selection. Multilevel models have the same assumptions as other major general linear models (e.g., ANOVA, ... Multilevel models are a subclass of hierarchical Bayesian models, which are general models with multiple levels of random ...
v t e v t e (Chemical physics, Monte Carlo methods, Statistical mechanics, Computational physics, All stub articles, ... The model belongs to the universality class of directed percolation. The model was modified several times. Ziff RM, Gulari E, ... The model consists of three steps: Adsorption of the reacting species CO and O2 The actual reaction step on the surface: CO + O ... The Ziff-Gulari-Barshad (ZGB) model is a simple Monte Carlo method for catalytic reactions of oxidation of carbon monoxide to ...
"The statistical mechanics of networks". arXiv:cond-mat/0405566. van der Hoorn, Pim; Gabor Lippner; Dmitri Krioukov (2017-10-10 ... Maximum-entropy random graph models are random graph models used to study complex networks subject to the principle of maximum ... as well as the configuration model (CM). and soft configuration model (SCM) (which each have n {\displaystyle n} local ... producing an exponential random graph model (ERGM). Suppose we are building a random graph model consisting of a probability ...
... is that the Ising model is useful for any model of neural function, because a statistical model for neural activity should be ... Ising model Swendsen-Wang algorithm t-J model Two-dimensional critical Ising model Wolff algorithm XY model Z N model See ... Spin models, Exactly solvable models, Statistical mechanics, Lattice models). ... Ward Kuramoto model Maximal evenness Order operator Potts model (common with Ashkin-Teller model) Spin models Square-lattice ...
Statistical Methods and Applications. 21 (3): 335-339. doi:10.1007/s10260-012-0196-1. (Articles lacking in-text citations from ... to say that STAR models nest the SETAR model lacks justification. Unfortunately, whether one should use a SETAR model or a STAR ... The models can be thought of in terms of extension of autoregressive models discussed above, allowing for changes in the model ... The model is usually referred to as the STAR(p) models proceeded by the letter describing the transition function (see below) ...
Journal of Statistical Physics. 162 (5): 1353-1364. doi:10.1007/s10955-015-1412-9. PMC 4761375. PMID 26941467. Mora, Thierry; ... Optimality modeling is the modeling aspect of optimization theory. It allows for the calculation and visualization of the costs ... The results from Parker's experiment agree with this model. One common use of the optimality model is in optimal foraging ... In his model, Zach predicted the optimal height for crows to drop the whelks. To do this, Zach calculated the total distance ...
CAMO Software Statistical Methods. Lee, Michael; Steyvers, Mark; de Young, Mindy; Miller, Brent (2011). "A Model-Based Approach ... A Thurstonian model is a stochastic transitivity model with latent variables for describing the mapping of some continuous ... Thurstonian models have been used as an alternative to generalized linear models in analysis of sensory discrimination tasks. ... Prior to 1999, Thurstonian models were rarely used for modeling tasks involving more than 4 options because of the high- ...
Fey, A.; Levine, L.; Peres, Y. (2010). "Growth Rates and Explosions in Sandpiles". Journal of Statistical Physics. 138 (1-3): ... The Abelian sandpile model (ASM) is the more popular name of the original Bak-Tang-Wiesenfeld model (BTW). BTW model was the ... The extended sandpile model is defined nearly exactly the same as the usual sandpile model (i.e. the original Bak-Tang- ... A strongly related model is the so-called divisible sandpile model, introduced by Levine and Peres in 2008, in which, instead ...
... a semiparametric model is a statistical model that has parametric and nonparametric components. A statistical model is a ... Semiparametric regression Statistical model Generalized method of moments Bickel, P. J.; Klaassen, C. A. J.; Ritov, Y.; Wellner ... eds.), Encyclopedia of Statistical Sciences, Wiley. Oakes, D. (2006), "Semi-parametric models", in Kotz, S.; et al. (eds.), ... These models often use smoothing or kernels. A well-known example of a semiparametric model is the Cox proportional hazards ...

No FAQ available that match "models statistical"

No images available that match "models statistical"