Run computer experiments using the adaptive composite grid algorithm with a Gaussian process model. The algorithm works best when running an experiment that can evaluate thousands of points from a deterministic computer simulation. This package is an implementation of a forthcoming paper by Plumlee, Erickson, Ankenman, et al. For a preprint of the paper, contact the maintainer of this package.
When fitting a set of linear regressions which have some same variables, we can separate the matrix and reduce the computation cost. This package aims to fit a set of repeated linear regressions faster. More details can be found in this blog Lijun Wang (2017) <https://stats.hohoweiya.xyz/regression/2017/09/26/An-R-Package-Fit-Repeated-Linear-Regressions/>.
Computes and tests individual (species, phylogenetic and functional) diversity-area relationships, i.e., how species-, phylogenetic- and functional-diversity varies with spatial scale around the individuals of some species in a community. See applications of these methods in Wiegand et al. (2007) <doi:10.1073/pnas.0705621104> or Chacon-Labella et al. (2016) <doi:10.1007/s00442-016-3547-z>.
This package implements a Gibbs sampler to do linear regression with multiple covariates, multiple responses, Gaussian measurement errors on covariates and responses, Gaussian intrinsic scatter, and a covariate prior distribution which is given by either a Gaussian mixture of specified size or a Dirichlet process with a Gaussian base distribution. Described further in Mantz (2016) <DOI:10.1093/mnras/stv3008>.
This package provides functions and wrappers for using the Multiple Aggregation Prediction Algorithm (MAPA) for time series forecasting. MAPA models and forecasts time series at multiple temporal aggregation levels, thus strengthening and attenuating the various time series components for better holistic estimation of its structure. For details see Kourentzes et al. (2014) <doi:10.1016/j.ijforecast.2013.09.006>.
Nonparametric test of independence between a pair of spatial objects (random fields, point processes) based on random shifts with torus or variance correction. See MrkviÄ ka et al. (2021) <doi:10.1016/j.spasta.2020.100430>, DvoŠák et al. (2022) <doi:10.1111/insr.12503>, DvoŠák and MrkviÄ ka (2024) <doi:10.1080/10618600.2024.2357626>.
This package provides a first implementation of automated parsing of user stories, when used to defined functional requirements for operational research mathematical models. It allows reading user stories, splitting them on the who-what-why template, and classifying them according to the parts of the mathematical model that they represent. Also provides semantic grouping of stories, for project management purposes.
This package provides methods for statistical analysis and reporting preferred by the US Office of Evaluation Sciences (OES). This package prepares data from standard model output objects (such as from \codelm()
and \codeestimatr::lm_robust()
) and creates visualizations of treatment effects from the prepared quantities, according to the standards of the US Office of Evaluation Sciences.
Analyse common types of plant phenotyping data, provide a simplified interface to longitudinal growth modeling and select Bayesian statistics, and streamline use of PlantCV
output. Several Bayesian methods and reporting guidelines for Bayesian methods are described in Kruschke (2018) <doi:10.1177/2515245918771304>, Kruschke (2013) <doi:10.1037/a0029146>, and Kruschke (2021) <doi:10.1038/s41562-021-01177-7>.
This package implements a three-dimensional stochastic model of cancer growth and mutation similar to the one described in Waclaw et al. (2015) <doi:10.1038/nature14971>. Allows for interactive 3D visualizations of the simulated tumor. Provides a comprehensive summary of the spatial distribution of mutants within the tumor. Contains functions which create synthetic sequencing datasets from the generated tumor.
Seven different methods for multiple testing problems. The SGoF-type
methods (see for example, Carvajal Rodrà guez et al., 2009 <doi:10.1186/1471-2105-10-209>; de Uña à lvarez, 2012 <doi:10.1515/1544-6115.1812>; Castro Conde et al., 2015 <doi:10.1177/0962280215597580>) and the BH and BY false discovery rate controlling procedures.
This package provides functions for implementing the targeted gold standard (GS) testing. You provide the true disease or treatment failure status and the risk score, tell TGST the availability of GS tests and which method to use, and it returns the optimal tripartite rules. Please refer to Liu et al. (2013) <doi:10.1080/01621459.2013.810149> for more details.
The ToxCast
Data Analysis Pipeline ('tcpl') is an R package that manages, curve-fits, plots, and stores ToxCast
data to populate its linked MySQL
database, invitrodb'. The package was developed for the chemical screening data curated by the US EPA's Toxicity Forecaster (ToxCast
) program, but tcpl can be used to support diverse chemical screening efforts.
Used to determine which cell types are enriched within gene lists. The package provides tools for testing enrichments within simple gene lists (such as human disease associated genes) and those resulting from differential expression studies. The package does not depend upon any particular Single Cell Transcriptome dataset and user defined datasets can be loaded in and used in the analyses.
This package provides the output of running Salmon on a set of 12 RNA-seq samples from King & Klose, "The pioneer factor OCT4 requires the chromatin remodeller BRG1 to support gene regulatory element function in mouse embryonic stem cells", published in eLIFE
, March 2017. For details on version numbers and how the samples were processed see the package vignette.
Subtyping via Consensus Factor Analysis (SCFA) can efficiently remove noisy signals from consistent molecular patterns in multi-omics data. SCFA first uses an autoencoder to select only important features and then repeatedly performs factor analysis to represent the data with different numbers of factors. Using these representations, it can reliably identify cancer subtypes and accurately predict risk scores of patients.
This package simulates continuous distributions of random vectors using Markov chain Monte Carlo (MCMC). Users specify the distribution by an R function that evaluates the log unnormalized density. Algorithms are random walk Metropolis algorithm (function metrop
), simulated tempering (function temper
), and morphometric random walk Metropolis (function morph.metrop
), which achieves geometric ergodicity by change of variable.
mlr3
enables efficient, object-oriented programming on the building blocks of machine learning. It provides R6
objects for tasks, learners, resamplings, and measures. The package is geared towards scalability and larger datasets by supporting parallelization and out-of-memory data-backends like databases. While mlr3
focuses on the core computational operations, add-on packages provide additional functionality.
Similarly to Schafer's package pan, jomo is a package for multilevel joint modelling multiple imputation http://doi.org/10.1002/9781119942283. Novel aspects of jomo are the possibility of handling binary and categorical data through latent normal variables, the option to use cluster-specific covariance matrices and to impute compatibly with the substantive model.
The snow package provides support for simple parallel computing on a network of workstations using R. A master R process calls makeCluster
to start a cluster of worker processes; the master process then uses functions such as clusterCall
and clusterApply
to execute R code on the worker processes and collect and return the results on the master.
This package contains utilities for the analysis of Michaelian kinetic data. Beside the classical linearization methods (Lineweaver-Burk, Eadie-Hofstee, Hanes-Woolf and Eisenthal-Cornish-Bowden), features include the ability to carry out weighted regression analysis that, in most cases, substantially improves the estimation of kinetic parameters (Aledo (2021) <doi:10.1002/bmb.21522>). To avoid data transformation and the potential biases introduced by them, the package also offers functions to directly fitting data to the Michaelis-Menten equation, either using ([S], v) or (time, [S]) data. Utilities to simulate substrate progress-curves (making use of the Lambert W function) are also provided. The package is accompanied of vignettes that aim to orientate the user in the choice of the most suitable method to estimate the kinetic parameter of an Michaelian enzyme.
This package provides functions provide risk projections of invasive breast cancer based on Gail model according to National Cancer Institute's Breast Cancer Risk Assessment Tool algorithm for specified race/ethnic groups and age intervals. Gail MH, Brinton LA, et al (1989) <doi:10.1093/jnci/81.24.1879>. Marthew PB, Gail MH, et al (2016) <doi:10.1093/jnci/djw215>.
We use a Bayesian approach to run individual patient data meta-analysis and network meta-analysis using JAGS'. The methods incorporate shrinkage methods and calculate patient-specific treatment effects as described in Seo et al. (2021) <DOI:10.1002/sim.8859>. This package also includes user-friendly functions that impute missing data in an individual patient data using mice-related packages.
The framework of causal decomposition of group disparities developed by Yu and Elwert (2023) <doi:10.48550/arXiv.2306.16591>
. This package implements the decomposition estimators that are based on efficient influence functions. For the nuisance functions of the estimators, both parametric and nonparametric options are provided, as well as manual options in case the default models are not satisfying.