General-purpose MCMC and SMC samplers, as well as plot and diagnostic functions for Bayesian statistics, with a particular focus on calibrating complex system models. Implemented samplers include various Metropolis MCMC variants (including adaptive and/or delayed rejection MH), the T-walk, two differential evolution MCMCs, two DREAM MCMCs, and a sequential Monte Carlo (SMC) particle filter.
The encompassing test is developed based on multi-step-ahead predictions of two nested models as in Pitarakis, J. (2023) <doi:10.48550/arXiv.2312.16099>. The statistics are standardised to a normal distribution, and the null hypothesis is that the larger model contains no additional useful information. P-values will be provided in the output.
Interface to TensorFlow Probability', a Python library built on TensorFlow that makes it easy to combine probabilistic models and deep learning on modern hardware ('TPU', GPU'). TensorFlow Probability includes a wide selection of probability distributions and bijectors, probabilistic layers, variational inference, Markov chain Monte Carlo, and optimizers such as Nelder-Mead, BFGS, and SGLD.
Implementation of a simple algorithm designed for online multivariate changepoint detection of a mean in sparse changepoint settings. The algorithm is based on a modified cusum statistic and guarantees control of the type I error on any false discoveries, while featuring O(1) time and O(1) memory updates per series as well as a proven detection delay.
This is a pure dummy interfaces package which mirrors MsSparkUtils APIs <https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/microsoft-spark-utilities?pivots=programming-language-r> of Azure Synapse Analytics <https://learn.microsoft.com/en-us/azure/synapse-analytics/> for R users, customer of Azure Synapse can download this package from CRAN for local development.
Measuring angles between points in a landscape is much easier than measuring distances. When the location of three points is known the position of the observer can be determined based solely on the angles between these points as seen by the observer. This task (known as triangulation) however requires onerous calculations - these calculations are automated by this package.
The package ABAEnrichment is designed to test for enrichment of user defined candidate genes in the set of expressed genes in different human brain regions. The core function aba_enrich integrates the expression of the candidate gene set (averaged across donors) and the structural information of the brain using an ontology, both provided by the Allen Brain Atlas project.
The Docker registry server enable you to host your own docker registry. With it, there is also two other utilities:
The digest utility is a tool that generates checksums compatibles with various docker manifest files.
The registry-api-descriptor-template is a tool for generating API specifications from the docs/spec/api.md.tmpl file.
Repeating of math operators at the broken line and the new line in inline equations is used in Cyrillic mathematical typography (Russian for example), but unfortunately LaTeX does not provide such an option. This package solves the problem by extending ideas described in M. I. Grinchuk TeX and Russian Traditions of Typesetting, and supports most of LaTeX mathematical packages.
Re-analysis of human gene expression data generated on the Affymetrix HG_U133PlusV2 (EH176) and Affymetrix HG_U133A (EH177) platforms. The original data were normalized using robust multiarray averaging (RMA) to obtain an integrated gene expression atlas across diverse biological sample types and conditions. The entire compendia comprisee 9395 arrays for EH176 and 5372 arrays for EH177.
This package provides functions are provided that facilitate the analysis of SNP (single nucleotide polymorphism) data to answer questions regarding captive breeding and relatedness between individuals. dartR.captive is part of the dartRverse suit of packages. Gruber et al. (2018) <doi:10.1111/1755-0998.12745>. Mijangos et al. (2022) <doi:10.1111/2041-210X.13918>.
Computes exact p-values for multinomial goodness-of-fit tests based on multiple test statistics, namely, Pearson's chi-square, the log-likelihood ratio and the probability mass statistic. Implements the algorithm detailed in Resin (2023) <doi:10.1080/10618600.2022.2102026>. Estimates based on the classical asymptotic chi-square approximation or Monte-Carlo simulation can also be computed.
Perform calculations for the WHO International Reference Reagents for the microbiome. Using strain, species or genera abundance tables generated through analysis of 16S ribosomal RNA sequencing or shotgun sequencing which included a reference reagent. This package will calculate measures of sensitivity, False positive relative abundance, diversity, and similarity based on mean average abundances with respect to the reference reagent.
This package provides a nonparametric, multicore-capable plausible naive Bayes classifier based on the Pareto density estimation (PDE) featuring a plausible approach to a pitfall in the Bayesian theorem covering low evidence cases. Stier, Q., Hoffmann, J., and Thrun, M.C.: "Classifying with the Fine Structure of Distributions: Leveraging Distributional Information for Robust and Plausible Naïve Bayes" (2025).
This package provides function shinyShortcut() that, when given the base directory of a shiny application, will produce an executable file that runs the shiny app directly in the user's default browser. Tested on both windows and unix machines. Inspired by and borrowing from <http://www.mango-solutions.com/wp/2017/03/shiny-based-tablet-or-desktop-app/>.
This package provides a tool to plot data with a large sample size using shiny and plotly'. Relatively small samples are obtained from the original data using a specific algorithm. The samples are updated according to a user-defined x range. Jonas Van Der Donckt, Jeroen Van Der Donckt, Emiel Deprost (2022) <https://github.com/predict-idlab/plotly-resampler>.
Implementation of Small Area Estimation (SAE) using Hierarchical Bayesian (HB) Method when auxiliary variable measured with error under Beta Distribution. The rjags package is employed to obtain parameter estimates. For the references, see J.N.K & Molina (2015) <doi:10.1002/9781118735855>, Ybarra and Sharon (2008) <doi:10.1093/biomet/asn048>, and Ntzoufras (2009, ISBN-10: 1118210352).
Trelliscope is a scalable, flexible, interactive approach to visualizing data (Hafen, 2013 <doi:10.1109/LDAV.2013.6675164>). This package provides methods that make it easy to create a Trelliscope display specification for TrelliscopeJS. High-level functions are provided for creating displays from within tidyverse or ggplot2 workflows. Low-level functions are also provided for creating new interfaces.
The sparse nature of single cell epigenomics data can be overruled using probabilistic modelling methods such as Latent Dirichlet Allocation (LDA). This package allows the probabilistic modelling of cis-regulatory topics (cisTopics) from single cell epigenomics data, and includes functionalities to identify cell states based on the contribution of cisTopics and explore the nature and regulatory proteins driving them.
This package extends beachmat to support initialization of tatami matrices from HDF5-backed arrays. This allows C++ code in downstream packages to directly call the HDF5 C/C++ library to access array data, without the need for block processing via DelayedArray. Some utilities are also provided for direct creation of an in-memory tatami matrix from a HDF5 file.
Retrying is a general-purpose retrying library to simplify the task of adding retry behavior to just about anything.
Features:
Generic Decorator API.
Specify stop condition (i.e. limit by number of attempts).
Specify wait condition (i.e. exponential backoff sleeping between attempts).
Customize retrying on Exceptions.
Customize retrying on expected returned result.
The Chromatograms packages defines an efficient infrastructure for storing and handling of chromatographic mass spectrometry data. It provides different implementations of *backends* to store and represent the data. Such backends can be optimized for small memory footprint or fast data access/processing. A lazy evaluation queue and chunk-wise processing capabilities ensure efficient analysis of also very large data sets.
Using a Gaussian copula approach, this package generates simulated data mimicking a target real dataset. It supports normal, Poisson, empirical, and DESeq2 (negative binomial with size factors) marginal distributions. It uses an low-rank plus diagonal covariance matrix to efficiently generate omics-scale data. Methods are described in: Yang, Grant, and Brooks (2025) <doi:10.1101/2025.01.31.634335>.
This package provides a simple interface to build designs using the package DeclareDesign'. In one line of code, users can specify the parameters of individual designs and diagnose their properties. The designers can also be used to compare performance of a given design across a range of combinations of parameters, such as effect size, sample size, and assignment probabilities.