This package produces ANOVA tables in the format used by Judd, McClelland, and Ryan (2017, ISBN: 978-1138819832) in their introductory textbook, Data Analysis. This includes proportional reduction in error and formatting to improve ease the transition between the book and R.
Randomization of presence/absence species distribution raster data with or without including spatial structure for calculating standardized effect sizes and testing null hypothesis. The randomization algorithms are based on classical algorithms for matrices (Gotelli 2000, <doi:10.2307/177478>) implemented for raster data.
Streamline the process of accessing fundamental financial data from the United States Securities and Exchange Commission's ('SEC') Electronic Data Gathering, Analysis, and Retrieval system ('EDGAR') API <https://www.sec.gov/edgar/sec-api-documentation>, transforming it into a tidy, analysis-ready format.
Description: Implementation of topological data analysis methods based on graph-theoretic approaches for discovering topological structures in data. The core algorithm constructs topological spaces from graphs following Nada et al. (2018) <doi:10.1002/mma.4726> "New types of topological structures via graphs".
Create United Nations High Commissioner for Refugees (UNHCR) branded documents, presentations, and reports using R Markdown templates. This package provides customized formats that align with UNHCR's official brand guidelines for creating professional PDF reports, Word documents, PowerPoint presentations, and HTML outputs.
This package provides optimized functions and flexible combinatorial iterators implemented in C++ for solving problems in combinatorics and computational mathematics. It utilizes the RMatrix class from RcppParallel for thread safety. There are combination/permutation functions with constraint parameters that allow for generation of all results of a vector meeting specific criteria. It is capable of generating specific combinations/permutations which sets up nicely for parallelization as well as random sampling. Gmp support permits exploration where the total number of results is large. Additionally, there are several high performance number theoretic functions that are useful for problems common in computational mathematics.
This package provides tools for diagnosing the reproducibility of statistical model outputs under data perturbations. Implements bootstrap, subsampling, and noise-based perturbation schemes and computes coefficient stability, p-value stability, selection stability, prediction stability, and a composite reproducibility index on a 0 to 100 scale. Includes cross-validation ranking stability for model comparison and visualization utilities. Optional backends support robust M-estimation ('MASS') and penalized regression ('glmnet'). Bootstrap perturbation follows Efron and Tibshirani (1993, ISBN:9780412042317); selection stability follows Meinshausen and Buhlmann (2010) <doi:10.1111/j.1467-9868.2010.00740.x>; reproducibility framework follows Peng (2011) <doi:10.1126/science.1213847>.
The r-ggformula introduces a family of graphics functions, gf_point(), gf_density(), and so on, bring the formula interface to ggplot(). This captures and extends the excellent simplicity of the lattice-graphics formula interface, while providing the intuitive capabilities of r-ggplot2.
This package provides infrastructure for seriation with an implementation of several seriation/sequencing techniques to reorder matrices, dissimilarity matrices, and dendrograms. It also provides (optimally) reordered heatmaps, color images and clustering visualizations like dissimilarity plots, and visual assessment of cluster tendency plots (VAT and iVAT).
pass-rotate is a command line utility and python library for rotating passwords on various web services. It makes it easier to rotate your passwords, one at a time or in bulk, when security events or routine upkeep of your online accounts makes it necessary.
Implementation of a clustering method for time series gene expression data based on mixed-effects models with Gaussian variables and non-parametric cubic splines estimation. The method can robustly account for the high levels of noise present in typical gene expression time series datasets.
This package provides a fast scatterplot smoother based on B-splines with second-order difference penalty. Functions for microarray normalization of single-colour data i.e. Affymetrix/Illumina and two-colour data supplied as marray MarrayRaw-objects or limma RGList-objects are available.
Flexible multi-environment trials analysis via MCMC method for Additive Main Effects and Multiplicative Interaction Model (AMMI) for continuous data. Biplot with the averages and regions of confidence can be generated. The chains run in parallel on Linux systems and run serially on Windows.
This package performs efficient and scalable glm best subset selection using a novel implementation of a branch and bound algorithm. To speed up the model fitting process, a range of optimization methods are implemented in RcppArmadillo'. Parallel computation is available using OpenMP'.
This package contains data sets regarding songs on the Billboard Hot 100 list from 1960 to 2016. The data sets include the ranks for the given year, musical features of a lot of the songs and lyrics for several of the songs as well.
Designed to simplify the process of retrieving datasets from the Big Data PE platform using secure token-based authentication. It provides functions for securely storing, retrieving, and managing tokens associated with specific datasets, as well as fetching and processing data using the httr2 package.
Modelling of population growth under static and dynamic environmental conditions. Includes functions for model fitting and making prediction under isothermal and dynamic conditions. The methods (algorithms & models) are based on predictive microbiology (See Perez-Rodriguez and Valero (2012, ISBN:978-1-4614-5519-6)).
This package provides the "comma-free call" operator: %(%'. Use it to call a function without commas between the arguments. Just replace the ( with %(% in a function call, supply your arguments as standard R expressions enclosed by ', and be free of commas (for that call).
An interface to the fast_matrix_market C++ library, this package offers efficient read and write operations for Matrix Market files in R. It supports both sparse and dense matrix formats. Peer-reviewed at rOpenSci (<https://github.com/ropensci/software-review/issues/606>).
This package provides publication-ready volcano plots for visualizing differential expression results, commonly used in RNA-seq and similar analyses. This tool helps create high-quality visual representations of data using the ggplot2 framework Wickham (2016) <doi:10.1007/978-3-319-24277-4>.
LP nonparametric high-dimensional K-sample comparison method that includes (i) confirmatory test, (ii) exploratory analysis, and (iii) options to output a data-driven LP-transformed matrix for classification. The primary reference is Mukhopadhyay, S. and Wang, K. (2020, Biometrika); <arXiv:1810.01724>.
This package provides a system for accurately designing complex light regimes using LEDs. Takes calibration data and user-defined target irradiances and it tells you what intensities to use. For more details see Vong et al. (2025) <doi:10.1101/2025.06.06.658293>.
Implementation of the methodology of Aleshin-Guendel & Sadinle (2022) <doi:10.1080/01621459.2021.2013242>. It handles the general problem of multifile record linkage and duplicate detection, where any number of files are to be linked, and any of the files may have duplicates.
This package provides a modified function bic.glm of the BMA package that can be applied to multinomial logit (MNL) data. The data is converted to binary logit using the Begg & Gray approximation. The package also contains functions for maximum likelihood estimation of MNL.