This package provides nearest-neighbors matching and analysis of case-control data. Cui, Z., Marder, E. P., Click, E. S., Hoekstra, R. M., & Bruce, B. B. (2022) <doi:10.1097/EDE.0000000000001504>.
This package provides a collection of general-purpose helper functions that I (and maybe others) find useful when developing data science software. Includes tools for simulation, data transformation, input validation, and more.
Algorithm of online regularized k-means to deal with online multi(single) view data. The philosophy of the package is described in Guo G. (2024) <doi:10.1016/j.ins.2024.121133>.
This package provides a toolbox for deterministic, probabilistic and privacy-preserving record linkage techniques. Combines the functionality of the Merge ToolBox
(<https://www.record-linkage.de>) with current privacy-preserving techniques.
An iterative feature selection method that internally utilizes various Machine Learning methods that have embedded feature reduction in order to shrink down the feature space into a small and yet robust set.
Perform a Bayesian estimation of the exploratory Sparse Latent Class Model for Binary Data described by Chen, Y., Culpepper, S. A., and Liang, F. (2020) <doi:10.1007/s11336-019-09693-2>.
The Sparse Marginal Epistasis Test is a computationally efficient genetics method which detects statistical epistasis in complex traits; see Stamp et al. (2025, <doi:10.1101/2025.01.11.632557>) for details.
An MCMC algorithm for simultaneous feature selection and classification, and visualization of the selected features and feature interactions. An implementation of SBFC by Krakovna, Du and Liu (2015), <arXiv:1506.02371>
.
We provide functions for computing the decision boundaries for pre-licensure vaccine trials using the Generalized Likelihood Ratio tests proposed by Shih, Lai, Heyse and Chen (2010, <doi:10.1002/sim.4036>).
Computes the Gaussian variational approximation of the Bayesian empirical likelihood posterior. This is an implementation of the function found in Yu, W., & Bondell, H. D. (2023) <doi:10.1080/01621459.2023.2169701>.
The MAIT package contains functions to perform end-to-end statistical analysis of LC/MS Metabolomic Data. Special emphasis is put on peak annotation and in modular function design of the functions.
This package enables you to read and manipulate genome intervals and signals. It provides functionality similar to command-line tool suites within R, enabling interactive analysis and visualization of genome-scale data.
This package provides a simple HTTP client, with tools for making HTTP requests, and mocking HTTP requests. The package is built on R6, and takes inspiration from Ruby's faraday
gem.
This package supports twin models that are able to estimate the dynamic behaviour of the variance components in the classical twin models with respect to age using B-splines and P-splines.
This package provides an easy and simple way to read, write and display bitmap images stored in the TIFF format. It can read and write both files and in-memory raw vectors.
This package provides functions for summarizing and plotting the output from Markov Chain Monte Carlo (MCMC) simulations, as well as diagnostic tests of convergence to the equilibrium distribution of the Markov chain.
Test Statistics for Independence in High-Dimensional Datasets. This package consists of two functions to perform the complete independence test based on test statistics proposed by Bulut (unpublished yet) and suggested by Najarzadeh (2021) <doi: 10.1080/03610926.2019.1702699>. The Bulut's statistic is not sensitive to outliers in high-dimensional data, unlike one of Najarzadeh (2021) <doi: 10.1080/03610926.2019.1702699>. So, the Bulut's statistic can be performed robustly by using RDnp function.
Handle climate data from the DWD ('Deutscher Wetterdienst', see <https://www.dwd.de/EN/climate_environment/cdc/cdc_node_en.html> for more information). Choose observational time series from meteorological stations with selectDWD()
'. Find raster data from radar and interpolation according to <https://bookdown.org/brry/rdwd/raster-data.html>. Download (multiple) data sets with progress bars and no re-downloads through dataDWD()
'. Read both tabular observational data and binary gridded datasets with readDWD()
'.
Detects copy number alteration events in targeted exon sequencing data for tumor samples without matched normal controls. The advantage of this method is that it can be applied to smaller sequencing panels including evaluations of exon, transcript, gene, or even user specified genetic regions of interest. Functions in the package include steps for GC-content correction, calculation of quantile based normal karyotype ranges, and calculation of feature score. Cutoffs for "normal" quantile and score are user-adjustable.
Existing adaptive design methods in clinical trials. The package includes power, stopping boundaries (sample size) calculation functions for two-group group sequential designs, adaptive design with coprimary endpoints, biomarker-informed adaptive design, etc.
This package implements the methodological developments found in Hermes, van Heerwaarden, and Behrouzi (2024) <doi:10.48550/arXiv.2408.10558>
, and allows for the statistical modeling of multi-attribute pairwise comparison data.
The Chinese ID number contains a lot of information, this package helps you get the region, date of birth, age, age based on year, gender, zodiac, constellation information from the Chinese ID number.
This package provides a set of user-friendly wrapper functions for creating consistent graphics and diagrams with lines, common shapes, text, and page settings. Compatible with and based on the R grid package.
Provide the EMU Speech Database Management System (EMU-SDMS) with database management, data extraction, data preparation and data visualization facilities. See <https://ips-lmu.github.io/The-EMU-SDMS-Manual/> for more details.