This package provides methods for visualizing large multivariate datasets using static and interactive scatterplot matrices, parallel coordinate plots, volcano plots, and litre plots. It includes examples for visualizing RNA-sequencing datasets and differentially expressed genes.
This package provides functions to produce rudimentary ASCII graphics directly in the terminal window. This package provides a basic plotting function (and equivalents of curve, density, acf and barplot) as well as a boxplot function.
Statistical and biological validation of clustering results. This package implements Dunn Index, Silhouette, Connectivity, Stability, BHI and BSI. Further information can be found in Brock, G et al. (2008) <doi: 10.18637/jss.v025.i04>.
Lp_solve is software for solving linear, integer and mixed integer programs. This implementation supplies a "wrapper" function in C and some R functions that solve general linear/integer problems, assignment problems, and transportation problems.
RQ (Redis Queue) is a simple Python library for queueing jobs and processing them in the background with workers. It is backed by Redis and it is designed to have a low barrier to entry.
WWTD is a Travis Simulator that lets you run test matrices defined in .travis.yml on your local machine, using rvm, rbenv, or chruby to test different versions of Ruby.
Retime speech signals with a native Waveform Similarity Overlap-Add (WSOLA) implementation translated from the TSM toolbox by Driedger & Müller (2014) <https://www.audiolabs-erlangen.de/content/resources/MIR/TSMtoolbox/2014_DriedgerMueller_TSM-Toolbox_DAFX.pdf>. Design retimings and pitch (f0) transformations with tidy data and apply them via Praat interface. Produce spectrograms, spectra, and amplitude envelopes. Includes implementation of vocalic speech envelope analysis (fft_spectrum) technique and example data (mm1) from Tilsen, S., & Johnson, K. (2008) <doi:10.1121/1.2947626>.
Ridge regression due to Hoerl and Kennard (1970)<DOI:10.1080/00401706.1970.10488634> and generalized ridge regression due to Yang and Emura (2017)<DOI:10.1080/03610918.2016.1193195> with optimized tuning parameters. These ridge regression estimators (the HK estimator and the YE estimator) are computed by minimizing the cross-validated mean squared errors. Both the ridge and generalized ridge estimators are applicable for high-dimensional regressors (p>n), where p is the number of regressors, and n is the sample size.
This package performs all steps in the credit scoring process. This package allows the user to follow all the necessary steps for building an effective scorecard. It provides the user functions for coarse binning of variables, Weights of Evidence (WOE) transformation, variable clustering, custom binning, visualization, and scaling of logistic regression coefficients. The results will generate a scorecard that can be used as an effective credit scoring tool to evaluate risk. For complete details on the credit scoring process, see Siddiqi (2005, ISBN:047175451X).
This package provides a collection of methods for estimating the basic reproduction number (R0) of infectious diseases. Features a web application to interface with the estimators. Uses the models from: Fisman et al. (2013) <DOI:10.1371/journal.pone.0083622>, Bettencourt and Ribeiro (2008) <DOI:10.1371/journal.pone.0002185>, and White and Pagano (2008) <DOI:10.1002/sim.3136>. Includes datasets for Canadian national and provincial COVID-19 case counts provided by Berry et al. (2021) <DOI:10.1038/s41597-021-00955-2>.
The regression discontinuity (RD) design is a popular quasi-experimental design for causal inference and policy evaluation. The rdpower package provides tools to perform power, sample size and MDE calculations in RD designs: rdpower() calculates the power of an RD design, rdsampsi() calculates the required sample size to achieve a desired power and rdmde() calculates minimum detectable effects. See Cattaneo, Titiunik and Vazquez-Bare (2019) <https://rdpackages.github.io/references/Cattaneo-Titiunik-VazquezBare_2019_Stata.pdf> for further methodological details.
Non-linear transformations of data to better discover latent effects. Applies a sequence of three transformations (1) a Gaussianizing transformation, (2) a Z-score transformation, and (3) an outlier removal transformation. A publication describing the method has the following citation: Gregory J. Hunt, Mark A. Dane, James E. Korkola, Laura M. Heiser & Johann A. Gagnon-Bartsch (2020) "Automatic Transformation and Integration to Improve Visualization and Discovery of Latent Effects in Imaging Data", Journal of Computational and Graphical Statistics, <doi:10.1080/10618600.2020.1741379>.
Returns dynamic fit index (DFI) cutoffs for latent variable models that are tailored to the user's model statement, model type, and sample size. This is the counterpart of the Shiny Application, <https://dynamicfit.app>.
An extension to the DPQ package with computations for DPQ (Density (pdf), Probability (cdf) and Quantile) functions, where the functions here partly use the Rmpfr package and hence the underlying MPFR and GMP C libraries.
End-member modelling analysis of grain-size data is an approach to unmix a data set's underlying distributions and their contribution to the data set. EMMAgeo provides deterministic and robust protocols for that purpose.
Generate citations and references for R packages from CRAN or Bioconductor. Supports RIS and BibTeX formats with automatic DOI retrieval from GitHub repositories and published papers. Includes command-line interface for batch processing.
This package provides an efficient C++ code for computing an optimal segmentation model with Poisson loss, up-down constraints, and label constraints, as described by Kaufman et al. (2024) <doi:10.1080/10618600.2023.2293216>.
Efficient computation of the Liu regression coefficient paths, Liu-related statistics and information criteria for a grid of the regularization parameter. The computations are based on the C++ library Armadillo through the R package Rcpp'.
YACFP (Yet Another Convenience Function Package). get_age() is a fast & accurate tool for measuring fractional years between two dates. stale_package_check() tries to identify any library() calls to unused packages.
Allows maximum likelihood fitting of cluster-weighted models, a class of mixtures of regression models with random covariates. Methods are described in Angelo Mazza, Antonio Punzo, Salvatore Ingrassia (2018) <doi:10.18637/jss.v086.i02>.
This package provides a ggplot2 extension for visualizing Chinaâ s map, offering customizable projections, boundary styles, and buffer zones for thematic maps. Suitable for spatial data analysis and enhancing map visualization with flexible styling options.
This package provides functions for downloading of geographic data for use in spatial analysis and mapping. The package facilitates access to climate, crops, elevation, land use, soil, species occurrence, accessibility, administrative boundaries and other data.
Estimates the parameters of a GARCH-X model with exogenous covariates, performs hypothesis tests for the parameters returning the p-values, and uses False Discovery Rate p-value corrections to select the exogenous variables.
This package provides a collection of tools which extract a model documentation from GAMS code and comments. In order to use the package you need to install pandoc and pandoc-citeproc first (<https://pandoc.org/>).