Regularized version of partial least square approaches providing sparse, group, and sparse group versions of partial least square regression models (Liquet, B., Lafaye de Micheaux, P., Hejblum B., Thiebaut, R. (2016) <doi:10.1093/bioinformatics/btv535>). Version of PLS Discriminant analysis is also provided.
Accompanies the book Rainer Schlittgen and Cristina Sattarhoff (2020) <https://www.degruyter.com/view/title/575978> "Angewandte Zeitreihenanalyse mit R, 4. Auflage" . The package contains the time series and functions used therein. It was developed over many years teaching courses about time series analysis.
This package provides a simple type annotation for R that is usable in scripts, in the R console and in packages. It is intended as a convention to allow other packages to use the type information to provide error checking, automatic documentation or optimizations.
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. This package provides an R interface to the Arrow C++ library.
Project Raincat is a game developed by Carnegie Mellon students through GCS during the Fall 2008 semester. Raincat features game play inspired from classics Lemmings and The Incredible Machine. The project proved to be an excellent learning experience for the programmers. Everything is programmed in Haskell.
Fit (exponential or diffusion) response-time extended multinomial processing tree (RT-MPT) models by Klauer and Kellen (2018) <doi:10.1016/j.jmp.2017.12.003> and Klauer, Hartmann, and Meyer-Grant (submitted). The RT-MPT class not only incorporate frequencies like traditional multinomial processing tree (MPT) models, but also latencies. This enables it to estimate process completion times and encoding plus motor execution times next to the process probabilities of traditional MPTs. rtmpt is a hierarchical Bayesian framework and posterior samples are sampled using a Metropolis-within-Gibbs sampler (for exponential RT-MPTs) or Hamiltonian-within-Gibbs sampler (for diffusion RT-MPTs).
With this package we provide an easy method to compute robust and conditional Data Envelopment Analysis (DEA), Free Disposal Hull (FDH) and Benefit of the Doubt (BOD) scores. The robust approach is based on the work of Cazals, Florens and Simar (2002) <doi:10.1016/S0304-4076(01)00080-X>. The conditional approach is based on Daraio and Simar (2007) <doi:10.1007/s11123-007-0049-3>. Besides we provide graphs to help with the choice of m. We relay on the Benchmarking package to compute the efficiency scores and on the np package to compute non parametric estimation of similarity among units.
Estimates Pareto-optimal solution for personnel selection with 3 objectives using Normal Boundary Intersection (NBI) algorithm introduced by Das and Dennis (1998) <doi:10.1137/S1052623496307510>. Takes predictor intercorrelations and predictor-objective relations as input and generates a series of solutions containing predictor weights as output. Accepts between 3 and 10 selection predictors. Maximum 2 objectives could be adverse impact objectives. Partially modeled after De Corte (2006) TROFSS Fortran program <https://users.ugent.be/~wdecorte/trofss.pdf> and updated from ParetoR
package described in Song et al. (2017) <doi:10.1037/apl0000240>. For details, see Study 3 of Zhang et al. (2023).
This package contains functions to retrieve, organize, and visualize weather data from the NCEP/NCAR Reanalysis (<https://psl.noaa.gov/data/gridded/data.ncep.reanalysis.html>) and NCEP/DOE Reanalysis II (<https://psl.noaa.gov/data/gridded/data.ncep.reanalysis2.html>) datasets. Data are queried via the Internet and may be obtained for a specified spatial and temporal extent or interpolated to a point in space and time. We also provide functions to visualize these weather data on a map. There are also functions to simulate flight trajectories according to specified behavior using either NCEP wind data or data specified by the user.
Utility functions to retrieve data from the UK National River Flow Archive (<https://nrfa.ceh.ac.uk/>, terms and conditions: <https://nrfa.ceh.ac.uk/costs-terms-and-conditions>). The package contains R wrappers to the UK NRFA data temporary-API. There are functions to retrieve stations falling in a bounding box, to generate a map and extracting time series and general information. The package is fully described in Vitolo et al (2016) "rnrfa: An R package to Retrieve, Filter and Visualize Data from the UK National River Flow Archive" <https://journal.r-project.org/archive/2016/RJ-2016-036/RJ-2016-036.pdf>.
Sensitivity (or recall or true positive rate), false positive rate, specificity, precision (or positive predictive value), negative predictive value, misclassification rate, accuracy, F-score---these are popular metrics for assessing performance of binary classifiers for certain thresholds. These metrics are calculated at certain threshold values. Receiver operating characteristic (ROC) curve is a common tool for assessing overall diagnostic ability of the binary classifier. Unlike depending on a certain threshold, area under ROC curve (also known as AUC), is a summary statistic about how well a binary classifier performs overall for the classification task. The ROCit package provides flexibility to easily evaluate threshold-bound metrics.
This package provides functions required to classify subjects within camera trap field data. The package can handle both images and videos. The authors recommend a two-step approach using Microsoft's MegaDector
model and then a second model trained on the classes of interest.
Computing and visualizing comparative asymptotic timings of different algorithms and code versions. Also includes functionality for comparing empirical timings with expected references such as linear or quadratic, <https://en.wikipedia.org/wiki/Asymptotic_computational_complexity> Also includes functionality for measuring asymptotic memory and other quantities.
This package provides an interface to the algorithm selection benchmark library at <http://www.aslib.net> and the LLAMA package (<https://cran.r-project.org/package=llama>) for building algorithm selection models; see Bischl et al. (2016) <doi:10.1016/j.artint.2016.04.003>.
This package provides tools designed to make it easier for beginner and intermediate users to build and validate binary logistic regression models. Includes bivariate analysis, comprehensive regression output, model fit statistics, variable selection procedures, model validation techniques and a shiny app for interactive model building.
This package provides a bootstrap-based approach to integrate multiple forms of high dimensional genomic data with multiple clinical endpoints. This method is used to find clinically meaningful groups of genomic features, such as genes or pathways. A manuscript describing this method is in preparation.
Implementation of the CNAIM standard in R. Contains a series of algorithms which determine the probability of failure, consequences of failure and monetary risk associated with electricity distribution companies assets such as transformers and cables. Results are visualized in an easy-to-understand risk matrix.
Includes various functions for playing drum sounds. beat()
plays a drum sound from one of the six included drum kits. tempo()
sets spacing between calls to beat()
in bpm. Together the two functions can be used to create many different drum patterns.
Computes the double bootstrap as discussed in McKnight
, McKean
, and Huitema (2000) <doi:10.1037/1082-989X.5.1.87>. The double bootstrap method provides a better fit for a linear model with autoregressive errors than ARIMA when the sample size is small.
An R interface to the codediff JavaScript
library (a copy of which is included in the package, see <https://github.com/danvk/codediff.js> for information). Allows for visualization of the difference between 2 files, usually text files or R scripts, in a browser.
FDR functions for permutation-based estimators, including pi0 as well as FDR confidence intervals. The confidence intervals account for dependencies between tests by the incorporation of an overdispersion parameter, which is estimated from the permuted data. Also included are options for an analog parametric approach.
Includes several statistical methods for the estimation of parameters and high quantiles of river flow distributions. The focus is on regional estimation based on homogeneity assumptions and computed from multivariate observations (multiple measurement stations). For details see Kinsvater et al. (2017) <arXiv:1701.06455>
.
This package provides a model-independent factor importance ranking and selection procedure based on total Sobol indices. Please see Huang and Joseph (2025) <doi:10.1080/00401706.2025.2483531>. This research is supported by U.S. National Science Foundation grants DMS-2310637 and DMREF-1921873.
Performing the different steps of gene set enrichment meta-analysis. It provides different functions that allow the application of meta-analysis based on the combination of effect sizes from different pathways in different studies to obtain significant pathways that are common to all of them.