Robust multivariate methods for high dimensional data including outlier detection (Filzmoser and Todorov (2013) <doi:10.1016/j.ins.2012.10.017>), robust sparse PCA (Croux et al. (2013) <doi:10.1080/00401706.2012.727746>, Todorov and Filzmoser (2013) <doi:10.1007/978-3-642-33042-1_31>), robust PLS (Todorov and Filzmoser (2014) <doi:10.17713/ajs.v43i4.44>), and robust sparse classification (Ortner et al. (2020) <doi:10.1007/s10618-019-00666-8>).
Microarray Classification is designed for both biologists and statisticians. It offers the ability to train a classifier on a labelled microarray dataset and to then use that classifier to predict the class of new observations. A range of modern classifiers are available, including support vector machines (SVMs), nearest shrunken centroids (NSCs)... Advanced methods are provided to estimate the predictive error rate and to report the subset of genes which appear essential in discriminating between classes.
The IntCal20 radiocarbon calibration curves (Reimer et al. 2020 <doi:10.1017/RDC.2020.68>) are provided as a data package, together with previous IntCal curves (IntCal13, IntCal09, IntCal04, IntCal98), other curves (e.g., NOTCal04 [van der Plicht et al. 2004], Arnold & Libby 1951) and postbomb curves. Also provided are functions to copy the curves into memory, and to read, query and plot the data underlying the IntCal20 curves.
Reads Arena <https://www.arenasimulation.com/> CSV output files and generates nice tables and plots. The package contains a Shiny App that can be used to interactively visualize Arena's results.
This package provides a set of R functions and data sets for the book Introduction to Bayesian Statistics, Bolstad, W.M. (2017), John Wiley & Sons ISBN 978-1-118-09156-2.
This package provides functions for Bayesian Data Analysis, with datasets from the book "Bayesian data Analysis (second edition)" by Gelman, Carlin, Stern and Rubin. Not all datasets yet, hopefully completed soon.
Implementation of an efficient BLAST-like sequence comparison algorithm, written in C++11 and using native R datatypes. Blaster is based on nsearch - Schmid et al (2018) <doi:10.1101/399782>.
OpenAI's ChatGPT <https://chat.openai.com/> coding assistant for RStudio'. A set of functions and RStudio addins that aim to help the R developer in tedious coding tasks.
This package provides a high performance package estimating Cox Model when an even has more than one causes. It also supports random and fixed effects, tied events, and time-varying variables.
Core functions for simulating quantities of interest from generalised linear models (GLM). This package will form the backbone of a series of other packages that improve the interpretation of GLM estimates.
This package provides a collection of data-limited management procedures that can be evaluated with management strategy evaluation with the MSEtool package, or applied to fishery data to provide management recommendations.
This package provides functions that support stable prediction and classification with radiomics data through factor-analytic modeling. For details, see Peeters et al. (2019) <doi:10.48550/arXiv.1903.11696>.
Robust analysis using forward search in linear and generalized linear regression models, as described in Atkinson, A.C. and Riani, M. (2000), Robust Diagnostic Regression Analysis, First Edition. New York: Springer.
Automatically perform a reanalysis series on a data set using CNA, and calculate the fit-robustness of the resulting models, as described in Parkkinen and Baumgartner (2021) <doi:10.1177/0049124120986200>.
Processes data from The Social Networks and Fertility Survey, downloaded from <https://dataarchive.lissdata.nl>, including correcting respondent errors and transforming network data into network objects to facilitate analyses and visualisation.
Implementation of routines of the author's PhD thesis on gradient-free Gradient Boosting (Werner, Tino (2020) "Gradient-Free Gradient Boosting", URL <https://oops.uni-oldenburg.de/id/eprint/4290>').
Datos de nombres inscritos en Chile entre 1920 y 2021, de acuerdo al Servicio de Registro Civil. English: Chilean baby names registered from 1920 to 2021 by the Civil Registry Service.
This package provides a ggplot2 extension that adds specialised arrow geometry layers. It offers more arrow options than the standard grid arrows that are built-in many line-based geom layers.
Convert the chip ID of GPL2025 <https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL2025> to GeneBank Accession and ENTREZID <http://www.ncbi.nlm.nih.gov/gene>.
Functions, Shiny apps and data for the book "Introduction to Statistics" by Wolfgang Karl Härdle, Sigbert Klinke, and Bernd Rönz (2015) <doi:10.1007/978-3-319-17704-5>.
Empirical value of the Hellinger correlation, a measure of dependence between two continuous random variables. More details can be found in Geenens and Lafaye De Micheaux (2019) <arXiv:1810.10276v4>.
It provides in-place operators for R that are equivalent to +=', -=', *=', /= in C++. Those can be applied on integer|double vectors|matrices. You have also access to sweep operations (in-place).
This package provides functions and data sets to accompany the book Integrated Population Models: Theory and Ecological Applications with R and JAGS by Michael Schaub and Marc Kéry (ISBN: 9780128205648).
Implementation of a KL-based scoring rule to assess the quality of different missing value imputations in the broad sense as introduced in Michel et al. (2021) <arXiv:2106.03742>.