Robust multivariate methods for high dimensional data including outlier detection (Filzmoser and Todorov (2013) <doi:10.1016/j.ins.2012.10.017>), robust sparse PCA (Croux et al. (2013) <doi:10.1080/00401706.2012.727746>, Todorov and Filzmoser (2013) <doi:10.1007/978-3-642-33042-1_31>), robust PLS (Todorov and Filzmoser (2014) <doi:10.17713/ajs.v43i4.44>), and robust sparse classification (Ortner et al. (2020) <doi:10.1007/s10618-019-00666-8>).
The IntCal20
radiocarbon calibration curves (Reimer et al. 2020 <doi:10.1017/RDC.2020.68>) are provided as a data package, together with previous IntCal
curves (IntCal13
, IntCal09
, IntCal04
, IntCal98
), other curves (e.g., NOTCal04 [van der Plicht et al. 2004], Arnold & Libby 1951) and postbomb curves. Also provided are functions to copy the curves into memory, and to read, query and plot the data underlying the IntCal20
curves.
Reads Arena <https://www.arenasimulation.com/> CSV output files and generates nice tables and plots. The package contains a Shiny App that can be used to interactively visualize Arena's results.
This package provides functions for Bayesian Data Analysis, with datasets from the book "Bayesian data Analysis (second edition)" by Gelman, Carlin, Stern and Rubin. Not all datasets yet, hopefully completed soon.
Implementation of an efficient BLAST-like sequence comparison algorithm, written in C++11 and using native R datatypes. Blaster is based on nsearch - Schmid et al (2018) <doi:10.1101/399782>.
This package provides a set of R functions and data sets for the book Introduction to Bayesian Statistics, Bolstad, W.M. (2017), John Wiley & Sons ISBN 978-1-118-09156-2.
Core functions for simulating quantities of interest from generalised linear models (GLM). This package will form the backbone of a series of other packages that improve the interpretation of GLM estimates.
This package provides a high performance package estimating Cox Model when an even has more than one causes. It also supports random and fixed effects, tied events, and time-varying variables.
OpenAI's
ChatGPT
<https://chat.openai.com/> coding assistant for RStudio'. A set of functions and RStudio addins that aim to help the R developer in tedious coding tasks.
This package provides a collection of data-limited management procedures that can be evaluated with management strategy evaluation with the MSEtool package, or applied to fishery data to provide management recommendations.
Robust analysis using forward search in linear and generalized linear regression models, as described in Atkinson, A.C. and Riani, M. (2000), Robust Diagnostic Regression Analysis, First Edition. New York: Springer.
Processes data from The Social Networks and Fertility Survey, downloaded from <https://dataarchive.lissdata.nl>, including correcting respondent errors and transforming network data into network objects to facilitate analyses and visualisation.
Automatically perform a reanalysis series on a data set using CNA, and calculate the fit-robustness of the resulting models, as described in Parkkinen and Baumgartner (2021) <doi:10.1177/0049124120986200>.
Datos de nombres inscritos en Chile entre 1920 y 2021, de acuerdo al Servicio de Registro Civil. English: Chilean baby names registered from 1920 to 2021 by the Civil Registry Service.
This package provides a ggplot2 extension that adds specialised arrow geometry layers. It offers more arrow options than the standard grid arrows that are built-in many line-based geom layers.
Implementation of routines of the author's PhD
thesis on gradient-free Gradient Boosting (Werner, Tino (2020) "Gradient-Free Gradient Boosting", URL <https://oops.uni-oldenburg.de/id/eprint/4290>').
Convert the chip ID of GPL2025 <https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL2025> to GeneBank
Accession and ENTREZID <http://www.ncbi.nlm.nih.gov/gene>.
Functions, Shiny apps and data for the book "Introduction to Statistics" by Wolfgang Karl Härdle, Sigbert Klinke, and Bernd Rönz (2015) <doi:10.1007/978-3-319-17704-5>.
Empirical value of the Hellinger correlation, a measure of dependence between two continuous random variables. More details can be found in Geenens and Lafaye De Micheaux (2019) <arXiv:1810.10276v4>
.
This package provides functions and data sets to accompany the book Integrated Population Models: Theory and Ecological Applications with R and JAGS by Michael Schaub and Marc Kéry (ISBN: 9780128205648).
It provides in-place operators for R that are equivalent to +=', -=', *=', /= in C++. Those can be applied on integer|double vectors|matrices. You have also access to sweep operations (in-place).
Implementation of a KL-based scoring rule to assess the quality of different missing value imputations in the broad sense as introduced in Michel et al. (2021) <arXiv:2106.03742>
.
This package provides flexible but lightweight logging facilities for R scripts. Supports priority levels for logs and messages, flagging messages, capturing script output, switching logs, and logging to files or connections.
Routines for assessing multivariate normality. Implements three Wald's type chi-squared tests; non-parametric Anderson-Darling and Cramer-von Mises tests; Doornik-Hansen test, Royston test and Henze-Zirkler test.