This package provides a framework for multiple hypothesis testing based on distribution of p values. It is well known that the p values come from different distribution for null and alternatives, in this package we provide functions to detect that change. We provide a method for using the change in distribution of p values as a way to detect the true signals in the data.
An interface to the Apache OpenNLP
tools (version 1.5.3). The Apache OpenNLP
library is a machine learning based toolkit for the processing of natural language text written in Java. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. See <https://opennlp.apache.org/> for more information.
Applies an objective Bayesian method to the Mb capture-recapture model to estimate the population size N. The Mb model is a class of capture-recapture methods used to account for variations in capture probability due to animal behavior. Under the Mb formulation, the initial capture of an animal may effect the probability of subsequent captures due to their becoming "trap happy" or "trap shy.".
Creating maps for statistical analysis such as proportional circles, choropleth, typology and flows. Some functions use shiny or leaflet technologies for dynamism and interactivity. The great features are : - Create maps in a web environment where the parameters are modifiable on the fly ('shiny and leaflet technologies). - Create interactive maps through zoom and pop-up ('leaflet technology). - Create frozen maps with the possibility to add labels.
Simple method of purging independent variables of mediating effects. First, regress the direct variable on the indirect variable. Then, used the stored residuals as the new purged (direct) variable in the updated specification. This purging process allows for use of a new direct variable uncorrelated with the indirect variable. Please cite the method and/or package using Waggoner, Philip D. (2018) <doi:10.1177/1532673X18759644>.
An implementation of self-exciting point process model for information cascades, which occurs when many people engage in the same acts after observing the actions of others (e.g. post resharings on Facebook or Twitter). It provides functions to estimate the infectiousness of an information cascade and predict its popularity given the observed history. See <http://snap.stanford.edu/seismic/> for more information and datasets.
Implementation of SAPEVO-M, a Group Ordinal Method for Multiple Criteria Decision-Making (MCDM). SAPEVO-M is an acronym for Simple Aggregation of Preferences Expressed by Ordinal Vectors Group Decision Making. This method provides alternatives ranking given decision makers preferences: criteria preferences and alternatives preferences for each criterion.This method is described in Gomes et al. (2020) <doi: 10.1590/0101-7438.2020.040.00226524 >.
This package provides tools to download data series from Banco de España ('BdE
') on tibble format. Banco de España is the national central bank and, within the framework of the Single Supervisory Mechanism ('SSM'), the supervisor of the Spanish banking system along with the European Central Bank. This package is in no way sponsored endorsed or administered by Banco de España'.
Computation of approximate potentials for both gradient and non gradient fields. It is known from physics that only gradient fields, also known as conservative, have a well defined potential function. Here we present an algorithm, based on the classical Helmholtz decomposition, to obtain an approximate potential function for non gradient fields. More information in Rodrà guez-Sánchez (2020) <doi:10.1371/journal.pcbi.1007788>.
EBImage provides general purpose functionality for image processing and analysis. In the context of (high-throughput) microscopy-based cellular assays, EBImage offers tools to segment cells and extract quantitative cellular descriptors. This allows the automation of such tasks using the R programming language and facilitates the use of other tools in the R environment for signal processing, statistical modeling, machine learning and visualization with image data.
Streamline use of the All of Us Researcher Workbench (<https://www.researchallofus.org/data-tools/workbench/>)with tools to extract and manipulate data from the All of Us database. Increase interoperability with the Observational Health Data Science and Informatics ('OHDSI') tool stack by decreasing reliance of All of Us tools and allowing for cohort creation via Atlas'. Improve reproducible and transparent research using All of Us'.
Interactive R tutorials written using learnr for Field (2016), "An Adventure in Statistics", <ISBN:9781446210451>. Topics include general workflow in R and Rstudio', the R environment and tidyverse', summarizing data, model fitting, central tendency, visualising data using ggplot2', inferential statistics and robust estimation, hypothesis testing, the general linear model, comparing means, repeated measures designs, factorial designs, multilevel models, growth models, and generalized linear models (logistic regression).
BEAST2 (<https://www.beast2.org>) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. BEAST2 is commonly accompanied by BEAUti 2', Tracer and DensiTree
'. babette provides for an alternative workflow of using all these tools separately. This allows doing complex Bayesian phylogenetics easily and reproducibly from R'.
Markov chain Monte Carlo based inference routines for collapsed latent position cluster models or social networks, which includes searches over the model space (number of clusters in the latent position cluster model). The label switching algorithm used is that of Nobile and Fearnside (2007) <doi:10.1007/s11222-006-9014-7> which relies on the algorithm of Carpaneto and Toth (1980) <doi:10.1145/355873.355883>.
This package provides a very simple syntax for the user to generate custom plot(s) without having to remember complicated ggplot2 syntax. The chartql package uses ggplot2 and manages all the syntax complexities internally. As an example, to generate a bar chart of company sales faceted by product category further faceted by season of the year, we simply write: "CHART bar X category, season Y sales".
This package provides tools for accessing and analyzing eBird
Status and Trends Data Products (<https://science.ebird.org/en/status-and-trends>). eBird
(<https://ebird.org/home>) is a global database of bird observations collected by member of the public. eBird
Status and Trends uses these data to model global bird distributions, abundances, and population trends at a high spatial and temporal resolution.
The EpiSimR
package provides an interactive shiny app based on deterministic compartmental mathematical modeling for simulating and visualizing the dynamics of epidemic and endemic disease spread. It allows users to explore various intervention strategies, including vaccination and isolation, by adjusting key epidemiological parameters. The methodology follows the approach described by Brauer (2008) <doi:10.1007/978-3-540-78911-6_2>. Thanks to shiny package.
Fully robust versions of the elastic net estimator are introduced for linear and binary and multinomial regression, in particular high dimensional data. The algorithm searches for outlier free subsets on which the classical elastic net estimators can be applied. A reweighting step is added to improve the statistical efficiency of the proposed estimators. Selecting appropriate tuning parameters for elastic net penalties are done via cross-validation.
This package provides a framework for predicting retention times in liquid chromatography. Users can train custom models for specific chromatography columns, predict retention times using existing models, or adjust existing models to account for altered experimental conditions. The provided functionalities can be accessed either via the R console or via a graphical user interface. Related work: Bonini et al. (2020) <doi:10.1021/acs.analchem.9b05765>.
Point and interval estimation in dual frame surveys. In contrast to classic sampling theory, where only one sampling frame is considered, dual frame methodology assumes that there are two frames available for sampling and that, overall, they cover the entire target population. Then, two probability samples (one from each frame) are drawn and information collected is suitably combined to get estimators of the parameter of interest.
This package provides R functions to access the API of the project and repository management web application GitLab
'. For many common tasks (repository file access, issue assignment and status, commenting) convenience wrappers are provided, and in addition the full API can be used by specifying request locations. GitLab
is open-source software and can be self-hosted or used on <https://about.gitlab.com>.
The American Community Survey (ACS) <https://www.census.gov/programs-surveys/acs> offers geodatabases with geographic information and associated data of interest to researchers in the area. The goal of this package is to generate objects that allow us to access and consult the information available in various formats, such as in GeoPackage
format or in multidimensional ROLAP (Relational On-Line Analytical Processing) star format.
This package provides a suite of function-building tools centered around a (forward) composition operator, %>>>%, which extends the semantics of the magrittr %>% operator and supports Tidyverse quasiquotation. It enables you to construct composite functions that can be inspected and transformed as list-like objects. In conjunction with %>>>%, a compact function constructor, fn()
, and a partial-application constructor, partial()
, are also provided; both support quasiquotation.
This package provides functions and datasets to support Smilde, Marini, Westerhuis and Liland (2025, ISBN: 978-1-394-21121-0) "Analysis of Variance for High-Dimensional Data - Applications in Life, Food and Chemical Sciences". This implements and imports a collection of methods for HD-ANOVA data analysis with common interfaces, result- and plotting functions, multiple real data sets and four vignettes covering a range different applications.