Potential outliers are identified for all combinations of a dataset's variables. O3 plots are described in Unwin(2019) <doi:10.1080/10618600.2019.1575226>. The available methods are HDoutliers()
from the package HDoutliers', FastPCS()
from the package FastPCS
', mvBACON()
from robustX
', adjOutlyingness()
from robustbase', DectectDeviatingCells()
from cellWise
', covMcd()
from robustbase'.
R functions to access provenance information collected by rdt or rdtLite
'. The information is stored inside a ProvInfo
object and can be accessed through a collection of functions that will return the requested data. The exact format of the JSON created by rdt and rdtLite
is described in <https://github.com/End-to-end-provenance/ExtendedProvJson>
.
This package provides a set of tools for determining the necessary sample size in order to identify the optimal dynamic treatment regime in a sequential, multiple assignment, randomized trial (SMART). Utilizes multiple comparisons with the best methodology to adjust for multiple comparisons. Designed for an arbitrary SMART design. Please see Artman (2018) <doi:10.1093/biostatistics/kxy064> for more details.
Estimates split-half reliabilities for scoring algorithms of cognitive tasks and questionnaires. The splithalfr supports researcher-provided scoring algorithms, with six vignettes illustrating how on included datasets. The package provides four splitting methods (first-second, odd-even, permutated, Monte Carlo), the option to stratify splits by task design, a number of reliability coefficients, and the option to sub-sample data.
Supplies AnnotationHub
with MassBank
metabolite/compound annotations bundled in CompDb
SQLite databases. CompDb
SQLite databases contain general compound annotation as well as fragment spectra representing fragmentation patterns of compounds ions. MassBank
data is retrieved from https://massbank.eu/MassBank
and processed using helper functions from the CompoundDb
Bioconductor package into redistributable SQLite databases.
The package provides statistical tools for detecting differentially abundant proteins in shotgun mass spectrometry-based proteomic experiments with tandem mass tag (TMT) labeling. It provides multiple functionalities, including aata visualization, protein quantification and normalization, and statistical modeling and inference. Furthermore, it is inter-operable with other data processing tools, such as Proteome Discoverer, MaxQuant
, OpenMS
and SpectroMine
.
This package vendors an assortment of useful header-only C++ libraries. Bioconductor packages can use these libraries in their own C++ code by LinkingTo
this package without introducing any additional dependencies. The use of a central repository avoids duplicate vendoring of libraries across multiple R packages, and enables better coordination of version updates across cohorts of interdependent C++ libraries.
This package provides a convenient way to analyze and visualize PICRUSt2 output with pre-defined plots and functions. It allows for generating statistical plots about microbiome functional predictions and offers customization options. It features a one-click option for creating publication-level plots, saving time and effort in producing professional-grade figures. It streamlines the PICRUSt2 analysis and visualization process.
This package provides a collection of functions helpful in learning the basic tenets of Bayesian statistical inference. It contains functions for summarizing basic one and two parameter posterior distributions and predictive distributions. It contains MCMC algorithms for summarizing posterior distributions defined by the user. It also contains functions for regression models, hierarchical models, Bayesian tests, and illustrations of Gibbs sampling.
The CommonMark specification defines a rationalized version of markdown syntax. This package uses the cmark
reference implementation for converting markdown text into various formats including HTML, LaTeX and groff man. In addition, it exposes the markdown parse tree in XML format. The latest version of this package also adds support for Github extensions including tables, autolinks and strikethrough text.
This package provides a simple approach to measure political sophistication based on open-ended survey responses. Discursive sophistication captures the complexity of individual attitude expression by quantifying its relative size, range, and constraint. For more information on the measurement approach see: Kraft, Patrick W. 2023. "Women Also Know Stuff: Challenging the Gender Gap in Political Sophistication." American Political Science Review (forthcoming).
It contains functions for dose calculation for different routes, fitting data to probability distributions, random number generation (Monte Carlo simulation) and calculation of systemic and carcinogenic risks. For more information see the publication: Barrio-Parra et al. (2019) "Human-health probabilistic risk assessment: the role of exposure factors in an urban garden scenario" <doi:10.1016/j.landurbplan.2019.02.005>.
Simplex optimization algorithms as firstly proposed by Spendley et al. (1962) <doi:10.1080/00401706.1962.10490033> and later modified by Nelder and Mead (1965) <doi:10.1093/comjnl/7.4.308> for laboratory and manufacturing processes. The package also provides tools for graphical representation of the simplexes and some example response surfaces that are useful in illustrating the optimization process.
This package provides tools used by organizational researchers for the analysis of multilevel data. Includes four broad sets of tools. First, functions for estimating within-group agreement and reliability indices. Second, functions for manipulating multilevel and longitudinal (panel) data. Third, simulations for estimating power and generating multilevel data. Fourth, miscellaneous functions for estimating reliability and performing simple calculations and data transformations.
This package provides a collection of methods for commonly undertaken analytical tasks, primarily developed for Public Health Scotland (PHS) analysts, but the package is also generally useful to others working in the healthcare space, particularly since it has functions for working with Community Health Index (CHI) numbers. The package can help to make data manipulation and analysis more efficient and reproducible.
Doubly-robust, non-parametric estimators for the transported average treatment effect from Rudolph, Williams, Stuart, and Diaz (2023) <doi:10.48550/arXiv.2304.00117>
and the intent-to-treatment average treatment effect from Rudolph and van der Laan (2017) <doi:10.1111/rssb.12213>. Estimators are fit using cross-fitting and nuisance parameters are estimated using the Super Learner algorithm.
Estimate and return either the traffic speed or the car entries in the city of Thessaloniki using historical traffic data. It's used in transport pilot of the BigDataEurope
project. There are functions for processing these data, training a neural network, select the most appropriate model and predict the traffic speed or the car entries for a selected time date.
This package provides a not uncommon task for quants is to create waterfall charts'. There seems to be no simple way to do this in ggplot2 currently. This package contains a single function (waterfall) that simply draws a waterfall chart in a ggplot2 object. Some flexibility is provided, though often the object created will need to be modified through a theme.
The wavelet and ANN technique have been combined to reduce the effect of data noise. This wavelet-ANN conjunction model is able to forecast time series data with better accuracy than the traditional time series model. This package fits hybrid Wavelet ANN model for time series forecasting using algorithm by Anjoy and Paul (2017) <DOI: 10.1007/s00521-017-3289-9>.
The CTexploreR
package re-defines the list of Cancer Testis/Germline (CT) genes. It is based on publicly available RNAseq databases (GTEx, CCLE and TCGA) and summarises CT genes main characteristics. Several visualisation functions allow to explore their expression in different types of tissues and cancer cells, or to inspect the methylation status of their promoters in normal tissues.
This package is a tool to predict the power of CyTOF
experiments in the context of differential state analyses. The package provides a shiny app with two options to predict the power of an experiment: i. generation of in-sicilico CyTOF
data, using users input ii. browsing in a grid of parameters for which the power was already precomputed.
This package builds on existing tools and adds some simple but extremely useful capabilities for working wth ChIP-Seq
data. The focus is on detecting differential binding windows/regions. One set of functions focusses on set-operations retaining mcols for GRanges objects, whilst another group of functions are to aid visualisation of results. Coercion to tibble objects is also implemented.
Psupertime is supervised pseudotime for single cell RNAseq data. It uses single cell RNAseq data, where the cells have a known ordering. This ordering helps to identify a small number of genes which place cells in that known order. It can be used for discovery of relevant genes, for identification of subpopulations, and characterization of further unknown or differently labelled data.
emacs-ranger
is a minor mode that runs within dired, it emulates many of ranger's features. This minor mode shows a stack of parent directories, and updates the parent buffers, while you're navigating the file system. The preview window takes some of the ideas from Peep-Dired, to display previews for the selected files, in the primary dired buffer.