This implements the Brunton et al (2016; PNAS <doi:10.1073/pnas.1517384113>) sparse identification algorithm for finding ordinary differential equations for a measured system from raw data (SINDy). The package includes a set of additional tools for working with raw data, with an emphasis on cognitive science applications (Dale and Bhat, 2018 <doi:10.1016/j.cogsys.2018.06.020>). See <https://github.com/racdale/sindyr> for examples and updates.
Uses indicator species scores across binary partitions of a sample set to detect congruence in taxon-specific changes of abundance and occurrence frequency along an environmental gradient as evidence of an ecological community threshold. Relevant references include Baker and King (2010) <doi:10.1111/j.2041-210X.2009.00007.x>, King and Baker (2010) <doi:10.1899/09-144.1>, and Baker and King (2013) <doi:10.1899/12-142.1>.
Infer biological pathway activity of cells from single-cell RNA-sequencing data by calculating a pathway score for each cell (pathway genes are specified by the user). It is recommended to have the data in Transcripts-Per-Million (TPM) or Counts-Per-Million (CPM) units for best results. Scores may change when adding cells to or removing cells off the data. SiPSiC
stands for Single Pathway analysis in Single Cells.
This package provides Ion Trap positive ionization mode data in mzML file format. It includes a subset from 500-850 m/z and 1190-1310 seconds, including MS2 and MS3, intensity threshold 100.000; extracts from FTICR Apex III, m/z 400-450; a subset of UPLC - Bruker micrOTOFq data, both mzML and mz5; LC-MSMS and MRM files from proteomics experiments; and PSI mzIdentML example files for various search engines.
This package provides the timing functions tic
and toc
that can be nested. One can record all timings while a complex script is running, and examine the values later. It is also possible to instrument the timing call with custom callbacks. In addition, this package provides class 'Stack', implemented as a vector, and class 'List', implemented as a list, both of whic support operations 'push', 'pop', 'first', 'last' and 'clear'.
Waffle plots are rectangular pie charts that represent a quantity or abundances using colored squares or other symbol. This makes them better at transmitting information as the discrete number of squares is easier to read than the circular area of pie charts. While the original waffle charts were rectangular with 10 rows and columns, with a single square representing 1%, they are nowadays popular in various infographics to visualize any proportional ratios.
Implementation of the bisque strategy for approximate Bayesian posterior inference. See Hewitt and Hoeting (2019) <arXiv:1904.07270>
for complete details. bisque combines conditioning with sparse grid quadrature rules to approximate marginal posterior quantities of hierarchical Bayesian models. The resulting approximations are computationally efficient for many hierarchical Bayesian models. The bisque package allows approximate posterior inference for custom models; users only need to specify the conditional densities required for the approximation.
This package provides constrained triangulation of polygons. Ear cutting (or ear clipping) applies constrained triangulation by successively cutting triangles from a polygon defined by path/s. Holes are supported by introducing a bridge segment between polygon paths. This package wraps the header-only library earcut.hpp <https://github.com/mapbox/earcut.hpp.git> which includes a reference to the method used by Held, M. (2001) <doi:10.1007/s00453-001-0028-4>.
Introduces a Copilot'-like completion experience, but it knows how to talk to the objects in your R environment. ellmer chats are integrated directly into your RStudio and Positron sessions, automatically incorporating relevant context from surrounding lines of code and your global environment (like data frame columns and types). Open the package dialog box with a keyboard shortcut, type your request, and the assistant will stream its response directly into your documents.
Fits a Gaussian process model to data. Gaussian processes are commonly used in computer experiments to fit an interpolating model. The model is stored as an R6 object and can be easily updated with new data. There are options to run in parallel, and Rcpp has been used to speed up calculations. For more info about Gaussian process software, see Erickson et al. (2018) <doi:10.1016/j.ejor.2017.10.002>.
This package implements Individual Conditional Expectation (ICE) plots, a tool for visualizing the model estimated by any supervised learning algorithm. ICE plots refine Friedman's partial dependence plot by graphing the functional relationship between the predicted response and a covariate of interest for individual observations. Specifically, ICE plots highlight the variation in the fitted values across the range of a covariate of interest, suggesting where and to what extent they may exist.
Implementing the interventional effects for mediation analysis for up to 3 mediators. The methods used are based on VanderWeele
, Vansteelandt and Robins (2014) <doi:10.1097/ede.0000000000000034>, Vansteelandt and Daniel (2017) <doi:10.1097/ede.0000000000000596> and Chan and Leung (2020; unpublished manuscript, available on request from the author of this package). Linear regression, logistic regression and Poisson regression are used for continuous, binary and count mediator/outcome variables respectively.
Maximum likelihood estimates (MLE) of the proportions of 5-mC
and 5-hmC
in the DNA using information from BS-conversion, TAB-conversion, and oxBS-conversion
methods. One can use information from all three methods or any combination of two of them. Estimates are based on Binomial model by Qu et al. (2013) <doi:10.1093/bioinformatics/btt459> and Kiihl et al. (2019) <doi:10.1515/sagmb-2018-0031>.
This package provides tools for loading and processing passive acoustic data. Read in data that has been processed in Pamguard (<https://www.pamguard.org/>), apply a suite processing functions, and export data for reports or external modeling tools. Parameter calculations implement methods by Oswald et al (2007) <doi:10.1121/1.2743157>, Griffiths et al (2020) <doi:10.1121/10.0001229> and Baumann-Pickering et al (2010) <doi:10.1121/1.3479549>.
Sonification (or audification) is the process of representing data by sounds in the audible range. This package provides the R function sonify()
that transforms univariate data, sampled at regular or irregular intervals, into a continuous sound with time-varying frequency. The ups and downs in frequency represent the ups and downs in the data. Sonify provides a substitute for R's plot function to simplify data analysis for the visually impaired.
Provide model averaging-based approaches that can be used to predict personalized survival probabilities. The key underlying idea is to approximate the conditional survival function using a weighted average of multiple candidate models. Two scenarios of candidate models are allowed: (Scenario 1) partial linear Cox model and (Scenario 2) time-varying coefficient Cox model. A reference of the underlying methods is Li and Wang (2023) <doi:10.1016/j.csda.2023.107759>.
Classical methods for combining summary data from genome-wide association studies (GWAS) only use marginal genetic effects and power can be compromised in the presence of heterogeneity. subgxe is a R package that implements p-value assisted subset testing for association (pASTA
), a method developed by Yu et al. (2019) <doi:10.1159/000496867>. pASTA
generalizes association analysis based on subsets by incorporating gene-environment interactions into the testing procedure.
This package provides methods for spatial and spatio-temporal smoothing of demographic and health indicators using survey data, with particular focus on estimating and projecting under-five mortality rates, described in Mercer et al. (2015) <doi:10.1214/15-AOAS872>, Li et al. (2019) <doi:10.1371/journal.pone.0210645>, Wu et al. (DHS Spatial Analysis Reports No. 21, 2021), and Li et al. (2023) <doi:10.48550/arXiv.2007.05117>
.
This package provides a set of commonly used distance measures and some additional functions which, although initially not designed for this purpose, can be used to measure the dissimilarity between time series. These measures can be used to perform clustering, classification or other data mining tasks which require the definition of a distance measure between time series. U. Mori, A. Mendiburu and J.A. Lozano (2016), <doi:10.32614/RJ-2016-058>.
This is an alternative mechanism for importing objects from packages. The syntax allows for importing multiple objects from a package with a single command in an expressive way. The import package bridges some of the gap between using library
(or require
) and direct (single-object) imports. Furthermore the imported objects are not placed in the current environment. It is also possible to import objects from stand-alone .R
files.
This package implements core utilities for single-cell RNA-seq data analysis. Contained within are utility functions for working with DE matrices and count matrices, a collection of functions for manipulating and plotting data via ggplot2, and functions to work with cell graphs and cell embeddings. Graph-based methods include embedding kNN cell graphs into a UMAP, collapsing vertices of each cluster in the graph, and propagating graph labels.
Circle Manhattan Plot is an R package that can lay out genome-wide association study P-value results in both traditional rectangular patterns, QQ-plot and novel circular ones. United in only one bull's eye style plot, association results from multiple traits can be compared interactively, thereby to reveal both similarities and differences between signals. Additional functions include: highlight signals, a group of SNPs, chromosome visualization and candidate genes around SNPs.
This package provides a straightforward model to estimate soil migration rates across various soil contexts. Based on the compartmental, vertically-resolved, physically-based mass balance model of Soto and Navas (2004) <doi:10.1016/j.jaridenv.2004.02.003> and Soto and Navas (2008) <doi:10.1016/j.radmeas.2008.02.024>. RadEro
provides a user-friendly interface in R, utilizing input data such as 137Cs inventories and parameters directly derived from soil samples (e.g., fine fraction density, effective volume) to accurately capture the 137Cs distribution within the soil profile. The model simulates annual 137Cs fallout, radioactive decay, and vertical diffusion, with the diffusion coefficient calculated from 137Cs reference inventory profiles. Additionally, it allows users to input custom parameters as calibration coefficients. The RadEro
user manual and protocol, including detailed instructions on how to format input data and configuration files, can be found at the following link: <https://github.com/eead-csic-eesa/RadEro>
.
This package implements anomaly detection as binary classification for cross-sectional data. Uses maximum likelihood estimates and normal probability functions to classify observations as anomalous. The method is presented in the following lecture from the Machine Learning course by Andrew Ng: <https://www.coursera.org/learn/machine-learning/lecture/C8IJp/algorithm/>, and is also described in: Aleksandar Lazarevic, Levent Ertoz, Vipin Kumar, Aysel Ozgur, Jaideep Srivastava (2003) <doi:10.1137/1.9781611972733.3>.