The US EPA ECOTOX database is a freely available database with a treasure of aquatic and terrestrial ecotoxicological data. As the online search interface doesn't come with an API, this package provides the means to easily access and search the database in R. To this end, all raw tables are downloaded from the EPA website and stored in a local SQLite database <doi:10.1016/j.chemosphere.2024.143078>.
Estimates the time-varying reproduction number, rate of spread, and doubling time using a range of open-source tools (Abbott et al. (2020) <doi:10.12688/wellcomeopenres.16006.1>), and current best practices (Gostic et al. (2020) <doi:10.1101/2020.06.18.20134858>). It aims to help users avoid some of the limitations of naive implementations in a framework that is informed by community feedback and is actively supported.
Batch processing framework for ellmer chat models. Provides both sequential and parallel processing of chat interactions with features including tool calling and structured data extraction. Enables workflow management through progress tracking and recovery and automatic retry with backoff. Additional quality-of-life features include verbosity (or echo) control and sound notifications. Parallel processing is implemented via the future framework. Includes methods for retrieving progress status, chat texts, and chat objects.
Homomorphic encryption (Brakerski and Vaikuntanathan (2014) <doi:10.1137/120868669>) using Ring Learning with Errors (Lyubashevsky et al. (2012) <https://eprint.iacr.org/2012/230>) is a form of Learning with Errors (Regev (2005) <doi:10.1145/1060590.1060603>) using polynomial rings over finite fields. Functions to generate the required polynomials (using polynom'), with various distributions of coefficients are provided. Additionally, functions to generate and take coefficient modulo are provided.
Inference of Multiscale graphical models with neighborhood selection approach. The method is based on solving a convex optimization problem combining a Lasso and fused-group Lasso penalties. This allows to infer simultaneously a conditional independence graph and a clustering partition. The optimization is based on the Continuation with Nesterov smoothing in a Shrinkage-Thresholding Algorithm solver (Hadj-Selem et al. 2018) <doi:10.1109/TMI.2018.2829802> implemented in python.
Function ModEstM()
is the only one of this package, it estimates the modes of an empirical univariate distribution. It relies on the stats::density()
function, even for input control. Due to very good performance of the density estimation, computation time is not an issue. The multiple modes are handled using dplyr::group_by()
. For conditions and rates of convergences, see Eddy (1980) <doi:10.1214/aos/1176345080>.
Many tools for making, modifying, mapping, marking, measuring, and motifs and memberships of many different types of networks. All functions operate with matrices, edge lists, and igraph', network', and tidygraph objects, and on one-mode, two-mode (bipartite), and sometimes three-mode networks. The package includes functions for importing and exporting, creating and generating networks, modifying networks and node and tie attributes, and describing and visualizing networks with sensible defaults.
This package provides a Software Development Kit for working with Nixtla''s TimeGPT
', a foundation model for time series forecasting. API is an acronym for application programming interface'; this package allows users to interact with TimeGPT
via the API'. You can set and validate API keys and generate forecasts via API calls. It is compatible with tsibble and base R. For more details visit <https://docs.nixtla.io/>.
The openMSE
package is designed for building operating models, doing simulation modelling and management strategy evaluation for fisheries. openMSE
is an umbrella package for the MSEtool (Management Strategy Evaluation toolkit), DLMtool (Data-Limited Methods toolkit), and SAMtool (Stock Assessment Methods toolkit) packages. By loading and installing openMSE
', users have access to the full functionality contained within these packages. Learn more about openMSE
at <https://openmse.com/>.
This package implements a range of facilities for post-hoc analysis and summarizing linear models, generalized linear models and generalized linear mixed models, including grouping and clustering via pairwise comparisons using graph representations and efficient algorithms for finding maximal cliques of a graph. Includes also non-parametric toos for post-hoc analysis. It has S3 methods for printing summarizing, and producing plots, line and barplots suitable for post-hoc analyses.
Presents an explanatory animation of normal quantile-quantile plots based on a water-filling analogy. The animation presents a normal QQ plot as the parametric plot of the water levels in vases defined by two distributions. The distributions decorate the axes in the normal QQ plot and are optionally shown as vases adjacent to the plot. The package draws QQ plots for several distributions, either as samples or continuous functions.
Data wrangling, pre-processing, and generating automated reports from Colombia's epidemiological surveillance system, SIVIGILA <https://portalsivigila.ins.gov.co/>. It provides a customizable R Markdown template for analysis and automatic generation of epidemiological reports that can be adapted to local, regional, and national contexts. This tool offers a standardized and reproducible workflow that helps to reduce manual labor and potential errors in report generation, improving their efficiency and consistency.
This tiny package contains one function smirnov()
which calculates two scaled taxonomic coefficients, Txy (coefficient of similarity) and Txx (coefficient of originality). These two characteristics may be used for the analysis of similarities between any number of taxonomic groups, and also for assessing uniqueness of giving taxon. It is possible to use smirnov()
output as a distance measure: convert it to distance by "as.dist(1 - smirnov(x))".
This data-driven phylogenetic comparative method fits stabilizing selection models to continuous trait data, building on the ouch methodology of Butler and King (2004) <doi:10.1086/426002>. The main functions fit a series of Hansen models using stepwise AIC, then identify cases of convergent evolution where multiple lineages have shifted to the same adaptive peak. For more information see Ingram and Mahler (2013) <doi:10.1111/2041-210X.12034>.
This package provides a collection of commonly used tools for animal movement and other tracking data. Variously distance, angle, bearing, distance-to, bearing-to and speed are provided for geographic data that can be used directly or within tidyverse syntax. Distances and bearings are calculated using modern geodesic methods as provided by Charles F. F. Karney (2013) <doi:10.1007/s00190-012-0578-z> via the geodist and geosphere packages.
This package implements marginal structural models combined with a latent class growth analysis framework for assessing the causal effect of treatment trajectories. Based on the approach described in "Marginal Structural Models with Latent Class Growth Analysis of Treatment Trajectories" Diop, A., Sirois, C., Guertin, J.R., Schnitzer, M.E., Candas, B., Cossette, B., Poirier, P., Brophy, J., Mésidor, M., Blais, C. and Hamel, D., (2023) <doi:10.1177/09622802231202384>.
Estimates heterogeneous treatment effects using tidy semantics on experimental or observational data. Methods are based on the doubly-robust learner of Kennedy (n.d.) <arXiv:2004.14497>
. You provide a simple recipe for what machine learning algorithms to use in estimating the nuisance functions and tidyhte will take care of cross-validation, estimation, model selection, diagnostics and construction of relevant quantities of interest about the variability of treatment effects.
There are two new network metrics, RWC (random walk centrality) and CBET (counting betweenness). Also available are the normalized versions of those metrics. These measures of centrality and betweenness are particularly useful for the analysis of very dense weighted networks which include loops. Traditional measures do not work as well for those network characteristics. The main reference is DePaolis
at al (2022) <doi:10.1007/s41109-022-00519-2>.
This package provides a model designed for dimensionality reduction and batch effect removal for scRNA-seq
data. It is designed to be massively parallelizable using shared objects that prevent memory duplication, and it can be used with different mini-batch approaches in order to reduce time consumption. It assumes a negative binomial distribution for the data with a dispersion parameter that can be both commonwise across gene both genewise.
This package provides functions for identification and visualization of potential intramolecular triplex patterns in DNA sequence. The main functionality is to detect the positions of subsequences capable of folding into an intramolecular triplex (H-DNA) in a much larger sequence. The potential H-DNA (triplexes) should be made of as many cannonical nucleotide triplets as possible. The package includes visualization showing the exact base-pairing in 1D, 2D or 3D.
This package provides a fast implementation of a key-value store. Environments are commonly used as key-value stores, but every time a new key is used, it is added to R's global symbol table, causing a small amount of memory leakage. This can be problematic in cases where many different keys are used. Fastmap avoids this memory leak issue by implementing the map using data structures in C++.
This package provides tools to visualize simple graphs (networks) based on a transition matrix, utilities to plot flow diagrams, visualizing webs, electrical networks, etc. It also includes supporting material for the book "A practical guide to ecological modelling - using R as a simulation platform" by Karline Soetaert and Peter M.J. Herman (2009) and the book "Solving Differential Equations in R" by Karline Soetaert, Jeff Cash and Francesca Mazzia (2012).
This package can be used to predict the r-species accumulation curve (r-SAC), which is the number of species represented at least r times as a function of the sampling effort. When r = 1, the curve is known as the species accumulation curve, or the library complexity curve in high-throughput genomic sequencing. The package includes both parametric and nonparametric methods, as described by Deng C, et al. (2018).
This package provides HTTP error helpers. Methods are included for general purpose HTTP error handling, as well as individual methods for every HTTP status code, both via status code numbers as well as their descriptive names. It supports the ability to adjust behavior to stop, message or warning. It includes the ability to use a custom whisker template to have any configuration of status code, short description, and verbose message.