Extends the capabilities for flexible partitioning and model-based clustering available in the packages flexclust and flexmix to handle ordinal and mixed-with-ordinal data types via new distance, centroid and driver functions that make various assumptions regarding ordinality. Using them within the flex-scheme allows for easy comparisons across methods.
Statistical tool set for population genetics. The package provides following functions: 1) empirical Bayes estimator of Fst and other measures of genetic differentiation, 2) regression analysis of environmental effects on genetic differentiation using bootstrap method, 3) interfaces to read and manipulate GENEPOP format data files and allele/haplotype frequency format files.
Calculates Agresti's generalized odds ratios. For a randomly selected pair of observations from two groups, calculates the odds that the second group will have a higher scoring outcome than that of the first group. Package provides hypothesis testing for if this odds ratio is significantly different to 1 (equal chance).
Estimate risk caused by two extreme and dependent forcing variables using bivariate extreme value models as described in Zheng, Westra, and Sisson (2013) <doi:10.1016/j.jhydrol.2013.09.054>; Zheng, Westra and Leonard (2014) <doi:10.1002/2013WR014616>; Zheng, Leonard and Westra (2015) <doi:10.2166/hydro.2015.052>.
This package provides tools facilitating access to the macro_data service of the La Societe Nouvelle API. It ensures an easy and fully-disclosed access to all macro-level data used in the La Societe Nouvelle systems and the related metadata. Related API can be accessed from <https://api.lasocietenouvelle.org/>.
This package creates HTML strings to embed tables, images or graphs in pop-ups of interactive maps created with packages like leaflet or mapview'. Handles local images located on the file system or via remote URL. Handles graphs created with lattice or ggplot2 as well as interactive plots created with htmlwidgets'.
This package provides tools to analyse, interpret and understand air pollution data. Data are typically regular time series and air quality measurement, meteorological data and dispersion model output can be analysed. The package is described in Carslaw and Ropkins (2012, <doi:10.1016/j.envsoft.2011.09.008>) and subsequent papers.
Designed to enhance data validation and management processes by employing a set of functions that read a set of rules from a CSV or Excel file and apply them to a dataset. Funded by the National Renewable Energy Laboratory and Possibility Lab, maintained by the Moore Institute for Plastic Pollution Research.
Improving graphics by ameliorating order effects, using Eulerian tours and Hamiltonian decompositions of graphs. References for the methods presented here are C.B. Hurley and R.W. Oldford (2010) <doi:10.1198/jcgs.2010.09136> and C.B. Hurley and R.W. Oldford (2011) <doi:10.1007/s00180-011-0229-5>.
Gene-based association tests using the actual impurity reduction (AIR) variable importance. The function aggregates AIR importance measures from a group of SNPs or probes and outputs a p-value for each gene. The procedures builds upon the method described in <doi:10.1093/Bioinformatics/Bty373> and will be published soon.
Get sun position, sunlight phases (times for sunrise, sunset, dusk, etc.), moon position and lunar phase for the given location and time. Most calculations are based on the formulas given in Astronomy Answers articles about position of the sun and the planets : <https://www.aa.quae.nl/en/reken/zonpositie.html>.
Does prediction in the case of a censored survival outcome, or a regression outcome, using the "supervised principal component" approach. Superpc is especially useful for high-dimensional data when the number of features p dominates the number of samples n (p >> n paradigm), as generated, for instance, by high-throughput technologies.
Fits, spatially predicts and temporally forecasts large amounts of space-time data using [1] Bayesian Gaussian Process (GP) Models, [2] Bayesian Auto-Regressive (AR) Models, and [3] Bayesian Gaussian Predictive Processes (GPP) based AR Models for spatio-temporal big-n problems. Bakar and Sahu (2015) <doi:10.18637/jss.v063.i15>.
Function for the computation of fractal dimension based on mass of soil particle size distribution by Tyler & Wheatcraft (1992) <doi:10.2136/sssaj1992.03615995005600020005x>. It also provides functions for calculation of mean weight and geometric mean diameter of particle size distribution by Perfect et al. (1992) <doi:10.2136/sssaj1992.03615995005600050012x>.
An implementation of a computationally efficient method to fit large-scale interaction models based on the reluctant interaction selection principle. The method and its properties are described in greater depth in Yu, G., Bien, J., and Tibshirani, R.J. (2019) "Reluctant interaction modeling", which is available at <arXiv:1907.08414>
.
Identifies the locations of neurons, and estimates their calcium concentrations over time using the SCALPEL method proposed in Petersen, Ashley; Simon, Noah; Witten, Daniela. SCALPEL: Extracting neurons from calcium imaging data. Ann. Appl. Stat. 12 (2018), no. 4, 2430--2456. <doi:10.1214/18-AOAS1159>. <https://projecteuclid.org/euclid.aoas/1542078051>.
This package provides a set of measures of dissimilarity between time series to perform time series clustering. Metrics based on raw data, on generating models and on the forecast behavior are implemented. Some additional utilities related to time series clustering are also provided, such as clustering algorithms and cluster evaluation metrics.
The goal of vetiver is to provide fluent tooling to version, share, deploy, and monitor a trained model. Functions handle both recording and checking the model's input data prototype, and predicting from a remote API endpoint. The vetiver package is extensible, with generics that can support many kinds of models.
The goal of DELocal is to identify DE genes compared to their neighboring genes from the same chromosomal location. It has been shown that genes of related functions are generally very far from each other in the chromosome. DELocal utilzes this information to identify DE genes comparing with their neighbouring genes.
Pathway Expression Profiles (PEPs) are based on the expression of pathways (defined as sets of genes) as opposed to individual genes. This package converts gene expression profiles to PEPs and performs enrichment analysis of both pathways and experimental conditions, such as "drug set enrichment analysis" and "gene2drug" drug discovery analysis respectively.
Keep track of dates in terms of fractional calendar months per Damien Laker "Time Calculations for Annualizing Returns: the Need for Standardization", The Journal of Performance Measurement, 2008. Model dates as of close of business. Perform date arithmetic in units of "months" and "years". Allow "infinite" dates to model "ultimate" time.
Another Base58 codec implementation. Compared to the base58 crate this is significantly faster at decoding: about 2.4x as fast when decoding 32 bytes, almost the same speed for encoding: about 3% slower when encoding 32 bytes. rust-bs58
is not limited to 128 bytes and supports a configurable alphabet.
Another Base58 codec implementation. Compared to the base58 crate this is significantly faster at decoding: about 2.4x as fast when decoding 32 bytes, almost the same speed for encoding: about 3% slower when encoding 32 bytes. rust-bs58
is not limited to 128 bytes and supports a configurable alphabet.
Haml is a layer on top of HTML or XML that is designed to express the structure of documents using indentation rather than closing tags. It was originally envisioned as a plugin for Ruby on Rails, but it can function as a stand-alone templating engine.