An implementation of neural networks trained with flow-sorted gene expression data to classify cellular phenotypes in single cell RNA-sequencing data. See Chamberlain M et al. (2021) <doi:10.1101/2021.02.01.429207> for more details.
This package provides a tool for cutting data into intervals. Allows singleton intervals. Always includes the whole range of data by default. Flexible labelling. Convenience functions for cutting by quantiles etc. Handles dates, times, units and other vectors.
This package provides functions to speed up work flow for hydrological analysis. Focused on Australian climate data (SILO climate data), hydrological models (eWater
Source) and in particular South Australia (<https://water.data.sa.gov.au> hydrological data).
The systemPipeShiny
(SPS) framework comes with many useful utility functions. However, installing the whole framework is heavy and takes some time. If you like only a few useful utility functions from SPS, install this package is enough.
The idea is to provide a standard interface to users who use both R and Python for building machine learning models. This package provides a scikit-learn's fit, predict interface to train machine learning models in R.
Compute various common mean squared predictive error (MSPE) estimators, as well as several existing variance component predictors as a byproduct, for FH model (Fay and Herriot, 1979) and NER model (Battese et al., 1988) in small area estimation.
This package provides a wrapper for the TexTra
API <https://mt-auto-minhon-mlt.ucri.jgn-x.jp/>, a web service for translating texts between different languages. TexTra
API account is required to use the service.
Draws tornado plots for model sensitivity to univariate changes. Implements methods for many modeling methods including linear models, generalized linear models, survival regression models, and arbitrary machine learning models in the caret package. Also draws variable importance plots.
Common techinical complications such as clogging can result in spurious events and fluorescence intensity shifting, flowCut
is designed to detect and remove technical artifacts from your data by removing segments that show statistical differences from other segments.
Bedgraph files generated by Bisulfite pipelines often come in various flavors. Critical downstream step requires summarization of these files into methylation/coverage matrices. This step of data aggregation is done by Methrix, including many other useful downstream functions.
Rasterize only specific layers of a ggplot2 plot while simultaneously keeping all labels and text in vector format. This allows users to keep plots within the reasonable size limit without losing vector properties of the scale-sensitive information.
This package provides a new class Formula
, which extends the base class formula
. It supports extended formulas with multiple parts of regressors on the right-hand side and/or multiple responses on the left-hand side.
This package provides helpers for reordering factor levels (including moving specified levels to front, ordering by first appearance, reversing, and randomly shuffling), and tools for modifying factor levels (including collapsing rare levels into other, "anonymizing", and manually "recoding").
This package provides kernel-based machine learning methods for classification, regression, clustering, novelty detection, quantile regression and dimensionality reduction. Among other methods kernlab
includes Support Vector Machines, Spectral Clustering, Kernel PCA, Gaussian Processes and a QP solver.
This package provides a reticulate wrapper for the Python package anndata
. It provides a scalable way of keeping track of data and learned annotations. It is used to read from and write to the h5ad file format.
Maximum likelihood computations for Tweedie families, including the series expansion (Dunn and Smyth, 2005; <doi10.1007/s11222-005-4070-y>) and the Fourier inversion (Dunn and Smyth, 2008; <doi:10.1007/s11222-007-9039-6>), and related methods.
Reprotest builds the same source code twice in different environments, and then checks the binaries produced by each build for differences. If any are found, then diffoscope or diff is used to display them in detail for later analysis.
This package provides a portable Shiny tool to explore patient-level electronic health record data and perform chart review in a single integrated framework. This tool supports browsing clinical data in many different formats including multiple versions of the OMOP common data model as well as the MIMIC-III data model. In addition, chart review information is captured and stored securely via the Shiny interface in a REDCap (Research Electronic Data Capture) project using the REDCap API. See the ReviewR
website for additional information, documentation, and examples.
Non-parametric clustering of joint pattern multi-genetic/epigenetic factors. This package contains functions designed to cluster subjects based on gene features including single nucleotide polymorphisms (SNPs), DNA methylation (CPG), gene expression (GE), and covariate data. The novel concept follows the general K-means (Hartigan and Wong (1979) <doi:10.2307/2346830> framework but uses weighted Euclidean distances across the gene features to cluster subjects. This approach is unique in that it attempts to capture all pairwise interactions in an effort to cluster based on their complex biological interactions.
This package provides a pair of functions for calculating mean residual life (MRL) , median residual life, and percentile residual life using the outputs of either the flexsurv package or parameters provided by the user. Input information about the distribution, the given life value, the percentile, and the type of residual life, and the function will return your desired values. For the flexsurv option, the function allows the user to input their own data for making predictions. This function is based on Jackson (2016) <doi:10.18637/jss.v070.i08>.
Classify hemispherical photographs of the plant canopy with algorithms specially developed for such a task and well documented in DÃ az and Lencinas (2015) <doi:10.1109/lgrs.2015.2425931> and DÃ az and Lencinas (2018) <doi:10.1139/cjfr-2018-0006>. It supports non-circular hemispherical photography, such as those acquired with 15mm lenses or with auxiliary fish-eye lenses attached to mobile devices. For smartphone-based hemispherical photography see DÃ az (2023) <doi:10.1111/2041-210x.14059>. Most of the functions also support restricted view photography.
Simulate multivariate data with arbitrary marginal distributions. bigsimr is a package for simulating high-dimensional multivariate data with a target correlation and arbitrary marginal distributions via Gaussian copula. It utilizes the Julia package Bigsimr.jl for its core routines.
Permutational method to incorporate taxonomic uncertainty and some functions to assess its effects on parameters of some widely used multivariate methods in ecology, as explained in Cayuela et al. (2011) <doi:10.1111/j.1600-0587.2009.05899.x>.
Explore and normalize American campaign finance data. Created by the Investigative Reporting Workshop to facilitate work on The Accountability Project, an effort to collect public data into a central, standard database that is more easily searched: <https://publicaccountability.org/>.