RTG Tools is a subset of RTG Core that includes several useful utilities for dealing with VCF files and sequence data. Probably the most interesting is the vcfeval command which performs sophisticated comparison of VCF files.
This is a legacy project, do not use it for new projects. Ruby 2.3 and later should make this obsolete. kgio provides non-blocking I/O methods for Ruby without raising exceptions on EAGAIN and EINPROGRESS.
This package provides a new class Formula, which extends the base class formula. It supports extended formulas with multiple parts of regressors on the right-hand side and/or multiple responses on the left-hand side.
Maximum likelihood computations for Tweedie families, including the series expansion (Dunn and Smyth, 2005; <doi10.1007/s11222-005-4070-y>) and the Fourier inversion (Dunn and Smyth, 2008; <doi:10.1007/s11222-007-9039-6>), and related methods.
This package provides a reticulate wrapper for the Python package anndata. It provides a scalable way of keeping track of data and learned annotations. It is used to read from and write to the h5ad file format.
Rasterize only specific layers of a ggplot2 plot while simultaneously keeping all labels and text in vector format. This allows users to keep plots within the reasonable size limit without losing vector properties of the scale-sensitive information.
This package provides helpers for reordering factor levels (including moving specified levels to front, ordering by first appearance, reversing, and randomly shuffling), and tools for modifying factor levels (including collapsing rare levels into other, "anonymizing", and manually "recoding").
This package provides kernel-based machine learning methods for classification, regression, clustering, novelty detection, quantile regression and dimensionality reduction. Among other methods kernlab includes Support Vector Machines, Spectral Clustering, Kernel PCA, Gaussian Processes and a QP solver.
Reprotest builds the same source code twice in different environments, and then checks the binaries produced by each build for differences. If any are found, then diffoscope or diff is used to display them in detail for later analysis.
miaDash provides a Graphical User Interface for the exploration of microbiome data. This way, no knowledge of programming is required to perform analyses. Datasets can be imported, manipulated, analysed and visualised with a user-friendly interface.
This package provides an R interface for various subsampling algorithms implemented in python packages. Currently, interfaces to the geosketch and scSampler python packages are implemented. In addition it also provides diagnostic plots to evaluate the subsampling.
This package provides functions for handling data from Bioconductor Affymetrix annotation data packages. It produces compact HTML and text reports including experimental data and URL links to many online databases. It allows searching of biological metadata using various criteria.
This package implements a model of per-position sequencing bias in high-throughput sequencing data using a simple Bayesian network, the structure and parameters of which are trained on a set of aligned reads and a reference genome sequence.
This is a package providing efficient operations for single cell ATAC-seq fragments and RNA counts matrices. It is interoperable with standard file formats, and introduces efficient bit-packed formats that allow large storage savings and increased read speeds.
The goal of this package is to generate an attractive and useful website from a source package. pkgdown converts your documentation, vignettes, README file, and more to HTML making it easy to share information about your package online.
This package provides statistical tools for Bayesian structure learning in undirected graphical models for continuous, discrete, and mixed data. It uses a trans-dimensional Markov Chain Monte Carlo (MCMC) approach based on a continuous-time birth-death process.
This is an extension of the testthat package that lets you add parameters to your unit tests. Parameterized unit tests are often easier to read and more reliable, since they follow the DNRY (do not repeat yourself) rule.
This package provides an implementation of multiscale bootstrap resampling for assessing the uncertainty in hierarchical cluster analysis. It provides an AU (approximately unbiased) P-value as well as a BP (bootstrap probability) value for each cluster in a dendrogram.
Functions to help implement the extraction / subsetting / indexing function [ and replacement function [<- of custom matrix-like types (based on S3, S4, etc.), modeled as closely to the base matrix class as possible (with tests to prove it).
This package implements the Differential Evolution algorithm. This algorithm is used for the global optimization of a real-valued function of a real-valued parameter vector. The implementation of DifferentialEvolution in DEoptim interfaces with C code for efficiency.
This package contains several basic utility functions including: moving (rolling, running) window statistic functions, read/write for GIF and ENVI binary files, fast calculation of AUC, LogitBoost classifier, base64 encoder/decoder, round-off-error-free sum and cumsum, etc.
The main function biclust() provides several algorithms to find biclusters in two-dimensional data, spectral, plaid model, xmotifs, and bimax. In addition, the package provides methods for data preprocessing (normalization and discretization), visualization, and validation of bicluster solutions.
This package interacts with a suite of web services for chemical information. Sources include: Alan Wood's Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PubChem, SRS, Wikidata.
Bedgraph files generated by Bisulfite pipelines often come in various flavors. Critical downstream step requires summarization of these files into methylation/coverage matrices. This step of data aggregation is done by Methrix, including many other useful downstream functions.