The NCI-60 cancer cell line panel has been used over the course of several decades as an anti-cancer drug screen. This panel was developed as part of the Developmental Therapeutics Program (DTP, http://dtp.nci.nih.gov/) of the U.S. National Cancer Institute (NCI). Thousands of compounds have been tested on the NCI-60, which have been extensively characterized by many platforms for gene and protein expression, copy number, mutation, and others (Reinhold, et al., 2012). The purpose of the CellMiner project (http://discover.nci.nih.gov/ cellminer) has been to integrate data from multiple platforms used to analyze the NCI-60 and to provide a powerful suite of tools for exploration of NCI-60 data.
This package provides high level functions for reading Affy .CEL files, phenotypic data, and then computing simple things with it, such as t-tests, fold changes and the like. It makes heavy use of the affy library. It also has some basic scatter plot functions and mechanisms for generating high resolution journal figures.
This package comprises a set of pretrained machine learning models to predict basic immune cell types. This enables to quickly get a first annotation of the cell types present in the dataset without requiring prior knowledge. The package also lets you train using own models to predict new cell types based on specific research needs.
This package provides a model agnostic tool for decomposition of predictions from black boxes. It supports additive attributions and attributions with interactions. The Break Down Table shows contributions of every variable to a final prediction. The Break Down Plot presents variable contributions in a concise graphical way. This package works for classification and regression models.
Pry Doc is a Pry REPL plugin. It provides extended documentation support for the REPL by means of improving the show-doc and show-source commands. With help of the plugin the commands are be able to display the source code and the docs of Ruby methods and classes implemented in C.
Taking a set of sequence motifs as PWMs, test a set of sequences for over-representation of these motifs, as well as any positional features within the set of motifs. Enrichment analysis can be undertaken using multiple statistical approaches. The package also contains core functions to prepare data for analysis, and to visualise results.
This package provides functionality for client-side navigation of the server side file system in shiny apps. In case the app is running locally this gives the user direct access to the file system without the need to "download" files to a temporary location. Both file and folder selection as well as file saving is available.
This package provides a collection of functions dealing with labelled data, like reading and writing data between R and other statistical software packages. This includes easy ways to get, set or change value and variable label attributes, to convert labelled vectors into factors or numeric (and vice versa), or to deal with multiple declared missing values.
Read in activity measurements from standard file formats used by circadian rhythm researchers, currently only ClockLab format, and process and plot the data. The central type of plot is the actogram, as first described in "Activity and distribution of certain wild mice in relation to biotic communities" by MS Johnson (1926) doi:10.2307/1373575.
The nls.lm function provides an R interface to lmder and lmdif from the MINPACK library, for solving nonlinear least-squares problems by a modification of the Levenberg-Marquardt algorithm, with support for lower and upper parameter bounds. The implementation can be used via nls-like calls using the nlsLM function.
ncRNAtools provides a set of basic tools for handling and analyzing non-coding RNAs. These include tools to access the RNAcentral database and to predict and visualize the secondary structure of non-coding RNAs. The package also provides tools to read, write and interconvert the file formats most commonly used for representing such secondary structures.
This package contains a set of functions to perform large-scale analysis of pharmaco-genomic data. These include the PharmacoSet object for storing the results of pharmacogenomic experiments, as well as a number of functions for computing common summaries of drug-dose response and correlating them with the molecular features in a cancer cell-line.
unicorn is an HTTP server for Rack applications designed to only serve fast clients on low-latency, high-bandwidth connections and take advantage of features in Unix/Unix-like kernels. Slow clients should only be served by placing a reverse proxy capable of fully buffering both the the request and response in between unicorn and slow clients.
The package is usable with Affymetrix GeneChip short oligonucleotide arrays, and it can be adapted or extended to other platforms. It is able to modify or replace the grouping of probes in the probe sets. Also, the package contains simple functions to read R connections in the FASTA format and it can create an alternative mapping from sequences.
HDCytoData contains a set of high-dimensional cytometry benchmark datasets. These datasets are formatted into SummarizedExperiment and flowSet Bioconductor object formats, including all required metadata. Row metadata includes sample IDs, group IDs, patient IDs, reference cell population or cluster labels and labels identifying spiked in cells. Column metadata includes channel names, protein marker names, and protein marker classes.
In order to smoothly animate the transformation of polygons and paths, many aspects needs to be taken into account, such as differing number of control points, changing center of rotation, etc. The transformr package provides an extensive framework for manipulating the shapes of polygons and paths and can be seen as the spatial brother to the tweenr package.
This package implements multitaper spectral estimation techniques using prolate spheroidal sequences (Slepians) and sine tapers for time series analysis. It includes an adaptive weighted multitaper spectral estimate, a coherence estimate, Thomson's Harmonic F-test, and complex demodulation. The Slepians sequences are generated efficiently using a tridiagonal matrix solution, and jackknifed confidence intervals are available for most estimates.
R's default conflict management system gives the most recently loaded package precedence. This can make it hard to detect conflicts, particularly when they arise because a package update creates ambiguity that did not previously exist. The conflicted package takes a different approach, making every conflict an error and forcing you to choose which function to use.
Random Jungle is an implementation of Random Forests. It is supposed to analyse high dimensional data. In genetics, it can be used for analysing big Genome Wide Association (GWA) data. Random Forests is a powerful machine learning method. Most interesting features are variable selection, missing value imputation, classifier creation, generalization error estimation and sample proximities between pairs of cases.
Mixedpower uses pilotdata and a linear mixed model fitted with lme4 to simulate new data sets. Power is computed separate for every effect in the model output as the relation of significant simulations to all simulations. More conservative simulations as a protection against a bias in the pilotdata are available as well as methods for plotting the results.
Detect binding sites using motifs IUPAC sequence or bed coordinates and ChIP-seq experiments in bed or bam format. Combine/compare binding sites across experiments, tissues, or conditions. All normalization and differential steps are done using TMM-GLM method. Signal decomposition is done by setting motifs as the centers of the mixture of normal distribution curves.
MSstats package provide tools for preprocessing, summarization and differential analysis of mass spectrometry (MS) proteomics data. Recently, some MS protocols enable acquisition of data sets that result in larger than memory quantitative data. MSstats functions are not able to process such data. MSstatsBig package provides additional converter functions that enable processing larger than memory data sets.
This package vendors an assortment of useful header-only C++ libraries. Bioconductor packages can use these libraries in their own C++ code by LinkingTo this package without introducing any additional dependencies. The use of a central repository avoids duplicate vendoring of libraries across multiple R packages, and enables better coordination of version updates across cohorts of interdependent C++ libraries.
This package provides a convenient way to analyze and visualize PICRUSt2 output with pre-defined plots and functions. It allows for generating statistical plots about microbiome functional predictions and offers customization options. It features a one-click option for creating publication-level plots, saving time and effort in producing professional-grade figures. It streamlines the PICRUSt2 analysis and visualization process.