RcisTarget identifies transcription factor binding motifs (TFBS) over-represented on a gene list. In a first step, RcisTarget selects DNA motifs that are significantly over-represented in the surroundings of the transcription start site (TSS) of the genes in the gene-set. This is achieved by using a database that contains genome-wide cross-species rankings for each motif. The motifs that are then annotated to TFs and those that have a high Normalized Enrichment Score (NES) are retained. Finally, for each motif and gene-set, RcisTarget predicts the candidate target genes (i.e. genes in the gene-set that are ranked above the leading edge).
The motifStack package is designed for graphic representation of multiple motifs with different similarity scores. It works with both DNA/RNA sequence motifs and amino acid sequence motifs. In addition, it provides the flexibility for users to customize the graphic parameters such as the font type and symbol colors.
This package provides an R interface to the Tabler HTML template. tablerDash is a light Bootstrap 4 dashboard template. There are different layouts available such as a one page dashboard or a multi-page template, where the navigation menu is contained in the navigation bar.
This package plots path diagrams from models in lavaan using the plotting functionality from the DiagrammeR package. DiagrammeR provides nice path diagrams via Graphviz, and these functions make it easy to generate these diagrams from a lavaan path model without having to write the DOT language graph specification.
This is an R package to build generic .loom files aligning with the default naming convention of the .loom format and to integrate other data types e.g.: regulons (SCENIC), clusters from Seurat, trajectory information... The package can also be used to extract data from .loom files.
This is a package to infer transmission trees from a dated phylogeny. It includes methods to simulate and analyze outbreaks. The methodology is described in Didelot et al. (2014) and Didelot et al. (2017).
This package provides the prediction() function, a type-safe alternative to predict() that always returns a data frame. The package currently supports common model types (e.g., "lm", "glm") from the stats package, as well as numerous other model classes from other add-on packages.
This package provides methods and object classes for parsing FastQC reports and output summaries from other NGS tools into R. As well as parsing files, multiple plotting methods have been implemented for visualising the parsed data. Plots can be generated as static ggplot objects or interactive plotly objects.
This package provides a set of R functions for identifying and correcting HGNC human gene symbols. In addition, you can identify MGI mouse gene symbols, which have been converted to date format by Excel, withdrawn, or aliased. It also contains functions for reversibly converting between HGNC symbols and valid R names.
Radiance is a web application environment, which is sort of like a web framework, but more general, more flexible. It should let you write personal websites and generally deployable applications easily and in such a way that they can be used on practically any setup without having to undergo special adaptations.
The package aims to facilitate Russian typesetting (based on input using MicroSoft Code Page 1251). Russian hyphenation is selected, and various mathematical commands are set up in Russian style. Furthermore all Cyrillic letters catcodes are set to letter, so that commands with Cyrillic letters in their names may be defined.
Algorithm and tools for in silico pack-TYPE transposon discovery. Filters a given genome for properties unique to DNA transposons and provides tools for the investigation of returned matches. Sequences are input in DNAString format, and ranges are returned as a dataframe (in the format returned by as.dataframe(GRanges)).
This package provides a novel feature selection algorithm for binary classification using support vector machine recursive feature elimination SVM-RFE and t-statistic. In this feature selection process, the selected features are differentially significant between the two classes and also they are good classifier with higher degree of classification accuracy.
Seed Fu is an attempt to once and for all solve the problem of inserting and maintaining seed data in a database. It uses a variety of techniques gathered from various places around the web and combines them to create what is hopefully the most robust seed data system around.
This package provides an R to C/C++ interface that runs the Leiden community detection algorithm to find a basic partition. It runs the equivalent of the leidenalg find_partition() function. This package includes the required source code files from the official leidenalg distribution and functions from the R igraph package.
This package offers a set of functions for extending dendrogram objects in R, letting you visualize and compare trees of hierarchical clusterings. You can adjust a tree's graphical parameters (the color, size, type, etc of its branches, nodes and labels) and visually and statistically compare different dendrograms to one another.
This is a comprehensive package to perform Tensor decomposition based unsupervised feature extraction. It can perform unsupervised feature extraction. It uses tensor decomposition. It is applicable to gene expression, DNA methylation, and histone modification etc. It can perform multiomics analysis. It is also potentially applicable to single cell omics data sets.
This package provides tools to query the U.S. National Library of Medicine's Clinical Trials database. Functions are provided for a variety of techniques for searching the data using range queries, categorical filtering, and by searching for full-text keywords. Minimal graphical tools are also provided for interactively exploring the constructed data.
pairedGSEA makes it simple to run a paired Differential Gene Expression (DGE) and Differencital Gene Splicing (DGS) analysis. The package allows you to store intermediate results for further investiation, if desired. pairedGSEA comes with a wrapper function for running an Over-Representation Analysis (ORA) and functionalities for plotting the results.
This package provides an R interface to Illumina's BaseSpace cloud computing environment, enabling the fast development of data analysis and visualization tools. Besides providing an easy to use set of tools for manipulating the data from BaseSpace, it also facilitates the access to R's rich environment of statistical and data analysis tools.
This package provides utilities for processing the parameters of various statistical models. Beyond computing p values, CIs, and other indices for a wide variety of models, this package implements features like standardization or bootstrapping of parameters and models, feature reduction (feature extraction and variable selection) as well as conversion between indices of effect size.
spacefillr enables generation of random and quasi-random space-filling sequences. It supports the following sequences: Halton, Sobol, Owen-scrambled Sobol, Owen-scrambled Sobol with errors distributed as blue noise, progressive jittered, progressive multi-jittered (PMJ), PMJ with blue noise, PMJ02, and PMJ02 with blue noise. The package also includes a C++ API.
An R package for deeping mining gene co-expression networks in multi-trait expression data. Provides functions for analyzing, comparing, and visualizing WGCNA networks across conditions. multiWGCNA was designed to handle the common case where there are multiple biologically meaningful sample traits, such as disease vs wildtype across development or anatomical region.
The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. This package provides a collection of functions for retrieving, processing, and re-packaging UniProt web services. The package makes use of UniProt's modernized REST API and allows mapping of identifiers accross different databases.