Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel search send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
This package implements the mini-batch k-means algorithm for large datasets, including support for on-disk data representation.
This package implements exact and approximate methods for nearest neighbor detection, in a framework that allows them to be easily switched within Bioconductor packages or workflows. The exact algorithm is implemented using pre-clustering with the k-means algorithm. Functions are also provided to search for all neighbors within a given distance. Parallelization is achieved for all methods using the BiocParallel framework.
This is a package for normalization, testing for differential variability and differential methylation and gene set testing for data from Illumina's Infinium HumanMethylation arrays. The normalization procedure is subset-quantile within-array normalization (SWAN), which allows Infinium I and II type probes on a single array to be normalized together. The test for differential variability is based on an empirical Bayes version of Levene's test. Differential methylation testing is performed using RUV, which can adjust for systematic errors of unknown origin in high-dimensional data by using negative control probes. Gene ontology analysis is performed by taking into account the number of probes per gene on the array, as well as taking into account multi-gene associated probes.
Analyze and visualize Mutation Annotation Format (MAF) files from large scale sequencing studies. This package provides various functions to perform most commonly used analyses in cancer genomics and to create feature rich customizable visualzations with minimal effort.
Logistic Factor Analysis (LFA) is a method for a PCA analogue on Binomial data via estimation of latent structure in the natural parameter.
This package provides tools for processing short read data from ChIPseq experiments.
The package contains functions to infer and visualize cell cycle process using Single-cell RNA-Seq data. It exploits the idea of transfer learning, projecting new data to the previous learned biologically interpretable space. The tricycle provides a pre-learned cell cycle space, which could be used to infer cell cycle time of human and mouse single cell samples. In addition, it also offer functions to visualize cell cycle time on different embeddings and functions to build new reference.
This package provides S4 generic functions needed by Bioconductor proteomics packages.
This package wires together large collections of single-cell RNA-seq datasets, which allows for both the identification of recurrent cell clusters and the propagation of information between datasets in multi-sample or atlas-scale collections. Conos focuses on the uniform mapping of homologous cell types across heterogeneous sample collections. For instance, users could investigate a collection of dozens of peripheral blood samples from cancer patients combined with dozens of controls, which perhaps includes samples of a related tissue such as lymph nodes.
R-msigdb provides the Molecular Signatures Database in a R accessible objects. Signatures are stored in GeneSet class objects form the GSEABase package and the entire database is stored in a GeneSetCollection object. These data are then hosted on the ExperimentHub. Data used in this package was obtained from the MSigDB of the Broad Institute. Metadata for each gene set is stored along with the gene set in the GeneSet class object.
This package uses segmented copy number data to estimate tumor cell percentage and produce copy number plots displaying absolute copy numbers. For this it uses segmented data from the QDNAseq package, which in turn uses a number of dependencies to turn mapped reads into segmented data. ACE will run QDNAseq or use its output rds-file of segmented data. It will subsequently run through all samples in the object(s), for which it will create individual subdirectories. For each sample, it will calculate how well the segments fit (the relative error) to integer copy numbers for each percentage of tumor cells (cells with divergent segments).
This package offers tools to create DNA barcode sets capable of correcting insertion, deletion, and substitution errors. Existing barcodes can be analyzed regarding their minimal, maximal and average distances between barcodes. Finally, reads that start with a (possibly mutated) barcode can be demultiplexed, i.e. assigned to their original reference barcode.
This is a comprehensive package to automatically train and validate a multi-class SVM classifier based on gene expression data. It provides transparent selection of gene markers, their coexpression networks, and an interface to query the classifier.
This package provides a collection of reference expression datasets with curated cell type labels, for use in procedures like automated annotation of single-cell data or deconvolution of bulk RNA-seq.
TreeSummarizedExperiment extends SingleCellExperiment to include hierarchical information on the rows or columns of the rectangular data.
This package provides an annotation database of Homo sapiens genome data. It is derived from the UCSC hg19 genome and based on the "knownGene" track. The database is exposed as a TxDb object.
The sparse nature of single cell epigenomics data can be overruled using probabilistic modelling methods such as Latent Dirichlet Allocation (LDA). This package allows the probabilistic modelling of cis-regulatory topics (cisTopics) from single cell epigenomics data, and includes functionalities to identify cell states based on the contribution of cisTopics and explore the nature and regulatory proteins driving them.
DSS is an R library performing differential analysis for count-based sequencing data. It detects differentially expressed genes (DEGs) from RNA-seq, and differentially methylated loci or regions (DML/DMRs) from bisulfite sequencing (BS-seq). The core of DSS is a dispersion shrinkage method for estimating the dispersion parameter from Gamma-Poisson or Beta-Binomial distributions.
Independent hypothesis weighting (IHW) is a multiple testing procedure that increases power compared to the method of Benjamini and Hochberg by assigning data-driven weights to each hypothesis. The input to IHW is a two-column table of p-values and covariates. The covariate can be any continuous-valued or categorical variable that is thought to be informative on the statistical properties of each hypothesis test, while it is independent of the p-value under the null hypothesis.
This package provides memory efficient string containers, string matching algorithms, and other utilities, for fast manipulation of large biological sequences or sets of sequences.
The package contains 8 BAM files, 1 per sequencing run. Each BAM file was obtained by aligning the reads (paired-end) to the full hg19 genome with TopHat2, and then subsetting to keep only alignments on chr14. See accession number E-MTAB-1147 in the ArrayExpress database for details about the experiment, including links to the published study (by Zarnack et al., 2012) and to the FASTQ files.
This package provides per-exon and per-gene read counts computed for selected genes from RNA-seq data that were presented in the article 'Conservation of an RNA regulatory map between Drosophila and mammals' by Brooks et al., Genome Research 2011.
Rqc is an optimized tool designed for quality control and assessment of high-throughput sequencing data. It performs parallel processing of entire files and produces a report which contains a set of high-resolution graphics.
This package contains whole-genome single cell sequencing data for demonstration purposes in the AneuFinder package.