Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
This package implements methods for batch correction and integration of scRNA-seq datasets, based on the Seurat anchor-based integration framework. In particular, STACAS is optimized for the integration of heterogeneous datasets with only limited overlap between cell sub-types (e.g. TIL sets of CD8 from tumor with CD8/CD4 T cells from lymphnode), for which the default Seurat alignment methods would tend to over-correct biological differences. The 2.0 version of the package allows the users to incorporate explicit information about cell-types in order to assist the integration process.
This package provides an object for plotting GRanges, RleList, UCSC file formats, and ffTrack objects in multi-track panels.
Ribotaper is a method for defining translated open reading frames (ORFs) using ribosome profiling (ribo-seq) data. This package provides the Ribotaper pipeline.
This is an R package for pre-processing of flow and mass cytometry data. This package includes panel editing or renaming for FCS files, bead-based normalization and debarcoding.
The Spliced Transcripts Alignment to a Reference (STAR) software is based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences.
BioJava is a project dedicated to providing a Java framework for processing biological data. It provides analytical and statistical routines, parsers for common file formats, reference implementations of popular algorithms, and allows the manipulation of sequences and 3D structures. The goal of the biojava project is to facilitate rapid application development for bioinformatics.
This package provides the core libraries.
ravanan is a CWL implementation that is powered by GNU Guix and provides strong reproducibility guarantees. ravanan provides strong caching of intermediate results so the same step of a workflow is never run twice. ravanan captures logs from every step of the workflow for easy tracing back in case of job failures. ravanan currently runs on single machines and on slurm via its API.
Vembrane simultaneously filters variants based on any INFO or FORMAT field, CHROM, POS, ID, REF, ALT, QUAL, FILTER, and the annotation field ANN. When filtering based on ANN, annotation entries are filtered first. If no annotation entry remains, the entire variant is deleted.
JAMM is a peak finder for next generation sequencing datasets (ChIP-Seq, ATAC-Seq, DNase-Seq, etc.) that can integrate replicates and assign peak boundaries accurately. JAMM is applicable to both broad and narrow datasets.
This package provides data for the book "Computational Genomics with R".
Grassroots DICOM (GDCM) is an implementation of the DICOM standard designed to be open source so that researchers may access clinical data directly. GDCM includes a file format definition and a network communications protocol, both of which should be extended to provide a full set of tools for a researcher or small medical imaging vendor to interface with an existing medical database.
Isolator analyzes RNA-Seq experiments. Isolator has a particular focus on producing stable, consistent estimates. It implements a full hierarchical Bayesian model of an entire RNA-Seq experiment. It saves all the samples generated by the sampler, which can be processed to compute posterior probabilities for arbitrarily complex questions, far beyond the confines of pairwise tests. It aggressively corrects for technical effects, such as random priming bias, GC-bias, 3' bias, and fragmentation effects. Compared to other MCMC approaches, it is exceedingly efficient, though generally slower than modern maximum likelihood approaches.
MAGIC is an interactive tool to impute missing values in single-cell sequencing data and to restore the structure of the data. It also provides data pre-processing functionality such as dimensionality reduction and gene expression visualization.
Proteinortho is a tool to detect orthologous genes across different species. For doing so, it compares similarities of given gene sequences and clusters them to find significant groups. The algorithm was designed to handle large-scale data and can be applied to hundreds of species at once.
A tandem repeat in DNA is two or more adjacent, approximate copies of a pattern of nucleotides. Tandem Repeats Finder is a program to locate and display tandem repeats in DNA sequences. In order to use the program, the user submits a sequence in FASTA format. The output consists of two files: a repeat table file and an alignment file. Submitted sequences may be of arbitrary length. Repeats with pattern size in the range from 1 to 2000 bases are detected.
Pando leverages multi-modal single-cell measurements to infer gene regulatory networks using a flexible linear model-based framework. By modeling the relationship between TF-binding site pairs with the expression of target genes, Pando simultaneously infers gene modules and sets of regulatory regions for each transcription factor.
CLIPper is a tool to define peaks in CLIP-seq datasets.
This package is a client implementation of the GA4GH htsget protocol. It provides a simple and reliable way to retrieve genomic data from servers supporting the protocol.
Fastahack is a small application for indexing and extracting sequences and subsequences from FASTA files. The included library provides a FASTA reader and indexer that can be embedded into applications which would benefit from directly reading subsequences from FASTA files. The library automatically handles index file generation and use.
This package provides tools for handling BAM, SAM, Tabix, bgzf, CRAM, CSIv1, CSIv2 and FAI files.
Parabam is a tool for processing sequencing files in parallel. It uses Python's native multiprocessing framework to apply a user defined rule on an input file.
Circe is a Python package for inferring co-accessibility networks from single-cell ATAC-seq data, using skggm for the graphical lasso and python-scanpy for data processing.
This is a collection of functions created and/or curated to aid in the visualization and analysis of single-cell data using R.
This package performs a fast Wilcoxon rank sum test and auROC analysis.