Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
Mudata is a Python package for multi-omics data analysis. It is designed to provide functionality to load, process, and store multimodal omics data.
This package provides procedures for efficient pythonic random access to fasta subsequences.
FAN-C provides a pipeline for analysing Hi-C data starting at mapped paired-end sequencing reads.
muon is a multimodal omics Python framework.
PiGx BSseq is a data processing pipeline for raw fastq read data of bisulfite experiments; it produces reports on aggregate methylation and coverage and can be used to produce information on differential methylation and segmentation.
This package provides a tool for identifying and removing doublets in single-cell RNA-seq data.
Pyahocorasick is a fast, memory-efficient library for multi-pattern string search. This means that you can find multiple key strings occurrences at once in some input text.
BioJava is a project dedicated to providing a Java framework for processing biological data. It provides analytical and statistical routines, parsers for common file formats, reference implementations of popular algorithms, and allows the manipulation of sequences and 3D structures. The goal of the biojava project is to facilitate rapid application development for bioinformatics.
This package provides the core libraries.
The loom file format is an efficient format for very large omics datasets, consisting of a main matrix, optional additional layers, a variable number of row and column annotations. Loom also supports sparse graphs. This library makes it easy to work with .loom files for single-cell RNA-seq data.
BSeq-sc is a bioinformatics analysis pipeline that leverages single-cell sequencing data to estimate cell type proportion and cell type-specific gene expression differences from RNA-seq data from bulk tissue samples. This is a companion package to the publication "A single-cell transcriptomic map of the human and mouse pancreas reveals inter- and intra-cell population structure." Baron et al. Cell Systems (2016) https://www.ncbi.nlm.nih.gov/pubmed/27667365.
This package provides a C library for parsing local and remote BigWig files.
This package contains some tools for processing BAM files including:
bamsormadup: parallel sorting and duplicate marking
bamcollate2: reads BAM and writes BAM reordered such that alignment or collated by query name
bammarkduplicates: reads BAM and writes BAM with duplicate alignments marked using the BAM flags field
bammaskflags: reads BAM and writes BAM while masking (removing) bits from the flags column
bamrecompress: reads BAM and writes BAM with a defined compression setting. This tool is capable of multi-threading.
bamsort: reads BAM and writes BAM resorted by coordinates or query name
bamtofastq: reads BAM and writes FastQ; output can be collated or uncollated by query name.
This is a collection of utility functions for Seurat. These functions allow the automation and multiplexing of plotting, 3D plotting, visualization of statistics & QC, interaction with the Seurat object. Some functionalities require functions from CodeAndRoll and MarkdownReports libraries.
This package provides a GFF/GTF file parsing utility providing format conversions, region filtering, FASTA sequence extraction and more.
BBKNN is a batch effect removal tool that can be directly used in the Scanpy workflow. It serves as an alternative to scanpy.api.pp.neighbors(), with both functions creating a neighbour graph for subsequent use in clustering, pseudotime and UMAP visualisation. If technical artifacts are present in the data, they will make it challenging to link corresponding cell types across different batches. BBKNN actively combats this effect by splitting your data into batches and finding a smaller number of neighbours for each cell within each of the groups. This helps create connections between analogous cells in different batches without altering the counts or PCA space.
This package provides a client for the OmniPath web service and many other resources. It also includes functions to transform and pretty print some of the downloaded data, functions to access a number of other resources such as BioPlex, ConsensusPathDB, EVEX, Gene Ontology, Guide to Pharmacology (IUPHAR/BPS), Harmonizome, HTRIdb, Human Phenotype Ontology, InWeb InBioMap, KEGG Pathway, Pathway Commons, Ramilowski et al. 2015, RegNetwork, ReMap, TF census, TRRUST and Vinayagam et al. 2011. Furthermore, OmnipathR features a close integration with the NicheNet method for ligand activity prediction from transcriptomics data, and its R implementation nichenetr.
Newick-utils is a suite of utilities for processing phylogenetic trees in Newick format. Functions include re-rooting, extracting subtrees, trimming, pruning, condensing, drawing (ASCII graphics or SVG).
SeqAn is a C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. It contains algorithms and data structures for string representation and their manipulation, online and indexed string search, efficient I/O of bioinformatics file formats, sequence alignment, and more.
Salmon is a program to produce highly-accurate, transcript-level quantification estimates from RNA-seq data. Salmon achieves its accuracy and speed via a number of different innovations, including the use of lightweight alignments (accurate but fast-to-compute proxies for traditional read alignments) and massively-parallel stochastic collapsed variational inference.
This package provides a library and collection of scripts to work with Illumina paired-end data (for CASAVA 1.8+).
Proteinortho is a tool to detect orthologous genes across different species. For doing so, it compares similarities of given gene sequences and clusters them to find significant groups. The algorithm was designed to handle large-scale data and can be applied to hundreds of species at once.
Picard is a set of Java command line tools for manipulating high-throughput sequencing (HTS) data and formats. Picard is implemented using the HTSJDK Java library to support accessing file formats that are commonly used for high-throughput sequencing data such as SAM, BAM, CRAM and VCF.
This package provides the kentUtils, a selection of bioinformatics utilities used in combination with the UCSC genome browser.
EMBOSS is the "European Molecular Biology Open Software Suite". EMBOSS is an analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. It also provides a number of libraries for the development of software in the field of molecular biology. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole.