Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel search send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
Fxtract extracts sequences from a protein or nucleotide fastx (FASTA or FASTQ) file given a subsequence. It uses a simple substring search for basic tasks but can change to using POSIX regular expressions, PCRE, hash lookups or multi-pattern searching as required. By default fxtract looks in the sequence of each record but can also be told to look in the header, comment or quality sections.
PiGx is a collection of genomics pipelines. It includes the following pipelines:
PiGx BSseq for raw fastq read data of bisulfite experiments
PiGx RNAseq for RNAseq samples
PiGx scRNAseq for single cell dropseq analysis
PiGx ChIPseq for reads from ChIPseq experiments
All pipelines are easily configured with a simple sample sheet and a descriptive settings file. The result is a set of comprehensive, interactive HTML reports with interesting findings about your samples.
This package aims to produce high-quality genome browser tracks that are highly customizable. Currently, it is possible to plot: bigwig, bed (many options), bedgraph, links (represented as arcs), and Hi-C matrices. pyGenomeTracks can make plots with or without Hi-C data.
The khmer software is a set of command-line tools for working with DNA shotgun sequencing data from genomes, transcriptomes, metagenomes and single cells. Khmer can make de novo assemblies faster, and sometimes better. Khmer can also identify and fix problems with shotgun data.
This package has been developed under ROpenSci gudelines to integrate conventional and cutting edge cytometry analysis tools under a unified framework. It aims to represent an intuitive and interactive approach to analysing cytometry data in R.
Bio++ is a set of C++ libraries for Bioinformatics, including sequence analysis, phylogenetics, molecular evolution and population genetics. This library provides phylogenetics-related modules.
Chromap is a fast method for aligning and preprocessing high throughput chromatin profiles. Typical use cases include:
trimming sequencing adapters, mapping bulk ATAC-seq or ChIP-seq genomic reads to the human genome and removing duplicates;
trimming sequencing adapters, mapping single cell ATAC-seq genomic reads to the human genome, correcting barcodes, removing duplicates and performing Tn5 shift;
split alignment of Hi-C reads against a reference genome.
Fluff is a Python package that contains several scripts to produce pretty, publication-quality figures for next-generation sequencing experiments.
This is a package that lets you process UMI-4C data from scratch to produce nice plots.
Azimuth utilizes an annotated reference dataset. It automates the processing, analysis, and interpretation. This applies specifically to new single-cell RNA-seq or ATAC-seq experiments. Azimuth leverages a reference-based mapping pipeline that inputs accounts matrix and performs normalization, visualization, cell annotation, and differential expression.
This package provides a framework to process and analyze data from high-throughput sequencing (HTS) assays
This package provides the ASCAT R package that can be used to infer tumour purity, ploidy and allele-specific copy number profiles.
This package contains the Battenberg R package for subclonal copy number estimation, as described by Nik-Zainal et al.
Prodigal runs smoothly on finished genomes, draft genomes, and metagenomes, providing gene predictions in GFF3, Genbank, or Sequin table format. It runs quickly, in an unsupervised fashion, handles gaps, handles partial genes, and identifies translation initiation sites.
MOSAIK is a program for mapping second and third-generation sequencing reads to a reference genome. MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT.
Ngesh is a Python library and CLI tool for simulating phylogenetic trees and data. It is intended for benchmarking phylogenetic methods, especially in historical linguistics andstemmatology. The generation of stochastic phylogenetic trees also goes by the name simulationmethods for phylogenetic trees, synthetic data generation, or just phylogenetic tree simulation.
IMP's broad goal is to contribute to a comprehensive structural characterization of biomolecules ranging in size and complexity from small peptides to large macromolecular assemblies, by integrating data from diverse biochemical and biophysical experiments. IMP provides a C++ and Python toolbox for solving complex modeling problems, and a number of applications for tackling some common problems in a user-friendly way.
This package provides a computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
CENTIPEDE applies a hierarchical Bayesian mixture model to infer regions of the genome that are bound by particular transcription factors. It starts by identifying a set of candidate binding sites, and then aims to classify the sites according to whether each site is bound or not bound by a transcription factor. CENTIPEDE is an unsupervised learning algorithm that discriminates between two different types of motif instances using as much relevant information as possible.
This is the reference implementation of the CWL standards. The CWL open standards are for describing analysis workflows and tools in a way that makes them portable and scalable across a variety of software and hardware environments, from workstations to cluster, cloud, and high performance computing (HPC) environments. CWL is designed to meet the needs of data-intensive science, such as Bioinformatics, Medical Imaging, Astronomy, Physics, and Chemistry. The cwltool is intended to be feature complete and to provide comprehensive validation of CWL files as well as provide other tools related to working with CWL descriptions.
CodingQuarry is a highly accurate, self-training GHMM fungal gene predictor designed to work with assembled, aligned RNA-seq transcripts.
VCFtools is a program package designed for working with VCF files, such as those generated by the 1000 Genomes Project. The aim of VCFtools is to provide easily accessible methods for working with complex genetic variation data in the form of VCF files.
Biopython is a set of tools for biological computation including parsers for bioinformatics files into Python data structures; interfaces to common bioinformatics programs; a standard sequence class and tools for performing common operations on them; code to perform data classification; code for dealing with alignments; code making it easy to split up parallelizable tasks into separate processes; and more.
BLAST is a popular method of performing a DNA or protein sequence similarity search, using heuristics to produce results quickly. It also calculates an “expect value” that estimates how many matches would have occurred at a given score by chance, which can aid a user in judging how much confidence to have in an alignment.