Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
CNVkit is a Python library and command-line software toolkit to infer and visualize copy number from high-throughput DNA sequencing data. It is designed for use with hybrid capture, including both whole-exome and custom target panels, and short-read sequencing platforms such as Illumina and Ion Torrent.
Biosoup is a C++ collection of header-only data structures used for storage and logging in bioinformatics tools.
SnapTools can operate on snap files the following types of operations:
index the reference genome before alignment;
align reads to the corresponding reference genome;
pre-process by convert pair-end reads into fragments, checking the mapping quality score, alignment and filtration;
create the cell-by-bin matrix.
twobitreader is a Python library for reading .2bit files as used by the UCSC genome browser.
This package implements scalable gene regulatory network inference using tree-based ensemble regressors.
This package provides a simple web interface for the RNA-centric annotation system (RCAS).
This package is a collection of Perl, Python, and R scripts for manipulating 3C/4C/5C/Hi-C data.
This package addresses the challenge of handling large amounts of data that are now routinely generated from DNA sequencing centers. deepTools contains useful modules to process the mapped reads data for multiple quality checks, creating normalized coverage files in standard bedGraph and bigWig file formats, that allow comparison between different files. Finally, using such normalized and standardized files, deepTools can create many publication-ready visualizations to identify enrichments and for functional annotations of the genome.
PyEGA3 is a tool for viewing and downloading files from authorized EGA datasets. It uses the EGA data API and has several key features:
Files are transferred over secure https connections and received unencrypted, so no need for decryption after download.
Downloads resume from where they left off in the event that the connection is interrupted.
Supports file segmenting and parallelized download of segments, improving overall performance.
After download completes, file integrity is verified using checksums.
Implements the GA4GH-compliant htsget protocol for download of genomic ranges for data files with accompanying index files.
The khmer software is a set of command-line tools for working with DNA shotgun sequencing data from genomes, transcriptomes, metagenomes and single cells. Khmer can make de novo assemblies faster, and sometimes better. Khmer can also identify and fix problems with shotgun data.
This is an R package providing additional capabilities and speed for GenomicRanges operations.
This package is a client implementation of the GA4GH htsget protocol. It provides a simple and reliable way to retrieve genomic data from servers supporting the protocol.
This package builds on Seurat's Doheatmap function code to produce a heatmap from a Seurat object with multiple annotation bars.
Mosaicatcher counts Strand-seq reads and classifies strand states of each chromosome in each cell using a Hidden Markov Model.
Pando leverages multi-modal single-cell measurements to infer gene regulatory networks using a flexible linear model-based framework. By modeling the relationship between TF-binding site pairs with the expression of target genes, Pando simultaneously infers gene modules and sets of regulatory regions for each transcription factor.
This package generates a Miami plot with centered chromosome labels. The output is a ggplot2 object. Users can specify which data they want plotted on top vs. bottom, whether to display significance line(s), what colors to give chromosomes, and what points to label.
This package stores motif collections as lists of position frequency matrix (PWMatrixList) objects provided by the TFBSTools package for use in R with packages like motifmatchr or chromVAR.
Bloom-filter-based error correction solution for high-throughput sequencing reads (BLESS) uses a single minimum-sized bloom filter is a correction tool for genomic reads produced by Next-generation sequencing (NGS). BLESS produces accurate correction results with much less memory compared with previous solutions and is also able to tolerate a higher false-positive rate. BLESS can extend reads like DNA assemblers to correct errors at the end of reads.
Cooler is a support library for a sparse, compressed, binary persistent storage format, called cool, used to store genomic interaction data, such as Hi-C contact matrices.
PLINK is a whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner. The focus of PLINK is purely on analysis of genotype/phenotype data, so there is no support for steps prior to this (e.g. study design and planning, generating genotype or CNV calls from raw data). Through integration with gPLINK and Haploview, there is some support for the subsequent visualization, annotation and storage of results.
This is a C++ wrapper around the Tabix project which abstracts some of the details of opening and jumping in tabix-indexed files.
This package provides data for the book "Computational Genomics with R".
HTSlib is a C library for reading/writing high-throughput sequencing data. It also provides the bgzip, htsfile, and tabix utilities.
Flexbar preprocesses high-throughput nucleotide sequencing data efficiently. It demultiplexes barcoded runs and removes adapter sequences. Moreover, trimming and filtering features are provided. Flexbar increases read mapping rates and improves genome and transcriptome assemblies. It supports next-generation sequencing data in fasta/q and csfasta/q format from Illumina, Roche 454, and the SOLiD platform.