Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
This is package for including transposable elements in differential enrichment analysis of sequencing datasets. TEtranscripts and TEcount take RNA-seq (and similar data) and annotates reads to both genes and transposable elements. TEtranscripts then performs differential analysis using DESeq2. Note that TEtranscripts and TEcount rely on specially curated GTF files, which are not included due to their size.
Kraken is a taxonomic sequence classifier that assigns taxonomic labels to DNA sequences. Kraken examines the k-mers within a query sequence and uses the information within those k-mers to query a database. That database maps k-mers to the lowest common ancestor (LCA) of all genomes known to contain a given k-mer.
Very fast parallel big-data BLAST XML file parser which can be used as command line utility. Use blastxmlparser to: Parse BLAST XML; filter output; generate FASTA, JSON, YAML, RDF, JSON-LD, HTML, CSV, tabular output etc.
This package can be used to normalize cytometry samples when a control sample is taken along in each of the batches. This is done by first identifying multiple clusters/cell types, learning the batch effects from the control samples and applying quantile normalization on all markers of interest.
BWA-PSSM is a probabilistic short genomic sequence read aligner based on the use of position specific scoring matrices (PSSM). Like many of the existing aligners it is fast and sensitive. Unlike most other aligners, however, it is also adaptible in the sense that one can direct the alignment based on known biases within the data set. It is coded as a modification of the original BWA alignment program and shares the genome index structure as well as many of the command line options.
This package provides several programs that perform operations on SAM/BAM files. All of these programs are built into a single executable called bam.
The alignment module of BioJava provides an API that contains
implementations of dynamic programming algorithms for sequence alignment;
reading and writing of popular alignment file formats;
a single-, or multi- threaded multiple sequence alignment algorithm.
This package provides bioinformatic tools to align, deduplicate, reformat, filter and normalize DNA and RNA-seq data. It includes the following tools: BBMap, a short read aligner for DNA and RNA-seq data; BBNorm, a kmer-based error-correction and normalization tool; Dedupe, a tool to simplify assemblies by removing duplicate or contained subsequences that share a target percent identity; Reformat, to convert reads between fasta/fastq/scarf/fasta+qual/sam, interleaved/paired, and ASCII-33/64, at over 500 MB/s; and BBDuk, a tool to filter, trim, or mask reads with kmer matches to an artifact/contaminant file.
This program searches for and removes remnant adapter sequences from High-Throughput Sequencing (HTS) data and (optionally) trims low quality bases from the 3' end of reads following adapter removal. AdapterRemoval can analyze both single end and paired end data, and can be used to merge overlapping paired-ended reads into (longer) consensus sequences. Additionally, the AdapterRemoval may be used to recover a consensus adapter sequence for paired-ended data, for which this information is not available.
This package provides a toolbox to process, analyze and visualize spatial single-cell expression data.
Arriba is a command-line tool for the detection of gene fusions from RNA-Seq data. It was developed for the use in a clinical research setting. Therefore, short runtimes and high sensitivity were important design criteria. It is based on the fast STAR aligner and the post-alignment runtime is typically just around two minutes. In contrast to many other fusion detection tools which build on STAR, Arriba does not require to reduce the alignIntronMax parameter of STAR to detect small deletions.
Change-O is a collection of tools for processing the output of V(D)J alignment tools, assigning clonal clusters to immunoglobulin (Ig) sequences, and reconstructing germline sequences.
Telomerecat is a tool for estimating the average telomere length (TL) for a paired end, whole genome sequencing (WGS) sample.
Telomerecat is adaptable, accurate and fast. The algorithm accounts for sequencing amplification artifacts, anneouploidy (common in cancer samples) and noise generated by WGS. For a high coverage WGS BAM file of around 100GB telomerecat can produce an estimate in ~1 hour.
PAIRADISE is a method for detecting allele-specific alternative splicing (ASAS) from RNA-seq data. Unlike conventional approaches that detect ASAS events one sample at a time, PAIRADISE aggregates ASAS signals across multiple individuals in a population. By treating the two alleles of an individual as paired, and multiple individuals sharing a heterozygous SNP as replicates, PAIRADISE formulates ASAS detection as a statistical problem for identifying differential alternative splicing from RNA-seq data with paired replicates.
This is an R package to query and expand DisGeNET data, and to visualize the results within R framework. The disgenet2r package is designed to retrieve data from DisGeNET v6.0 (Jan, 2019).
Scregseg (Single-Cell REGulatory landscape SEGmentation) is a tool that facilitates the analysis of single cell ATAC-seq data by an HMM-based segmentation algorithm. Scregseg uses an HMM with Dirichlet-Multinomial emission probabilities to segment the genome either according to distinct relative cross-cell accessibility profiles or (after collapsing the single-cell tracks to pseudo-bulk tracks) to capture distinct cross-cluster accessibility profiles.
This package provides a fast and accurate analysis toolkit for single cell ATAC-seq (Assay for transposase-accessible chromatin using sequencing). Single cell ATAC-seq can resolve the heterogeneity of a complex tissue and reveal cell-type specific regulatory landscapes. However, the exceeding data sparsity has posed unique challenges for the data analysis. This package r-snapatac is an end-to-end bioinformatics pipeline for analyzing large- scale single cell ATAC-seq data which includes quality control, normalization, clustering analysis, differential analysis, motif inference and exploration of single cell ATAC-seq sequencing data.
This package provides necessary tools for the analysis of the genomic interaction data stored in .cool format. This collection of tools includes operations like compartment, insulation or peak calling.
twobitreader is a Python library for reading .2bit files as used by the UCSC genome browser.
This package provides a collection of methods to extract gene programs from single-cell gene expression data using non-negative matrix factorization (NMF). GeneNMF contains functions to directly interact with the Seurat toolkit and derive interpretable gene program signatures.
Straw is library which allows rapid streaming of contact data from .hic files. This package provides Python bindings.
CellTypist is an automated cell type annotation tool for scRNA-seq datasets on the basis of logistic regression classifiers optimised by the stochastic gradient descent algorithm. CellTypist allows for cell prediction using either built-in (with a current focus on immune sub-populations) or custom models, in order to assist in the accurate classification of different cell types and subtypes.
python-cwlformat is a specification and a reference implementation for a very opinionated CWL code formatter. It outputs CWL in a standardized YAML format.
This package provides accelerated functions for the CIRI toolkit. It also provides the ccs executable to scan for circular consensus sequences.