Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel search send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
SCENIC (Single-cell regulatory network inference and clustering) is an R package to infer Gene Regulatory Networks and cell types from single-cell RNA-seq data.
The Maxprobes package collects cross-reactive probes of Illumina methylation array 450K and EPIC/850K.
This is a set of R functions that allows you to generate precise figures. This tool will create clean markdown reports about what you just discovered.
This is an R package that integrates the installation of doublet-detection methods. In addition, this tool is used for execution and benchmark of those eight mentioned methods.
t-Stochastic Neighborhood Embedding (t-SNE) is a highly successful method for dimensionality reduction and visualization of high dimensional datasets. A popular implementation of t-SNE uses the Barnes-Hut algorithm to approximate the gradient at each iteration of gradient descent. This package is a Cython wrapper for FIt-SNE.
DoubletFinder identifies doublets by generating artificial doublets from existing scRNA-seq data and defining which real cells preferentially co-localize with artificial doublets in gene expression space. Other DoubletFinder package functions are used for fitting DoubletFinder to different scRNA-seq datasets. For example, ideal DoubletFinder performance in real-world contexts requires optimal pK selection and homotypic doublet proportion estimation. pK selection is achieved using pN-pK parameter sweeps and maxima identification in mean-variance-normalized bimodality coefficient distributions. Homotypic doublet proportion estimation is achieved by finding the sum of squared cell annotation frequencies.
PAIRADISE is a method for detecting allele-specific alternative splicing (ASAS) from RNA-seq data. Unlike conventional approaches that detect ASAS events one sample at a time, PAIRADISE aggregates ASAS signals across multiple individuals in a population. By treating the two alleles of an individual as paired, and multiple individuals sharing a heterozygous SNP as replicates, PAIRADISE formulates ASAS detection as a statistical problem for identifying differential alternative splicing from RNA-seq data with paired replicates.
MUSIC is an algorithm for identification of enriched regions at multiple scales in the read depth signals from ChIP-Seq experiments.
This package provides a GFF/GTF file parsing utility providing format conversions, region filtering, FASTA sequence extraction and more.
The Filtlong package is a tool for filtering long reads by quality. It can take a set of long reads and produce a smaller, better subset. It uses both read length (longer is better) and read identity (higher is better) when choosing which reads pass the filter.
SQUID is Sean Eddy's personal library of C functions and utility programs for sequence analysis.
The ccwl is a concise syntax to express CWL workflows. ccwl is a compiler to generate CWL workflows from concise descriptions in ccwl. It is implemented as an EDSL in the Scheme programming language.
Bio::Kseq provides ruby bindings to the kseq.h FASTA and FASTQ parsing code. It provides a fast iterator over sequences and their quality scores.
Parabam is a tool for processing sequencing files in parallel. It uses Python's native multiprocessing framework to apply a user defined rule on an input file.
This package provides different statistical methods to extract biological activities from omics data within a unified framework.
This package provides a Python client for BioThings API services.
dRep is a Python program for rapidly comparing large numbers of genomes. dRep can also "de-replicate" a genome set by identifying groups of highly similar genomes and choosing the best representative genome for each genome set.
Bandage is a program for visualising de novo assembly graphs. It allows users to interact with the assembly graphs made by de novo assemblers such as Velvet, SPAdes, MEGAHIT and others. De novo assembly graphs contain not only assembled contigs but also the connections between those contigs, which were previously not easily accessible. Bandage visualises assembly graphs, with connections, using graph layout algorithms. Nodes in the drawn graph, which represent contigs, can be automatically labelled with their ID, length or depth. Users can interact with the graph by moving, labelling and colouring nodes. Sequence information can also be extracted directly from the graph viewer. By displaying connections between contigs, Bandage opens up new possibilities for analysing and improving de novo assemblies that are not possible by looking at contigs alone.
Circe is a Python package for inferring co-accessibility networks from single-cell ATAC-seq data, using skggm for the graphical lasso and python-scanpy for data processing.
Pyahocorasick is a fast, memory-efficient library for multi-pattern string search. This means that you can find multiple key strings occurrences at once in some input text.
Sickle is a tool that trims reads based on quality and length thresholds. It uses sliding windows to detect low-quality bases at the 3'-end and high-quality bases at the 5'-end. Additionally, it discards reads based on the length threshold.
python-scanrbp is a Python package that provides the scanRBP tool that loads RNA-protein binding motif PWM and computes the log-odds scores for all the loaded RBPs across a given genomic sequence and draws a heatmap of the scores.
Sylamer is a system for finding significantly over or under-represented words in sequences according to a sorted gene list. Typically it is used to find significant enrichment or depletion of microRNA or siRNA seed sequences from microarray expression data. Sylamer is extremely fast and can be applied to genome-wide datasets with ease. Results are plotted in terms of a significance landscape plot. These plots show significance profiles for each word studied across the sorted genelist.
A tandem repeat in DNA is two or more adjacent, approximate copies of a pattern of nucleotides. Tandem Repeats Finder is a program to locate and display tandem repeats in DNA sequences. In order to use the program, the user submits a sequence in FASTA format. The output consists of two files: a repeat table file and an alignment file. Submitted sequences may be of arbitrary length. Repeats with pattern size in the range from 1 to 2000 bases are detected.