Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
TSIS is used for detecting transcript isoform switches in time-series data. Transcript isoform switches occur when a pair of alternatively spliced isoforms reverse the order of their relative expression levels. TSIS characterizes the transcript switch by defining the isoform switch time-points for any pair of transcript isoforms within a gene. In addition, this tool describes the switch using five different features or metrics. Also it filters the results with user’s specifications and visualizes the results using different plots for the user to examine further details of the switches.
Ngesh is a Python library and CLI tool for simulating phylogenetic trees and data. It is intended for benchmarking phylogenetic methods, especially in historical linguistics andstemmatology. The generation of stochastic phylogenetic trees also goes by the name simulationmethods for phylogenetic trees, synthetic data generation, or just phylogenetic tree simulation.
SeuratWrappers is a collection of community-provided methods and extensions for Seurat, curated by the Satija Lab at NYGC. These methods comprise functionality not presently found in Seurat, and are able to be updated much more frequently.
The R package rareMETALS2 is an extension of the R package rareMETALS. It was designed to meta-analyze gene-level association tests for binary trait. While rareMETALS offers a near-complete solution for meta-analysis of gene-level tests for quantitative trait, it does not offer the optimal solution for binary trait. The package rareMETALS2 offers improved features for analyzing gene-level association tests in meta-analyses for binary trait.
PyEGA3 is a tool for viewing and downloading files from authorized EGA datasets. It uses the EGA data API and has several key features:
Files are transferred over secure https connections and received unencrypted, so no need for decryption after download.
Downloads resume from where they left off in the event that the connection is interrupted.
Supports file segmenting and parallelized download of segments, improving overall performance.
After download completes, file integrity is verified using checksums.
Implements the GA4GH-compliant htsget protocol for download of genomic ranges for data files with accompanying index files.
DoubletFinder identifies doublets by generating artificial doublets from existing scRNA-seq data and defining which real cells preferentially co-localize with artificial doublets in gene expression space. Other DoubletFinder package functions are used for fitting DoubletFinder to different scRNA-seq datasets. For example, ideal DoubletFinder performance in real-world contexts requires optimal pK selection and homotypic doublet proportion estimation. pK selection is achieved using pN-pK parameter sweeps and maxima identification in mean-variance-normalized bimodality coefficient distributions. Homotypic doublet proportion estimation is achieved by finding the sum of squared cell annotation frequencies.
BWA-PSSM is a probabilistic short genomic sequence read aligner based on the use of position specific scoring matrices (PSSM). Like many of the existing aligners it is fast and sensitive. Unlike most other aligners, however, it is also adaptible in the sense that one can direct the alignment based on known biases within the data set. It is coded as a modification of the original BWA alignment program and shares the genome index structure as well as many of the command line options.
The loom file format is an efficient format for very large omics datasets, consisting of a main matrix, optional additional layers, a variable number of row and column annotations. Loom also supports sparse graphs. This library makes it easy to work with .loom files for single-cell RNA-seq data.
This is a Python module for analyzing cell-hashing/nucleus-hashing data. It is the demultiplexing module of Pegasus, which is used by Cumulus in the demultiplexing step.
This is a collection of functions created and/or curated to aid in the visualization and analysis of single-cell data using R.
Entrez Direct (EDirect) is a method for accessing the National Center for Biotechnology Information's (NCBI) set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a terminal. Functions take search terms from command-line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.
EDirect also provides an argument-driven function that simplifies the extraction of data from document summaries or other results that are returned in structured XML format. This can eliminate the need for writing custom software to answer ad hoc questions.
This is package for including transposable elements in differential enrichment analysis of sequencing datasets. TEtranscripts and TEcount take RNA-seq (and similar data) and annotates reads to both genes and transposable elements. TEtranscripts then performs differential analysis using DESeq2. Note that TEtranscripts and TEcount rely on specially curated GTF files, which are not included due to their size.
This package is a library to enable flexible and scalable operations on genomic interval dataframes in Python. Bioframe enables access to a rich set of dataframe operations. Working in Python enables rapid visualization and iteration of genomic analyses. The philosophy underlying bioframe is to enable flexible operations. Instead of creating a function for every possible use-case, we encourage users to compose functions to achieve their goals.
The HH-suite is a software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs).
python-gffutils is a Python package for working with and manipulating the GFF and GTF format files typically used for genomic annotations. The files are loaded into a SQLite database, allowing much more complex manipulation of hierarchical features (e.g., genes, transcripts, and exons) than is possible with plain-text methods alone.
This package provides an implementation of chunked, compressed, N-dimensional arrays for R, Zarr specification version 2 (2024) <doi:10.5281/zenodo.11320255>.
This package provides necessary tools for the analysis of the genomic interaction data stored in .cool format. This collection of tools includes operations like compartment, insulation or peak calling.
This library implements an efficient loopless multiset combination generation algorithm which is (approximately) described in "Loopless algorithms for generating permutations, combinations, and other combinatorial configurations.", G. Ehrlich - Journal of the ACM (JACM), 1973. (Algorithm 7.)
This package provides a deconvolution based on Single Nucleotide Position (SNP) for multiplexed scRNA-seq data. The name vireo stand for Variational Inference for Reconstructing Ensemble Origin by expressed SNPs in multiplexed scRNA-seq data and follows the clone identification from single-cell data named cardelino.
This package provides a set of R functions to parse markdown and other generic helpers.
This package provides a collection of useful functions for working with DNA methylation micro-array data.
CGAT-core is a set of libraries and helper functions used to enable researchers to design and build computational workflows for the analysis of large-scale data-analysis.
PiGx SARS-CoV-2 is a pipeline for analysing data from sequenced wastewater samples and identifying given variants-of-concern of SARS-CoV-2. The pipeline can be used for continuous sampling. The output report will provide an intuitive visual overview about the development of variant abundance over time and location.
Miniasm is a very fast OLC-based de novo assembler for noisy long reads. It takes all-vs-all read self-mappings (typically by minimap) as input and outputs an assembly graph in the GFA format. Different from mainstream assemblers, miniasm does not have a consensus step. It simply concatenates pieces of read sequences to generate the final unitig sequences. Thus the per-base error rate is similar to the raw input reads.