Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
This package provides Shiny apps for interactive exploration of single-cell data.
SeqAn is a C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. It contains algorithms and data structures for string representation and their manipulation, online and indexed string search, efficient I/O of bioinformatics file formats, sequence alignment, and more.
SQUID is Sean Eddy's personal library of C functions and utility programs for sequence analysis.
This package is an integrated pipeline for large-scale phylogenetic profiling of genomes and metagenomes. PhyloPhlAn is an accurate, rapid, and easy-to-use method for large-scale microbial genome characterization and phylogenetic analysis at multiple levels of resolution. This software package can assign both genomes and MAGs to SGBs. PhyloPhlAn can reconstruct strain-level phylogenies using clade- specific maximally informative phylogenetic markers, and can also scale to very large phylogenies comprising >17,000 microbial species.
Mudskipper is a tool for projecting genomic alignments to transcriptomic coordinates.
A tandem repeat in DNA is two or more adjacent, approximate copies of a pattern of nucleotides. Tandem Repeats Finder is a program to locate and display tandem repeats in DNA sequences. In order to use the program, the user submits a sequence in FASTA format. The output consists of two files: a repeat table file and an alignment file. Submitted sequences may be of arbitrary length. Repeats with pattern size in the range from 1 to 2000 bases are detected.
This is a simple "libraryfication" of the GFF/GTF parsing code that is used in the Cufflinks codebase. The goal of this library is to provide this functionality without the necessity of drawing in a heavy-weight dependency like SeqAn.
Pegasusio is a Python package for reading or writing single-cell genomics data.
Exonerate is a generic tool for pairwise sequence comparison. It allows the alignment of sequences using a many alignment models, either exhaustive dynamic programming or a variety of heuristics.
This is a package for fast Non-negative Matrix Factorization (NMF) with automatic rank-determination for dimension reduction of single-cell data using Seurat, RcppML nmf, SingleCellExperiments and similar.
This package provides the ASCAT R package that can be used to infer tumour purity, ploidy and allele-specific copy number profiles.
This package allows building the hierarchy of domains starting from Hi-C data. Each hierarchical level is identified by a minimum value of physical insulation between neighboring domains.
This package provides a package that makes it easy to implement sankey, alluvial and sankey bump plots in ggplot2.
Bowtie 2 is a fast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes. Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes.
Skewer implements the bit-masked k-difference matching algorithm dedicated to the task of adapter trimming and it is specially designed for processing next-generation sequencing (NGS) paired-end sequences.
Ngesh is a Python library and CLI tool for simulating phylogenetic trees and data. It is intended for benchmarking phylogenetic methods, especially in historical linguistics andstemmatology. The generation of stochastic phylogenetic trees also goes by the name simulationmethods for phylogenetic trees, synthetic data generation, or just phylogenetic tree simulation.
This package provides a robust, parallelized Python CLI for annotating three prime UTR.
Bioinformaticians often have to convert sequence files between formats and do little manipulations on them, and it's not worth writing scripts for that. Seqmagick is a utility to expose the file format conversion in BioPython in a convenient way. Instead of having a big mess of scripts, there is one that takes arguments.
This package stores motif collections as lists of position frequency matrix (PWMatrixList) objects provided by the TFBSTools package for use in R with packages like motifmatchr or chromVAR.
CellBender is a software package for eliminating technical artifacts from high-throughput single-cell RNA sequencing (scRNA-seq) data.
Parabam is a tool for processing sequencing files in parallel. It uses Python's native multiprocessing framework to apply a user defined rule on an input file.
LibSBML is a library to help you read, write, manipulate, translate, and validate SBML files and data streams. The Systems Biology Markup Language (SBML) is an interchange format for computer models of biological processes. SBML is useful for models of metabolism, cell signaling, and more. It continues to be evolved and expanded by an international community.
This package detects naive associations between omics features and metadata in cross-sectional data-sets using non-parametric tests. In a second step, confounding effects between metadata associated to the same omics feature are detected and labeled using nested post-hoc model comparison tests. The generated output can be graphically summarized using the built-in plotting function.
LoFreq is a fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data. It makes full use of base-call qualities and other sources of errors inherent in sequencing (e.g. mapping or base/indel alignment uncertainty), which are usually ignored by other methods or only used for filtering.