Enter the query into the form above. You can look for specific version of a package by using @ symbol like this: gcc@10.
API method:
GET /api/packages?search=hello&page=1&limit=20
where search is your query, page is a page number and limit is a number of items on a single page. Pagination information (such as a number of pages and etc) is returned
in response headers.
If you'd like to join our channel webring send a patch to ~whereiseveryone/toys@lists.sr.ht adding your channel as an entry in channels.scm.
Cooler is a support library for a sparse, compressed, binary persistent storage format, called cool, used to store genomic interaction data, such as Hi-C contact matrices.
EMBOSS is the "European Molecular Biology Open Software Suite". EMBOSS is an analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. It also provides a number of libraries for the development of software in the field of molecular biology. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole.
MAGIC is an interactive tool to impute missing values in single-cell sequencing data and to restore the structure of the data. It also provides data pre-processing functionality such as dimensionality reduction and gene expression visualization.
python-gffutils is a Python package for working with and manipulating the GFF and GTF format files typically used for genomic annotations. The files are loaded into a SQLite database, allowing much more complex manipulation of hierarchical features (e.g., genes, transcripts, and exons) than is possible with plain-text methods alone.
This package provides version 1.12 of the HTSlib C library for high-throughput sequence analysis. The package is primarily useful to developers of other R packages who wish to make use of HTSlib.
Salmon is a program to produce highly-accurate, transcript-level quantification estimates from RNA-seq data. Salmon achieves its accuracy and speed via a number of different innovations, including the use of lightweight alignments (accurate but fast-to-compute proxies for traditional read alignments) and massively-parallel stochastic collapsed variational inference.
This package adds 3D perspective plotting of points, paths, and line, 3D perspective axes, 3D perspective annotations, and wireframe plots.
PiGx is a collection of genomics pipelines. It includes the following pipelines:
PiGx BSseq for raw fastq read data of bisulfite experiments
PiGx RNAseq for RNAseq samples
PiGx scRNAseq for single cell dropseq analysis
PiGx ChIPseq for reads from ChIPseq experiments
All pipelines are easily configured with a simple sample sheet and a descriptive settings file. The result is a set of comprehensive, interactive HTML reports with interesting findings about your samples.
This program searches for and removes remnant adapter sequences from High-Throughput Sequencing (HTS) data and (optionally) trims low quality bases from the 3' end of reads following adapter removal. AdapterRemoval can analyze both single end and paired end data, and can be used to merge overlapping paired-ended reads into (longer) consensus sequences. Additionally, the AdapterRemoval may be used to recover a consensus adapter sequence for paired-ended data, for which this information is not available.
Bismark is a program to map bisulfite treated sequencing reads to a genome of interest and perform methylation calls in a single step. The output can be easily imported into a genome viewer, such as SeqMonk, and enables a researcher to analyse the methylation levels of their samples straight away. Its main features are:
Bisulfite mapping and methylation calling in one single step
Supports single-end and paired-end read alignments
Supports ungapped and gapped alignments
Alignment seed length, number of mismatches etc are adjustable
Output discriminates between cytosine methylation in CpG, CHG and CHH context
This package is analyzing TCR and BCR sequences using unselected RNA sequencing data, profiled from fluid and solid tissues, including tumors. TRUST4 performs de novo assembly on V, J, C genes including the hypervariable CDR3 and reports consensus contigs of BCR/TCR sequences. TRUST4 then realigns the contigs to IMGT reference gene sequences to identify the corresponding gene and CDR3 details. TRUST4 supports both single-end and paired-end bulk or single-cell sequencing data with any read length.
MACS is an implementation of a ChIP-Seq analysis algorithm for identifying transcript factor binding sites named Model-based Analysis of ChIP-Seq (MACS). MACS captures the influence of genome complexity to evaluate the significance of enriched ChIP regions and it improves the spatial resolution of binding sites through combining the information of both sequencing tag position and orientation.
The FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing.
Next-Generation sequencing machines usually produce FASTA or FASTQ files, containing multiple short-reads sequences. The main processing of such FASTA/FASTQ files is mapping the sequences to reference genomes. However, it is sometimes more productive to preprocess the files before mapping the sequences to the genome---manipulating the sequences to produce better mapping results. The FASTX-Toolkit tools perform some of these preprocessing tasks.
NGS is a domain-specific API for accessing reads, alignments and pileups produced from Next Generation Sequencing. The API itself is independent from any particular back-end implementation, and supports use of multiple back-ends simultaneously.
The Maxprobes package collects cross-reactive probes of Illumina methylation array 450K and EPIC/850K.
This package contains some tools for processing BAM files including:
bamsormadup: parallel sorting and duplicate marking
bamcollate2: reads BAM and writes BAM reordered such that alignment or collated by query name
bammarkduplicates: reads BAM and writes BAM with duplicate alignments marked using the BAM flags field
bammaskflags: reads BAM and writes BAM while masking (removing) bits from the flags column
bamrecompress: reads BAM and writes BAM with a defined compression setting. This tool is capable of multi-threading.
bamsort: reads BAM and writes BAM resorted by coordinates or query name
bamtofastq: reads BAM and writes FastQ; output can be collated or uncollated by query name.
The R package rareMETALS2 is an extension of the R package rareMETALS. It was designed to meta-analyze gene-level association tests for binary trait. While rareMETALS offers a near-complete solution for meta-analysis of gene-level tests for quantitative trait, it does not offer the optimal solution for binary trait. The package rareMETALS2 offers improved features for analyzing gene-level association tests in meta-analyses for binary trait.
BEDOPS is a suite of tools to address common questions raised in genomic studies---mostly with regard to overlap and proximity relationships between data sets. It aims to be scalable and flexible, facilitating the efficient and accurate analysis and management of large-scale genomic data.
BEDOPS provides tools that perform highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale. Tasks can be easily split by chromosome for distributing whole-genome analyses across a computational cluster.
LibSBML is a library to help you read, write, manipulate, translate, and validate SBML files and data streams. The Systems Biology Markup Language (SBML) is an interchange format for computer models of biological processes. SBML is useful for models of metabolism, cell signaling, and more. It continues to be evolved and expanded by an international community.
Entrez Direct (EDirect) is a method for accessing the National Center for Biotechnology Information's (NCBI) set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a terminal. Functions take search terms from command-line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.
EDirect also provides an argument-driven function that simplifies the extraction of data from document summaries or other results that are returned in structured XML format. This can eliminate the need for writing custom software to answer ad hoc questions.
This package addresses the challenge of handling large amounts of data that are now routinely generated from DNA sequencing centers. deepTools contains useful modules to process the mapped reads data for multiple quality checks, creating normalized coverage files in standard bedGraph and bigWig file formats, that allow comparison between different files. Finally, using such normalized and standardized files, deepTools can create many publication-ready visualizations to identify enrichments and for functional annotations of the genome.
This package provides a robust, parallelized Python CLI for annotating three prime UTR.
genomepy is designed to provide a simple and straightforward way to download and use genomic data. This includes
searching available data,
showing the available metadata,
automatically downloading, preprocessing and matching data, and
generating optional aligner indexes.
All with sensible, yet controllable defaults.
This is an R package that integrates the installation of doublet-detection methods. In addition, this tool is used for execution and benchmark of those eight mentioned methods.