_            _    _        _         _
      /\ \         /\ \ /\ \     /\_\      / /\
      \_\ \       /  \ \\ \ \   / / /     / /  \
      /\__ \     / /\ \ \\ \ \_/ / /     / / /\ \__
     / /_ \ \   / / /\ \ \\ \___/ /     / / /\ \___\
    / / /\ \ \ / / /  \ \_\\ \ \_/      \ \ \ \/___/
   / / /  \/_// / /   / / / \ \ \        \ \ \
  / / /      / / /   / / /   \ \ \   _    \ \ \
 / / /      / / /___/ / /     \ \ \ /_/\__/ / /
/_/ /      / / /____\/ /       \ \_\\ \/___/ /
\_\/       \/_________/         \/_/ \_____\/
r-scoup 1.4.0
Propagated dependencies: r-matrix@1.7-3 r-biostrings@2.76.0
Channel: guix-bioc
Location: guix-bioc/packages/s.scm (guix-bioc packages s)
Home page: https://github.com/thsadiq/scoup
Licenses: GPL 2+
Synopsis: Simulate Codons with Darwinian Selection Modelled as an OU Process
Description:

An elaborate molecular evolutionary framework that facilitates straightforward simulation of codon genetic sequences subjected to different degrees and/or patterns of Darwinian selection. The model is built upon the fitness landscape paradigm of Sewall Wright, as popularised by the mutation-selection model of Halpern and Bruno. This enables realistic evolutionary process of living organisms to be reproducible seamlessly. For example, an Ornstein-Uhlenbeck fitness update algorithm is incorporated herein. Consequently, otherwise complex biological processes, such as the effect of the interplay between genetic drift and fitness landscape fluctuations on the inference of diversifying selection, may now be investigated with minimal effort. Frequency-dependent and stochastic fitness landscape update techniques are available.

r-mbess 4.9.3
Propagated dependencies: r-boot@1.3-31 r-lavaan@0.6-19 r-mass@7.3-65 r-mnormt@2.1.1 r-nlme@3.1-168 r-openmx@2.22.7 r-sem@3.1-16 r-semtools@0.5-7
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://www3.nd.edu/~kkelley/site/MBESS.html
Licenses: GPL 2 GPL 3
Synopsis: Methods for designing research studies
Description:

This package implements methods that are useful in designing research studies and analyzing data, with particular emphasis on methods that are developed for or used within the behavioral, educational, and social sciences (broadly defined). That being said, many of the methods implemented within MBESS are applicable to a wide variety of disciplines. MBESS has a suite of functions for a variety of related topics, such as effect sizes, confidence intervals for effect sizes (including standardized effect sizes and noncentral effect sizes), sample size planning (from the accuracy in parameter estimation (AIPE), power analytic, equivalence, and minimum-risk point estimation perspectives), mediation analysis, various properties of distributions, and a variety of utility functions.

rtl-sdr 2.0.1
Dependencies: libusb@1.0.25
Channel: guix
Location: gnu/packages/radio.scm (gnu packages radio)
Home page: https://osmocom.org/projects/sdr/wiki/rtl-sdr
Licenses: GPL 2+
Synopsis: Software defined radio driver for Realtek RTL2832U
Description:

DVB-T dongles based on the Realtek RTL2832U can be used as a cheap software defined radio, since the chip allows transferring the raw I/Q samples to the host. rtl-sdr provides drivers for this purpose.

The default Linux driver managing DVB-T dongles as TV devices doesn't work for SDR purposes and clashes with this package. Therefore you must prevent the kernel from loading it automatically by adding the following line to your system configuration:

(kernel-arguments '("modprobe.blacklist=dvb_usb_rtl28xxu"))

To install the rtl-sdr udev rules, you must extend 'udev-service-type' with this package. E.g.: (udev-rules-service 'rtl-sdr rtl-sdr)

r-skewr 1.42.0
Channel: guix-bioc
Location: guix-bioc/packages/s.scm (guix-bioc packages s)
Home page: https://bioconductor.org/packages/skewr
Licenses: GPL 2
Synopsis: Visualize Intensities Produced by Illumina's Human Methylation 450k BeadChip
Description:

The skewr package is a tool for visualizing the output of the Illumina Human Methylation 450k BeadChip to aid in quality control. It creates a panel of nine plots. Six of the plots represent the density of either the methylated intensity or the unmethylated intensity given by one of three subsets of the 485,577 total probes. These subsets include Type I-red, Type I-green, and Type II.The remaining three distributions give the density of the Beta-values for these same three subsets. Each of the nine plots optionally displays the distributions of the "rs" SNP probes and the probes associated with imprinted genes as series of tick marks located above the x-axis.

r-mosbi 1.16.0
Channel: guix-bioc
Location: guix-bioc/packages/m.scm (guix-bioc packages m)
Home page: https://bioconductor.org/packages/mosbi
Licenses: FSDG-compatible
Synopsis: Molecular Signature identification using Biclustering
Description:

This package is a implementation of biclustering ensemble method MoSBi (Molecular signature Identification from Biclustering). MoSBi provides standardized interfaces for biclustering results and can combine their results with a multi-algorithm ensemble approach to compute robust ensemble biclusters on molecular omics data. This is done by computing similarity networks of biclusters and filtering for overlaps using a custom error model. After that, the louvain modularity it used to extract bicluster communities from the similarity network, which can then be converted to ensemble biclusters. Additionally, MoSBi includes several network visualization methods to give an intuitive and scalable overview of the results. MoSBi comes with several biclustering algorithms, but can be easily extended to new biclustering algorithms.

r-exact 3.3
Propagated dependencies: r-rootsolve@1.8.2.4
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://cran.r-project.org/package=Exact
Licenses: GPL 2
Synopsis: Unconditional exact test
Description:

Performs unconditional exact tests and power calculations for 2x2 contingency tables. For comparing two independent proportions, performs Barnard's test (1945) using the original CSM test (Barnard (1947)), using Fisher's p-value referred to as Boschloo's test (1970), or using a Z-statistic (Suissa and Shuster (1985)). For comparing two binary proportions, performs unconditional exact test using McNemar's Z-statistic (Berger and Sidik (2003)), using McNemar's Z-statistic with continuity correction, or using CSM test. Calculates confidence intervals for the difference in proportion.

r-gcrma 2.80.0
Propagated dependencies: r-affy@1.86.0 r-affyio@1.78.0 r-biobase@2.68.0 r-biocmanager@1.30.25 r-biostrings@2.76.0 r-xvector@0.48.0
Channel: guix
Location: gnu/packages/bioconductor.scm (gnu packages bioconductor)
Home page: https://bioconductor.org/packages/gcrma/
Licenses: LGPL 2.1+
Synopsis: Background adjustment using sequence information
Description:

Gcrma adjusts for background intensities in Affymetrix array data which include optical noise and non-specific binding (NSB). The main function gcrma converts background adjusted probe intensities to expression measures using the same normalization and summarization methods as a Robust Multiarray Average (RMA). Gcrma uses probe sequence information to estimate probe affinity to NSB. The sequence information is summarized in a more complex way than the simple GC content. Instead, the base types (A, T, G or C) at each position along the probe determine the affinity of each probe. The parameters of the position-specific base contributions to the probe affinity is estimated in an NSB experiment in which only NSB but no gene-specific binding is expected.

r-cairo 1.6-2
Dependencies: cairo@1.18.4 harfbuzz@11.4.4 icu4c@73.1 libjpeg-turbo@2.1.4 libtiff@4.4.0 zlib@1.3.1
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://www.rforge.net/Cairo/
Licenses: GPL 2
Synopsis: R graphics device using Cairo graphics library
Description:

This package provides a Cairo graphics device that can be use to create high-quality vector (PDF, PostScript and SVG) and bitmap output (PNG, JPEG, TIFF), and high-quality rendering in displays (X11 and Win32). Since it uses the same back-end for all output, copying across formats is WYSIWYG. Files are created without the dependence on X11 or other external programs. This device supports alpha channel (semi-transparent drawing) and resulting images can contain transparent and semi-transparent regions. It is ideal for use in server environments (file output) and as a replacement for other devices that don't have Cairo's capabilities such as alpha support or anti-aliasing. Backends are modular such that any subset of backends is supported.

r-spamm 4.5.0
Dependencies: gsl@2.8
Propagated dependencies: r-backports@1.5.0 r-boot@1.3-31 r-crayon@1.5.3 r-geometry@0.5.2 r-gmp@0.7-5 r-mass@7.3-65 r-matrix@1.7-3 r-minqa@1.2.8 r-nlme@3.1-168 r-nloptr@2.2.1 r-numderiv@2016.8-1.1 r-pbapply@1.7-2 r-proxy@0.4-27 r-rcpp@1.0.14 r-rcppeigen@0.3.4.0.2 r-roi@1.0-1
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://www.r-project.org
Licenses: CeCILL
Synopsis: Mixed-Effect Models, with or without Spatial Random Effects
Description:

Inference based on models with or without spatially-correlated random effects, multivariate responses, or non-Gaussian random effects (e.g., Beta). Variation in residual variance (heteroscedasticity) can itself be represented by a mixed-effect model. Both classical geostatistical models (Rousset and Ferdy 2014 <doi:10.1111/ecog.00566>), and Markov random field models on irregular grids (as considered in the INLA package, <https://www.r-inla.org>), can be fitted, with distinct computational procedures exploiting the sparse matrix representations for the latter case and other autoregressive models. Laplace approximations are used for likelihood or restricted likelihood. Penalized quasi-likelihood and other variants discussed in the h-likelihood literature (Lee and Nelder 2001 <doi:10.1093/biomet/88.4.987>) are also implemented.

r-anota 1.56.0
Propagated dependencies: r-multtest@2.64.0 r-qvalue@2.40.0
Channel: guix
Location: gnu/packages/bioconductor.scm (gnu packages bioconductor)
Home page: https://bioconductor.org/packages/anota/
Licenses: GPL 3
Synopsis: Analysis of translational activity
Description:

Genome wide studies of translational control is emerging as a tool to study various biological conditions. The output from such analysis is both the mRNA level (e.g. cytosolic mRNA level) and the level of mRNA actively involved in translation (the actively translating mRNA level) for each mRNA. The standard analysis of such data strives towards identifying differential translational between two or more sample classes - i.e., differences in actively translated mRNA levels that are independent of underlying differences in cytosolic mRNA levels. This package allows for such analysis using partial variances and the random variance model. As 10s of thousands of mRNAs are analyzed in parallel the library performs a number of tests to assure that the data set is suitable for such analysis.

r-party 1.3-18
Propagated dependencies: r-coin@1.4-3 r-modeltools@0.2-24 r-mvtnorm@1.3-3 r-sandwich@3.1-1 r-strucchange@1.5-4 r-survival@3.8-3 r-zoo@1.8-14
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://party.r-forge.r-project.org
Licenses: GPL 2
Synopsis: Laboratory for recursive partitioning
Description:

This package provides a computational toolbox for recursive partitioning. The core of the package is ctree(), an implementation of conditional inference trees which embed tree-structured regression models into a well defined theory of conditional inference procedures. This non-parametric class of regression trees is applicable to all kinds of regression problems, including nominal, ordinal, numeric, censored as well as multivariate response variables and arbitrary measurement scales of the covariates. Based on conditional inference trees, cforest() provides an implementation of Breiman's random forests. The function mob() implements an algorithm for recursive partitioning based on parametric models (e.g. linear models, GLMs or survival regression) employing parameter instability tests for split selection. Extensible functionality for visualizing tree-structured regression models is available.

r-gdina 2.9.9
Propagated dependencies: r-alabama@2023.1.0 r-ggplot2@3.5.2 r-mass@7.3-65 r-nloptr@2.2.1 r-numderiv@2016.8-1.1 r-rcpp@1.0.14 r-rcpparmadillo@14.4.3-1 r-rsolnp@1.16 r-shiny@1.10.0 r-shinydashboard@0.7.3
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://github.com/Wenchao-Ma/GDINA
Licenses: GPL 3
Synopsis: Generalized DINA model framework
Description:

This package provides a set of psychometric tools for cognitive diagnosis modeling based on the generalized deterministic inputs, noisy and gate (G-DINA) model by de la Torre (2011) doi:10.1007/s11336-011-9207-7 and its extensions, including the sequential G-DINA model by Ma and de la Torre (2016) doi:10.1111/bmsp.12070 for polytomous responses, and the polytomous G-DINA model by Chen and de la Torre doi:10.1177/0146621613479818 for polytomous attributes. Joint attribute distribution can be independent, saturated, higher-order, loglinear smoothed or structured. Q-matrix validation, item and model fit statistics, model comparison at test and item level and differential item functioning can also be conducted. A graphical user interface is also provided.

r-vctrs 0.6.5
Propagated dependencies: r-cli@3.6.5 r-glue@1.8.0 r-lifecycle@1.0.4 r-rlang@1.1.6
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://github.com/r-lib/vctrs
Licenses: GPL 3
Synopsis: Vector helpers
Description:

There are three main goals to the vctrs package:

  1. To propose vec_size() and vec_type() as alternatives to length() and class(). These definitions are paired with a framework for type-coercion and size-recycling.

  2. To define type- and size-stability as desirable function properties, use them to analyse existing base function, and to propose better alternatives. This work has been particularly motivated by thinking about the ideal properties of c(), ifelse(), and rbind().

  3. To provide a new vctr base class that makes it easy to create new S3 vectors. vctrs provides methods for many base generics in terms of a few new vctrs generics, making implementation considerably simpler and more robust.

r-tenet 1.2.0
Channel: guix-bioc
Location: guix-bioc/packages/t.scm (guix-bioc packages t)
Home page: https://github.com/rhielab/TENET
Licenses: GPL 2
Synopsis: R package for TENET (Tracing regulatory Element Networks using Epigenetic Traits) to identify key transcription factors
Description:

TENET identifies key transcription factors (TFs) and regulatory elements (REs) linked to a specific cell type by finding significantly correlated differences in gene expression and RE DNA methylation between case and control input datasets, and identifying the top genes by number of significant RE DNA methylation site links. It also includes many tools for visualization and analysis of the results, including plots displaying and comparing methylation and expression data and methylation site link counts, survival analysis, TF motif searching in the vicinity of linked RE DNA methylation sites, custom TAD and peak overlap analysis, and UCSC Genome Browser track file generation. A utility function is also provided to download methylation, expression, and patient survival data from The Cancer Genome Atlas (TCGA) for use in TENET or other analyses.

r-mapfx 1.6.0
Channel: guix-bioc
Location: guix-bioc/packages/m.scm (guix-bioc packages m)
Home page: https://github.com/HsiaoChiLiao/MAPFX
Licenses: GPL 2
Synopsis: MAssively Parallel Flow cytometry Xplorer (MAPFX): A Toolbox for Analysing Data from the Massively-Parallel Cytometry Experiments
Description:

MAPFX is an end-to-end toolbox that pre-processes the raw data from MPC experiments (e.g., BioLegend's LEGENDScreen and BD Lyoplates assays), and further imputes the ‘missing’ infinity markers in the wells without those measurements. The pipeline starts by performing background correction on raw intensities to remove the noise from electronic baseline restoration and fluorescence compensation by adapting a normal-exponential convolution model. Unwanted technical variation, from sources such as well effects, is then removed using a log-normal model with plate, column, and row factors, after which infinity markers are imputed using the informative backbone markers as predictors. The completed dataset can then be used for clustering and other statistical analyses. Additionally, MAPFX can be used to normalise data from FFC assays as well.

r-powsc 1.18.0
Propagated dependencies: r-summarizedexperiment@1.38.1 r-singlecellexperiment@1.30.1 r-rcolorbrewer@1.1-3 r-pheatmap@1.0.12 r-mast@1.33.0 r-limma@3.64.1 r-ggplot2@3.5.2 r-biobase@2.68.0
Channel: guix-bioc
Location: guix-bioc/packages/p.scm (guix-bioc packages p)
Home page: https://bioconductor.org/packages/POWSC
Licenses: GPL 2
Synopsis: Simulation, power evaluation, and sample size recommendation for single cell RNA-seq
Description:

Determining the sample size for adequate power to detect statistical significance is a crucial step at the design stage for high-throughput experiments. Even though a number of methods and tools are available for sample size calculation for microarray and RNA-seq in the context of differential expression (DE), this topic in the field of single-cell RNA sequencing is understudied. Moreover, the unique data characteristics present in scRNA-seq such as sparsity and heterogeneity increase the challenge. We propose POWSC, a simulation-based method, to provide power evaluation and sample size recommendation for single-cell RNA sequencing DE analysis. POWSC consists of a data simulator that creates realistic expression data, and a power assessor that provides a comprehensive evaluation and visualization of the power and sample size relationship.

r-tfhaz 1.32.0
Propagated dependencies: r-s4vectors@0.46.0 r-orfik@1.30.1 r-iranges@2.42.0 r-genomicranges@1.60.0
Channel: guix-bioc
Location: guix-bioc/packages/t.scm (guix-bioc packages t)
Home page: https://bioconductor.org/packages/TFHAZ
Licenses: Artistic License 2.0
Synopsis: Transcription Factor High Accumulation Zones
Description:

It finds trascription factor (TF) high accumulation DNA zones, i.e., regions along the genome where there is a high presence of different transcription factors. Starting from a dataset containing the genomic positions of TF binding regions, for each base of the selected chromosome the accumulation of TFs is computed. Three different types of accumulation (TF, region and base accumulation) are available, together with the possibility of considering, in the single base accumulation computing, the TFs present not only in that single base, but also in its neighborhood, within a window of a given width. Two different methods for the search of TF high accumulation DNA zones, called "binding regions" and "overlaps", are available. In addition, some functions are provided in order to analyze, visualize and compare results obtained with different input parameters.

r-verso 1.20.0
Propagated dependencies: r-rfast@2.1.5.1 r-data-tree@1.1.0 r-ape@5.8-1
Channel: guix-bioc
Location: guix-bioc/packages/v.scm (guix-bioc packages v)
Home page: https://github.com/BIMIB-DISCo/VERSO
Licenses: FSDG-compatible
Synopsis: Viral Evolution ReconStructiOn (VERSO)
Description:

Mutations that rapidly accumulate in viral genomes during a pandemic can be used to track the evolution of the virus and, accordingly, unravel the viral infection network. To this extent, sequencing samples of the virus can be employed to estimate models from genomic epidemiology and may serve, for instance, to estimate the proportion of undetected infected people by uncovering cryptic transmissions, as well as to predict likely trends in the number of infected, hospitalized, dead and recovered people. VERSO is an algorithmic framework that processes variants profiles from viral samples to produce phylogenetic models of viral evolution. The approach solves a Boolean Matrix Factorization problem with phylogenetic constraints, by maximizing a log-likelihood function. VERSO includes two separate and subsequent steps; in this package we provide an R implementation of VERSO STEP 1.

r-qubic 1.38.0
Propagated dependencies: r-rcpparmadillo@14.4.3-1 r-rcpp@1.0.14 r-matrix@1.7-3 r-biclust@2.0.3.1
Channel: guix-bioc
Location: guix-bioc/packages/q.scm (guix-bioc packages q)
Home page: http://github.com/zy26/QUBIC
Licenses: FSDG-compatible
Synopsis: An R package for qualitative biclustering in support of gene co-expression analyses
Description:

The core function of this R package is to provide the implementation of the well-cited and well-reviewed QUBIC algorithm, aiming to deliver an effective and efficient biclustering capability. This package also includes the following related functions: (i) a qualitative representation of the input gene expression data, through a well-designed discretization way considering the underlying data property, which can be directly used in other biclustering programs; (ii) visualization of identified biclusters using heatmap in support of overall expression pattern analysis; (iii) bicluster-based co-expression network elucidation and visualization, where different correlation coefficient scores between a pair of genes are provided; and (iv) a generalize output format of biclusters and corresponding network can be freely downloaded so that a user can easily do following comprehensive functional enrichment analysis (e.g. DAVID) and advanced network visualization (e.g. Cytoscape).

r-omada 1.12.0
Channel: guix-bioc
Location: guix-bioc/packages/o.scm (guix-bioc packages o)
Home page: https://bioconductor.org/packages/omada
Licenses: GPL 3
Synopsis: Machine learning tools for automated transcriptome clustering analysis
Description:

Symptomatic heterogeneity in complex diseases reveals differences in molecular states that need to be investigated. However, selecting the numerous parameters of an exploratory clustering analysis in RNA profiling studies requires deep understanding of machine learning and extensive computational experimentation. Tools that assist with such decisions without prior field knowledge are nonexistent and further gene association analyses need to be performed independently. We have developed a suite of tools to automate these processes and make robust unsupervised clustering of transcriptomic data more accessible through automated machine learning based functions. The efficiency of each tool was tested with four datasets characterised by different expression signal strengths. Our toolkit’s decisions reflected the real number of stable partitions in datasets where the subgroups are discernible. Even in datasets with less clear biological distinctions, stable subgroups with different expression profiles and clinical associations were found.

r-prone 1.4.0
Channel: guix-bioc
Location: guix-bioc/packages/p.scm (guix-bioc packages p)
Home page: https://github.com/daisybio/PRONE
Licenses: GPL 3+
Synopsis: The PROteomics Normalization Evaluator
Description:

High-throughput omics data are often affected by systematic biases introduced throughout all the steps of a clinical study, from sample collection to quantification. Normalization methods aim to adjust for these biases to make the actual biological signal more prominent. However, selecting an appropriate normalization method is challenging due to the wide range of available approaches. Therefore, a comparative evaluation of unnormalized and normalized data is essential in identifying an appropriate normalization strategy for a specific data set. This R package provides different functions for preprocessing, normalizing, and evaluating different normalization approaches. Furthermore, normalization methods can be evaluated on downstream steps, such as differential expression analysis and statistical enrichment analysis. Spike-in data sets with known ground truth and real-world data sets of biological experiments acquired by either tandem mass tag (TMT) or label-free quantification (LFQ) can be analyzed.

r-decon 1.3-4
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://cran.r-project.org/web/packages/decon/
Licenses: GPL 3+
Synopsis: Deconvolution Estimation in Measurement Error Models
Description:

This package contains a collection of functions to deal with nonparametric measurement error problems using deconvolution kernel methods. We focus two measurement error models in the package: (1) an additive measurement error model, where the goal is to estimate the density or distribution function from contaminated data; (2) nonparametric regression model with errors-in-variables. The R functions allow the measurement errors to be either homoscedastic or heteroscedastic. To make the deconvolution estimators computationally more efficient in R, we adapt the "Fast Fourier Transform" (FFT) algorithm for density estimation with error-free data to the deconvolution kernel estimation. Several methods for the selection of the data-driven smoothing parameter are also provided in the package. See details in: Wang, X.F. and Wang, B. (2011). Deconvolution estimation in measurement error models: The R package decon. Journal of Statistical Software, 39(10), 1-24.

r-mitch 1.22.0
Channel: guix-bioc
Location: guix-bioc/packages/m.scm (guix-bioc packages m)
Home page: https://github.com/markziemann/mitch
Licenses: FSDG-compatible
Synopsis: Multi-Contrast Gene Set Enrichment Analysis
Description:

mitch is an R package for multi-contrast enrichment analysis. At it’s heart, it uses a rank-MANOVA based statistical approach to detect sets of genes that exhibit enrichment in the multidimensional space as compared to the background. The rank-MANOVA concept dates to work by Cox and Mann (https://doi.org/10.1186/1471-2105-13-S16-S12). mitch is useful for pathway analysis of profiling studies with one, two or more contrasts, or in studies with multiple omics profiling, for example proteomic, transcriptomic, epigenomic analysis of the same samples. mitch is perfectly suited for pathway level differential analysis of scRNA-seq data. We have an established routine for pathway enrichment of Infinium Methylation Array data (see vignette). The main strengths of mitch are that it can import datasets easily from many upstream tools and has advanced plotting features to visualise these enrichments.

r-sradb 1.72.0
Propagated dependencies: r-rsqlite@2.3.11 r-rcurl@1.98-1.17 r-r-utils@2.13.0 r-graph@1.86.0
Channel: guix-bioc
Location: guix-bioc/packages/s.scm (guix-bioc packages s)
Home page: https://bioconductor.org/packages/SRAdb
Licenses: Artistic License 2.0
Synopsis: compilation of metadata from NCBI SRA and tools
Description:

The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Roche 454 GS System, Illumina Genome Analyzer, Applied Biosystems SOLiD System, Helicos Heliscope, and others. However, finding data of interest can be challenging using current tools. SRAdb is an attempt to make access to the metadata associated with submission, study, sample, experiment and run much more feasible. This is accomplished by parsing all the NCBI SRA metadata into a SQLite database that can be stored and queried locally. Fulltext search in the package make querying metadata very flexible and powerful. fastq and sra files can be downloaded for doing alignment locally. Beside ftp protocol, the SRAdb has funcitons supporting fastp protocol (ascp from Aspera Connect) for faster downloading large data files over long distance. The SQLite database is updated regularly as new data is added to SRA and can be downloaded at will for the most up-to-date metadata.

Total results: 7783