This package provides functions for validating the structure and properties of data frames. Answers essential questions about a data set after initial import or modification. What are the unique or missing values? What columns form a primary key? What are the properties of the numeric or categorical columns? What kind of overlap or mapping exists between 2 columns?
Assess Water Quality Trends for Long-Term Monitoring Data in Estuaries using Generalized Additive Models following Wood (2017) <doi:10.1201/9781315370279> and Error Propagation with Mixed-Effects Meta-Analysis following Sera et al. (2019) <doi:10.1002/sim.8362>. Methods are available for model fitting, assessment of fit, annual and seasonal trend tests, and visualization of results.
DoRothEA
is a gene regulatory network containing signed transcription factor (TF) - target gene interactions. DoRothEA
regulons, the collection of a TF and its transcriptional targets, were curated and collected from different types of evidence for both human and mouse. A confidence level was assigned to each TF-target interaction based on the number of supporting evidence.
Visualization functions for spatial transcriptomics data. Includes functions to generate several types of plots, including spot plots, feature (molecule) plots, reduced dimension plots, spot-level quality control (QC) plots, and feature-level QC plots, for datasets from the 10x Genomics Visium and other technological platforms. Datasets are assumed to be in either SpatialExperiment
or SingleCellExperiment
format.
GEOfastq is used to download fastq files from the European Nucleotide Archive (ENA) starting with an accession from the Gene Expression Omnibus (GEO). To do this, sample metadata is retrieved from GEO and the Sequence Read Archive (SRA). SRA run accessions are then used to construct FTP and aspera download links for fastq files generated by the ENA.
This package implements R bindings to C++ code for analyzing single-cell (expression) data, mostly from various libscran libraries. Each function performs an individual step in the single-cell analysis workflow, ranging from quality control to clustering and marker detection. It is mostly intended for other Bioconductor package developers to build more user-friendly end-to-end workflows.
This package provides basic wavelet routines for time series (1D), image (2D) and array (3D) analysis. The code provided here is based on wavelet methodology developed in Percival and Walden (2000); Gencay, Selcuk and Whitcher (2001); the dual-tree complex wavelet transform (DTCWT) from Kingsbury (1999, 2001) as implemented by Selesnick; and Hilbert wavelet pairs (Selesnick 2001, 2002).
This package lets you build complex plots, heatmaps in particular, using natural semantics. Bigger plots can be assembled using directives such as LeftOf
, RightOf
, TopOf
, and Beneath
and more. Other features include clustering, dendrograms and integration with ggplot2 generated grid objects. This package is particularly designed for bioinformaticians to assemble complex plots for publication.
This package contains functions to generate pre-defined summary statistics from activPAL events files. The package also contains functions to produce informative graphics that visualize physical activity behaviour and trends. This includes generating graphs that align physical activity behaviour with additional time based observations described by other data sets, such as sleep diaries and continuous glucose monitoring data.
Automated Characterization of Health Information at Large-Scale Longitudinal Evidence Systems. Creates a descriptive statistics summary for an Observational Medical Outcomes Partnership Common Data Model standardized data source. This package includes functions for executing summary queries on the specified data source and exporting reporting content for use across a variety of Observational Health Data Sciences and Informatics community applications.
Currently, the package provides several functions for plotting and analyzing bibliometric data (JIF, Journal Impact Factor, and paper percentile values), beamplots with citations and percentiles, and three plot functions to visualize the result of a reference publication year spectroscopy (RPYS) analysis performed in the free software CRExplorer (see <http://crexplorer.net>). Further extension to more plot variants is planned.
The congeneric normal-ogive model is a popular model for psychometric data (McDonald
, R. P. (1997) <doi:10.1007/978-1-4757-2691-6_15>). This model estimates the model, calculates theoretical and concrete reliability coefficients, and predicts the latent variable of the model. This is the companion package to Moss (2020) <doi:10.31234/osf.io/nvg5d>.
Computes density function, cumulative distribution function, quantile function and random numbers for a multisection composite distribution specified by the user. Also fits the user specified distribution to a given data set. More details of the package can be found in the following paper submitted to the R journal Wiegand M and Nadarajah S (2017) CompDist
: Multisection composite distributions.
This package provides an extension to the purrr family of mapping functions to apply a function to each combination of elements in a list of inputs. Also includes functions for automatically detecting output type in mapping functions, finding every combination of elements of lists or rows of data frames, and applying multiple models to multiple subsets of a dataset.
Quantify and visualise various measures of chemical diversity and dissimilarity, for phytochemical compounds and other sets of chemical composition data. Importantly, these measures can incorporate biosynthetic and/or structural properties of the chemical compounds, resulting in a more comprehensive quantification of diversity and dissimilarity. For details, see Petrén, Köllner and Junker (2023) <doi:10.1111/nph.18685>.
This package performs sensitivity analysis for the sharp null and attributable effects in matched studies with continuous exposures and binary outcomes as described in Zhang, Small, Heng (2024) <arXiv:2401.06909>
. Two of the functions require installation of the Gurobi optimizer. Please see <https://www.gurobi.com/documentation/9.0/refman/ins_the_r_package.html> for guidance.
Data cleaning scripts typically contain a lot of if this change that type of statements. Such statements are typically condensed expert knowledge. With this package, such data modifying rules are taken out of the code and become in stead parameters to the work flow. This allows one to maintain, document, and reason about data modification rules as separate entities.
This package contains the discrete nonparametric survivor function estimation algorithm of De Gruttola and Lagakos for doubly interval-censored failure time data and the discrete nonparametric survivor function estimation algorithm of Sun for doubly interval-censored left-truncated failure time data [Victor De Gruttola & Stephen W. Lagakos (1989) <doi:10.2307/2532030>] [Jianguo Sun (1995) <doi:10.2307/2533008>].
This package provides functions and classes designed to handle and visualise epidemiological flows between locations. Also contains a statistical method for predicting disease spread from flow data initially described in Dorigatti et al. (2017) <doi:10.2807/1560-7917.ES.2017.22.28.30572>. This package is part of the RECON (<https://www.repidemicsconsortium.org/>) toolkit for outbreak analysis.
Implementation of fused Markov graphical model (FMGM; Park and Won, 2022). The functions include building mixed graphical model (MGM) objects from data, inference of networks using FMGM, stable edge-specific penalty selection (StEPS
) for the determination of penalization parameters, and the visualization. For details, please refer to Park and Won (2022) <doi:10.48550/arXiv.2208.14959>
.
This package provides functions for landmark prediction of a survival outcome incorporating covariate and short-term event information. For more information about landmark prediction please see: Parast, Layla, Su-Chun Cheng, and Tianxi Cai. Incorporating short-term outcome information to predict long-term survival with discrete markers. Biometrical Journal 53.2 (2011): 294-307, <doi:10.1002/bimj.201000150>.
We present a method based on filtering algorithms to estimate the parameters of linear, i.e. the coefficients and the variance of the error term. The proposed algorithms make use of Particle Filters following Ristic, B., Arulampalam, S., Gordon, N. (2004, ISBN: 158053631X) resampling methods. Parameters of logistic regression models are also estimated using an evolutionary particle filter method.
Supports morphological analysis for Japanese by using MeCab
<https://taku910.github.io/mecab/>, Sudachi <https://github.com/WorksApplications/Sudachi>
, Chamame <https://chamame.ninjal.ac.jp/>, or Ginza <https://github.com/megagonlabs/ginza>. Can input a data.frame and obtain all results of MeCab
and the row number of the original data.frame as a text id.
This package provides tools that facilitate ordinary differential equation (ODE) modeling in R'. This package allows one to perform simulations for ODE models that are encoded in the GNU MCSim model specification language (Bois, 2009) <doi:10.1093/bioinformatics/btp162> using ODE solvers from the R package deSolve
(Soetaert et al., 2010) <doi:10.18637/jss.v033.i09>.