This package implements a successive halving and hyperband optimization algorithm for the mlr3 ecosystem. The implementation in mlr3hyperband features improved scheduling and parallelizes the evaluation of configurations. The package includes tuners for hyperparameter optimization in mlr3tuning and optimizers for black-box optimization in bbotk.
Approximate false positive rate control in selection frequency for random forest using the methods described by Ender Konukoglu and Melanie Ganz (2014) <arXiv:1410.2838>
. Methods for calculating the selection frequency threshold at false positive rates and selection frequency false positive rate feature selection.
This package provides tools to support research on vowel covariation. Methods are provided to support Principal Component Analysis workflows (as in Brand et al. (2021) <doi:10.1016/j.wocn.2021.101096> and Wilson Black et al. (2023) <doi:10.1515/lingvan-2022-0086>).
Detect the number and locations of change points. The locations can be either exact or in terms of ranges, depending on the available computational resource. The method is based on Jie Ding, Yu Xiang, Lu Shen, Vahid Tarokh (2017) <doi:10.1109/TSP.2017.2711558>.
Provide users with a convenient way to access and analyze information on endangered plant species in Peru based on `Decreto Supremo N 043-2006-AG - Aprueban categorizacion de especies amenazadas de flora silvestre`<https://sinia.minam.gob.pe/normas/aprueban-categorizacion-especies-amenazadas-flora-silvestre>.
S4 class wrappers for the ODBC and Pool DBI connection, also provides some utilities to paste small datasets to clipboard, rename columns. It is used by the package stacomiR
for connections to the database. Development versions of stacomiR
are available in R-forge.
The Cancer Genome Atlas (TCGA) is a program aimed at improving our understanding of Cancer Biology. Several TCGA Datasets are available online. TCGAretriever helps accessing and downloading TCGA data hosted on cBioPortal
via its Web Interface (see <https://www.cbioportal.org/> for more information).
Obtain historical and near real time data related to stocks, index and currencies from the Yahoo Finance API. This package is community maintained and is not officially supported by Yahoo'. The accuracy of data is only as correct as provided on <https://finance.yahoo.com/>.
This package provides tools for managing SingleCellExperiment
objects as projects. Includes functions for analysis and visualization of single-cell data. Also included is a shiny app for visualization of pre-processed scRNA
data. Supported by NIH grants R01CA137124 and R01EY026661 to David Cobrinik.
PhantasusLite
– a lightweight package with helper functions of general interest extracted from phantasus package. In parituclar it simplifies working with public RNA-seq datasets from GEO by providing access to the remote HSDS repository with the precomputed gene counts from ARCHS4 and DEE2 projects.
This manual explains the C language for use with the GNU Compiler Collection (GCC) on the GNU/Linux system and other systems. We refer to this dialect as GNU C. If you already know C, you can use this as a reference manual.
This package provides statistical methods especially developed to analyze anthropometric data. These methods are aimed at providing effective solutions to some commons problems related to Ergonomics and Anthropometry. They are based on clustering, the statistical concept of data depth, statistical shape analysis and archetypal analysis.
This package defines S4 classes for single-cell genomic data and associated information, such as dimensionality reduction embeddings, nearest-neighbor graphs, and spatially-resolved coordinates. It provides data access methods and R-native hooks to ensure the Seurat object is familiar to other R users.
This package provides various features to streamline and enhance the styling of interactive reactable tables with easy-to-use and highly-customizable functions and themes. Apply conditional formatting to cells with data bars, color scales, color tiles, and icon sets. Utilize custom table themes inspired by popular websites such and bootstrap themes. Apply sparkline line & bar charts (note this feature requires the dataui package which can be downloaded from <https://github.com/timelyportfolio/dataui>). Increase the portability and reproducibility of reactable tables by embedding images from the web directly into cells. Save the final table output as a static image or interactive file.
This package provides a GUI with which the user can construct and interact with Bootstrap methods on Classical Biplots and with Clustering and/or Disjoint Biplot. This GUI is also aimed for estimate any numerical data matrix using the Clustering and Disjoint Principal component (CDPCA) methodology.
Copernicus Digital Elevation Model datasets (DEM) of 90 and 30 meters resolution using the awscli command line tool. The Copernicus (DEM) is included in the Registry of Open Data on AWS (Amazon Web Services) and represents the surface of the Earth including buildings, infrastructure and vegetation.
This package provides a wrapper for the Onboard Data building data API <https://api.onboarddata.io/swagger>. Along with streamlining access to the API, this package simplifies access to sensor time series data, metadata (sensors, equipment, and buildings), and details about the Onboard data model/ontology.
The separate p-values of SNPs, RNA expressions and DNA methylations are calculated by KM regression. The correlation between different omics data are taken into account. This method can be applied to either samples with all three types of omics data or samples with two types.
This package provides a supervised compression method that incorporates the response for reducing big data to a carefully selected subset. Please see Joseph and Mak (2021) <doi:10.1002/sam.11508>. This research is supported by a U.S. National Science Foundation (NSF) grant CMMI-1921646.
Save MultiAssayExperiments
into file artifacts, and load them back into memory. This is a more portable alternative to serialization of such objects into RDS files. Each artifact is associated with metadata for further interpretation; downstream applications can enrich this metadata with context-specific properties.
Genomic coordinates of problematic genomic regions that should be avoided when working with genomic data. GRanges of exclusion regions (formerly known as blacklisted), centromeres, telomeres, known heterochromatin regions, etc. (UCSC gap table data). Primarily for human and mouse genomes, hg19/hg38 and mm9/mm10 genome assemblies.
Builds platform design information packages. These consist of a SQLite database containing feature-level data such as x, y position on chip and featureSet
ID. The database also incorporates featureSet-level
annotation data. The products of this packages are used by the oligo pkg.
Leverage the existing open access TCGA data on Terra with well-established Bioconductor infrastructure. Make use of the Terra data model without learning its complexities. With a few functions, you can copy / download and generate a MultiAssayExperiment
from the TCGA example workspaces provided by Terra.
mlr3pipelines
enriches mlr3
with a diverse set of pipelining operators (PipeOps) that can be composed into graphs. Operations exist for data preprocessing, model fitting, and ensemble learning. Graphs can themselves be treated as mlr3
Learners and can therefore be resampled, benchmarked, and tuned.