Tests for Kaiser-Meyer-Olkin (KMO) and communalities in a dataset. It provides a final sample by removing variables in a iterable manner while keeping account of the variables that were removed in each step. It follows the best practices and assumptions according to Hair, Black, Babin & Anderson (2018, ISBN:9781473756540).
Implementation of functions to assign corresponding common job prestige scores (SIOPS, ISEI), the official job or group title and the ISCO-88 code to given ISCO-08 codes. ISCO-08 is the latest version of the International Standard Classification of Occupations which is used to organise information on labour and jobs.
This package performs a sentiment analysis of textual contents in R. This implementation utilizes various existing dictionaries, such as Harvard IV, or finance-specific dictionaries. Furthermore, it can also create customized dictionaries. The latter uses LASSO regularization as a statistical approach to select relevant terms based on an exogenous response variable.
Curated human breast cancer tissue S4 ExpresionSet
datasets from over 16 clinical trials comprising over 2,000 patients. All datasets contain at least one type of outcomes variable and treatment information (minimum level: whether they had chemotherapy and whether they had hormonal therapy). Includes code to post-process these datasets.
The necessary external data to run the flowWorkspace
and openCyto
vignette is found in this package. This data package contains two flowJo, one diva xml workspace and the associated fcs
files as well as three GatingSets
for testing the flowWorkspace
, openCyto
and CytoML
packages.
This package provides a suite of functions to use with regression models, including summaries, residual plots, and factor comparisons. Used as part of the Model Fitting module of iNZight
', a graphical user interface providing easy exploration and visualisation of data for students of statistics, available in both desktop and online versions.
Multi-precision library that allows to store and operate with arbitrarily big integers without loss of precision. It includes a large list of tools to work with them, like: - Arithmetic and logic operators - Modular-arithmetic operators - Computer Number Theory utilities - Probabilistic primality tests - Factorization algorithms - Random generators of diferent types of integers.
This is a package providing tools to quantify and interpret multiple sources of biological and technical variation in gene expression experiments. It uses a linear mixed model to quantify variation in gene expression attributable to individual, tissue, time point, or technical variables. The package includes dream differential expression analysis for repeated measures.
This package provides utilities for dealing with distributions. Functionality includes sample skewness and kurtosis, log-histogram, tail plots, moments by integration, changing the point about which a moment is calculated, functions for testing distributions using inversion tests and the Massart inequality. Also included is an implementation of the incomplete Bessel K function.
An implementation of efficiency first conformal prediction (EFCP) and validity first conformal prediction (VFCP) that demonstrates both validity (coverage guarantee) and efficiency (width guarantee). To learn how to use it, check the vignettes for a quick tutorial. The package is based on the work by Yang Y., Kuchibhotla A.,(2021) <arxiv:2104.13871>.
This package provides a dashboard supports the usage of cromwell'. Cromwell is a scientific workflow engine for command line users. This package utilizes cromwell REST APIs and provides these convenient functions: timing diagrams for running workflows, cromwell engine status, a tabular workflow list. For more information about cromwell', visit <http://cromwell.readthedocs.io>.
Support for implicit expansion of arrays in operations involving arrays of mismatching sizes. This pattern is known as "broadcasting" in Python and "implicit expansion" in Matlab and is explained for example in the article "Array programming with NumPy
" by C. R. Harris et al. (2020) <doi:10.1038/s41586-020-2649-2>.
This package provides a tm Source to create corpora from articles exported from the Dow Jones Factiva content provider as XML or HTML files. It is able to read both text content and meta-data information (including source, date, title, author, subject, geographical coverage, company, industry, and various provider-specific fields).
This package defines an S4 class for storing data from spatial -omics experiments. The class extends SingleCellExperiment to support storage and retrieval of additional information from spot-based and molecule-based platforms, including spatial coordinates, images, and image metadata. A specialized constructor function is included for data from the 10x Genomics Visium platform.
RocBandwidthTest is designed to capture the performance characteristics of buffer copying and kernel read/write operations. The help screen of the benchmark shows various options one can use in initiating copy/read/writer operations. In addition one can also query the topology of the system in terms of memory pools and their agents.
Post Global Financial Crisis derivatives reforms have lifted the veil off over-the-counter (OTC) derivative markets. Swap Execution Facilities (SEFs) and Swap Data Repositories (SDRs) now publish data on swaps that are traded on or reported to those facilities (respectively). This package provides you the ability to get this data from supported sources.
This library converts a Float
to a String
with ultimate control how many digits after the decimal point are shown and how the remaining digits are rounded. It rounds, floors and ceils the common way (i.e. half up) or the commerical way (ie. half away from zero).
RocBandwidthTest is designed to capture the performance characteristics of buffer copying and kernel read/write operations. The help screen of the benchmark shows various options one can use in initiating cop/read/writer operations. In addition one can also query the topology of the system in terms of memory pools and their agents.
Consider autoregressive model of order p where the distribution function of innovation is unknown, but innovations are independent and symmetrically distributed. The package contains a function named ARMDE which takes X (vector of n observations) and p (order of the model) as input argument and returns minimum distance estimator of the parameters in the model.
Speeds up exploratory data analysis (EDA) by providing a succinct workflow and interactive visualization tools for understanding which features have relationships to target (response). Uses binary correlation analysis to determine relationship. Default correlation method is the Pearson method. Lian Duan, W Nick Street, Yanchi Liu, Songhua Xu, and Brook Wu (2014) <doi:10.1145/2637484>.
Precise knowledge on the binding sites of an RNA-binding protein (RBP) is key to understand (post-) transcriptional regulatory processes. Here we present a workflow that describes how exact binding sites can be defined from iCLIP
data. The package provides functions for binding site definition and result visualization. For details please see the vignette.
Minitest-hooks adds around
, before_all
, after_all
, around_all
hooks for Minitest. This allows, for instance, running each suite of specs inside a database transaction, running each spec inside its own savepoint inside that transaction. This can significantly speed up testing for specs that share expensive database setup code.
Estimate bivariate common mean vector under copula models with known correlation. In the current version, available copulas are the Clayton, Gumbel, Frank, Farlie-Gumbel-Morgenstern (FGM), and normal copulas. See Shih et al. (2019) <doi:10.1080/02331888.2019.1581782> and Shih et al. (2021) <under review> for details under the FGM and general copulas, respectively.
This package contains the prepared data that is needed for the shiny application examples in the canvasXpress
package. This package also includes datasets used for automated testthat tests. Scotto L, Narayan G, Nandula SV, Arias-Pulido H et al. (2008) <doi:10.1002/gcc.20577>. Davis S, Meltzer PS (2007) <doi:10.1093/bioinformatics/btm254>.