Wraps the StarSpace
library <https://github.com/facebookresearch/StarSpace>
allowing users to calculate word, sentence, article, document, webpage, link and entity embeddings'. By using the embeddings', you can perform text based multi-label classification, find similarities between texts and categories, do collaborative-filtering based recommendation as well as content-based recommendation, find out relations between entities, calculate graph embeddings as well as perform semi-supervised learning and multi-task learning on plain text. The techniques are explained in detail in the paper: StarSpace
: Embed All The Things! by Wu et al. (2017), available at <arXiv:1709.03856>
.
This package provides optimized functions and flexible combinatorial iterators implemented in C++ for solving problems in combinatorics and computational mathematics. It utilizes the RMatrix
class from RcppParallel
for thread safety. There are combination/permutation functions with constraint parameters that allow for generation of all results of a vector meeting specific criteria. It is capable of generating specific combinations/permutations which sets up nicely for parallelization as well as random sampling. Gmp support permits exploration where the total number of results is large. Additionally, there are several high performance number theoretic functions that are useful for problems common in computational mathematics.
This package provides the run-time support library developed by the LLVM project for the OpenMP multi-theaded programming extension. This package notably provides libgomp.so
, which is has a binary interface compatible with that of libgomp, the GNU Offloading and Multi Processing Library.
Package providing a fast match() replacement for cases that require repeated look-ups. It is slightly faster that R's built-in match() function on first match against a table, but extremely fast on any subsequent lookup as it keeps the hash table in memory.
cl-ratify
is a collection of utilities to perform validation checks and parsing. The main intention of usage for this is in web-applications in order to check form inputs for correctness and automatically parse them into their proper representations or return meaningful errors.
This package performs Bayesian estimation of the additive main effects and multiplicative interaction (AMMI) model. The method is explained in Crossa, J., Perez-Elizalde, S., Jarquin, D., Cotes, J.M., Viele, K., Liu, G. and Cornelius, P.L. (2011) (<doi:10.2135/cropsci2010.06.0343>).
Utility functions to facilitate the import, the reporting and analysis of clinical data. Example datasets in SDTM and ADaM
format, containing a subset of patients/domains from the CDISC Pilot 01 study are also available as R datasets to demonstrate the package functionalities.
This package provides tools for linear fitting with complex variables. Includes ordinary least-squares (zlm()
) and robust M-estimation (rzlm()
), and complex methods for oft used generics. Originally adapted from the rlm()
functions of MASS and the lm()
functions of stats'.
An implementation of robust estimation in Cox model. Functionality includes fitting efficiently and robustly Cox proportional hazards regression model in its basic form, where explanatory variables are time independent with one event per subject. Method is based on a smooth modification of the partial likelihood.
This package provides a comprehensive suite of spatial functions created to analyze and assess data heterogeneity and climate variability in spatial datasets. This package is specifically designed to address the challenges associated with characterizing and understanding complex spatial patterns in environmental and climate-related data.
Implementations of the weighted Kozachenko-Leonenko entropy estimator and independence tests based on this estimator, (Kozachenko and Leonenko (1987) <http://mi.mathnet.ru/eng/ppi797>). Also includes a goodness-of-fit test for a linear model which is an independence test between covariates and errors.
Individual based simulations of hybridizing populations, where the accumulation of junctions is tracked. Furthermore, mathematical equations are provided to verify simulation outcomes. Both simulations and mathematical equations are based on Janzen (2018, <doi:10.1101/058107>) and Janzen (2022, <doi:10.1111/1755-0998.13519>).
High-performance MongoDB
client based on mongo-c-driver and jsonlite'. Includes support for aggregation, indexing, map-reduce, streaming, encryption, enterprise authentication, and GridFS
. The online user manual provides an overview of the available methods in the package: <https://jeroen.github.io/mongolite/>.
Deep Learning library that extends the mlr3 framework by building upon the torch package. It allows to conveniently build, train, and evaluate deep learning models without having to worry about low level details. Custom architectures can be created using the graph language defined in mlr3pipelines'.
Useful git hooks for R building on top of the multi-language framework pre-commit for hook management. This package provides git hooks for common tasks like formatting files with styler or spell checking as well as wrapper functions to access the pre-commit executable.
This package implements multinomial CDF (P(N1<=n1, ..., Nk<=nk)) and tail probabilities (P(N1>n1, ..., Nk>nk)), as well as probabilities with both constraints (P(l1<N1<=u1, ..., lk<Nk<=uk)). Uses a method suggested by Bruce Levin (1981) <doi:10.1214/aos/1176345593>.
Connect R to the PhotosynQ
platform (<https://photosynq.org>). It allows to login and logout, as well as receive project information and project data. Further it transforms the received JSON objects into a data frame, which can be used for the final data analysis.
Create a project directory structure, along with typical files for that project. This allows projects to be quickly and easily created, as well as for them to be standardized. Designed specifically with scientists in mind (mainly bio-medical researchers, but likely applies to other fields).
This package provides the probability, distribution, and quantile functions and random number generator for the Poisson-Binomial distribution. This package relies on FFTW to implement the discrete Fourier transform, so that it is much faster than the existing implementation of the same algorithm in R.
An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.
The main functions for methylGSA
are methylglm and methylRRA
. methylGSA
implements logistic regression adjusting number of probes as a covariate. methylRRA
adjusts multiple p-values of each gene by Robust Rank Aggregation. For more detailed help information, please see the vignette.
This package provides infrastructure for seriation with an implementation of several seriation/sequencing techniques to reorder matrices, dissimilarity matrices, and dendrograms. It also provides (optimally) reordered heatmaps, color images and clustering visualizations like dissimilarity plots, and visual assessment of cluster tendency plots (VAT and iVAT).
The r-ggformula
introduces a family of graphics functions, gf_point(), gf_density(), and so on, bring the formula interface to ggplot(). This captures and extends the excellent simplicity of the lattice-graphics formula interface, while providing the intuitive capabilities of r-ggplot2
.
This package provides various tools to work with prime numbers. It includes: optimised prime sieves, checking for primality, enumerating primes, factorising numbers, and state-of-the-art estimation of upper and lower bounds for the number of primes below n and the k-th prime.