This package implements two versions of the algorithm namely: stochastic and batch. The package determines also the best number of clusters and offers to the user the best clustering scheme from different results.
Computational functions for player metrics in major league baseball including batting, pitching, fielding, base-running, and overall player statistics. This package is actively maintained with new metrics being added as they are developed.
Library to plot performance profiles (Dolan and More (2002) <doi:10.1007/s101070100263>) and nested performance profiles (Hekmati and Mirhajianmoghadam (2019) <doi:10.19139/soic-2310-5070-679>) for a given data frame.
An interactive document on the topic of naive Bayes classification analysis using rmarkdown and shiny packages. Runtime examples are provided in the package function as well as at <https://kartikeyab.shinyapps.io/NBShiny/>.
Modelling the vegetation, carbon, nitrogen and water dynamics of undisturbed open bog ecosystems in a temperate to sub-boreal climate. The executable of the model can downloaded from <https://github.com/jeroenpullens/NUCOMBog>.
An interactive document on the topic of naive Bayes classification analysis using rmarkdown and shiny packages. Runtime examples are provided in the package function as well as at <https://kartikeyab.shinyapps.io/NBShiny/>.
Parse options from the command-line using a simple, clean syntax. It requires little or no specification and supports short and long options, GNU-, Java- or Microsoft- style syntaxes, verb commands and more.
This package provides a bioinformatics method developed for analyzing the heterogeneity of single-cell populations. Phitest provides an objective and automatic method to evaluate the performance of clustering and quality of cell clusters.
This package provides tools to interact with the Pangaea Database (<https://www.pangaea.de>), including functions for searching for data, fetching datasets by dataset ID', and working with the Pangaea OAI-PMH service.
Allows users to easily build custom docker images <https://docs.docker.com/> from Amazon Web Service Sagemaker <https://aws.amazon.com/sagemaker/> using Amazon Web Service CodeBuild
<https://aws.amazon.com/codebuild/>.
This package provides a sparklyr <https://spark.posit.co/> extension that provides an R interface for XGBoost <https://github.com/dmlc/xgboost> on Apache Spark'. XGBoost is an optimized distributed gradient boosting library.
An interface to OpenSubspace
', an open source framework for evaluation and exploration of subspace clustering algorithms in WEKA (see <http://dme.rwth-aachen.de/de/opensubspace> for more information). Also performs visualization.
This package provides a framework to download, parse, and store text datasets on the disk and load them when needed. Includes various sentiment lexicons and labeled text data sets for classification and analysis.
Set of tools to estimate the probability in the upper tail of the aggregate loss distribution using different methods: Panjer recursion, Monte Carlo simulations, Markov bound, Cantelli bound, Moment bound, and Chernoff bound.
This package provides a set of fast tidy functions for wrangling, completing and summarising date and date-time data. It combines tidyverse syntax with the efficiency of data.table and speed of collapse'.
This package provides a collection of functions used to format taxonomic names in Markdown documents. Those functions work with data structured according to Alvarez and Luebert (2018) <doi:10.3897/bdj.6.e23635>.
This package provides tools that stem and lemmatize text. Stemming is a process that removes endings such as affixes. Lemmatization is the process of grouping inflected forms together as a single base form.
This package provides a fast and simple URL parser package for R'. This package provides functions to parse URLs into their components, such as scheme, user, password, host, port, path, query, and fragment.
Reading and writing sheets of a single Excel file into and from a list of data frames. Eases I/O of tabular data in bioinformatics while keeping them in a human readable format.
This package provides tools to analyze alternative splicing sites, interpret outcomes based on sequence information, select and design primers for site validiation and give visual representation of the event to guide downstream experiments.
This package contains a set of functions to perform large-scale analysis of toxicogenomic data, providing a standardized data structure to hold information relevant to annotation, visualization and statistical analysis of toxicogenomic data.
The package allows users to readily import spatial data obtained from the 10X Xenium Analyzer pipeline. Supported formats include parquet', h5', and mtx files. The package mainly represents data as SpatialExperiment
objects.
This package provides tools for the maximum likelihood estimation of the parameters of a fractionally differenced ARIMA(p,d,q) model (Haslett and Raftery, Appl.Statistics, 1989); it includes inference and basic methods.
This package is an R wrapper around the cubature C library for adaptive multivariate integration over hypercubes. This version provides both hcubature
and pcubature
routines in addition to a vector interface.