Defines classes and methods that can be used to implement genetic algorithms for feature selection. The idea is that we want to select a fixed number of features to combine into a linear classifier that can predict a binary outcome, and can use a genetic algorithm heuristically to select an optimal set of features.
Fits generalized additive models for the location, scale and shape parameters of a generalized extreme value response distribution. The methodology is based on Rigby, R.A. and Stasinopoulos, D.M. (2005), <doi:10.1111/j.1467-9876.2005.00510.x> and implemented using functions from the gamlss package <doi:10.32614/CRAN.package.gamlss>.
This package contains many functions useful for monitoring and reporting the results of clinical trials and other experiments in which treatments are compared. LaTeX is used to typeset the resulting reports, recommended to be in the context of knitr'. The Hmisc', ggplot2', and lattice packages are used by greport for high-level graphics.
This package provides functions to help with creating sparklines in the style of Edward Tufte <https://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001OR&topic_id=1> in ggplot2'. It computes ribbon geoms with the interquartile ranges and points and/or labels at the beginning, end, max, and min points.
We provide a stage-wise selection method using genetic algorithms, designed to efficiently identify main and two-way interactions within high-dimensional linear regression models. Additionally, it implements simulated annealing algorithm during the mutation process. The relevant paper can be found at: Ye, C.,and Yang,Y. (2019) <doi:10.1109/TIT.2019.2913417>.
Holistic generalized linear models (HGLMs) extend generalized linear models (GLMs) by enabling the possibility to add further constraints to the model. The holiglm package simplifies estimating HGLMs using convex optimization. Additional information about the package can be found in the reference manual, the README and the accompanying paper <doi:10.18637/jss.v108.i07>.
Some functions for performing ICA, MICA, Group ICA, and Multilinear ICA are implemented. ICA, MICA/Group ICA, and Multilinear ICA extract statistically independent components from single matrix, multiple matrices, and single tensor, respectively. For the details of these methods, see the reference section of GitHub README.md <https://github.com/rikenbit/iTensor>.
Synthesize images into characteristic features for time-series analysis or machine learning applications. The package was originally intended for monitoring volcanic eruptions in video data by highlighting and extracting regions above the vent associated with plume activity. However, the functions within are general and have wide applications for image processing, analyzing, filtering, and plotting.
Linear ridge regression coefficient's estimation and testing with different ridge related measures such as MSE, R-squared etc. REFERENCES i. Hoerl and Kennard (1970) <doi:10.1080/00401706.1970.10488634>, ii. Halawa and El-Bassiouni (2000) <doi:10.1080/00949650008812006>, iii. Imdadullah, Aslam, and Saima (2017), iv. Marquardt (1970) <doi:10.2307/1267205>.
This package contains a collection of datasets for working with machine learning tasks. It will contain datasets for supervised machine learning Jiang (2020)<doi:10.1016/j.beth.2020.05.002> and will include datasets for classification and regression. The aim of this package is to use data generated around health and other domains.
Facilitates tidy calculation of popular quantitative marketing metrics. It also includes functions for doing analysis that will help marketers and data analysts better understand the drivers and/or trends of these metrics. These metrics include Customer Experience Index <https://go.forrester.com/analytics/cx-index/> and Net Promoter Score <https://www.netpromoter.com/know/>.
R Client for the Microsoft Cognitive Services Text-to-Speech REST API, including voice synthesis. A valid account must be registered at the Microsoft Cognitive Services website <https://azure.microsoft.com/en-us/products/ai-services/> in order to obtain a (free) API key. Without an API key, this package will not work properly.
Picks the suitable cell types in spatial and scRNA-seq data using shrinkage methods. The package includes curated reference gene expression profiles for human and mouse cell types, facilitating immediate application to common spatial transcriptomics or scRNA datasets. Additionally, users can input custom reference data to support tissue- or experiment-specific analyses.
Calculate seat apportionment for legislative bodies with various methods. The algorithms include divisor or highest averages methods (e.g. Jefferson, Webster or Adams), largest remainder methods and biproportional apportionment. Gaffke, N. & Pukelsheim, F. (2008) <doi:10.1016/j.mathsocsci.2008.01.004> Oelbermann, K. F. (2016) <doi:10.1016/j.mathsocsci.2016.02.003>.
Survey to collect data about the social and economic conditions of Indonesian society. This activity aims to include: As a data source for planning and evaluating national, sectoral development programs, and providing indicators for Sustainable Development Goals (TPB), National Medium Term Development Plan (RPJMN), and Nawacita, GDP/GRDP and annual Integrated Institutional Balance Sheet.
This package provides a tidy workflow for generating, estimating, reporting, and plotting structural equation models using lavaan', OpenMx', or Mplus'. Throughout this workflow, elements of syntax, results, and graphs are represented as tidy data, making them easy to customize. Includes functionality to estimate latent class analyses, and to plot dagitty and igraph objects.
Implementation of Time to Target plot based on the work of Ribeiro and Rosseti (2015) <DOI:10.1007/s11590-014-0760-8>, that describe a numerical method that gives the probability of an algorithm A finds a solution at least as good as a given target value in smaller computation time than algorithm B.
Construct and plot objective hierarchies and associated value and utility functions. Evaluate the values and utilities and visualize the results as colored objective hierarchies or tables. Visualize uncertainty by plotting median and quantile intervals within the nodes of objective hierarchies. Get numerical results of the evaluations in standard R data types for further processing.
Processes standard recommendation datasets (e.g., a user-item rating matrix) as input and generates rating predictions and lists of recommended items. Standard algorithm implementations which are included in this package are the following: Global/Item/User-Average baselines, Weighted Slope One, Item-Based KNN, User-Based KNN, FunkSVD, BPR and weighted ALS. They can be assessed according to the standard offline evaluation methodology (Shani, et al. (2011) <doi:10.1007/978-0-387-85820-3_8>) for recommender systems using measures such as MAE, RMSE, Precision, Recall, F1, AUC, NDCG, RankScore and coverage measures. The package (Coba, et al.(2017) <doi: 10.1007/978-3-319-60042-0_36>) is intended for rapid prototyping of recommendation algorithms and education purposes.
The Spectra package defines an efficient infrastructure for storing and handling mass spectrometry spectra and functionality to subset, process, visualize and compare spectra data. It provides different implementations (backends) to store mass spectrometry data. These comprise backends tuned for fast data access and processing and backends for very large data sets ensuring a small memory footprint.
This is a package for de novo identification and extraction of differentially methylated regions (DMRs) from the human genome using Whole Genome Bisulfite Sequencing (WGBS) and Illumina Infinium Array (450K and EPIC) data. It provides functionality for filtering probes possibly confounded by SNPs and cross-hybridisation. It includes GRanges generation and plotting functions.
BiocSet displays different biological sets in a triple tibble format. These three tibbles are element, set, and elementset. The user has the ability to activate one of these three tibbles to perform common functions from the dplyr package. Mapping functionality and accessing web references for elements/sets are also available in BiocSet.
To make it easy to create CONSORT diagrams for the transparent reporting of participant allocation in randomized, controlled clinical trials. This is done by creating a standardized disposition data, and using this data as the source for the creation a standard CONSORT diagram. Human effort by supplying text labels on the node can also be achieved.
Stringr is a consistent, simple and easy to use set of wrappers around the fantastic stringi package. All function and argument names (and positions) are consistent, all functions deal with "NA"'s and zero length vectors in the same way, and the output from one function is easy to feed into the input of another.