_            _    _        _         _
      /\ \         /\ \ /\ \     /\_\      / /\
      \_\ \       /  \ \\ \ \   / / /     / /  \
      /\__ \     / /\ \ \\ \ \_/ / /     / / /\ \__
     / /_ \ \   / / /\ \ \\ \___/ /     / / /\ \___\
    / / /\ \ \ / / /  \ \_\\ \ \_/      \ \ \ \/___/
   / / /  \/_// / /   / / / \ \ \        \ \ \
  / / /      / / /   / / /   \ \ \   _    \ \ \
 / / /      / / /___/ / /     \ \ \ /_/\__/ / /
/_/ /      / / /____\/ /       \ \_\\ \/___/ /
\_\/       \/_________/         \/_/ \_____\/
r-datamedios 1.2.2
Propagated dependencies: r-xml2@1.3.8 r-wordcloud2@0.2.1 r-tidytext@0.4.2 r-stringr@1.5.1 r-rvest@1.0.4 r-rlang@1.1.6 r-purrr@1.0.4 r-plotly@4.10.4 r-magrittr@2.0.3 r-lubridate@1.9.4 r-jsonlite@2.0.0 r-httr@1.4.7 r-ggplot2@3.5.2 r-dt@0.33 r-dplyr@1.1.4
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=datamedios
Licenses: Expat
Synopsis: Scraping Chilean Media
Description:

This package provides a system for extracting news from Chilean media, specifically through Web Scapping from Chilean media. The package allows for news searches using search phrases and date filters, and returns the results in a structured format, ready for analysis. Additionally, it includes functions to clean the extracted data, visualize it, and store it in databases. All of this can be done automatically, facilitating the collection and analysis of relevant information from Chilean media.

r-datastudio 1.2.1
Propagated dependencies: r-scales@1.4.0 r-ggplot2@3.5.2 r-data-table@1.17.4
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://www.maths.ed.ac.uk/~mdecarv/
Licenses: GPL 3+
Synopsis: The Research Data Warehouse of Miguel de Carvalho
Description:

Pulls together a collection of datasets from Miguel de Carvalho research articles. Including, for example: - de Carvalho (2012) <doi:10.1016/j.jspi.2011.08.016>; - de Carvalho et al (2012) <doi:10.1080/03610926.2012.709905>; - de Carvalho et al (2012) <doi:10.1016/j.econlet.2011.09.007>); - de Carvalho and Davison (2014) <doi:10.1080/01621459.2013.872651>; - de Carvalho and Rua (2017) <doi:10.1016/j.ijforecast.2015.09.004>; - de Carvalho et al (2023) <doi:10.1002/sta4.560>; - de Carvalho et al (2022) <doi:10.1007/s13253-021-00469-9>; - Palacios et al (2024) <doi:10.1214/24-BA1420>.

r-datasaurus 0.1.9
Channel: guix
Location: gnu/packages/cran.scm (gnu packages cran)
Home page: https://github.com/lockedata/datasauRus
Licenses: Expat
Synopsis: Datasets from the Datasaurus Dozen
Description:

The Datasaurus Dozen is a set of datasets with the same summary statistics. They retain the same summary statistics despite having radically different distributions. The datasets represent a larger and quirkier object lesson that is typically taught via Anscombe's Quartet (available in the 'datasets' package). Anscombe's Quartet contains four very different distributions with the same summary statistics and as such highlights the value of visualisation in understanding data, over and above summary statistics. As well as being an engaging variant on the Quartet, the data is generated in a novel way. The simulated annealing process used to derive datasets from the original Datasaurus is detailed in "Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing" doi:10.1145/3025453.3025912.

r-datadriftr 1.0.0
Propagated dependencies: r-r6@2.6.1 r-fda-usc@2.2.0 r-doremi@1.0.0
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/ugurdar/datadriftR
Licenses: GPL 2+
Synopsis: Concept Drift Detection Methods for Stream Data
Description:

This package provides a system designed for detecting concept drift in streaming datasets. It offers a comprehensive suite of statistical methods to detect concept drift, including methods for monitoring changes in data distributions over time. The package supports several tests, such as Drift Detection Method (DDM), Early Drift Detection Method (EDDM), Hoeffding Drift Detection Methods (HDDM_A, HDDM_W), Kolmogorov-Smirnov test-based Windowing (KSWIN) and Page Hinkley (PH) tests. The methods implemented in this package are based on established research and have been demonstrated to be effective in real-time data analysis. For more details on the methods, please check to the following sources. KobyliŠska et al. (2023) <doi:10.48550/arXiv.2308.11446>, S. Kullback & R.A. Leibler (1951) <doi:10.1214/aoms/1177729694>, Gama et al. (2004) <doi:10.1007/978-3-540-28645-5_29>, Baena-Garcia et al. (2006) <https://www.researchgate.net/publication/245999704_Early_Drift_Detection_Method>, Frà as-Blanco et al. (2014) <https://ieeexplore.ieee.org/document/6871418>, Raab et al. (2020) <doi:10.1016/j.neucom.2019.11.111>, Page (1954) <doi:10.1093/biomet/41.1-2.100>, Montiel et al. (2018) <https://jmlr.org/papers/volume19/18-251/18-251.pdf>.

r-datanugget 1.3.1
Propagated dependencies: r-rfast@2.1.5.1 r-foreach@1.5.2 r-dosnow@1.0.20
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=datanugget
Licenses: GPL 2
Synopsis: Create, and Refine Data Nuggets
Description:

Creating, and refining data nuggets. Data nuggets reduce a large dataset into a small collection of nuggets of data, each containing a center (location), weight (importance), and scale (variability) parameter. Data nugget centers are created by choosing observations in the dataset which are as equally spaced apart as possible. Data nugget weights are created by counting the number observations closest to a given data nugget center. We then say the data nugget contains these observations and the data nugget center is recalculated as the mean of these observations. Data nugget scales are created by calculating the trace of the covariance matrix of the observations contained within a data nugget divided by the dimension of the dataset. Data nuggets are refined by splitting data nuggets which have scales or shapes (defined as the ratio of the two largest eigenvalues of the covariance matrix of the observations contained within the data nugget) Reference paper: [1] Beavers, T. E., Cheng, G., Duan, Y., Cabrera, J., Lubomirski, M., Amaratunga, D., & Teigler, J. E. (2024). Data Nuggets: A Method for Reducing Big Data While Preserving Data Structure. Journal of Computational and Graphical Statistics, 1-21. [2] Cherasia, K. E., Cabrera, J., Fernholz, L. T., & Fernholz, R. (2022). Data Nuggets in Supervised Learning. \emphIn Robust and Multivariate Statistical Methods: Festschrift in Honor of David E. Tyler (pp. 429-449). Cham: Springer International Publishing.

r-datrprofile 0.1.0
Propagated dependencies: r-rsqlite@2.3.11 r-odbc@1.6.1 r-dplyr@1.1.4
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/avitaliano/datrProfile
Licenses: GPL 3 FSDG-compatible
Synopsis: Column Profile for Tables and Datasets
Description:

Profiles datasets (collecting statistics and informative summaries about that data) on data frames and ODBC tables: maximum, minimum, mean, standard deviation, nulls, distinct values, data patterns, data/format frequencies.

r-datasetsicr 1.0
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=datasetsICR
Licenses: GPL 2+
Synopsis: Datasets from the Book "An Introduction to Clustering with R"
Description:

Companion to the book "An Introduction to Clustering with R" by P. Giordani, M.B. Ferraro and F. Martella (Springer, Singapore, 2020). The datasets are used in some case studies throughout the text.

r-datasetjson 0.3.0
Propagated dependencies: r-yyjsonr@0.1.21 r-jsonvalidate@1.5.0 r-hms@1.1.3
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://atorus-research.github.io/datasetjson/
Licenses: FSDG-compatible
Synopsis: Read and Write CDISC Dataset JSON Files
Description:

Read, construct and write CDISC (Clinical Data Interchange Standards Consortium) Dataset JSON (JavaScript Object Notation) files, while validating per the Dataset JSON schema file, as described in CDISC (2023) <https://www.cdisc.org/standards/data-exchange/dataset-json>.

r-dataverifyr 0.1.8
Propagated dependencies: r-yaml@2.3.10
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/DavZim/dataverifyr
Licenses: Expat
Synopsis: Lightweight, Flexible, and Fast Data Validation Package that Can Handle All Sizes of Data
Description:

Allows you to define rules which can be used to verify a given dataset. The package acts as a thin wrapper around more powerful data packages such as dplyr', data.table', arrow', and DBI ('SQL'), which do the heavy lifting.

r-datapackage 0.2.3
Propagated dependencies: r-yaml@2.3.10 r-jsonlite@2.0.0 r-iso8601@0.1.2
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/djvanderlaan/datapackage
Licenses: GPL 3
Synopsis: Creating and Reading Data Packages
Description:

Open, read data from and modify Data Packages. Data Packages are an open standard for bundling and describing data sets (<https://datapackage.org>). When data is read from a Data Package care is taken to convert the data as much a possible to R appropriate data types. The package can be extended with plugins for additional data types.

r-datasetsuni 0.1
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=DataSetsUni
Licenses: GPL 2+
Synopsis: Collection of Univariate Data Sets
Description:

This package provides a collection of widely used univariate data sets of various applied domains on applications of distribution theory. The functions allow researchers and practitioners to quickly, easily, and efficiently access and use these data sets. The data are related to different applied domains and as follows: Bio-medical, survival analysis, medicine, reliability analysis, hydrology, actuarial science, operational research, meteorology, extreme values, quality control, engineering, finance, sports and economics. The total 100 data sets are documented along with associated references for further details and uses.

r-datacompare 1.0.5
Propagated dependencies: r-skimr@2.2.1 r-shinywidgets@0.9.0 r-shinydashboardplus@2.0.5 r-shinydashboard@0.7.3 r-shinycssloaders@1.1.0 r-shiny@1.10.0 r-magrittr@2.0.3 r-htmltools@0.5.8.1 r-golem@0.5.1 r-explore@1.3.5 r-dt@0.33 r-dplyr@1.1.4 r-devtools@2.4.5 r-data-table@1.17.4 r-config@0.3.2 r-arsenal@3.6.3
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/seewe/dataCompare
Licenses: Expat
Synopsis: 'shiny' App to Compare Two Data Frames
Description:

This package provides a tool developed with the Golem framework which provides an easier way to check cells differences between two data frames. The user provides two data frames for comparison, selects IDs variables identifying each row of input data, then clicks a button to perform the comparison. Several R package functions are used to describe the data and perform the comparison in the server of the application. The main ones are comparedf() from arsenal and skim() from skimr'. For more details see the description of comparedf() from the arsenal package and that of skim() from the skimr package.

r-datastreamr 2.0.4
Propagated dependencies: r-stringr@1.5.1 r-logger@0.4.0 r-jsonlite@2.0.0 r-ini@0.3.1 r-httr@1.4.7
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=DatastreamR
Licenses: GPL 3+
Synopsis: Datastream API
Description:

Access Datastream content through <https://product.datastream.com/dswsclient/Docs/Default.aspx>., our historical financial database with over 35 million individual instruments or indicators across all major asset classes, including over 19 million active economic indicators. It features 120 years of data, across 175 countries â the information you need to interpret market trends, economic cycles, and the impact of world events. Data spans bond indices, bonds, commodities, convertibles, credit default swaps, derivatives, economics, energy, equities, equity indices, ESG, estimates, exchange rates, fixed income, funds, fundamentals, interest rates, and investment trusts. Unique content includes I/B/E/S Estimates, Worldscope Fundamentals, point-in-time data, and Reuters Polls. Alongside the content, sit a set of powerful analytical tools for exploring relationships between different asset types, with a library of customizable analytical functions. In-house timeseries can also be uploaded using the package to comingle with Datastream maintained datasets, use with these analytical tools and displayed in Datastreamâ s flexible charting facilities in Microsoft Office.

r-dataexplorer 0.8.4
Propagated dependencies: r-scales@1.4.0 r-rmarkdown@2.29 r-reshape2@1.4.4 r-networkd3@0.4.1 r-gridextra@2.3 r-ggplot2@3.5.2 r-data-table@1.17.4
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: http://boxuancui.github.io/DataExplorer/
Licenses: Expat
Synopsis: Automate Data Exploration and Treatment
Description:

Automated data exploration process for analytic tasks and predictive modeling, so that users could focus on understanding data and extracting insights. The package scans and analyzes each variable, and visualizes them with typical graphical techniques. Common data processing methods are also available to treat and format data.

r-datacomparer 0.1.4
Propagated dependencies: r-stringi@1.8.7 r-markdown@2.0 r-knitr@1.50 r-dplyr@1.1.4
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/capitalone/dataCompareR
Licenses: ASL 2.0 FSDG-compatible
Synopsis: Compare Two Data Frames and Summarise the Difference
Description:

Easy comparison of two tabular data objects in R. Specifically designed to show differences between two sets of data in a useful way that should make it easier to understand the differences, and if necessary, help you work out how to remedy them. Aims to offer a more useful output than all.equal() when your two data sets do not match, but isn't intended to replace all.equal() as a way to test for equality.

r-datareporter 1.0.5
Dependencies: coreutils@9.1 pandoc@2.19.2 git@2.50.0
Propagated dependencies: r-whoami@1.3.0 r-stringi@1.8.7 r-robustbase@0.99-4-1 r-rmarkdown@2.29 r-rlang@1.1.6 r-pander@0.6.6 r-magrittr@2.0.3 r-htmltools@0.5.8.1 r-haven@2.5.5 r-gridextra@2.3 r-ggplot2@3.5.2
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/ekstroem/dataReporter
Licenses: GPL 2
Synopsis: Reproducible Data Screening Checks and Report of Possible Errors
Description:

Data screening is an important first step of any statistical analysis. dataReporter auto generates a customizable data report with a thorough summary of the checks and the results that a human can use to identify possible errors. It provides an extendable suite of test for common potential errors in a dataset. See Petersen AH, Ekstrøm CT (2019). "dataMaid: Your Assistant for Documenting Supervised Data Quality Screening in R." _Journal of Statistical Software_, *90*(6), 1-38 <doi:10.18637/jss.v090.i06> for more information.

r-datapackager 0.16.1
Propagated dependencies: r-yaml@2.3.10 r-usethis@3.1.0 r-rprojroot@2.0.4 r-roxygen2@7.3.2 r-rmarkdown@2.29 r-pkgload@1.4.0 r-pkgbuild@1.4.8 r-knitr@1.50 r-futile-logger@1.4.3 r-digest@0.6.37 r-desc@1.4.3 r-cli@3.6.5
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/ropensci/DataPackageR
Licenses: Expat
Synopsis: Construct Reproducible Analytic Data Sets as R Packages
Description:

This package provides a framework to help construct R data packages in a reproducible manner. Potentially time consuming processing of raw data sets into analysis ready data sets is done in a reproducible manner and decoupled from the usual R CMD build process so that data sets can be processed into R objects in the data package and the data package can then be shared, built, and installed by others without the need to repeat computationally costly data processing. The package maintains data provenance by turning the data processing scripts into package vignettes, as well as enforcing documentation and version checking of included data objects. Data packages can be version controlled on GitHub', and used to share data for manuscripts, collaboration and reproducible research.

r-datasets-load 2.3.0
Propagated dependencies: r-shiny@1.10.0 r-miniui@0.1.2 r-dt@0.33
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=datasets.load
Licenses: GPL 3
Synopsis: Graphical Interface for Loading Datasets
Description:

Graphical interface for loading datasets in RStudio from all installed (including unloaded) packages, also includes command line interfaces.

r-dataretrieval 2.7.20
Propagated dependencies: r-xml2@1.3.8 r-whisker@0.4.1 r-sf@1.0-21 r-readr@2.1.5 r-lubridate@1.9.4 r-jsonlite@2.0.0 r-httr2@1.1.2 r-curl@6.2.3
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://cran.r-project.org/package=dataRetrieval
Licenses: CC0
Synopsis: Retrieval Functions for USGS and EPA Hydrology and Water Quality Data
Description:

Collection of functions to help retrieve U.S. Geological Survey and U.S. Environmental Protection Agency water quality and hydrology data from web services.

r-datetimeutils 0.6-5
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://enricoschumann.net/R/packages/datetimeutils/
Licenses: GPL 3
Synopsis: Utilities for Dates and Times
Description:

Utilities for handling dates and times, such as selecting particular days of the week or month, formatting timestamps as required by RSS feeds, or converting timestamp representations of other software (such as MATLAB and Excel') to R. The package is lightweight (no dependencies, pure R implementations) and relies only on R's standard classes to represent dates and times ('Date and POSIXt'); it aims to provide efficient implementations, through vectorisation and the use of R's native numeric representations of timestamps where possible.

r-datasetsverse 0.1.0
Propagated dependencies: r-timeseriesdatasets@0.1.0 r-oncodatasets@0.1.0 r-meddatasets@0.1.0 r-educationr@0.1.0 r-crimedatasets@0.1.0 r-cli@3.6.5
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/lightbluetitan/datasetsverse
Licenses: GPL 3
Synopsis: Metapackage for Thematic and Domain-Specific Datasets
Description:

This package provides a metapackage that brings together a curated collection of R packages containing domain-specific datasets. It includes time series data, educational metrics, crime records, medical datasets, and oncology research data. Designed to provide researchers, analysts, educators, and data scientists with centralized access to structured and well-documented datasets, this metapackage facilitates reproducible research, data exploration, and teaching applications across a wide range of domains. Included packages: - timeSeriesDataSets': Time series data from economics, finance, energy, and healthcare. - educationR': Datasets related to education, learning outcomes, and school metrics. - crimedatasets': Datasets on global and local crime and criminal behavior. - MedDataSets': Datasets related to medicine, public health, treatments, and clinical trials. - OncoDataSets': Datasets focused on cancer research, survival, genetics, and biomarkers.

r-datametprocess 1.0.5
Propagated dependencies: r-tidyr@1.3.1 r-shiny@1.10.0 r-rlang@1.1.6 r-lubridate@1.9.4 r-dplyr@1.1.4
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/wagnnerms97/DataMetProcess
Licenses: GPL 3+
Synopsis: Meteorological Data Processing
Description:

Set of tools aimed at processing meteorological data, converting hourly recorded data to daily, monthly and annual data.

r-datadictionary 1.0.1
Propagated dependencies: r-tidyselect@1.2.1 r-tidyr@1.3.1 r-tibble@3.2.1 r-openxlsx@4.2.8 r-lubridate@1.9.4 r-labelled@2.14.1 r-haven@2.5.5 r-dplyr@1.1.4 r-chron@2.3-62
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://github.com/DoctorBJones/datadictionary
Licenses: Expat
Synopsis: Create a Data Dictionary
Description:

This package creates a data dictionary from any dataframe or tibble in your R environment. You can opt to add variable labels. You can write the object directly to Excel.

r-data-validator 0.2.1
Propagated dependencies: r-tidyr@1.3.1 r-tibble@3.2.1 r-shiny-semantic@0.5.1 r-rmarkdown@2.29 r-rlang@1.1.6 r-r6@2.6.1 r-purrr@1.0.4 r-knitr@1.50 r-htmlwidgets@1.6.4 r-htmltools@0.5.8.1 r-dplyr@1.1.4 r-assertr@3.0.1
Channel: guix-cran
Location: guix-cran/packages/d.scm (guix-cran packages d)
Home page: https://appsilon.github.io/data.validator/
Licenses: Expat
Synopsis: Automatic Data Validation and Reporting
Description:

Validate dataset by columns and rows using convenient predicates inspired by assertr package. Generate good looking HTML report or print console output to display in logs of your data processing pipeline.

Page: 1234
Total results: 88