Toys / Webring for GNU Guix channels

r-text 1.7.0

Propagated dependencies: r-yardstick@1.3.2 r-workflows@1.2.0 r-tune@1.3.0 r-topics@0.62 r-tidyr@1.3.1 r-tibble@3.2.1 r-stringi@1.8.7 r-rsample@1.3.0 r-rlang@1.1.6 r-reticulate@1.42.0 r-recipes@1.3.1 r-purrr@1.0.4 r-parsnip@1.3.2 r-magrittr@2.0.3 r-hardhat@1.4.1 r-ggrepel@0.9.6 r-ggplot2@3.5.2 r-future@1.49.0 r-furrr@0.3.1 r-dplyr@1.1.4 r-cowplot@1.1.3

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://r-text.org/

Licenses: GPL 3

Synopsis: Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

Description:

Link R with Transformers from Hugging Face to transform text variables to word embeddings; where the word embeddings are used to statistically test the mean difference between set of texts, compute semantic similarity scores between texts, predict numerical variables, and visual statistically significant words according to various dimensions etc. For more information see <https://www.r-text.org>.

r-texter 0.1.9

Propagated dependencies: r-tidytext@0.4.2 r-tidyr@1.3.1 r-textdata@0.4.5 r-stringr@1.5.1 r-stopwords@2.3 r-purrr@1.0.4 r-plyr@1.8.9 r-magrittr@2.0.3 r-ggplot2@3.5.2 r-dplyr@1.1.4

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://github.com/simmieyungie/texter

Licenses: Expat

Synopsis: An Easy Text and Sentiment Analysis Library

Description:

Implement text and sentiment analysis with texter'. Generate sentiment scores on text data and also visualize sentiments. texter allows you to quickly generate insights on your data. It includes support for lexicons such as NRC and Bing'.

r-textab 1.0.1

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://setzler.github.io/textab/

Licenses: Expat

Synopsis: Create Highly-Customized 'LaTeX' Tables

Description:

Generate LaTeX tables directly from R. It builds LaTeX tables in blocks in the spirit of ggplot2 using the + and / operators for concatenation in the vertical and horizontal dimensions, respectively. It exports tables in the LaTeX tabular environment using .tex code. It can compile .tex code to PDF automatically.

r-textir 2.0-5

Propagated dependencies: r-matrix@1.7-3 r-gamlr@1.13-8 r-distrom@1.0.2

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: http://taddylab.com

Licenses: GPL 3

Synopsis: Inverse Regression for Text Analysis

Description:

Multinomial (inverse) regression inference for text documents and associated attributes. For details see: Taddy (2013 JASA) Multinomial Inverse Regression for Text Analysis <arXiv:1012.2098> and Taddy (2015, AoAS), Distributed Multinomial Regression, <arXiv:1311.6139>. A minimalist partial least squares routine is also included. Note that the topic modeling capability of earlier textir is now a separate package, maptpx'.

r-textcat 1.0-9

Propagated dependencies: r-tau@0.0-26 r-slam@0.1-55

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://cran.r-project.org/package=textcat

Licenses: GPL 2

Synopsis: N-Gram Based Text Categorization

Description:

Text categorization based on n-grams.

r-textreg 0.1.5

Propagated dependencies: r-tm@0.7-16 r-rcpp@1.0.14 r-nlp@0.3-2

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://cran.r-project.org/package=textreg

Licenses: Expat

Synopsis: n-Gram Text Regression, aka Concise Comparative Summarization

Description:

Function for sparse regression on raw text, regressing a labeling vector onto a feature space consisting of all possible phrases.

r-textrar 0.8.0

Propagated dependencies: r-jsonlite@2.0.0 r-httr@1.4.7

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://github.com/matutosi/textrar

Licenses: Expat

Synopsis: Interface to 'TexTra' from R

Description:

This package provides a wrapper for the TexTra API <https://mt-auto-minhon-mlt.ucri.jgn-x.jp/>, a web service for translating texts between different languages. TexTra API account is required to use the service.

r-textdata 0.4.5

Propagated dependencies: r-tibble@3.2.1 r-readr@2.1.5 r-rappdirs@0.3.3 r-fs@1.6.6

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://emilhvitfeldt.github.io/textdata/

Licenses: Expat

Synopsis: Download and Load Various Text Datasets

Description:

This package provides a framework to download, parse, and store text datasets on the disk and load them when needed. Includes various sentiment lexicons and labeled text data sets for classification and analysis.

r-textstem 0.1.4

Propagated dependencies: r-textshape@1.7.5 r-textclean@0.9.3 r-stringi@1.8.7 r-snowballc@0.7.1 r-quanteda@4.3.0 r-lexicon@1.2.1 r-korpus-lang-en@0.1-4 r-korpus@0.13-8 r-hunspell@3.0.6 r-dplyr@1.1.4

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: http://github.com/trinker/textstem

Licenses: GPL 2

Synopsis: Tools for Stemming and Lemmatizing Text

Description:

This package provides tools that stem and lemmatize text. Stemming is a process that removes endings such as affixes. Lemmatization is the process of grouping inflected forms together as a single base form.

r-textplot 0.2.2

Propagated dependencies: r-data-table@1.17.4 r-lattice@0.22-7 r-matrix@1.7-3

Channel: guix

Location: gnu/packages/cran.scm (gnu packages cran)

Home page: https://github.com/bnosac/textplot

Licenses: GPL 2

Synopsis: Text Plots

Description:

Visualise complex relations in texts. This is done by providing functionalities for displaying text co-occurrence networks, text correlation networks, dependency relationships as well as text clustering. Feel free to join the effort of providing interesting text visualisations.

r-text2vec 0.6.4

Propagated dependencies: r-data-table@1.17.4 r-digest@0.6.37 r-lgr@0.4.4 r-matrix@1.7-3 r-mlapi@0.1.1 r-r6@2.6.1 r-rcpp@1.0.14 r-rsparse@0.5.3 r-stringi@1.8.7

Channel: guix

Location: gnu/packages/cran.scm (gnu packages cran)

Home page: https://text2vec.org

Licenses: GPL 2+

Synopsis: Text mining framework for R

Description:

This package provides fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. It provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.

r-text2sdg 1.1.2

Propagated dependencies: r-tidyr@1.3.1 r-tibble@3.2.1 r-text2sdgdata@0.1.1 r-stringr@1.5.1 r-ranger@0.17.0 r-magrittr@2.0.3 r-lifecycle@1.0.4 r-ggplot2@3.5.2 r-dplyr@1.1.4 r-corpustools@0.5.2

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://github.com/dwulff/text2sdg

Licenses: GPL 3

Synopsis: Detecting UN Sustainable Development Goals in Text

Description:

The United Nations Sustainable Development Goals (SDGs) have become an important guideline for organisations to monitor and plan their contributions to social, economic, and environmental transformations. The text2sdg package is an open-source analysis package that identifies SDGs in text using scientifically developed query systems, opening up the opportunity to monitor any type of text-based data, such as scientific output or corporate publications. For more information see Meier, Mata & Wulff (2025) <doi:10.32614/RJ-2024-005> and Wulff, Meier & Mata (2024) <doi:10.1007/s11625-024-01516-3>.

r-text2map 0.2.0

Propagated dependencies: r-tibble@3.2.1 r-text2vec@0.6.4 r-stringi@1.8.7 r-rsvd@1.0.5 r-rlang@1.1.6 r-qgraph@1.9.8 r-permute@0.9-7 r-matrix@1.7-3 r-kit@0.0.20 r-igraph@2.1.4 r-foreach@1.5.2 r-fastmatch@1.1-6 r-dplyr@1.1.4 r-doparallel@1.0.17 r-clusterr@1.3.3

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://gitlab.com/culturalcartography/text2map

Licenses: Expat

Synopsis: R Tools for Text Matrices, Embeddings, and Networks

Description:

This is a collection of functions optimized for working with with various kinds of text matrices. Focusing on the text matrix as the primary object - represented either as a base R dense matrix or a Matrix package sparse matrix - allows for a consistent and intuitive interface that stays close to the underlying mathematical foundation of computational text analysis. In particular, the package includes functions for working with word embeddings, text networks, and document-term matrices. Methods developed in Stoltz and Taylor (2019) <doi:10.1007/s42001-019-00048-6>, Taylor and Stoltz (2020) <doi:10.1007/s42001-020-00075-8>, Taylor and Stoltz (2020) <doi:10.15195/v7.a23>, and Stoltz and Taylor (2021) <doi:10.1016/j.poetic.2021.101567>.

r-textrank 0.3.1

Propagated dependencies: r-igraph@2.1.4 r-digest@0.6.37 r-data-table@1.17.4

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://github.com/bnosac/textrank

Licenses: FSDG-compatible

Synopsis: Summarize Text by Ranking Sentences and Finding Keywords

Description:

The textrank algorithm is an extension of the Pagerank algorithm for text. The algorithm allows to summarize text by calculating how sentences are related to one another. This is done by looking at overlapping terminology used in sentences in order to set up links between sentences. The resulting sentence network is next plugged into the Pagerank algorithm which identifies the most important sentences in your text and ranks them. In a similar way textrank can also be used to extract keywords. A word network is constructed by looking if words are following one another. On top of that network the Pagerank algorithm is applied to extract relevant words after which relevant words which are following one another are combined to get keywords. More information can be found in the paper from Mihalcea, Rada & Tarau, Paul (2004) <https://www.aclweb.org/anthology/W04-3252/>.

r-textshape 1.7.5

Propagated dependencies: r-data-table@1.17.4 r-slam@0.1-55 r-stringi@1.8.7

Channel: guix

Location: gnu/packages/cran.scm (gnu packages cran)

Home page: https://github.com/trinker/textshape

Licenses: GPL 2

Synopsis: Tools for Reshaping Text

Description:

Tools that can be used to reshape and restructure text data.

r-textminer 3.0.6

Propagated dependencies: r-text2vec@0.6.4 r-stringr@1.5.1 r-stopwords@2.3 r-rspectra@0.16-2 r-rcppprogress@0.4.2 r-rcpparmadillo@14.4.3-1 r-rcpp@1.0.14 r-matrix@1.7-3 r-magrittr@2.0.3 r-gtools@3.9.5

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://www.rtextminer.com/

Licenses: Expat

Synopsis: Functions for Text Mining and Topic Modeling

Description:

An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.

r-textpress 1.0.0

Propagated dependencies: r-xml2@1.3.8 r-stringr@1.5.1 r-stringi@1.8.7 r-rvest@1.0.4 r-pbapply@1.7-2 r-matrix@1.7-3 r-lubridate@1.9.4 r-jsonlite@2.0.0 r-httr@1.4.7 r-data-table@1.17.4

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://github.com/jaytimm/textpress

Licenses: Expat

Synopsis: Lightweight and Versatile NLP Toolkit

Description:

This package provides a simple Natural Language Processing (NLP) toolkit focused on search-centric workflows with minimal dependencies. The package offers key features for web scraping, text processing, corpus search, and text embedding generation via the HuggingFace API <https://huggingface.co/docs/api-inference/index>.

r-textreuse 0.1.5

Propagated dependencies: r-tidyr@1.3.1 r-tibble@3.2.1 r-stringr@1.5.1 r-rcppprogress@0.4.2 r-rcpp@1.0.14 r-nlp@0.3-2 r-dplyr@1.1.4 r-digest@0.6.37 r-bh@1.87.0-1 r-assertthat@0.2.1

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://docs.ropensci.org/textreuse

Licenses: Expat

Synopsis: Detect Text Reuse and Document Similarity

Description:

This package provides tools for measuring similarity among documents and detecting passages which have been reused. Implements shingled n-gram, skip n-gram, and other tokenizers; similarity/dissimilarity functions; pairwise comparisons; minhash and locality sensitive hashing algorithms; and a version of the Smith-Waterman local alignment algorithm suitable for natural language.

r-textutils 0.4-2

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://enricoschumann.net/R/packages/textutils/

Licenses: GPL 3

Synopsis: Utilities for Handling Strings and Text

Description:

Utilities for handling character vectors that store human-readable text (either plain or with markup, such as HTML or LaTeX). The package provides, in particular, functions that help with the preparation of plain-text reports, e.g. for expanding and aligning strings that form the lines of such reports. The package also provides generic functions for transforming R objects to HTML and to plain text.

r-textclean 0.9.3

Propagated dependencies: r-data-table@1.17.4 r-english@1.2-6 r-glue@1.8.0 r-lexicon@1.2.1 r-mgsub@1.7.3 r-qdapregex@0.7.10 r-stringi@1.8.7 r-textshape@1.7.5

Channel: guix

Location: gnu/packages/cran.scm (gnu packages cran)

Home page: https://github.com/trinker/textclean

Licenses: GPL 2

Synopsis: Text Cleaning Tools

Description:

Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) doi:10.1006/csla.2001.0169) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.

r-texttools 0.1.0

Propagated dependencies: r-data-table@1.17.4

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://cran.r-project.org/package=textTools

Licenses: GPL 2+

Synopsis: Functions for Text Cleansing and Text Analysis

Description:

This package provides a framework for text cleansing and analysis. Conveniently prepare and process large amounts of text for analysis. Includes various metrics for word counts/frequencies that scale efficiently. Quickly analyze large amounts of text data using a text.table (a data.table created with one word (or unit of text analysis) per row, similar to the tidytext format). Offers flexibility to efficiently work with text data stored in vectors as well as text data formatted as a text.table.

r-texttinyr 1.1.8

Propagated dependencies: r-rcpparmadillo@14.4.3-1 r-rcpp@1.0.14 r-r6@2.6.1 r-matrix@1.7-3 r-data-table@1.17.4 r-bh@1.87.0-1

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://github.com/mlampros/textTinyR

Licenses: GPL 3

Synopsis: Text Processing for Small or Big Data Files

Description:

It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. GloVe', fasttext') and incorporates functions for the calculation of (pairwise) text document dissimilarities. The source code is based on C++11 and exported in R through the Rcpp', RcppArmadillo and BH packages.

r-textometry 0.1.6

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://cran.r-project.org/package=textometry

Licenses: GPL 3+

Synopsis: Textual Data Analysis Package Used by the TXM Software

Description:

Statistical exploration of textual corpora using several methods from French Textometrie (new name of Lexicometrie') and French Data Analysis schools. It includes methods for exploring irregularity of distribution of lexicon features across text sets or parts of texts (Specificity analysis); multi-dimensional exploration (Factorial analysis), etc. Those methods are used in the TXM software.

r-texteffect 0.3

Propagated dependencies: r-mass@7.3-65 r-ggplot2@3.5.2 r-boot@1.3-31

Channel: guix-cran

Location: guix-cran/packages/t.scm (guix-cran packages t)

Home page: https://cran.r-project.org/package=texteffect

Licenses: GPL 2+

Synopsis: Discovering Latent Treatments in Text Corpora and Estimating Their Causal Effects

Description:

This package implements the approach described in Fong and Grimmer (2016) <https://aclweb.org/anthology/P/P16/P16-1151.pdf> for automatically discovering latent treatments from a corpus and estimating the average marginal component effect (AMCE) of each treatment. The data is divided into a training and test set. The supervised Indian Buffet Process (sibp) is used to discover latent treatments in the training set. The fitted model is then applied to the test set to infer the values of the latent treatments in the test set. Finally, Y is regressed on the latent treatments in the test set to estimate the causal effect of each treatment.

Page: 1 2

Total results: 35