An interface to the Microsoft 365 (formerly known as Office 365') suite of cloud services, building on the framework supplied by the AzureGraph
package. Enables access from R to data stored in Teams', SharePoint
Online and OneDrive
', including the ability to list drive folder contents, upload and download files, send messages, and retrieve data lists. Also provides a full-featured Outlook email client, with the ability to send emails and manage emails and mail folders.
This package provides a function to perform bias diagnostics on linear mixed models fitted with lmer()
from the lme4 package. Implements permutation tests for assessing the bias of fixed effects, as described in Karl and Zimmerman (2021) <doi:10.1016/j.jspi.2020.06.004>. Karl and Zimmerman (2020) <doi:10.17632/tmynggddfm.1> provide R code for implementing the test using mvglmmRank
output. Development of this package was assisted by GPT o1-preview for code structure and documentation.
Fits mixed Poisson regression models (Poisson-Inverse Gaussian or Negative-Binomial) on data sets with response variables being count data. The models can have varying precision parameter, where a linear regression structure (through a link function) is assumed to hold on the precision parameter. The Expectation-Maximization algorithm for both these models (Poisson Inverse Gaussian and Negative Binomial) is an important contribution of this package. Another important feature of this package is the set of functions to perform global and local influence analysis. See Barreto-Souza and Simas (2016) <doi:10.1007/s11222-015-9601-6> for further details.
Michel Rodange was a Luxembourguish writer and poet who lived in the 19th century. His most notable work is Rodange (1872, ISBN:1166177424), ("Renert oder de Fuuà am Frack an a Ma'nsgrëà t"), but he also wrote many more works, including Rodange, Tockert (1928) <https://www.autorenlexikon.lu/page/document/361/3614/1/FRE/index.html> ("D'Léierchen - Dem Léiweckerche säi Lidd") and Rodange, Welter (1929) <https://www.autorenlexikon.lu/page/document/361/3615/1/FRE/index.html> ("Dem Grow Sigfrid seng Goldkuommer"). This package contains three datasets, each made from the plain text versions of his works available on <https://data.public.lu/fr/datasets/the-works-in-luxembourguish-of-michel-rodange/>.
Many times, you will not find data for all dates. After first January, 2011 you may have next data on 20th January, 2011 and so on. Also available dates may have zero values. Try to gather all such kinds of data in different excel sheets of a single excel file. Every sheet will contain two columns (1st one is dates and second one is the data). After loading all the sheets into different elements of a list, using this you can fill the gaps for all the sheets and mark all the corresponding values as zeros. Here I am talking about daily data. Finally, it will combine all the filled results into one data frame (first column is date and other columns will be corresponding values of your sheets) and give one combined data frame. Number of columns in the data frame will be number of sheets plus one. Then imputation will be done. Daily to monthly and weekly conversion is also possible. More details can be found in Garai and others (2023) <doi:10.13140/RG.2.2.11977.42087>.
This package provides infrastructure to accurately measure and compare the execution time of R expressions.
This package provides a system for Analysis of LSD when there is one missing observation. Methods for this process is described in A.M.Gun,M.K.Gupta,B.Dasgupta(2019,ISBN:81-87567-81-3).
This package provides a system for Analysis of RBD when there is one missing observation. Methods for this process is described in A.M.Gun,M.K.Gupta,B.Dasgupta(2019,ISBN:81-87567-81-3).
This package provides a suite of methods for powerful and robust microbiome data analysis addressing zero-inflation, phylogenetic structure and compositional effects. The methods can be applied to the analysis of other (high-dimensional) compositional data arising from sequencing experiments.
Econometric analysis of multiple-input-multiple-output production technologies with ray-based input distance functions as suggested by Price and Henningsen (2022): "A Ray-Based Input Distance Function to Model Zero-Valued Output Quantities: Derivation and an Empirical Application", <https://ideas.repec.org/p/foi/wpaper/2022_03.html>.
Implementations of various robust and flexible model-based clustering methods for data sets with missing values at random. Two main models are: Multivariate Contaminated Normal Mixture (MCNM, Tong and Tortora, 2022, <doi:10.1007/s11634-021-00476-1>) and Multivariate Generalized Hyperbolic Mixture (MGHM, Wei et al., 2019, <doi:10.1016/j.csda.2018.08.016>). Mixtures via some special or limiting cases of the multivariate generalized hyperbolic distribution are also included: Normal-Inverse Gaussian, Symmetric Normal-Inverse Gaussian, Skew-Cauchy, Cauchy, Skew-t, Student's t, Normal, Symmetric Generalized Hyperbolic, Hyperbolic Univariate Marginals, Hyperbolic, and Symmetric Hyperbolic.
Normally building a GODB is fairly complicated, involving downloading multiple database files and using these to build e.g. a mySQL
database. Accessing this database is also complicated, involving an intimate knowledge of the database in order to construct reliable queries. Here we have a more modest goal, generating GOGOA3, which is a stripped down version of the GODB that is restricted to human genes as designated by the HUGO Gene Nomenclature Committee (HGNC) (see <https://geneontology.org/>). This can be built in a matter of seconds from 2 easily downloaded files (see <https://current.geneontology.org/products/pages/downloads.html> and <https://geneontology.org/docs/download-ontology/>), and it can be queried by e.g. w<-which(GOGOA3[,"HGNC"] %in% hgncList
) where GOGOA3 is a matrix representing the minimalist GODB and hgncList
is a list of gene identifiers. This database will be used in my upcoming package GoMiner
which is based on my previous publication (see Zeeberg, B.R., Feng, W., Wang, G. et al. (2003)<doi:10.1186/gb-2003-4-4-r28>). Relevant .RData files are available from GitHub
(<https://github.com/barryzee/GO>).
An approach to identify microbiome biomarker for time to event data by discovering microbiome for predicting survival and classifying subjects into risk groups. Classifiers are constructed as a linear combination of important microbiome and treatment effects if necessary. Several methods were implemented to estimate the microbiome risk score such as the LASSO method by Robert Tibshirani (1998) <doi:10.1002/(SICI)1097-0258(19970228)16:4%3C385::AID-SIM380%3E3.0.CO;2-3>, Elastic net approach by Hui Zou and Trevor Hastie (2005) <doi:10.1111/j.1467-9868.2005.00503.x>, supervised principle component analysis of Wold Svante et al. (1987) <doi:10.1016/0169-7439(87)80084-9>, and supervised partial least squares analysis by Inge S. Helland <https://www.jstor.org/stable/4616159>. Sensitivity analysis on the quantile used for the classification can also be accessed to check the deviation of the classification group based on the quantile specified. Large scale cross validation can be performed in order to investigate the mostly selected microbiome and for internal validation. During the evaluation process, validation is accessed using the hazard ratios (HR) distribution of the test set and inference is mainly based on resampling and permutations technique.
Response Surface Designs (RSDs) involving factors not all at same levels are called Mixed Level RSDs (or Asymmetric RSDs). In many practical situations, RSDs with asymmetric levels will be more suitable as it explores more regions in the design space. (J.S. Mehta and M.N. Das (1968) <doi:10.2307/1267046>. "Asymmetric rotatable designs and orthogonal transformations").This package contains function named ATORDs_I()
for generating asymmetric third order rotatable designs (ATORDs) based on third order designs given by Das and Narasimham (1962). Function ATORDs_II()
generates asymmetric third order rotatable designs developed using t-design of unequal set sizes, which are smaller in size as compared to design generated by function ATORDs_I()
. In general, third order rotatable designs can be classified into two classes viz., designs that are suitable for sequential experimentation and designs for non-sequential experimentation. The sequential experimentation approach involves conducting the trials step by step whereas, in the non-sequential experimentation approach, the entire runs are executed in one go (M. N. Das and V. Narasimham (1962) <doi:10.1214/AOMS/1177704374>. "Construction of Rotatable Designs through Balanced Incomplete Block Designs"). ATORDs_I()
and ATORDs_II()
functions generate non-sequential asymmetric third order designs. Function named SeqTORD()
generates symmetric sequential third order design in blocks and also gives G-efficiency of the given design. Function named Asymseq()
generates asymmetric sequential third order designs in blocks (M. Hemavathi, Eldho Varghese, Shashi Shekhar and Seema Jaggi (2020) <doi:10.1080/02664763.2020.1864817>. "Sequential asymmetric third order rotatable designs (SATORDs)"). In response surface design, situations may arise in which some of the factors are qualitative in nature (Jyoti Divecha and Bharat Tarapara (2017) <doi:10.1080/08982112.2016.1217338>. "Small, balanced, efficient, optimal, and near rotatable response surface designs for factorial experiments asymmetrical in some quantitative, qualitative factors"). The Function named QualRSD()
generates second order design with qualitative factors along with their D-efficiency and G-efficiency. The function named RotatabilityQ()
calculates a measure of rotatability (measure Q, 0 <= Q <= 1) given by Draper and Pukelshiem(1990) for given a design based on a second order model, (Norman R. Draper and Friedrich Pukelsheim(1990) <doi:10.1080/00401706.1990.10484635>. "Another look at rotatability").
Analysis of de novo copy number variants in trios from high-dimensional genotyping platforms.
This package provides a toolkit for simulating differential microbiome data designed for longitudinal analyses. Several functional forms may be specified for the mean trend. Observations are drawn from a multivariate normal model. The objective of this package is to be able to simulate data in order to accurately compare different longitudinal methods for differential abundance.
Fit growth curves to various known microbial growth models automatically to estimate growth parameters. Growth curves can be plotted with their uncertainty band. Growth models are: modified Gompertz model (Zwietering et al. (1990) <doi:10.1128/aem.56.6.1875-1881.1990>), Baranyi model (Baranyi and Roberts (1994) <doi:10.1016/0168-1605%2894%2990157-0>), Rosso model (Rosso et al. (1993) <doi:10.1006/jtbi.1993.1099>) and linear model (Dantigny (2005) <doi:10.1016/j.ijfoodmicro.2004.10.013>).
Discrete event simulation using both R and C++ (Karlsson et al 2016; <doi:10.1109/eScience.2016.7870915>
). The C++ code is adapted from the SSIM library <https://www.inf.usi.ch/carzaniga/ssim/>, allowing for event-oriented simulation. The code includes a SummaryReport
class for reporting events and costs by age and other covariates. The C++ code is available as a static library for linking to other packages. A priority queue implementation is given in C++ together with an S3 closure and a reference class implementation. Finally, some tools are provided for cost-effectiveness analysis.
Allows the user to estimate transition probabilities for migratory animals between any two phases of the annual cycle, using a variety of different data types. Also quantifies the strength of migratory connectivity (MC), a standardized metric to quantify the extent to which populations co-occur between two phases of the annual cycle. Includes functions to estimate MC and the more traditional metric of migratory connectivity strength (Mantel correlation) incorporating uncertainty from multiple sources of sampling error. For cross-species comparisons, methods are provided to estimate differences in migratory connectivity strength, incorporating uncertainty. See Cohen et al. (2018) <doi:10.1111/2041-210X.12916>, Cohen et al. (2019) <doi:10.1111/ecog.03974>, and Roberts et al. (2023) <doi:10.1002/eap.2788> for details on some of these methods.
This package provides a package containing an environment representing the miRNA-1_0_2Xgain.CDF
file.
Detection of migration events and segments of continuous residence based on irregular time series of location data as published in Chi et al. (2020) <doi:10.1371/journal.pone.0239408>.
Translating mature miRNA
names to different miRBase
versions, sequence retrieval, checking names for validity and detecting miRBase
version of a given set of names (data from http://www.mirbase.org/).
This package provides a comprehensive tool for converting and retrieving the miRNA
Name, Accession, Sequence, Version, History and Family information in different miRBase
versions. It can process a huge number of miRNAs
in a short time without other depends.
To date, a number of methods have been developed for microbiome marker discovery based on metagenomic profiles, e.g. LEfSe
. However, all of these methods have its own advantages and disadvantages, and none of them is considered standard or universal. Moreover, different programs or softwares may be development using different programming languages, even in different operating systems. Here, we have developed an all-in-one R package microbiomeMarker
that integrates commonly used differential analysis methods as well as three machine learning-based approaches, including Logistic regression, Random forest, and Support vector machine, to facilitate the identification of microbiome markers.