June 2022 list of packages of interest
- Introduction
- eurodata: Fast and Easy Eurostat Data Import and Search
- gdiff: Graphical Difference Testing
- aplot: Decorate a 'ggplot' with Associated Information
- SHT: Statistical Hypothesis Testing Toolbox
- gghighlight: Highlight Lines and Points in 'ggplot2'
- pdp: Partial Dependence Plots
- contingencytables: Statistical Analysis of Contingency Tables
- contingencytables: Statistical Analysis of Contingency Tables
- testDriveR: Teaching Data for Statistics and Data Science
- vcd: Visualizing Categorical Data
- gTestsMulti: New Graph-Based Multi-Sample Tests
TOC
- eurodata: Fast and Easy Eurostat Data Import and Search
- gdiff: Graphical Difference Testing
- aplot: Decorate a 'ggplot' with Associated Information
- SHT: Statistical Hypothesis Testing Toolbox
- gghighlight: Highlight Lines and Points in 'ggplot2'
- pdp: Partial Dependence Plots
- contingencytables: Statistical Analysis of Contingency Tables
- testDriveR: Teaching Data for Statistics and Data Science
- vcd: Visualizing Categorical Data
- gTestsMulti: New Graph-Based Multi-Sample Tests
Introduction
Each month I will describe the package that I've discovered or rediscovered and the ones that I've used the most of my time. I will start with the package used in my work and the the one that I would like to try/did not had time to use for work and also fun
Each card is organised as this
Name of the package: short description
mytags: #example tag
links
[cran package link]
[cran vignette link]
[github link]
description from the author/vignette
mynotes
eurodata: Fast and Easy Eurostat Data Import and Search
mytags: #statistics #data analysis
links
[cran package link] https://CRAN.R-project.org/package=eurodata
[vignette link]
[github link] https://github.com/alekrutkowski/eurodata/
description from the author/vignette
Interface to Eurostat’s Bulk Download Facility with fast data.table-based import of data, labels, and metadata. On top of the core functionality, data search and data description/comparison functions are also provided.
mynotes
gdiff: Graphical Difference Testing
mytags: #plot #data analysis #comparison
links
[cran package link] https://CRAN.R-project.org/package=gdiff
[vignette link]
[github link] https://github.com/pmur002/
description from the author/vignette
Functions for performing graphical difference testing. Differences are generated between raster images. Comparisons can be performed between different package versions and between different R versions.
mynotes
aplot: Decorate a 'ggplot' with Associated Information
mytags: #plot #data analysis #ggplot
links
[cran package link] https://CRAN.R-project.org/package=aplot
[vignette link]
[github link] https://github.com/YuLab-SMU/aplot
description from the author/vignette
For many times, we are not just aligning plots as what 'cowplot' and 'patchwork' did. Users would like to align associated information that requires axes to be exactly matched in subplots, e.g. hierarchical clustering with a heatmap. This package provides utilities to aligns associated subplots to a main plot at different sides (left, right, top and bottom) with axes exactly matched.
mynotes
SHT: Statistical Hypothesis Testing Toolbox
mytags: #statistics #data analysis #comparison
links
[cran package link] https://CRAN.R-project.org/package=SHT
[vignette link]
[github link] https://github.com/kisungyou/SHT/issues
description from the author/vignette
We provide a collection of statistical hypothesis testing procedures ranging from classical to modern methods for non-trivial settings such as high-dimensional scenario. For the general treatment of statistical hypothesis testing, see the book by Lehmann and Romano (2005) doi:10.1007/0-387-27605-X. mynotes
gghighlight: Highlight Lines and Points in 'ggplot2'
mytags: #plot #data analysis #ggplot
links
[cran package link] https://CRAN.R-project.org/package=gghighlight
[vignette link] https://cran.r-project.org/web/packages/gghighlight/index.html#:~:text=Introduction%20to%20gghighlight
[github link] https://github.com/yutannihilation/gghighlight/
description from the author/vignette
Make it easier to explore data with highlights.
mynotes
pdp: Partial Dependence Plots
mytags: #plot #data analysis #ggplot
links
[cran package link] https://CRAN.R-project.org/package=pdp
[vignette link] https://cran.r-project.org/web/packages/pdp/vignettes/pdp-approximate.pdf
[github link] https://github.com/yutannihilation/gghighlight/
description from the author/vignette
A general framework for constructing partial dependence (i.e., marginal effect) plots from various types machine learning models in R.
mynotes
contingencytables: Statistical Analysis of Contingency Tables
mytags: #plot #data analysis #ggplot
links
[cran package link] https://contingencytables.com/
[vignette link]
[github link] https://github.com/ocbe-uio/contingencytables/issues
description from the author/vignette
Provides functions to perform statistical inference of data organized in contingency tables. This package is a companion to the "Statistical Analysis of Contingency Tables" book by Fagerland et al. <ISBN 9781466588172>.
mynotes
contingencytables: Statistical Analysis of Contingency Tables
mytags: #plot #data analysis #ggplot
links
[cran package link] https://contingencytables.com/
[vignette link]
[github link] https://github.com/ocbe-uio/contingencytables/issues
description from the author/vignette
Provides functions to perform statistical inference of data organized in contingency tables. This package is a companion to the "Statistical Analysis of Contingency Tables" book by Fagerland et al. <ISBN 9781466588172>.
mynotes
testDriveR: Teaching Data for Statistics and Data Science
mytags: #plot #data analysis #ggplot
links
[cran package link] https://CRAN.R-project.org/package=testDriveR
[vignette link]
[github link] https://github.com/chris-prener/testDriveR
description from the author/vignette
Provides data sets for teaching statistics and data science courses. It includes a sample of data from John Edmund Kerrich's famous coinflip experiment. These are data that I used for teaching SOC 4015 / SOC 5050 at Saint Louis University (SLU). The package also contains an R Markdown template with the required formatting for assignments in my courses SOC 4015, SOC 4650, SOC 5050, and SOC 5650 at SLU.
mynotes
vcd: Visualizing Categorical Data
mytags: #plot #data analysis #ggplot
links
[cran package link] https://CRAN.R-project.org/package=testDriveR
[vignette link] https://cran.r-project.org/web/packages/vcd/vignettes/residual-shadings.pdf
[github link]
description from the author/vignette
Visualization techniques, data sets, summary and inference procedures aimed particularly at categorical data. Special emphasis is given to highly extensible grid graphics. The package was package was originally inspired by the book "Visualizing Categorical Data" by Michael Friendly and is now the main support package for a new book, "Discrete Data Analysis with R" by Michael Friendly and David Meyer (2015).
mynotes
gTestsMulti: New Graph-Based Multi-Sample Tests
mytags: #plot #data analysis #ggplot
links
[cran package link] https://CRAN.R-project.org/package=gTestsMulti
[vignette link]
[github link]
description from the author/vignette
New multi-sample tests for testing whether multiple samples are from the same distribution. They work well particularly for high-dimensional data. Song, H. and Chen, H. (2022) arXiv:2205.13787. mynotes