DataSimilarity
Quantifying Similarity of Datasets and Multivariate Two- And k-Sample Testing
Description
A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. It provides newly implemented methods as well as wrapper functions for existing methods that enable calling many different methods in a unified framework. The methods were selected from the review and comparison of Stolte et al. (2024) <doi:10.1214/24-SS149>. An empirical comparison of the methods was performed in Stolte et al. (2026) <doi:10.48550/arXiv.2604.11458> for categorical data and in Stolte et al. (2026) <doi:10.48550/arXiv.2604.12327> for numeric data.
Downloads
465
Last 30 days
8369th
1.1K
Last 90 days
2.9K
Last year
Trend: +21.1% (30d vs prior 30d)
17
Last 30 days
40
Last 90 days
160
Last year
Trend: 0% (30d vs prior 30d)
3
Last 7 days
22
Last 30 days
0
All-time
autoCRAN-only: this name is served only by autoCRAN, so the count is exact.
CRAN Check Status
Show all 13 flavors
| Flavor | Status |
|---|---|
| r-devel-linux-x86_64-debian-clang | NOTE |
| r-devel-linux-x86_64-debian-gcc | NOTE |
| r-devel-linux-x86_64-fedora-clang | NOTE |
| r-devel-linux-x86_64-fedora-gcc | NOTE |
| r-devel-windows-x86_64 | OK |
| r-oldrel-macos-arm64 | OK |
| r-oldrel-macos-x86_64 | OK |
| r-oldrel-windows-x86_64 | OK |
| r-patched-linux-x86_64 | NOTE |
| r-release-linux-x86_64 | NOTE |
| r-release-macos-arm64 | OK |
| r-release-macos-x86_64 | OK |
| r-release-windows-x86_64 | OK |
Check details (6 non-OK)
Rd cross-references
Unknown package ‘LPKsample’ in Rd xrefs
Rd cross-references
Unknown package ‘LPKsample’ in Rd xrefs
Rd cross-references
Undeclared packages ‘ranger’, ‘transport’ in Rd xrefs Unknown package ‘LPKsample’ in Rd xrefs
Rd cross-references
Unknown package ‘LPKsample’ in Rd xrefs
Rd cross-references
Unknown package ‘LPKsample’ in Rd xrefs
Rd cross-references
Unknown package ‘LPKsample’ in Rd xrefs
Check History
NOTE 12 OK · 1 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 11, 2026
Rd cross-references
Unknown package ‘LPKsample’ in Rd xrefs
OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 8, 2026
examples
Running examples in ‘DataSimilarity-Ex.R’ failed
The error most likely occurred in:
> base::assign(".ptime", proc.time(), pos = "CheckExEnv")
> ### Name: YMRZL
> ### Title: Yu et al. (2007) Two-Sample Test
> ### Aliases: YMRZL
> ### Keywords: htest
...[truncated]...
: ‘there is no package called ‘Ecume’’
Error in YMRZL(X1, X2) :
Package "Ecume" required for using method YMRZL().
Execution halted
Examples with CPU (user + system) or elapsed time > 5s
user system elapsed
Jeffreys 4.668 0.012 5.157