Skip to content

DataSimilarity

Quantifying Similarity of Datasets and Multivariate Two- And k-Sample Testing

v0.4.0 · May 15, 2026 · GPL (>= 3)

Description

A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. It provides newly implemented methods as well as wrapper functions for existing methods that enable calling many different methods in a unified framework. The methods were selected from the review and comparison of Stolte et al. (2024) <doi:10.1214/24-SS149>. An empirical comparison of the methods was performed in Stolte et al. (2026) <doi:10.48550/arXiv.2604.11458> for categorical data and in Stolte et al. (2026) <doi:10.48550/arXiv.2604.12327> for numeric data.

Downloads

CRAN

465

Last 30 days

8369th

1.1K

Last 90 days

2.9K

Last year

Trend: +21.1% (30d vs prior 30d)

r2u CRAN

17

Last 30 days

40

Last 90 days

160

Last year

Trend: 0% (30d vs prior 30d)

autoCRAN

3

Last 7 days

22

Last 30 days

0

All-time

autoCRAN-only: this name is served only by autoCRAN, so the count is exact.

CRAN Check Status

6 NOTE
7 OK
Show all 13 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang NOTE
r-devel-linux-x86_64-debian-gcc NOTE
r-devel-linux-x86_64-fedora-clang NOTE
r-devel-linux-x86_64-fedora-gcc NOTE
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 NOTE
r-release-linux-x86_64 NOTE
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK
Check details (6 non-OK)
NOTE r-devel-linux-x86_64-debian-clang

Rd cross-references

Unknown package ‘LPKsample’ in Rd xrefs
NOTE r-devel-linux-x86_64-debian-gcc

Rd cross-references

Unknown package ‘LPKsample’ in Rd xrefs
NOTE r-devel-linux-x86_64-fedora-clang

Rd cross-references

Undeclared packages ‘ranger’, ‘transport’ in Rd xrefs
Unknown package ‘LPKsample’ in Rd xrefs
NOTE r-devel-linux-x86_64-fedora-gcc

Rd cross-references

Unknown package ‘LPKsample’ in Rd xrefs
NOTE r-patched-linux-x86_64

Rd cross-references

Unknown package ‘LPKsample’ in Rd xrefs
NOTE r-release-linux-x86_64

Rd cross-references

Unknown package ‘LPKsample’ in Rd xrefs

Check History

NOTE 12 OK · 1 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 11, 2026
NOTE r-devel-linux-x86_64-fedora-gcc

Rd cross-references

Unknown package ‘LPKsample’ in Rd xrefs
OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 8, 2026
ERROR r-devel-linux-x86_64-debian-gcc

examples

Running examples in ‘DataSimilarity-Ex.R’ failed
The error most likely occurred in:

> base::assign(".ptime", proc.time(), pos = "CheckExEnv")
> ### Name: YMRZL
> ### Title: Yu et al. (2007) Two-Sample Test
> ### Aliases: YMRZL
> ### Keywords: htest 
...[truncated]...
:  ‘there is no package called ‘Ecume’’
Error in YMRZL(X1, X2) : 
  Package "Ecume" required for using method YMRZL().
Execution halted
Examples with CPU (user + system) or elapsed time > 5s
          user system elapsed
Jeffreys 4.668  0.012   5.157
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Dependency Network

Dependencies Reverse dependencies boot DataSimilarity

Version History

5 tracked
updated 0.4.0 ← 0.3.0 diff May 15, 2026
new 0.3.0 Mar 10, 2026
updated 0.3.0 ← 0.2.0 diff Feb 26, 2026
updated 0.2.0 ← 0.1.1 diff Jun 15, 2025
new 0.1.1 Mar 17, 2025