Skip to content

DataSimilarity

Quantifying Similarity of Datasets and Multivariate Two- And k-Sample Testing

v0.3.0 · Feb 27, 2026 · GPL (>= 3)

Description

A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. It provides newly implemented methods as well as wrapper functions for existing methods that enable calling many different methods in a unified framework. The methods were selected from the review and comparison of Stolte et al. (2024) <doi:10.1214/24-SS149>. An empirical comparison of the methods for categorical data was performed in Stolte et al. (2025) <doi:10.17877/DE290R-25572>.

Downloads

290

Last 30 days

14187th

729

Last 90 days

2.4K

Last year

Trend: +10.7% (30d vs prior 30d)

CRAN Check Status

14 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Dependency Network

Dependencies Reverse dependencies boot DataSimilarity

Version History

new 0.3.0 Mar 10, 2026
updated 0.3.0 ← 0.2.0 diff Feb 26, 2026
updated 0.2.0 ← 0.1.1 diff Jun 15, 2025
new 0.1.1 Mar 17, 2025