Skip to content

zoomerjoin

Superlatively Fast Fuzzy Joins

v0.2.3 · Mar 14, 2026 · MIT + file LICENSE

Description

Empowers users to fuzzily-merge data frames with millions or tens of millions of rows in minutes with low memory usage. The package uses the locality sensitive hashing algorithms developed by Datar, Immorlica, Indyk and Mirrokni (2004) <doi:10.1145/997817.997857>, and Broder (1998) <doi:10.1109/SEQUEN.1997.666900> to avoid having to compare every pair of records in each dataset, resulting in fuzzy-merges that finish in linear time.

Downloads

595

Last 30 days

6705th

1.3K

Last 90 days

3.3K

Last year

Trend: +51% (30d vs prior 30d)

CRAN Check Status

6 NOTE
8 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang NOTE
r-devel-linux-x86_64-debian-gcc NOTE
r-devel-linux-x86_64-fedora-clang NOTE
r-devel-linux-x86_64-fedora-gcc NOTE
r-devel-macos-arm64 OK
r-devel-windows-x86_64 NOTE
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 NOTE
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK
Check details (6 non-OK)
NOTE r-devel-linux-x86_64-debian-clang

compiled code

File ‘zoomerjoin/libs/zoomerjoin.so’:
  Found non-API calls to R: ‘R_NamespaceRegistry’, ‘R_Srcref’,
    ‘R_UnboundValue’

Compiled code should not call non-API entry points in R.

See ‘Writing portable packages’ in the ‘Writing R Extensions’ manual,
and section ‘Moving into C API compliance’ for issues with the use of
non-API entry points.
NOTE r-devel-linux-x86_64-debian-gcc

compiled code

File ‘zoomerjoin/libs/zoomerjoin.so’:
  Found non-API calls to R: ‘R_NamespaceRegistry’, ‘R_UnboundValue’

Compiled code should not call non-API entry points in R.

See ‘Writing portable packages’ in the ‘Writing R Extensions’ manual,
and section ‘Moving into C API compliance’ for issues with the use of
non-API entry points.
NOTE r-devel-linux-x86_64-fedora-clang

compiled code

File ‘zoomerjoin/libs/zoomerjoin.so’:
  Found non-API calls to R: ‘R_NamespaceRegistry’, ‘R_UnboundValue’

Compiled code should not call non-API entry points in R.

See ‘Writing portable packages’ in the ‘Writing R Extensions’ manual,
and section ‘Moving into C API compliance’ for issues with the use of
non-API entry points.
NOTE r-devel-linux-x86_64-fedora-gcc

compiled code

File ‘zoomerjoin/libs/zoomerjoin.so’:
  Found non-API calls to R: ‘R_NamespaceRegistry’, ‘R_Srcref’,
    ‘R_UnboundValue’

Compiled code should not call non-API entry points in R.

See ‘Writing portable packages’ in the ‘Writing R Extensions’ manual,
and section ‘Moving into C API compliance’ for issues with the use of
non-API entry points.
NOTE r-devel-windows-x86_64

compiled code

File 'zoomerjoin/libs/x64/zoomerjoin.dll':
  Found non-API calls to R: 'R_NamespaceRegistry', 'R_Srcref',
    'R_UnboundValue'

Compiled code should not call non-API entry points in R.

See 'Writing portable packages' in the 'Writing R Extensions' manual,
and section 'Moving into C API compliance' for issues with the use of
non-API entry points.
NOTE r-patched-linux-x86_64

compiled code

File ‘zoomerjoin/libs/zoomerjoin.so’:
  Found non-API calls to R: ‘R_MissingArg’, ‘R_NamespaceRegistry’,
    ‘R_Srcref’, ‘R_UnboundValue’

Compiled code should not call non-API entry points in R.

See ‘Writing portable packages’ in the ‘Writing R Extensions’ manual,
and section ‘Moving into C API compliance’ for issues with the use of
non-API entry points.

Check History

NOTE 13 OK · 1 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 27, 2026
NOTE r-devel-linux-x86_64-debian-gcc

compiled code

File ‘zoomerjoin/libs/zoomerjoin.so’:
  Found non-API calls to R: ‘R_MissingArg’, ‘R_NamespaceRegistry’,
    ‘R_Srcref’, ‘R_UnboundValue’

Compiled code should not call non-API entry points in R.

See ‘Writing portable packages’ in the ‘Writing R Extensions’ manual,
and section ‘Moving into C API compliance’ for issues with the use of
non-API entry points.
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 21, 2026
WARNING 5 OK · 0 NOTE · 9 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
WARNING r-devel-linux-x86_64-debian-clang

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: ‘fuzzyjoin’
WARNING r-devel-linux-x86_64-debian-gcc

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: ‘fuzzyjoin’
WARNING r-devel-linux-x86_64-fedora-clang

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: ‘fuzzyjoin’
WARNING r-devel-linux-x86_64-fedora-gcc

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: ‘fuzzyjoin’
WARNING r-devel-windows-x86_64

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: 'fuzzyjoin'
WARNING r-patched-linux-x86_64

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: ‘fuzzyjoin’
WARNING r-release-linux-x86_64

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: ‘fuzzyjoin’
WARNING r-release-windows-x86_64

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: 'fuzzyjoin'
WARNING r-oldrel-windows-x86_64

for unstated dependencies in ‘tests’

'library' or 'require' call not declared from: 'fuzzyjoin'

Reverse Dependencies (2)

Dependency Network

Dependencies Reverse dependencies collapse dplyr tibble tidyr rlang firmmatchr highlightr zoomerjoin

Version History

updated 0.2.3 ← 0.2.2 diff Mar 14, 2026
new 0.2.2 Mar 10, 2026
updated 0.2.2 ← 0.2.1 diff Jan 29, 2026
updated 0.2.1 ← 0.2.0 diff Apr 12, 2025
updated 0.2.0 ← 0.1.5 diff Sep 23, 2024
updated 0.1.5 ← 0.1.4 diff Jul 1, 2024
new 0.1.4 Jan 30, 2024