Skip to content

refinr

Cluster and Merge Similar Values Within a Character Vector

v0.3.3 · Nov 12, 2023 · GPL-3

Description

These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.

Downloads

CRAN

305

Last 30 days

12896th

810

Last 90 days

5.5K

Last year

Trend: +45.2% (30d vs prior 30d)

r2u CRAN

40

Last 30 days

137

Last 90 days

427

Last year

Trend: -21.6% (30d vs prior 30d)

CRAN Check Status

13 OK
Show all 13 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 22, 2026
ERROR 13 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Apr 18, 2026
ERROR r-devel-windows-x86_64

whether package can be installed

Installation failed.
See 'd:/Rcompile/CRANpkg/local/4.6/refinr.Rcheck/00install.out' for details.
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Dependency Network

Dependencies Reverse dependencies Rcpp stringdist stringi refinr

Version History

6 tracked
new 0.3.3 Mar 10, 2026
updated 0.3.3 ← 0.3.2 diff Nov 11, 2023
updated 0.3.2 ← 0.3.1 diff Apr 23, 2022
updated 0.3.1 ← 0.3.0 diff Jun 16, 2018
updated 0.3.0 ← 0.2.0 diff May 4, 2018
new 0.2.0 Jan 4, 2018