Skip to content

fuzzylink

Probabilistic Record Linkage Using Pretrained Text Embeddings

v0.4.1 · Feb 23, 2026 · MIT + file LICENSE

Description

Links datasets through fuzzy string matching using pretrained text embeddings. Produces more accurate record linkage when lexical string distance metrics are a poor guide to match quality (e.g., "Patricia" is more lexically similar to "Patrick" than it is to "Trish"). Capable of performing multilingual record linkage. Methods are described in Ornstein (2025) <doi:10.1017/pan.2025.10016>.

Downloads

441

Last 30 days

8619th

1.2K

Last 90 days

3.9K

Last year

Trend: +36.1% (30d vs prior 30d)

CRAN Check Status

14 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Dependency Network

Dependencies Reverse dependencies dplyr Rfast reshape2 stringdist stringr httr jsonlite httr2 ranger ellmer fuzzylink

Version History

new 0.4.1 Mar 10, 2026
updated 0.4.1 ← 0.3.0 diff Feb 22, 2026
updated 0.3.0 ← 0.2.5 diff Jan 22, 2026
updated 0.2.5 ← 0.2.4 diff Aug 28, 2025
updated 0.2.4 ← 0.2.1 diff Aug 17, 2025
new 0.2.1 Jun 13, 2025