Skip to content

textreuse

Detect Text Reuse and Document Similarity

v1.0.1 · May 7, 2026 · MIT + file LICENSE

Description

Tools for measuring similarity among documents and detecting passages which have been reused. Implements shingled n-gram, skip n-gram, and other tokenizers; similarity/dissimilarity functions; pairwise comparisons; minhash and locality sensitive hashing algorithms; and a version of the Smith-Waterman local alignment algorithm suitable for natural language.

Downloads

CRAN

620

Last 30 days

5262nd

1.7K

Last 90 days

13.9K

Last year

Trend: -4.6% (30d vs prior 30d)

r2u CRAN

50

Last 30 days

143

Last 90 days

466

Last year

Trend: 0% (30d vs prior 30d)

autoCRAN

2

Last 7 days

11

Last 30 days

0

All-time

autoCRAN-only: this name is served only by autoCRAN, so the count is exact.

CRAN Check Status

13 OK
Show all 13 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 7, 2026
ERROR r-devel-linux-x86_64-debian-gcc

Rd line widths

Fatal error: cannot create 'R_TempDir'
These lines will be truncated in the PDF manual.
OK 12 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 25, 2026
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 22, 2026
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory
ERROR 10 OK · 3 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Apr 18, 2026
ERROR r-devel-windows-x86_64

whether package can be installed

Installation failed.
See 'd:/Rcompile/CRANpkg/local/4.6/textreuse.Rcheck/00install.out' for details.
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 10, 2026
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory
ERROR 10 OK · 3 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Apr 9, 2026
ERROR r-devel-linux-x86_64-debian-gcc

whether package can be installed

Installation failed.
See ‘/home/hornik/tmp/R.check/r-devel-gcc/Work/PKGS/textreuse.Rcheck/00install.out’ for details.
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory

Reverse Dependencies (1)

suggests

Dependency Network

Dependencies Reverse dependencies assertthat digest dplyr NLP Matrix Rcpp RcppProgress stringr tibble tidyr textrank textreuse

Version History

8 tracked
updated 1.0.1 ← 0.1.5 diff May 7, 2026
new 0.1.5 Mar 10, 2026
updated 0.1.5 ← 0.1.4 diff May 14, 2020
updated 0.1.4 ← 0.1.3 diff Nov 27, 2016
updated 0.1.3 ← 0.1.2 diff Mar 27, 2016
updated 0.1.2 ← 0.1.1 diff Nov 5, 2015
updated 0.1.1 ← 0.1.0 diff Nov 3, 2015
new 0.1.0 Oct 30, 2015