Skip to content

textreuse

Detect Text Reuse and Document Similarity

v0.1.5 · May 15, 2020 · MIT + file LICENSE

Description

Tools for measuring similarity among documents and detecting passages which have been reused. Implements shingled n-gram, skip n-gram, and other tokenizers; similarity/dissimilarity functions; pairwise comparisons; minhash and locality sensitive hashing algorithms; and a version of the Smith-Waterman local alignment algorithm suitable for natural language.

Downloads

389

Last 30 days

9737th

1.7K

Last 90 days

13.5K

Last year

Trend: -10.6% (30d vs prior 30d)

CRAN Check Status

3 NOTE
11 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 NOTE
r-oldrel-macos-x86_64 NOTE
r-oldrel-windows-x86_64 NOTE
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK
Check details (3 non-OK)
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory

Check History

NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
NOTE r-oldrel-macos-arm64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-macos-x86_64

LazyData

'LazyData' is specified without a 'data' directory
NOTE r-oldrel-windows-x86_64

LazyData

'LazyData' is specified without a 'data' directory

Reverse Dependencies (1)

suggests

Dependency Network

Dependencies Reverse dependencies assertthat digest dplyr NLP Rcpp RcppProgress stringr tibble tidyr textrank textreuse

Version History

new 0.1.5 Mar 10, 2026
updated 0.1.5 ← 0.1.4 diff May 14, 2020
updated 0.1.4 ← 0.1.3 diff Nov 27, 2016
updated 0.1.3 ← 0.1.2 diff Mar 27, 2016
updated 0.1.2 ← 0.1.1 diff Nov 5, 2015
updated 0.1.1 ← 0.1.0 diff Nov 3, 2015
new 0.1.0 Oct 30, 2015