Skip to content

koRpus

Text Analysis with Emphasis on POS Tagging, Readability, and Lexical Diversity

v0.13-9 · Feb 3, 2026 · GPL (>= 3)

Description

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Note: For full functionality a local installation of TreeTagger is recommended. It is also recommended to not load this package directly, but by loading one of the available language support packages from the 'l10n' repository <https://undocumeantit.github.io/repos/l10n/>. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from <https://rkward.kde.org> (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list (<https://korpusml.reaktanz.de>).

Downloads

3.6K

Last 30 days

2114th

12.1K

Last 90 days

70.9K

Last year

Trend: -4.5% (30d vs prior 30d)

CRAN Check Status

3 NOTE
11 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 NOTE
r-oldrel-macos-x86_64 NOTE
r-oldrel-windows-x86_64 NOTE
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK
Check details (3 non-OK)
NOTE r-oldrel-macos-arm64

package dependencies

Packages suggested but not available for checking:
  'koRpus.lang.de', 'koRpus.lang.es', 'koRpus.lang.fr',
  'koRpus.lang.it', 'koRpus.lang.nl', 'koRpus.lang.pt',
  'koRpus.lang.ru'

Package which this enhances but not available for checking: ‘rkward’
NOTE r-oldrel-macos-x86_64

package dependencies

Packages suggested but not available for checking:
  'koRpus.lang.de', 'koRpus.lang.es', 'koRpus.lang.fr',
  'koRpus.lang.it', 'koRpus.lang.nl', 'koRpus.lang.pt',
  'koRpus.lang.ru'

Package which this enhances but not available for checking: ‘rkward’
NOTE r-oldrel-windows-x86_64

package dependencies

Packages suggested but not available for checking:
  'koRpus.lang.de', 'koRpus.lang.es', 'koRpus.lang.fr',
  'koRpus.lang.it', 'koRpus.lang.nl', 'koRpus.lang.pt',
  'koRpus.lang.ru'

Package which this enhances but not available for checking: 'rkward'

Check History

NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
NOTE r-oldrel-macos-arm64

package dependencies

Packages suggested but not available for checking:
  'koRpus.lang.de', 'koRpus.lang.es', 'koRpus.lang.fr',
  'koRpus.lang.it', 'koRpus.lang.nl', 'koRpus.lang.pt',
  'koRpus.lang.ru'

Package which this enhances but not available for checking: ‘rkward’
NOTE r-oldrel-macos-x86_64

package dependencies

Packages suggested but not available for checking:
  'koRpus.lang.de', 'koRpus.lang.es', 'koRpus.lang.fr',
  'koRpus.lang.it', 'koRpus.lang.nl', 'koRpus.lang.pt',
  'koRpus.lang.ru'

Package which this enhances but not available for checking: ‘rkward’
NOTE r-oldrel-windows-x86_64

package dependencies

Packages suggested but not available for checking:
  'koRpus.lang.de', 'koRpus.lang.es', 'koRpus.lang.fr',
  'koRpus.lang.it', 'koRpus.lang.nl', 'koRpus.lang.pt',
  'koRpus.lang.ru'

Package which this enhances but not available for checking: 'rkward'

Reverse Dependencies (4)

imports

suggests

Dependency Network

Dependencies Reverse dependencies sylly data.table Matrix koRpus.lang.en tm.plugin.koRpus textstem qdap koRpus

Version History

new 0.13-9 Mar 10, 2026
updated 0.13-9 ← 0.13-8 diff Feb 2, 2026
updated 0.13-8 ← 0.13-7 diff May 16, 2021
updated 0.13-7 ← 0.13-6 diff May 13, 2021
updated 0.13-6 ← 0.13-5 diff May 8, 2021
updated 0.13-5 ← 0.13-4 diff Feb 1, 2021
updated 0.13-4 ← 0.13-3 diff Dec 10, 2020
updated 0.13-3 ← 0.13-2 diff Oct 14, 2020
updated 0.13-2 ← 0.13-1 diff Sep 23, 2020
updated 0.13-1 ← 0.11-5 diff Sep 20, 2020
updated 0.11-5 ← 0.10-2 diff Oct 27, 2018
updated 0.10-2 ← 0.10-1 diff Apr 4, 2017
updated 0.10-1 ← 0.06-5 diff Mar 1, 2017
updated 0.06-5 ← 0.06-4 diff Jun 5, 2016
updated 0.06-4 ← 0.05-6 diff Mar 7, 2016
updated 0.05-6 ← 0.05-5 diff Jun 29, 2015
updated 0.05-5 ← 0.05-4 diff Mar 19, 2014
updated 0.05-4 ← 0.05-3 diff Jan 21, 2014
updated 0.05-3 ← 0.04-40 diff Dec 20, 2013
updated 0.04-40 ← 0.04-36 diff Apr 7, 2013