Skip to content

corpora

Statistics and Data Sets for Corpus Frequency Data

v0.7 · Jun 10, 2025 · GPL-3

Description

Utility functions for the statistical analysis of corpus frequency data. This package is a companion to the open-source course "Statistical Inference: A Gentle Introduction for Computational Linguists and Similar Creatures" ('SIGIL').

Downloads

CRAN

377

Last 30 days

11549th

1.2K

Last 90 days

5.2K

Last year

Trend: -2.8% (30d vs prior 30d)

r2u CRAN

17

Last 30 days

80

Last 90 days

213

Last year

Trend: -39.3% (30d vs prior 30d)

autoCRAN

19

Last 7 days

23

Last 30 days

4

All-time

autoCRAN-only: this name is served only by autoCRAN, so the count is exact.

CRAN Check Status

13 OK
Show all 13 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 8, 2026
ERROR r-devel-linux-x86_64-debian-gcc

Rd files

cannot open the connection
problem found in ‘BNCcomparison.Rd’
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Code

Structure

Lines of code

5,530

Files

82

Compiled share

0%

Has compiled src

No

Language breakdown

R 681 (12.3%)Tests 1,870 (33.8%)Docs 2,979 (53.9%)

API

Exported functions

21

Internal functions

10

Recent export changes

v0.7+2 am.score, builtin.am
v0.6+1 keyness

Testing & CI

Has tests

Yes

Test-to-code ratio

2.75

testthat edition

CI present

No

CI type

[]

PR gated

No

Docs

Return-value doc rate

100%

\dontrun example ratio

0%

Roxygen coverage

100%

Has pkgdown

No

NEWS present

Yes

Health & Security signals

Informational signals; not verdicts.

on.exit coverage

0%

Unsafe pattern score

0

Dep constraint coverage

0%

Secret pattern count

0

Bundled 3rd-party code

2 items

Portability & License

Min R version

3.5.0

System requirements

C++ standard

License

GPL-3

License flags

SPDX valid, OSI approved

History

Versions

7

First release

2005-10-26

Latest release

2025-06-10

Avg cadence

1176 days

Cold removal rate

100%

Dep drift

4

LOC over versions

v0.3-2: 1,064 LOCv0.3-2.1: 1,064 LOCv0.4-3: 1,514 LOCv0.5: 2,561 LOCv0.5-1: 2,560 LOCv0.6: 4,880 LOCv0.7: 5,530 LOC

Per-file churn detail lives in the source pipeline: https://github.com/r-observatory/cran-code-metrics.

Version History

8 tracked
new 0.7 Mar 10, 2026
updated 0.7 ← 0.6 diff Jun 9, 2025
updated 0.6 ← 0.5-1 diff Aug 20, 2023
updated 0.5-1 ← 0.5 diff Mar 3, 2022
updated 0.5 ← 0.4-3 diff Aug 30, 2018
updated 0.4-3 ← 0.3-2.1 diff Apr 3, 2012
updated 0.3-2.1 ← 0.3-2 diff Feb 24, 2009
new 0.3-2 Oct 25, 2005