corpora
Statistics and Data Sets for Corpus Frequency Data
Description
Utility functions for the statistical analysis of corpus frequency data. This package is a companion to the open-source course "Statistical Inference: A Gentle Introduction for Computational Linguists and Similar Creatures" ('SIGIL').
Downloads
377
Last 30 days
11549th
1.2K
Last 90 days
5.2K
Last year
Trend: -2.8% (30d vs prior 30d)
17
Last 30 days
80
Last 90 days
213
Last year
Trend: -39.3% (30d vs prior 30d)
19
Last 7 days
23
Last 30 days
4
All-time
autoCRAN-only: this name is served only by autoCRAN, so the count is exact.
CRAN Check Status
Show all 13 flavors
| Flavor | Status |
|---|---|
| r-devel-linux-x86_64-debian-clang | OK |
| r-devel-linux-x86_64-debian-gcc | OK |
| r-devel-linux-x86_64-fedora-clang | OK |
| r-devel-linux-x86_64-fedora-gcc | OK |
| r-devel-windows-x86_64 | OK |
| r-oldrel-macos-arm64 | OK |
| r-oldrel-macos-x86_64 | OK |
| r-oldrel-windows-x86_64 | OK |
| r-patched-linux-x86_64 | OK |
| r-release-linux-x86_64 | OK |
| r-release-macos-arm64 | OK |
| r-release-macos-x86_64 | OK |
| r-release-windows-x86_64 | OK |
Check History
OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 8, 2026
Rd files
cannot open the connection problem found in ‘BNCcomparison.Rd’
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
Code
Structure
Lines of code
5,530
Files
82
Compiled share
0%
Has compiled src
No
Language breakdown
API
Exported functions
21
Internal functions
10
Recent export changes
Testing & CI
Has tests
Yes
Test-to-code ratio
2.75
testthat edition
–
CI present
No
CI type
[]
PR gated
No
Docs
Return-value doc rate
100%
\dontrun example ratio
0%
Roxygen coverage
100%
Has pkgdown
No
NEWS present
Yes
Health & Security signals
Informational signals; not verdicts.
on.exit coverage
0%
Unsafe pattern score
0
Dep constraint coverage
0%
Secret pattern count
0
Bundled 3rd-party code
2 items
Portability & License
Min R version
3.5.0
System requirements
–
C++ standard
–
License
GPL-3
License flags
SPDX valid, OSI approved
History
Versions
7
First release
2005-10-26
Latest release
2025-06-10
Avg cadence
1176 days
Cold removal rate
100%
Dep drift
4
LOC over versions
Per-file churn detail lives in the source pipeline: https://github.com/r-observatory/cran-code-metrics.