Skip to content

rcorpora

A Collection of Small Text Corpora of Interesting Data

v2.0.1 · Jun 30, 2024 · CC0

Description

A collection of small text corpora of interesting data. It contains all data sets from 'dariusk/corpora'. Some examples: names of animals: birds, dinosaurs, dogs; foods: beer categories, pizza toppings; geography: English towns, rivers, oceans; humans: authors, US presidents, occupations; science: elements, planets; words: adjectives, verbs, proverbs, US president quotes.

Downloads

2.5K

Last 30 days

2698th

7.5K

Last 90 days

22.1K

Last year

Trend: +0.8% (30d vs prior 30d)

CRAN Check Status

5 NOTE
9 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang NOTE
r-devel-linux-x86_64-debian-gcc NOTE
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 NOTE
r-oldrel-macos-x86_64 NOTE
r-oldrel-windows-x86_64 NOTE
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK
Check details (5 non-OK)
NOTE r-devel-linux-x86_64-debian-clang

CRAN incoming feasibility

Maintainer: ‘Gábor Csárdi <csardi.gabor@gmail.com>’

No Authors@R field in DESCRIPTION.
Please add one, modifying
  Authors@R: c(person(given = "Darius",
                      family = "Kazemi",
                      role = "aut"),
               person(given = "Cole",
                      family = "Willsea",
                      role = "aut"),
               person(given = "Serin",
                      family = "Delaunay",
                      role = "aut"),
               person(given = "Karl",
                      family = "Swedberg",
                      role = "aut"),
               person(given = "Matthew",
                      family = "Rothenberg",
                      role = "aut"),
               person(given = "Greg",
                      family = "Kennedy",
                      role = "aut"),
               person(given = "Nathaniel",
                      family = "Mitchell",
                      role = "aut"),
               person(given = "Javier",
           
...[truncated]...
le = "aut"),
               person(given = "aarón",
                      family = "montoya-moraga",
                      role = "aut"),
               person(given = "Alex",
                      family = "Miller",
                      role = "aut"),
               person(family = "Delacannon",
                      role = "aut"),
               person(given = "Scott",
                      family = "Lieber",
                      role = "aut"),
               person(given = "Pace",
                      family = "Ricciardelli",
                      role = "aut"),
               person(given = "Ruta",
                      family = "Kruliauskaite",
                      role = "aut"),
               person(given = "Scott",
                      family = "Grant",
                      role = "aut"),
               person(given = "Gábor",
                      family = "Csárdi",
                      role = "cre",
                      email = "csardi.gabor@gmail.com"))
as necessary.
NOTE r-devel-linux-x86_64-debian-gcc

CRAN incoming feasibility

Maintainer: ‘Gábor Csárdi <csardi.gabor@gmail.com>’

No Authors@R field in DESCRIPTION.
Please add one, modifying
  Authors@R: c(person(given = "Darius",
                      family = "Kazemi",
                      role = "aut"),
               person(given = "Cole",
                      family = "Willsea",
                      role = "aut"),
               person(given = "Serin",
                      family = "Delaunay",
                      role = "aut"),
               person(given = "Karl",
                      family = "Swedberg",
                      role = "aut"),
               person(given = "Matthew",
                      family = "Rothenberg",
                      role = "aut"),
               person(given = "Greg",
                      family = "Kennedy",
                      role = "aut"),
               person(given = "Nathaniel",
                      family = "Mitchell",
                      role = "aut"),
               person(given = "Javier",
           
...[truncated]...
le = "aut"),
               person(given = "aarón",
                      family = "montoya-moraga",
                      role = "aut"),
               person(given = "Alex",
                      family = "Miller",
                      role = "aut"),
               person(family = "Delacannon",
                      role = "aut"),
               person(given = "Scott",
                      family = "Lieber",
                      role = "aut"),
               person(given = "Pace",
                      family = "Ricciardelli",
                      role = "aut"),
               person(given = "Ruta",
                      family = "Kruliauskaite",
                      role = "aut"),
               person(given = "Scott",
                      family = "Grant",
                      role = "aut"),
               person(given = "Gábor",
                      family = "Csárdi",
                      role = "cre",
                      email = "csardi.gabor@gmail.com"))
as necessary.
NOTE r-oldrel-macos-arm64

installed package size

installed size is  7.6Mb
  sub-directories of 1Mb or more:
    corpora   7.5Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  7.6Mb
  sub-directories of 1Mb or more:
    corpora   7.5Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  7.5Mb
  sub-directories of 1Mb or more:
    corpora   7.4Mb

Check History

NOTE 9 OK · 5 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
NOTE r-devel-linux-x86_64-debian-clang

CRAN incoming feasibility

Maintainer: ‘Gábor Csárdi <csardi.gabor@gmail.com>’

No Authors@R field in DESCRIPTION.
Please add one, modifying
  Authors@R: c(person(given = "Darius",
                      family = "Kazemi",
                      role = "aut"),
               per
...[truncated]...
         family = "Grant",
                      role = "aut"),
               person(given = "Gábor",
                      family = "Csárdi",
                      role = "cre",
                      email = "csardi.gabor@gmail.com"))
as necessary.
NOTE r-devel-linux-x86_64-debian-gcc

CRAN incoming feasibility

Maintainer: ‘Gábor Csárdi <csardi.gabor@gmail.com>’

No Authors@R field in DESCRIPTION.
Please add one, modifying
  Authors@R: c(person(given = "Darius",
                      family = "Kazemi",
                      role = "aut"),
               per
...[truncated]...
         family = "Grant",
                      role = "aut"),
               person(given = "Gábor",
                      family = "Csárdi",
                      role = "cre",
                      email = "csardi.gabor@gmail.com"))
as necessary.
NOTE r-oldrel-macos-arm64

installed package size

installed size is  7.6Mb
  sub-directories of 1Mb or more:
    corpora   7.5Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  7.6Mb
  sub-directories of 1Mb or more:
    corpora   7.5Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  7.5Mb
  sub-directories of 1Mb or more:
    corpora   7.4Mb

Reverse Dependencies (1)

suggests

ids

Dependency Network

Dependencies Reverse dependencies jsonlite ids rcorpora

Version History

new 2.0.1 Mar 10, 2026
updated 2.0.1 ← 2.0.0 diff Jun 29, 2024
updated 2.0.0 ← 1.2.0 diff Jul 16, 2018
updated 1.2.0 ← 1.1.1 diff May 1, 2016
updated 1.1.1 ← 1.1.0 diff Jul 12, 2015
updated 1.1.0 ← 1.0.1 diff Jun 7, 2015
new 1.0.0 Apr 6, 2015
updated 1.0.1 ← 1.0.0 diff Apr 6, 2015