Skip to content

contentanalysis

Scientific Content and Citation Analysis from PDF Documents

v1.1.1 · Jun 15, 2026 · GPL (>= 3)

Description

Provides comprehensive tools for extracting and analyzing scientific content from PDF documents, including citation extraction, reference matching, text analysis, and bibliometric indicators. Supports multi-column PDF layouts, 'CrossRef' API <https://www.crossref.org/documentation/retrieve-metadata/rest-api/> integration, and advanced citation parsing.

Downloads

CRAN

17.8K

Last 30 days

884th

61.7K

Last 90 days

156.4K

Last year

Trend: -13.9% (30d vs prior 30d)

r2u CRAN

31

Last 30 days

108

Last 90 days

176

Last year

Trend: -31.1% (30d vs prior 30d)

CRAN Check Status

13 OK
Show all 13 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 30, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 29, 2026
ERROR r-devel-windows-x86_64

re-building of vignette outputs

Error(s) in re-building vignettes:
--- re-building 'introduction.Rmd' using rmarkdown
trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf'
Content type 'application/octet-stream' length 543
...[truncated]...
nostics:
Can't subset columns that don't exist.
✖ Column `ref_journal` doesn't exist.
--- failed re-building 'introduction.Rmd'

SUMMARY: processing the following file failed:
  'introduction.Rmd'

Error: Vignette re-building failed.
Execution halted
OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 28, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 27, 2026
ERROR r-devel-linux-x86_64-debian-gcc

re-building of vignette outputs

Error(s) in re-building vignettes:
  ...
--- re-building ‘introduction.Rmd’ using rmarkdown
trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf'
Content type 'application/octet-stream' leng
...[truncated]...
nostics:
Can't subset columns that don't exist.
✖ Column `ref_journal` doesn't exist.
--- failed re-building ‘introduction.Rmd’

SUMMARY: processing the following file failed:
  ‘introduction.Rmd’

Error: Vignette re-building failed.
Execution halted
OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 9, 2026
ERROR 12 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Jun 8, 2026
ERROR r-devel-linux-x86_64-debian-gcc

R code for possible problems

Fatal error: cannot create 'R_TempDir'
OK 12 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 25, 2026
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 8, 2026
NOTE r-oldrel-macos-arm64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
ERROR 10 OK · 3 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Apr 3, 2026
ERROR r-devel-linux-x86_64-debian-clang

re-building of vignette outputs

Error(s) in re-building vignettes:
  ...
--- re-building ‘introduction.Rmd’ using rmarkdown
trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf'
Content type 'application/octet-stream' leng
...[truncated]...
nostics:
Can't subset columns that don't exist.
✖ Column `ref_journal` doesn't exist.
--- failed re-building ‘introduction.Rmd’

SUMMARY: processing the following file failed:
  ‘introduction.Rmd’

Error: Vignette re-building failed.
Execution halted
NOTE r-oldrel-macos-arm64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 1, 2026
NOTE r-oldrel-macos-arm64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
ERROR 10 OK · 3 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Mar 31, 2026
ERROR r-devel-linux-x86_64-debian-gcc

re-building of vignette outputs

Error(s) in re-building vignettes:
  ...
--- re-building ‘introduction.Rmd’ using rmarkdown
trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf'
Content type 'application/octet-stream' leng
...[truncated]...
nostics:
Can't subset columns that don't exist.
✖ Column `ref_journal` doesn't exist.
--- failed re-building ‘introduction.Rmd’

SUMMARY: processing the following file failed:
  ‘introduction.Rmd’

Error: Vignette re-building failed.
Execution halted
NOTE r-oldrel-macos-arm64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
NOTE r-oldrel-macos-arm64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb
NOTE r-oldrel-macos-x86_64

installed package size

installed size is  5.3Mb
  sub-directories of 1Mb or more:
    doc   4.7Mb
NOTE r-oldrel-windows-x86_64

installed package size

installed size is  6.4Mb
  sub-directories of 1Mb or more:
    doc    4.7Mb
    help   1.4Mb

Reverse Dependencies (1)

imports

Dependency Network

Dependencies Reverse dependencies base64enc dplyr httr2 igraph jsonlite magrittr openalexR (>= 2.0.2) pdftools purrr stringr (>= 1.5.2) tibble tidyr tidytext (>= 0.4.3) visNetwork bibliometrix contentanalysis

Version History

6 tracked
updated 1.1.1 ← 1.1.0 diff Jun 16, 2026
updated 1.1.0 ← 1.0.0 diff May 19, 2026
new 1.0.0 Mar 10, 2026
updated 1.0.0 ← 0.2.1 diff Mar 6, 2026
updated 0.2.1 ← 0.2.0 diff Dec 11, 2025
new 0.2.0 Oct 29, 2025