contentanalysis
Scientific Content and Citation Analysis from PDF Documents
Description
Provides comprehensive tools for extracting and analyzing scientific content from PDF documents, including citation extraction, reference matching, text analysis, and bibliometric indicators. Supports multi-column PDF layouts, 'CrossRef' API <https://www.crossref.org/documentation/retrieve-metadata/rest-api/> integration, and advanced citation parsing.
Downloads
19.3K
Last 30 days
889th
55.9K
Last 90 days
96.7K
Last year
Trend: -0.6% (30d vs prior 30d)
CRAN Check Status
Show all 14 flavors
| Flavor | Status |
|---|---|
| r-devel-linux-x86_64-debian-clang | ERROR |
| r-devel-linux-x86_64-debian-gcc | OK |
| r-devel-linux-x86_64-fedora-clang | OK |
| r-devel-linux-x86_64-fedora-gcc | OK |
| r-devel-macos-arm64 | OK |
| r-devel-windows-x86_64 | OK |
| r-oldrel-macos-arm64 | NOTE |
| r-oldrel-macos-x86_64 | NOTE |
| r-oldrel-windows-x86_64 | NOTE |
| r-patched-linux-x86_64 | OK |
| r-release-linux-x86_64 | OK |
| r-release-macos-arm64 | OK |
| r-release-macos-x86_64 | OK |
| r-release-windows-x86_64 | OK |
Check details (4 non-OK)
re-building of vignette outputs
Error(s) in re-building vignettes:
...
--- re-building ‘introduction.Rmd’ using rmarkdown
trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf'
Content type 'application/octet-stream' length 543702 bytes (530 KB)
==================================================
downloaded 530 KB
Quitting from introduction.Rmd:216-223 [reference-sources]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
<error/vctrs_error_subscript_oob>
Error in `analysis$parsed_references[, c("ref_first_author", "ref_year", "ref_journal",
"ref_source")]`:
! Can't subset columns that don't exist.
✖ Column `ref_journal` doesn't exist.
---
Backtrace:
▆
1. ├─utils::head(...)
2. ├─...[]
3. └─tibble:::`[.tbl_df`(...)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Error: processing vignette 'introduction.Rmd' failed with diagnostics:
Can't subset columns that don't exist.
✖ Column `ref_journal` doesn't exist.
--- failed re-building ‘introduction.Rmd’
SUMMARY: processing the following file failed:
‘introduction.Rmd’
Error: Vignette re-building failed.
Execution halted
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
Check History
ERROR 10 OK · 3 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Apr 3, 2026
re-building of vignette outputs
Error(s) in re-building vignettes: ... --- re-building ‘introduction.Rmd’ using rmarkdown trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf' Content type 'application/octet-stream' leng ...[truncated]... nostics: Can't subset columns that don't exist. ✖ Column `ref_journal` doesn't exist. --- failed re-building ‘introduction.Rmd’ SUMMARY: processing the following file failed: ‘introduction.Rmd’ Error: Vignette re-building failed. Execution halted
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 1, 2026
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
ERROR 10 OK · 3 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Mar 31, 2026
re-building of vignette outputs
Error(s) in re-building vignettes: ... --- re-building ‘introduction.Rmd’ using rmarkdown trying URL 'https://raw.githubusercontent.com/massimoaria/contentanalysis/master/inst/examples/example_paper.pdf' Content type 'application/octet-stream' leng ...[truncated]... nostics: Can't subset columns that don't exist. ✖ Column `ref_journal` doesn't exist. --- failed re-building ‘introduction.Rmd’ SUMMARY: processing the following file failed: ‘introduction.Rmd’ Error: Vignette re-building failed. Execution halted
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
NOTE 11 OK · 3 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb
installed package size
installed size is 5.3Mb
sub-directories of 1Mb or more:
doc 4.7Mb
installed package size
installed size is 6.4Mb
sub-directories of 1Mb or more:
doc 4.7Mb
help 1.4Mb