Skip to content

JATSdecoder

A Metadata and Text Extraction and Manipulation Tool Set

v1.3.0 · Feb 16, 2026 · GPL-3

Description

Provides a function collection to extract metadata, sectioned text and study characteristics from scientific articles in 'NISO-JATS' format. Articles in PDF format can be converted to 'NISO-JATS' with the 'Content ExtRactor and MINEr' ('CERMINE', <https://github.com/CeON/CERMINE>). For convenience, two functions bundle the extraction heuristics: JATSdecoder() converts 'NISO-JATS'-tagged XML files to a structured list with elements title, author, journal, history, 'DOI', abstract, sectioned text and reference list. study.character() extracts multiple study characteristics like number of included studies, statistical methods used, alpha error, power, statistical results, correction method for multiple testing, software used. The function get.stats() extracts all statistical results from text and recomputes p-values for many standard test statistics. It performs a consistency check of the reported with the recalculated p-values. An estimation of the involved sample size is performed based on textual reports within the abstract and the reported degrees of freedom within statistical results. In addition, the package contains some useful functions to process text (text2sentences(), text2num(), ngram(), strsplit2(), grep2()). See Böschen, I. (2021) <doi:10.1007/s11192-021-04162-z> Böschen, I. (2021) <doi:10.1038/s41598-021-98782-3>, Böschen, I. (2023) <doi:10.1038/s41598-022-27085-y>, and Böschen, I. (2024) <doi:10.48550/arXiv.2408.07948>.

Downloads

363

Last 30 days

10508th

1.2K

Last 90 days

3.6K

Last year

Trend: -15.2% (30d vs prior 30d)

CRAN Check Status

14 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Apr 6, 2026
ERROR 13 OK · 0 NOTE · 0 WARNING · 1 ERROR · 0 FAILURE Apr 3, 2026
ERROR r-devel-linux-x86_64-debian-gcc

package dependencies

Package required but not available: ‘openNLP’

See section ‘The DESCRIPTION file’ in the ‘Writing R Extensions’
manual.
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Reverse Dependencies (1)

imports

Dependency Network

Dependencies Reverse dependencies NLP openNLP tableParser JATSdecoder

Version History

new 1.3.0 Mar 10, 2026
updated 1.3.0 ← 1.2.2 diff Feb 15, 2026
updated 1.2.2 ← 1.2.1 diff Jan 7, 2026
updated 1.2.1 ← 1.2.0 diff Jul 28, 2025
updated 1.2.0 ← 1.1.2 diff Oct 11, 2023
updated 1.1.2 ← 1.1.1 diff Feb 26, 2023
updated 1.1.1 ← 1.1 diff Oct 20, 2022
new 1.1 Sep 29, 2022