Skip to content

epubr

Read EPUB File Metadata and Text

v0.6.5 · Sep 11, 2024 · MIT + file LICENSE

Description

Provides functions supporting the reading and parsing of internal e-book content from EPUB files. The 'epubr' package provides functions supporting the reading and parsing of internal e-book content from EPUB files. E-book metadata and text content are parsed separately and joined together in a tidy, nested tibble data frame. E-book formatting is not completely standardized across all literature. It can be challenging to curate parsed e-book content across an arbitrary collection of e-books perfectly and in completely general form, to yield a singular, consistently formatted output. Many EPUB files do not even contain all the same pieces of information in their respective metadata. EPUB file parsing functionality in this package is intended for relatively general application to arbitrary EPUB e-books. However, poorly formatted e-books or e-books with highly uncommon formatting may not work with this package. There may even be cases where an EPUB file has DRM or some other property that makes it impossible to read with 'epubr'. Text is read 'as is' for the most part. The only nominal changes are minor substitutions, for example curly quotes changed to straight quotes. Substantive changes are expected to be performed subsequently by the user as part of their text analysis. Additional text cleaning can be performed at the user's discretion, such as with functions from packages like 'tm' or 'qdap'.

Downloads

485

Last 30 days

8052nd

1.1K

Last 90 days

4.4K

Last year

Trend: +61.7% (30d vs prior 30d)

CRAN Check Status

14 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Dependency Network

Dependencies Reverse dependencies xml2 xslt magrittr tibble dplyr tidyr epubr

Version History

new 0.6.5 Mar 10, 2026
updated 0.6.5 ← 0.6.4 diff Sep 11, 2024
updated 0.6.4 ← 0.6.3 diff Sep 9, 2023
updated 0.6.3 ← 0.6.2 diff Jun 17, 2021
updated 0.6.2 ← 0.6.1 diff Feb 19, 2021
updated 0.6.1 ← 0.6.0 diff Nov 27, 2019
updated 0.6.0 ← 0.5.0 diff Jan 10, 2019
updated 0.5.0 ← 0.4.1 diff Oct 21, 2018
updated 0.4.1 ← 0.4.0 diff Jul 31, 2018
new 0.4.0 May 29, 2018