Description
Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
Downloads
7.4K
Last 30 days
1444th
21.5K
Last 90 days
121.9K
Last year
Trend: +21.2% (30d vs prior 30d)
CRAN Check Status
14
OK
Show all 14 flavors
| Flavor | Status |
|---|---|
| r-devel-linux-x86_64-debian-clang | OK |
| r-devel-linux-x86_64-debian-gcc | OK |
| r-devel-linux-x86_64-fedora-clang | OK |
| r-devel-linux-x86_64-fedora-gcc | OK |
| r-devel-macos-arm64 | OK |
| r-devel-windows-x86_64 | OK |
| r-oldrel-macos-arm64 | OK |
| r-oldrel-macos-x86_64 | OK |
| r-oldrel-windows-x86_64 | OK |
| r-patched-linux-x86_64 | OK |
| r-release-linux-x86_64 | OK |
| r-release-macos-arm64 | OK |
| r-release-macos-x86_64 | OK |
| r-release-windows-x86_64 | OK |
Check History
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 11, 2026
NOTE 13 OK · 1 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026
NOTE
r-patched-linux-x86_64
Rd files
checkRd: (-1) check_text.Rd:31: Lost braces in \itemize; \value handles \item{}{} directly
checkRd: (-1) check_text.Rd:32: Lost braces in \itemize; \value handles \item{}{} directly
checkRd: (-1) check_text.Rd:33: Lost braces in \itemize; \value hand
...[truncated]...
ck_text.Rd:53: Lost braces in \itemize; \value handles \item{}{} directly
checkRd: (-1) replace_html.Rd:12: Lost braces
12 | \item{symbol}{logical. If code{TRUE} the symbols are retained with appropriate
| ^
Reverse Dependencies (9)
Dependency Network
Version History
new
0.9.7
Mar 10, 2026
new
0.2.0
Jan 9, 2017