Skip to content

themis

Extra Recipes Steps for Dealing with Unbalanced Data

v1.0.3 · Jan 22, 2025 · MIT + file LICENSE

Description

A dataset with an uneven number of cases in each class is said to be unbalanced. Many models produce a subpar performance on unbalanced datasets. A dataset can be balanced by increasing the number of minority cases using SMOTE 2011 <doi:10.48550/arXiv.1106.1813>, BorderlineSMOTE 2005 <doi:10.1007/11538059_91> and ADASYN 2008 <https://ieeexplore.ieee.org/document/4633969>. Or by decreasing the number of majority cases using NearMiss 2003 <https://www.site.uottawa.ca/~nat/Workshop2003/jzhang.pdf> or Tomek link removal 1976 <https://ieeexplore.ieee.org/document/4309452>.

Downloads

12.8K

Last 30 days

1109th

37.6K

Last 90 days

213.2K

Last year

Trend: +9.9% (30d vs prior 30d)

CRAN Check Status

14 OK
Show all 14 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-macos-arm64 OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Reverse Dependencies (8)

Dependency Network

Dependencies Reverse dependencies recipes cli gower lifecycle dplyr generics purrr RANN rlang ROSE tibble withr glue hardhat vctrs BioMoR sae.projection RStoolbox caret fastml mlr3pipelines mrIML orbital themis

Version History

new 1.0.3 Mar 10, 2026
updated 1.0.3 ← 1.0.2 diff Jan 22, 2025
updated 1.0.2 ← 1.0.1 diff Aug 13, 2023
updated 1.0.1 ← 1.0.0 diff Apr 14, 2023
updated 1.0.0 ← 0.2.2 diff Jul 1, 2022
updated 0.2.2 ← 0.2.1 diff May 11, 2022
updated 0.2.1 ← 0.2.0 diff Apr 12, 2022
updated 0.2.0 ← 0.1.4 diff Mar 29, 2022
updated 0.1.4 ← 0.1.3 diff Jun 11, 2021
updated 0.1.3 ← 0.1.2 diff Nov 11, 2020
updated 0.1.2 ← 0.1.1 diff Aug 13, 2020
updated 0.1.1 ← 0.1.0 diff May 16, 2020
new 0.1.0 Jan 12, 2020