Skip to content

scutr

Balancing Multiclass Datasets for Classification Tasks

v0.2.0 · Nov 17, 2023 · MIT + file LICENSE

Description

Imbalanced training datasets impede many popular classifiers. To balance training data, a combination of oversampling minority classes and undersampling majority classes is useful. This package implements the SCUT (SMOTE and Cluster-based Undersampling Technique) algorithm as described in Agrawal et. al. (2015) <doi:10.5220/0005595502260234>. Their paper uses model-based clustering and synthetic oversampling to balance multiclass training datasets, although other resampling methods are provided in this package.

Downloads

CRAN

297

Last 30 days

15793rd

705

Last 90 days

3.1K

Last year

Trend: +74.7% (30d vs prior 30d)

r2u CRAN

0

Last 30 days

24

Last 90 days

90

Last year

Trend: -100% (30d vs prior 30d)

autoCRAN

7

Last 7 days

16

Last 30 days

2

All-time

autoCRAN-only: this name is served only by autoCRAN, so the count is exact.

CRAN Check Status

13 OK
Show all 13 flavors
Flavor Status
r-devel-linux-x86_64-debian-clang OK
r-devel-linux-x86_64-debian-gcc OK
r-devel-linux-x86_64-fedora-clang OK
r-devel-linux-x86_64-fedora-gcc OK
r-devel-windows-x86_64 OK
r-oldrel-macos-arm64 OK
r-oldrel-macos-x86_64 OK
r-oldrel-windows-x86_64 OK
r-patched-linux-x86_64 OK
r-release-linux-x86_64 OK
r-release-macos-arm64 OK
r-release-macos-x86_64 OK
r-release-windows-x86_64 OK

Check History

OK 13 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Jun 8, 2026
WARNING 12 OK · 0 NOTE · 1 WARNING · 0 ERROR · 0 FAILURE Jun 7, 2026
WARNING r-devel-linux-x86_64-debian-gcc

for unstated dependencies in examples

Fatal error: cannot create 'R_TempDir'
OK 14 OK · 0 NOTE · 0 WARNING · 0 ERROR · 0 FAILURE Mar 10, 2026

Code intelligence has not been computed for this package yet.

Code

Structure

Lines of code

1,140

Files

38

Compiled share

0%

Has compiled src

No

Language breakdown

R 505 (44.3%)Tests 241 (21.1%)Docs 394 (34.6%)

API

Exported functions

10

Internal functions

1

Recent export changes

v0.1.2+10 SCUT, SCUT_parallel, oversample_smote +7 more

Testing & CI

Has tests

Yes

Test-to-code ratio

0.48

testthat edition

2

CI present

No

CI type

[]

PR gated

No

Docs

Return-value doc rate

100%

\dontrun example ratio

0%

Roxygen coverage

100%

Has pkgdown

No

NEWS present

Yes

Health & Security signals

Informational signals; not verdicts.

on.exit coverage

Unsafe pattern score

0

Dep constraint coverage

0%

Secret pattern count

0

Bundled 3rd-party code

2 items

Portability & License

Min R version

2.10

System requirements

C++ standard

License

MIT + file LICENSE

License flags

SPDX valid, OSI approved

History

Versions

2

First release

2021-06-24

Latest release

2023-11-17

Avg cadence

876 days

Cold removal rate

Dep drift

0

LOC over versions

v0.1.2: 1,129 LOCv0.2.0: 1,140 LOC

Per-file churn detail lives in the source pipeline: https://github.com/r-observatory/cran-code-metrics.

Reverse Dependencies (1)

imports

Dependency Network

Dependencies Reverse dependencies smotefamily mclust MantaID scutr

Version History

3 tracked
new 0.2.0 Mar 10, 2026
updated 0.2.0 ← 0.1.2 diff Nov 17, 2023
new 0.1.2 Jun 23, 2021