Generate quasi Anscombe data sets Type 4: 2 Clusters
Source:R/quasianscombe.R
sim_quasianscombe_set_4.Rd
Data sets Type 4 recreate two cluster keeping the coefficient of the original regression model.
Usage
sim_quasianscombe_set_4(df, rescale_to = c(0.1, 0.2), prop = 0.15)
Arguments
- df
A data frame from
sim_quasianscombe_set_1
(or similar).- rescale_to
Rescale the x value before create the second cluster.
- prop
The proportion of value to modify as the second group/cluster.
Details
This function will:
Disorder the order of
x
values.Rescale the
x
value to specific original quantiles.Then take a proportion of value and translate to left keeping the original mean of
x
.Finally add some value to the associated
y
value and subtract to the complement group to have the same regression model in terms of coefficients.
Examples
df <- sim_quasianscombe_set_1()
dataset4 <- sim_quasianscombe_set_4(df)
dataset4
#> # A tibble: 500 × 2
#> x y
#> <dbl> <dbl>
#> 1 3.70 5.88
#> 2 3.72 4.39
#> 3 3.72 5.07
#> 4 3.73 4.49
#> 5 3.74 5.41
#> 6 3.74 5.00
#> 7 3.75 4.02
#> 8 3.75 5.12
#> 9 3.75 3.99
#> 10 3.75 4.62
#> # … with 490 more rows
plot(dataset4)
plot(sim_quasianscombe_set_4(df, rescale_to = c(0, .1), prop = 0.5))