Plot()
: New parameter ts_method
to specify
the estimation method for generating a forecast. The default is
"es"
for exponential smoothing, the only available previous
method. Can now also implement "lm"
for least squares
linear regression model with seasonality, which de-seasonalizes the data
and then adds seasonality back to the estimated regression line.Plot()
: To generalize parameter names for multiple
estimation methods, all parameters that began with time_
and es_
now begin with ts_
. For example,
time_unit
is now ts_unit
, and
es_seasons
becomes ts_seasons
.
Plot()
: New parameter value days7
for
ts_unit
. All other time units are based on the 365 or 366
day year, the interval over which seasons are assessed. Seasonality for
days
is not generally meaningful assessed over the entire
year instead of days of the week. So the time unit of days7
assesses seasonality of days over the 7-day week, such as, for example,
more sales on Monday than on Sunday.
Plot()
: New parameter origin_y
to set
the minimum value of the y
axis, though the data will not
be truncated if the value is larger than the smallest y
value, triggering an error. This parameter compliments existing
parameters origin_x
, and also scale_y
, which
sets the minimum, maximum, and interval for scaling the y
axis. Its value is set to 0 by default in relevant situations.
Plot()
: New parameter ts_NA
to specify
treatment of missing values for the y
-variable. By default,
y
missing values, those with value NA
, do not
plot, leaving a blank space. Or, specify a value such as 0 to replace
the NA
to plot that y
-value for the
corresponding date on the x
-axis.
STL()
: Output data structure provided that consists
of four variables: date according to the named date variable and the
extracted trend, season, and error components. Example: s <- STL(Qtr,
Sales) outputs to the data frame s
.
pivot()
: When computing a cross-tab table, if the
first variable was a factor the analysis would crash. Now
fixed.
Plot()
: When plotting a time series, x-axis tic
lengths and the margin for value labels is refined.
Plot()
: Refined the extraction of the date variable
and the y
variable from a time series used in internal
processing, such as identifying leap years to provide more generally
accurate dates.
Plot()
: When plotting a time series with multiple
y
values, the y
-axis label now properly
displays.
Plot()
: When plotting a stacked time series, the
vertical legend now displays with the proper colors and in the same
order as the display of the stacked variables.
Plot()
: If plotting daily data with a time unit of
"weeks"
, then dates are plotted as individual dates by days
instead of months.
Plot()
: If plotting a y
categorical
variable and an x
continuous variable, the number of
decimal digits for the displayed statistics is now based on the decimal
digits inherent in the continuous variable instead of set at 1.
Plot()
: For a time series exponential smoothing
forecast, now display SSE and MSE fit indices, the linear trend and
seasonal coefficients, and the obtained smoothing parameter
values.
Plot()
: Improved conversion of character string
numeric dates to R type Date.
Plot()
: Read dates entered as character strings
formatted as:
Plot()
: Forecasting output now written to an output
list object such as p <- Plot(...)
with names out_frcst,
out_fitted, out_coefs, and out_params. This output is accessible for
showing just part of the output, such as p$out_frcst
when
writing R Markdown documents from the results.
STL()
: Set parameter show_range
to
TRUE
to show the range of each component. Before
TRUE
was the default.
STL()
: Parameter quiet
added to
suppress text output and parameter do_plot
added to
suppress the visualization.
Plot()
: Exponential smoothing of multiplicative
models now working, indicated by
es_type="multiplicative"
.
Plot()
: Can set style(suggest=FALSE)
for a time series plot.
Plot()
: If missing data for a run chart, a proper
error message is now displayed if attempting to show the runs, for which
missing data does not work.
Plot()
: Turn off the plot with
do_plot=FALSE
, leaving only text output, now
works.
Plot()
: If plotting a run chart with multiple
y
variables, adjacent points are now connected with line
segments for each of the y
variables. Example:
Plot(.Index, c(y1,y2,y3))
.
Plot()
: Exponential smoothing forecasting
implemented with accompanying visualization. New parameters include
time_ahead
for the number of time_units
to
forecast into the future, and time_format
to provide a
specific format for the date variable if not detected correctly by
default. Control aspects of the exponential smoothing estimation and
prediction algorithms with parameters es_level
(alpha),
es_trend
(beta), es_seasons
(gamma),
es_type
for additive or multiplicative seasonality, and
es_PIlevel
for the level of the prediction
intervals.
Plot()
: Character string versions of a date as in a
variety of forms as digits, such as “08/18/2024”, are now by default
converted to variable type of Date
. However, this
conversion is inherently ambiguous, so the time_format
parameter is provided as a means to provide the precise format if
needed, including other formats such as “August 18, 2024”. Also, a
sequence of four-digit integers within the usual range of dates will
also convert automatically to a variable of type `Dates.
STL()
: A wrapper for Base R stl()
that
provides additional information and utility:
Date
as the x
-variable and the time series
values as the y
-variable instead of an R time series.Date
.BarChart()
, Histogram()
,
Plot()
: If the x
- or y
- axis
values in the resulting plots all end in 000, then the 000 is replaced
with a K, such as 120000
plotted as an axis value of
120K
.
LineChart()
: Deprecated for years, now removed as
its functionality has been incorporated into Plot()
and
extended with the x
-variable of type
Date
.
Plot()
: Parameter run
dropped. Instead,
to maintain the position of the variable of interest to be plotted, as
the y
-variable, indicate a run chart by specifying
.Index
as the x
-variable, that is, the first
variable listed. That name, beginning with a period, .
,
indicates data values for the variable Will be created
automatically.
Plot()
: Increase the default size of the dot in the
Cleveland dot plot and related.
Plot()
: When parameter time_unit
is
explicitly set, proper dates are now displayed on the x-axis when there
is a by
variable.
Plot()
: Suggestions are working. To turn off:
style(suggest=FALSE)
.
BarChart()
, Histogram()
,
Plot()
: Parameter names by1
and, where
applicable, by2
, deprecated, replaced with more descriptive
new names facet1
and facet2
.
Plot()
: Using functions from the xts package, when
plotting a time series dates are now formatted according to their
natural unit. For example, when plotting by years, just years are listed
without the month and the day.
Plot()
: When plotting a times series and requesting
a level of time aggregation with the time_unit
parameter
that is more detailed than the available data, the analysis
appropriately terminates with an error message. For example, if the time
series data is monthly and an aggregation of weeks
is
requested, no analysis is done.
Plot()
: For plotting time series data, new parameter
n_date_ticks
added to override the default number of ticks
on the x
axis, the date axis.
Plot()
: For Trellis (facet or lattice) plots of bar
charts and histograms, each panel is labeled with the proper size of
text in the corresponding panel strip.vignettes
: These extensive examples are moved to the
web to save space as the installed lessR was exceeding the accepted
maximum limit. The web address is given from
library(lessR)
.
BarChart()
: When doing a composite plot, if calling
fill=getColors(...)
for a divergent palette, such as for
one bar for all items on an attitude survey, no longer need to
explicitly add n=
for the number of levels.
getColors()
: Added three pre-defined palettes from
the colorpace
package: rainbow_hcl
,
terrain_hcl
, and heat_hcl
, e.g.,
getColors("rainbow_hcl")
.
getColors()
: Adjusted the margins to better fit the
plot window when the color wheel or rectangle is displayed.
getColors()
: Default Tableau qualitative palette
added as a name to recognize and translate to the colors, also for
visualization functions, such as fill="Tableau"
.
Plot()
: New parameter time_unit
for
when the x-variable is a Date
variable and a time series is
plotted with automatic aggregation of the time unit according to the
specified value, such as "years"
plotted from daily
dates.
Plot()
: New parameter time_agg
for when
the x-variable is a Date
variable and a time series is
plotted with automatic aggregation to specify the type of aggregation
with "sum"
the default.
Plot()
: Pearson correlation analysis no longer
displayed if the x-variable is a date, that is, if a time series is
plotted.
all visualizations
: Grid lines not affected in some
plots that would add some unintended transparency to the
visualizations.
PieChart()
: User now warned that the
values
family of parameters is changed to
labels
, though the old reference still will work.
Read()
: If quiet=TRUE
, automatically
modified variable names to legal R names are not reported.
all visualizations
: Default colors softened a bit
with default transparency
level moved from 0 to
0.10.
ANOVA()
, BarChart()
,
Histogram()
, PieChart()
, pivot()
,
Plot()
, Regression()
, ttest()
:
Parameter rows
, for subsetting data, deprecated, now named
filter
to be more consistent with other systems, although
rows
parameter removed entirely for ANOVA()
,
pivot()
,
Regression(), and
ttest()to avoid needing the
rowsparameter as part of the function definition in addition to
filter`.
BarChart()
, Histogram()
,
PieChart()
, Plot()
: Parameter
filter
(formerly rows
), now reports the number
of rows of data before and after the filtering along with the logical
statement that defines the filter.
BarChart()
: Default family of values
parameters now called labels
, such as
labels_color
, to be more consistent with other
systems.
BarChart()
: values_digits
now
labels_decimals
though old value still works.
BarChart()
: Default labels
(formerly
values
) that are labeled on each bar now set at
input
except when aggregating counts, where it remains at
%
.
BarChart()
: When labels
is set to
input
, default decimal digits is now 2, which can be
customized with labels_digits
.
BarChart()
: Default labels
(formerly
values
) that are labeled on each bar now set at
input
except when aggregating counts, where it remains at
%
.
Plot()
: Parameter area_fill
on time
series plots now extends down to the level of the y-variable specified
with scale_y
instead of the minimum value of y.
Plot()
: Binning large data sets now results in
larger bubbles.
dataStockPrice
: Data table updated with stock prices
through May 2024, and trading volume added as the fourth
variable.
BarChart()
: For an unstacked or grouped bar chart,
beside=TRUE
, if there is missing data for the
by
variable, the color distribution across the bars is now
consistent across levels and with the legend.
BarChart()
: For two categorical variables,
x
and by
, evaluation if input data is a
summary (pivot) table of a prior aggregation is now correctly
computed.
interact()
: User-written functions are able to again
use BarChart()
, Histogram()
,
PieChart()
, and Plot()
.
pivot()
: Output variable names now consistently
labeled: variable_stat, e.g., Salary_mean.
Plot()
: Shapes now properly plot with a
by
parameter stratifying according to the specified
categorical variable.
Plot()
: by1
parameter for Trellis
(facet) plot with fit
line specified now properly does not
display text output when quiet=TRUE
.
reshape_wide()
: After the transformation, all other
variables except those specified as the group, response, or ID are
deleted as their values are not evenly distributed over the new wide
format variables.
pivot()
: Fixed a bug on the output. The results were
correct but not formatted correctly.BarChart()
: Parameter stat_x
added to
specify how to plot the y
-axis if there is no
y
variable specified, with default value of
counts
and option proportion
or
%
. These values are separated from the stat
parameter which now exclusively pertains to the transformation of a
specified y
numerical variable.
BarChart()
, Histogram()
,
Plot()
: Parameters lab_adj
,
margin_adj
, and legend_adj
renamed for clarity
to lab_adjust
, margin_adjust
, and
legend_adjust
.
pivot()
: When pivot table is of counts, using
function table
, the values of n
and
na
are now listed last in the resulting pivot
table.
Plot()
: shape
parameter default for
multiple groups according to parameter by
returns to
"circle"
with the new shape
value
"vary"
if desired to retain a sequence of default shapes
across the different groups.
Plot()
: fit
parameter now set only to
the values that it can analyze such as lm
, so no longer can
be set to TRUE or FALSE.
Read()
and Write()
: Support the modern,
fast, and efficient data file formats from the arrow
package, feather
and parquet
.
details()
: Shows the proportion of all missing data
values in addition to the total, and displays each row of data with
missing data, also obtained with Read()
when
brief=FALSE
.
details()
: miss_zero
parameter
removed.
rename()
: Parameter to
can now specify
a vector of names to change, with the corresponding from
vector of new names.
BarChart()
: If a numerical y variable is specified
for analysis of the original,raw data, then if a value of the
stat
parameter is not specified its value now defaults to
"mean"
.
Plot()
: If a time series is plotted from a x-axis
variable that is not of type Date
, then, setting
segments=TRUE
connects the points (which can vanish if
size=0
), but the line width parameter lwd
was
previously not responsive in this context.
Logit(), Regression()
: For categorical predictor
variables, the conversion to indicator variables now happens before the
analysis. All analyses are now only of numeric variables so that the
correlation matrix, scatterplot matrix, tolerance, and best subset
analyses are now conducted for categorical predictor variables.
Logit(), Regression()
: Redundant predictor variables
that lead to a singularity is now detected and noted instead of the
function crashing.
Logit()
: For sigmoid function plot for a single
predictor model, right-hand margin adjusted to better display labels of
the target variable of varying lengths.
Regression()
: For the model fit section, the
displayed standard deviation of the response variable is now computed
from the same data from which the model was estimated, which can differ
depending on missing data.
BarChart()
: Missing data is now permitted for the
numerical or y
variable.
pivot()
: Aggregated variables are named the original
name of the aggregated variable concatenated with an underscore and the
name of the operation.
Plot()
: For plots of two or more variables, the
right-margin legend is displayed with a smaller font size and centered
more effectively.
prob_norm()
: Function terminates if value of
lo
is greater than hi
instead of returning a
meaningless result.
getColors()
: Default for parameter
output
is TRUE
only for a direct call from the
console. If embedded in a function call such as for the
fill
parameter, or even if directly called in R Markdown,
then set output=TRUE
to view the calculated.
palette
BarChart()
: Parameter theme
now
works.
getColors()
: Now can be called directly in R
Markdown documents though specify output=TRUE
for output to
be displayed.
pivot()
: Parameter out_names
now works
if the assigned name for a single variable analysis contains the name of
the variable analyzed from the input data frame.
Plot()
: For stacked time series plots, default for
size
of points is now 0.
Plot()
: For lattice plots, if there is only a single
point to plot in a panel, just the point is plotted instead of also
attempting to plot a violin and a box plot, which necessarily
fails.
Plot()
: Variable and axis value labels increased in size
for R Markdown and R by itself analyses.
ANOVA()
, BarChart()
,
Histogram()
, Plot()
,
Regression()
, ttest()
: A tibble that is not in
the user workspace, instead in a package, is now properly converted to a
standard R data frame upon which the internal function’s code depends,
which also permits more flexibility in the packaging of data in general
to these functions, such as with the pipe operator.
ANOVA()
: Overall p-value now available for the
one-way ANOVA as the statistic in an output object, called
p_value
, such as output object a$p_value
in
a <- ANOVA( ... )
.
getColors()
: To address color blindness, the
Okabe-Ito
color palette was added, and available as a
fill
parameter in Plot()
and
BarChart()
.
interact()
: To address color blindness, the
viridis
and Okabe-Ito
were added as
qualitative palettes, such as for the bars of a bar chart.
sort_by()
: New name for the Sort()
function, now deprecated, to maintain consistency of lower-case
functions being utility functions.
Plot()
: For Trellis plots, indicated with
by1
and by2
parameters, assessment of
replications of unique values is done only if 1000 or fewer unique
values to save both space and compute time for information that is not
of such value anyhow.
Plot()
: For a run chart, parameter
run=TRUE
, the successive points in the plot are now by
default joined with line segments, which can be stopped by setting
parameter segments=FALSE
.
Plot()
: Parameters lab_cex
and
axis_cex
for size of value labels and axis labels, scaled
smaller for visualizations in R run alone, without RStudio.
Plot()
: To address color blindness, when there is a
by
grouping, default is now to not only vary the fill color
but also the shape of the plotted points.
Regression()
: Parameter mod
for
moderation analysis in a two-predictor model added.
ttest()
: Parameter quiet
added to turn
off text display at the console if set to TRUE
.
order_by()
: Old function name was
sort_by()
. However, Base R now has a function with that
name, so to avoid confusion, the lessR
function was renamed
to order_by()
.
ANOVA()
: One-way ANOVA with parameter
graphics=FALSE
now works.
BarChart()
, Plot()
: Qualitative color
palette hues
was the default for the default color theme,
colors
, but now also works explicitly when specified as the
value for the fill
parameter.
BarChart()
: Now works for color
parameter specified as a vector.
interact()
: Interactive analyses resets label and
axis plot sizes to larger, now set back when saving the pdf and also
when re-plotting.
PieChart()
: Suggestions for alternative analyses had
wrong lowercase expression piechart()
instead of
PieChart()
.
Plot()
: With a linear fit line, b0 and b1 now
reflect the specified number of digits according to parameter
digits_d
.
Plot()
: For plotting a continuous and a categorical
variable scatterplot, when listing the continuous variable first, the
name of the categorical variable is now listed in the title of the text
output.
Plot()
: For a run chart with parameter
run=TRUE
, parameter scale_x
now properly
allows for a custom set of ticks for the Index
on the
x-axis.
Histogram()
, Plot()
,
Correlation()
, ttest()
: Vastly speeded up for
data sets much over 500 data values, substantial improvement on large
data sets.Plot()
: Parameters jitter_x
and
jitter_y
are reset for the scatterplot (not VBS plot) to
explicitly set the amount of jitter within the negative and positive
values of their assigned values, or, set to NULL
to
activate the default value of the range of the variable divided by
50.
Plot()
: Text output for Mahalanobis distance more
nicely formatted.
interact()
: Reading of text files was not detecting
blank data values for variables of type character
as
missing, which now it does.
interact()
: Various revisions of the interface
styles and efficiency increases.
interact()
: A Help
option added for
each analysis, which, when clicked, displays a web page of explanation
of the analysis and listing and definition of each presented
parameter.
interact("BarChart")
: Numerical y
variable and associated stat
parameter added for analysis
of means and related statistics across the levels of the categorical
variable x
.
interact("BarChart")
: The summary table of each
category level paired with a number, to translate into bar height, can
now be read as the input data table.
`interact(“Plot”): ID lableing of outliers added.
pivot()
: For a single variable
to
process, the variable name no longer repeated for each column of the
output, one column per statistic computed.
BarChart()
: When plotting a statistics for a
numerical variable, y
, the plotted values are now displayed
at the console with sufficient width.
interact("ScatterPlot")
: Parameter
enhance
now works.
interact("ScatterPlot")
: Choose parameter
enhance
then parameter by
now works.
interact("ScatterPlot")
: Parameters by
and size
now work together and properly written to the R
code file.
interact("Trellis")
: Single VBS now plotted without
a by1
variable for Trellis plots per se.
Regression()
: For parameter Rmd
,
collinearity issue fixed.
ref_group
for Logit()
, new, more
explanatory name for ref
, the level of the binary target
variable explicitly defined as the reference group instead of relying on
the default.
train_test()
has new parameter
matrix_out
which outputs the data structures as matrices
instead of as data frames.
Regression()
: For automatically generating and
processing R markdown files, users can now 100% customize the output
directly by creating their own input files, just one or all eight of the
files.
Regression()
: Added parameters
Rmd_custom
to specify one or more custom input files and
Rmd_dir
to specify a custom directory where the files are
stored.
Regression()
: For output, vars
component added that is a vector of the variable names in the model
beginning with the response variable.
Regression()
: Parameters res_rows
and
pred_rows
changed to the more descriptive
n_res_rows
and n_pred_rows
.
interact("Histogram")
: The bandwidth
slider for the density
option now re-adjusts if a new
variable is selected from the same data set.
interact("Plot")
: Points now plotted
properly.
interact("Plot")
: Fill color properly set if a
by
variable is selected.
Plot()
: Transparency (trans parameter) now works
when size as a variable and by are activated.
Plot()
: for a fit
function, mse now
properly computed.
recode()
: Entire data frame now not written to the
console.
Regression()
: Text enlarged for scatterplot
matrix.
Regression()
: Number of allowable terms in the model
definition now greatly expanded.
Regression()
: When generating a markdown file with
parameter Rmd
, if reading a label file included with
lessR
, the correct data file is now properly listed instead
of the label file.
interact()
: lessR data analysis is now interactive,
with each interactive analysis now displayed for analyzing the user’s
own data, which also includes summary statistics as output with the
option to save the plot to a pdf file plus the corresponding R
code.BarChart()
: Parameter do_plot
added for
the option of computing summary statistics but not plotting.
Histogram()
: Underlying algorithm more
efficient.
Histogram(..., density=TRUE)
: fill_gen
and color_gen
revised to fill_general
andcolor_general
, and fill_nrm
and
color_nrm
to fill_normal
and
color_normal
though the old names still accepted, as with
bandwidth
in place of bw
and
include_histogram
in place of
dn.hist
.
Histogram()
: for density curve, provided values for
fill_general
and fill_normal
that are color
names now are converted to the same transparency level as their default
colors.
PieChart()
: value labels now separated from the
category label so that size can be set independently.
pivot()
: Long labels now abbreviated to stay within
column widths.
Plot()
: For numbers < 10000, more decimal digits
for stat outcomes of fit
parameter, such as MSE.
Plot()
: With a by
variable a named,
sequential color range such as “reds” can now be specified in addition
to the qualitative range of “hues”.
tt.brief()
removed, long since replaced by
tt_brief()
.
loading lessR
: Now lessR loads properly if another
previously loaded package had loaded shiny.
Histogram()
: Returns in list structure, such as
h
, saved output from
h <- Histogram(...)
.
Histogram()
: When bin_start
was < 0
the number of displayed digits for the mid point of each bin could be
excessive, now fixed.
Histogram()
: spelling error of out_suggewt to
out_suggest
fixed for output name.s
pivot()
: When constructing a table (instead of a
dataframe), NA
’s now properly displayed.
Plot()
: Restored title that indicates scaling of
bubble plot.
Plot()
: Transparency on bubble plots now
works.
Read2()
: Now works properly.
BoxPlot()
, Histogram()
: Recent message
from outlier check from package robustbase
removed.
Logit()
: Confusion matrix now displayed even if all
predictions are for a single outcome category.
pivot()
: If not specified, output defaults to 3
decimal digits, otherwise displays the needed number of decimal digits
to avoid rounding to 0.000.
pivot()
: For consistency, parameter
n_show_group
changed to n_group_show
.
Plot()
: New parameter n_bins
> 1
indicates to bin the numeric x-variable and plot the mean or median of
the numeric y-variable for each bin, with each point size dependent on
the corresponding bin sample size.
Plot()
: More stats displayed regarding each panel
when doing a fit line.
Read()
: Browsing for data file in RStudio, message
about hidden window removed.
pivot()
: If na_remove
is
FALSE
, the missing data value for the aggregated statistic
was reported as 0 instead of NA
.
Plot()
: y-axis label now evaluated on all axis
values instead of only the maximum value, which, if 1.0, rounds to 1,
only a single digit.
Plot()
: segments=TRUE
now applies to
all scatter plots, not just with by=
.
interact()
: Each interactive display now shows the
underlying function call.
Plot()
: If x
is equally spaced, a line
chart not default if a fit
line.
Plot()
: Parameter sqrt
deprecated for
quat
.
Plot()
: Parameter root
deprecated for
power
.
Plot()
: fit_color
parameter added to
set directly, not just with style()
function.
Plot()
: When curve fitting with fit
parameter, b0
and b1
given for the linearized
line from nonlinear functions.
Prop_test()
: If the hypothesis test of a proportion
is one-sided, the alternative
hypothesis is now
provided.
Histogram()
: density=TRUE
properly
generates console output.
Plot()
: For fit
parameter,
"reciprocal"
option removed as better covered with
"exp"
which also does exponential decay in place of
"reciprocal"
if no by
variable.
Plot()
: fit_color
now works for a
fit
line.
Prop_test()
: parameters such as
alternative
properly passed to R functions
binom.test()
, prop.test()
, and
chisq.test()
.
ttest()
: One-tailed alternatives less
and greater
options work.
ANOVA()
: One-way, scatterplot labels separated more
from plot
Histogram()
: Summary stats headings adjusted to
better align with numerical values
Prop_test()
: Parameter p0
changed to
pi
for consistency with ttest()
,
p0
still works
reshape_long()
: Default name of
Response
changed from "Value"
to
"Response"
style()
: Slightly lighten grayscale bars and points
from "gray35"
to "gray42"
and point
fill
and color
from "gray30"
to
"gray42"
train_test()
: New function to either create training
and testing data from a data frame with x
and
y
combined or separated
Plot()
: a --run-donttest
test example was
improperly configured, now worksPlot()
: Plotting a vector for x
or
y
with fit
now worksANOVA()
: Interaction plot for 2-way factorial
reflects color theme
BarChart()
: Appropriate error message if
by
and by1
parameters both specified as only
one for one function call is allowed
BarChart()
: Bar chart from a pivot table for two
vars now possible with NA
column
data set
: dataAnova_rbf
data set for
randomized block factorial ANOVA added
data set
: dataAnova_sp
data set for
split-plot design ANOVA added
interact()
: More colors added
interact()
: Trellis plot has added
violin_fill
and box_fill
parameters
pivot()
: by
variables except for
Date
variables by default now converted to factors, though
controlled with parameter: factors
reshape_long()
: If needed, an ID variable is now
provided by default
reshape_long()
: Generated ID
variable
now has a prefix in front of the integer ID by default, can be changed
with the parameter: prefix
reshape_long()
: sep
parameter specifies
a separator between the prefix and the ID integer output data frame has
ID variable listed first
Transform()
: Previously deprecated, its abbreviation
trans() now removed
details()
: Now properly reports details of a
tibble
version of a data frame
interact()
: Histogram()
fill
now set properly
Plot()
: Panels in Trellis plots with no data now
properly display as blank with grid lines instead of showing an error
message
Plot()
: Interaction plot with
segments=TRUE
of a pivot table now has all
segments
Plot()
: Can now plot one continuous variable with
violin/box/scatter plots with by
, by1
, and
by2
parameters
Plot()
: Bubble plot transparency option now works
with parameter trans
Plot()
: Bubble plot fill
and
color
now properly specified
ANOVA()
: ANOVA 2-way factorial analysis provides
Type II sum of squares for unbalanced designs
ANOVA()
: R-squared and related indices now reported
to three decimal digits
ANOVA()
: Plotted point in interaction plot a little
smaller
ANOVA()
: Scatterplot title removed for more room for
the plot, labels adjusted
ANOVA()
: Scatterplot moved to the first plot
displayed
BarChart()
: Right-margin legend for two variable
chart no longer defaults to abbreviations for title and values, use
legend_abbrev
to specify
Histogram()
: Bars slightly more gray and slightly
less blue
Plot()
: For by
and fit
parameters together, equation, MSE, and R2 given for each fit
line
Regression()
: Analysis of covariance with one
categorical variable and one continuous variable displays the ….
regression line and equation for each level of the categorical variable
superimposed over the scatterplot. Type II SS for the ANOVA table. test
of the interaction of the covariate with the grouping variable
Regression()
: n_cat
parameter added to
allow integer variables to be treated as categorical without declaring
as a formal R factor
Regression()
: kfold
parameter now
applies to models with categorical variables with dummy variables
created automatically
Regression()
: Extra line of spacing between major
sections of output
Regression()
: Grid lines added to
scatterplot
Regression()
: title removed from scatterplot,
remains in text console output
Regression()
: Parameter best_sub
provides the value "Cp"
for Mallow’s "Cp"
statistic in addition to the default value "adjr2"
for
adjusted R-squared
rename()
: New utility function for renaming a
variable in the specified data frame
ANOVA()
: Interaction plot for 2-way factorial
generalizes better beyond two levels brief output works more reliably
for 2-way factorial
Regression()
: scale_response
parameter
now works, applicable when new_scale
is implemented,
FALSE
by default but if TRUE
then the response
variable y
is also rescaled
ANOVA()
: Better calculation of size of left-margin
for pairwise mean differences plot
Logit()
: For a single predictor, for a given
probability classification threshold, the x
-cutoff value
now provided
Logit()
: For a single predictor, annotations added
to the sigmoid curve
Logit()
: ref
parameter for designating
the value of the response variable that is the reference group instead
of relying upon defaults
Logit()
: Rows in the classification table reversed
to correspond to sigmoid curve plot
pivot()
: For consistency with other na
parameters, show_group_na
parameter changed to
na_show_group
Plot()
: Boxplot median line width increased to 2 for
better visibility in the presence of an embedded scatterplot
Logit()
: Bug fixed that occurred for 50% of the
analyses when the response variable has integer input values of 0,1
instead of a factor
such as Man/Woman
Logit()
: Fitted values now properly displayed when
X1_new
, etc. specified
Logit()
: Edge condition that crashed with very
little variance among fitted probabilities now fixed
Plot()
: by1
parameter now works when
x
is categorical
Regression()
: Table of estimates now properly
displayed for indicator variables with large number of characters for
the variable names
BarChart()
: digits_d
parameter added
for text output to control number of decimal digits for two categorical
variable output
BarChart()
: Default gray fill color for the bars
lightened slightly
data()
: dataStockPrice
updated to Jan
1, 2022 dataEmployee_lbl
updated
dataWeightLoss
data file added dataAnova_1way
data file added dataAnova_rb
data file added
dataAnova_2way
data file added
factors()
: Now processes tidyverse
tibble
version of a data frame
Help()
: Deprecated, now removed, replaced by
vignettes
interact()
: When prompting for valid names by
passing no arguments, no longer displays an error message
interact()
: More colors added to choice of fill
colors for interactive displays
Logit()
: For a single predictor variable, if the
response y
is non-numeric, then the value set at 1 has the
highest mean, for a positive difference
Logit()
: For a single predictor variable, logistic
curve better labeled
Nest()
: If response variable is a character
variable, automatically converted to a factor
to allow to
run without an error
Plot()
: For Trellis plots, default changed to a
single column (n_col=1
)
Read()
: If specified a lessR
data file
improperly, the list of valid data files is displayed
Read()
: Read data files in the Open Document
Spreadsheet format, .ods
Read()
: New parameter row_names for consistent
interface across the various file formats, for cs
files R
row.names
still works
Read()
: Report of variables with all unique values
moved to brief report
Regression()
: If x
is a factor with two
levels, the regression line is plotted in the scatterplot
Regression()
: Scatterplot can now have customized
axis labels size, set by style()
reshape_long()
: New function that is a simple
wrapper for Base R reshape()
with sensible parameter names
and sensible defaults, and allow for variable ranges of columns to
transfer from wide to long
reshape_wide()
: New function that is a simple
wrapper for Base R reshape()
with sensible parameter names
to convert a long-form data frame to a wide-form
Write()
: Parameters – data
and
to
– switched to a more natural order with
data
now the first parameter:
Write(data=, to=, ...)
Write()
: Write data frames in the Open Document
Spreadsheet format, .ods
, with
format="ODS"
Write()
: Write data frames in the SPSS format,
.sav
, with format="SPSS"
ANOVA()
: Family-wise confidence level now reported
in output
BarChart()
: Formatting so that columns do not run
together for some output with integers represented as integers instead
of double
BarChart()
: Specifying one_plot=TRUE
and multiple variables now works
BarChart()
: Custom ylab
now works when
the stat parameter is activated
Histogram()
: For density=TRUE
,
quiet=TRUE
now works
Logit()
: If many variables, such as from created
dummy variables, columns of estimated coefficients now align
correctly
PieChart()
: Text now responds to corresponding style
parameters: lab_cex
, main_cex=1.5
, and
values_size
Plot()
: If a by
or by1
variable, misleading warning message sort order deleted
ANOVA()
, BarChart()
,
Histogram()
, pivot()
, Plot()
,
Regression()
, ttest()
: Informative error
message added if a subset with the rows parameter returns no rows to
subset
getColors()
: Little more differentiation between
default sequential colors by lightening a bit more the lightest
color
Histogram()
: Output objects n.bins
and
n.miss
updated to n_bins and n_miss
Histogram()
: density=TRUE
output
provides more summary statistics and suggestions
Plot()
: When specifying a fit line with a by
categorical variable the sum of squared errors for each category are
better labeled
Plot()
: For a by
categorical variable,
no longer is the correlational analysis done for all the data, but the
SSE is reported for each group
Plot()
: Pairwise correlation output takes up fewer
lines
Plot()
: Variable labels for y
-axis made
a little longer and still fit without breaking into an additional
line
Plot()
: Some suggestions now change depending on
random “coin flip”
Plot()
: Trellis plot default changed from
n_col=1
to n_row=1
Regression()
: Standard deviation of response
variable added to fit information, other stats removed
Regression()
: Parameter fit_line
width
set at 1.5 if plotting w/o outlier line also and no standard error
region for original line
to()
: Parameter same.size
updated to
same_size
, old value still works
BarChart()
: For theme other than
"hues"
, fill
colors the same by default unless
an ordered factor
, then a sequential palette
Correlation()
: Pairwise correlation analysis
properly returns lower and upper bounds of confidence interval about the
sample correlation coefficient
Correlation()
: Pairwise correlation when called from
Correlation()
function works
PieChart()
: Default colors with default color theme,
"colors"
, now has qualitative color palette
"hues"
, the same as bar chart, for a non-ordinal
categorical variable
PieChart()
: Inner circle that creates the hole now
with higher resolution so as to make a smoother circle
PieChart()
: Better placement of the plot title if
there is one
ANOVA()
: Randomized blocks fitted means
visualizations enhanced
ANOVA()
: rb_points
parameter
removed
Histogram()
: For grayscale with a black background,
histogram bars have no transparency and are lighter
interact()
: "PieChart"
option added for
interactive, Shiny display
Logit()
: Grid lines added to logit plot for a single
predictor transparency of 0.7 added to the points in the logit
plot
pivot()
: If n=0
for an output row or
column, then converting to proportions returns 0’s instead of
NaN
’s
pivot()
: New parameter show_group_na
with default of TRUE
that will list NA
for
missing data of a grouping variable as a level on the output
PieChart()
: Labels proper size outside of
RStudio
Plot()
: plot_errors
now plots the
connecting segments from each point to the fitted line with gray if
style("gray")
Plot()
: Parameter size
for size of
points now applies to a scatterplot matrix
Plot()
: fill
color for points and
bubbles lightened slightly for gray scale
Plot()
: Trellis line plots increase slightly the
width of the lines
recode()
: Transformed variables converted to be
consistent with their type ex: convert a character
variable
to integer
, but before was still
character
Regression()
: For parameter new_scale
,
new option is "centered"
Regression()
: For all rescaling, now default is only
predictor numeric variables and only for numerical variables with more
than two values
Regression()
: If rescaling, set new parameter
rescale_response
to TRUE
to also rescale the
response variable
Regression()
: For model fit, standard deviation and
min and max of the target variable displayed along with the 95% range of
variation of the residuals, with values displayed with commas for large
numbers
Regression()
: Fitted vs residuals plot more
efficiently sized and largest Cooks Distance value displayed as a
sub-title
Regression()
: Summary stats of response variable
added to Basic Analysis
Sort()
: Feedback includes names of sorted variables
in place of their index
ttest()
: For one group t-test, density curve now has
dark gray border and value of smd added to the top text display over the
visualization
ANOVA()
: Randomized blocks ANOVA displays correct
visualizations
BarChart()
: Ordinal factors properly display as
color gradients, e.g., “blues”
PieChart()
: theme
other than default
now implemented
Plot()
: For plotting a Trellis plot with a
by
variable, all levels correctly plotted and legend
correct
Plot()
: If x
or y
is
categorical and the other continuous, if a by
variable then
fill
and color
colors properly displayed at
end of display
Sort()
: Internal coding simplified for improved
functionality and generality
ttest()
: pdf_file
now works for one
group t-test of the mean
interact()
: New function for running one of the
following shiny apps for interactive visualizations: “BarChart1”,
“BarChart2”, “Histogram”, “ScatterPlot”, “Trellis”, e.g.,
interact(“BarChart1”)
pivot()
: Read a tidyverse tibble
in
addition to standard R data frames
pivot()
: When naming variables, for variable
y
, now y_mean
instead of
y_mn
Plot()
: For x=categorical
and
y=continuous
, or vice versa, no jitter added if there are
no duplications of y
at each level of x
, plus
more jitter added for larger sample size if jitter is needed
BarChart()
: Sort now works properly with a
horizontal plot
pivot()
: Now can have one variable to aggregate and
one aggregation variable named x
Regression()
: Debugging code removed from the k-fold
cross-validation analysis
ANOVA()
: Scatterplot for 1-way ANOVA now has grid
lines and jitter
ANOVA()
: Scatterplot for 1-way ANOVA now has
jitter_x
parameter added to customize the jitter
level
ANOVA()
: Scatterplot for 1-way ANOVA now has means
plotted in a dark red instead of a dark gray
BarChart()
: Tilde removed from each label if for a
by
variable in the legend where it is
nonfunctional
pivot()
: User defined functions accounted for and
named in the output
pivot()
: When analysis of all data values, dropped
the Grand_Stat
label
Plot()
: Point color and fit line now a little darker
and complements of each other, plot_errors
segments a
little darker as well
Plot()
: For fit="reciprocal"
, fit line
present for all values of x
, even if plot window does not
accommodate the full range, can adjust range with parameter
pad_y
Plot()
: Parameter root
added for
general root transformation beyond the value of 0.5 for the square
root
Plot()
: Parameter fit_power
added to
raise y
to the designated power for the exp
,
root
, and reciprocal
transformations
Plot()
: Correlational analysis no longer displayed
for a non-linear fit line
Plot()
: SSE displayed for corresponding
fit
line (curve)
BarChart()
: Two variable chart with parameters
x
and by
specified now correctly displays a
color range when the style is not the default theme
BarChart()
: Base R text()
function
misleadingly displays text at size 1 if a value of 0 is entered, lessR
functions that rely on the R text function now fixed by changing an
input value of 0 to 0.01
pivot()
: Output variables now properly
named
Plot()
: Trellis plots with by1
and
by2
parameters work in Shiny environment
BarChart()
: If parameter sta
not
specified, there is now no default if the parameter y
variable is specified without an indication of what statistic to
compute
BarChart()
: Gentle termination with a message if
stats
specifies a transform that results in missing data in
the computed summary table
BarChart()
: For analysis of computing a summary
table with the parameter stat
parameter, no longer display
the summary stats for the parameter x
variable when a by
variable is present
getColors()
: Default qualitative color palette
softened a bit by increasing luminance from 55 to 60
getColors()
: By default, now does not generate
output when called from visualization parameters parameter
fill
and parameter color
for parameter
BarChart()
, etc.
getColors()
: Output parameter values changed to
parameter logical
, parameter TRUE
and
parameter FALSE
pivot()
: New parameter parameter
table_long
allows the table computation for a cross-tabs
table be output in long form
pivot()
: Tabulate computation continues to work,
though no longer documented as it is supersede
BarChart()
: Further adjustment to accommodate very
large legend labels for two variable plots
BarChart()
: If the y values are input, the display
of the values will be properly turned off if the parameter
style
option for values is set to parameter
"off"
Plot()
: Default jitter for 1-D scatterplot works
again in VBS plots
Plot()
: Parameter by
works again for
VBS plots, multiple plots on same panel
Plot()
: Parameter color
now works in
conjunction with the by
parameter
Plot()
: Applied to a scatterplot of two continuous
variables: if color
is set to "off"
(or
"transparent"
), a requested fit line
still
displays means are now properly plotted for a scatterplot of a
continuous and categorical variable when jitter is applied to the
plotted points default outlier shape in gray scale changes to diamond as
documented
BarChart()
: For a two categorical variable bar
chart, the legend
labels are by default no longer
abbreviated, instead sufficient room is generated by the plot
BarChart()
: New parameter legend_abbrev
allows for the specification of the maximum number of characters to
display for the legend
labels
BarChart()
: For a two categorical variable bar
chart, better default placement of the legend
new parameter
legend_adj
allows for horizontal adjustment of the
legend
BarChart()
: First variable x
is a
vector works again
Histogram()
: Parameter pdf_file
works
Plot()
: Parameter enhance=TRUE
correctly prints least-squares line when fill changed
BarChart()
: When fill or color set with
getColors()
, the value of n, the number of colors, is
provided according to the number of bars
BarChart()
: When horiz is TRUE
, order
of sort changed so that "-"
starts at top
BarChart()
: Space to separate the
x
-axis label when several lines revised parameter
pdf
to pdf_file
corProp()
: Alias changed from prop
to
cp
, so prop
for new function
Proportion()
Density()
: Parameter pdf
to
pdf_file
Histogram()
: Parameter pdf
to
pdf_file
kurtosis()
: New function for computing kurtosis,
available for the pivot()
function
pivot()
: Choose any two of the following three
parameters: multiple compute functions, multiple values over which to
compute, and multiple grouping (by
) variables
pivot()
: No by
variables imply to do
grand total(s) without an aggregation
pivot()
: Choose any two of the following three
parameters: multiple compute functions, multiple variables, and multiple
by
-variables.
pivot()
: Sort parameter applies for a single
aggregated numerical variable and specifies the variable or column
number to sort
pivot()
: For each non-decimal digit variable with
unique data values, display
pivot()
: The variable name and the unique values to
better label the output when drilling down into levels of one or more
categorical variables
pivot()
: Compute quantiles()
function
provides a standard data frame
pivot()
: Compute table()
function
provides a cross-classification table for a discrete aggregated
variable
pivot()
: Table frequencies can be converted to
proportions, and row and column proportions
pivot()
: Parameter out_names
parameter
to custom-name aggregated variables
pivot()
: Parameter na_by
renamed to
na_by_show
to better communicate its meaning skew and
kurtosis added to the list of available descriptive statistics
pivot()
: Parameter q_num
specifies
number of intervals for quantiles
pivot()
: Parameter rows
subsets rows of
the data frame for analysis
pivot()
: Parameter valu
e changed to
parameter variable
pivot()
: Parameters rows_by
and
cols_by
changed to by
and
by_cols
Plot()
: Scatterplot matrix does a loess fit if
fit=TRUE
Plot()
: If multiple x
or y
variables, text correlation analysis displayed for each
combination
Plot()
: If a by
variable, then text
correlation for each level not yet available, so not displayed
Plot()
: Default for plotting a fit line with by
groups is to not plot the standard errors because the plot becomes
overcrowded
Plot()
: Warning displayed if the date field or a
by/by1 field is not ordered
Plot()
: Stacked time series chart of different
shades of same hue, such as "blues"
now have each sub-plot
with a black border
Plot()
: Trellis plot distinguishes
area_fill
under a line from fill
for a
point
Plot()
: radius
value for bubble plot
frequency matrix dependent on size of
Plot()
: Largest frequency radius
value
displayed for bubble plot frequency matrix
prob_norm()
: Probability is returned as the value of
the function, not just printed
prob_tcut()
: Cutoff is returned as the value of the
function, not just printed
Prop_test()
: New function for the analysis of
proportions: test of homogeneity for one or more samples based on the
proportion of successes for a specific value of a variable, and for a
single sample, goodness-of-fit for a single categorical variable and
test of independence for two categorical variables
Prop_test()
: prop
is alias
recode()
: Changed to lower case function
name
rescale()
: Changed to lower case function
name
Regression()
: Bubble plot form of a scatterplot for
integer
variables with less than 10 unique values and a
single predictor variable
skew()
: New function for computing skew, available
for the pivot()
function
style()
: slatered
theme added
style()
: ellipse_fill
for default
"colors"
theme now a pale brown instead of blue
BarChart()
: With more than 30 categories, a message
to indicate how to retrieve now properly displays instead of
crashing
BarChart()
: fill=(count)
would fail for
a large number of levels
BarChart()
: Stacked bars from multiple items now
works with variable labels
BarChart()
: Stacked bars from multiple items now
works for binary responses
Extract()
: Now works if name of data frame is a
function in another context
Extract()
: Now works even if one variable name is
embedded in another name
Plot()
: Forcing an evaluation of a scalar logical on
a vector fixed
Plot()
: theme
parameter works for most
common parameters
Plot()
: Bubble plot frequency matrix text output
works with labels
Plot()
: Bubble plot frequency matrix plot
x
-axis works with binary response
Plot()
: area_fill
works
correctly
ttest()
: For paired analysis, x
-axis
properly labeled Difference
`