Statistics[FrequencyPlot] - generate frequency plots
|
Calling Sequence
|
|
FrequencyPlot(X, options, plotoptions)
FrequencyPlot['interactive'](X)
|
|
Parameters
|
|
X
|
-
|
data
|
options
|
-
|
(optional) equation(s) of the form option=value where option is one of averageshifted, binbounds, bincount, binwidth, color, discrete, frequencyscale, ignore, maxbins, minbins, or range; specify options for generating the frequency plot
|
plotoptions
|
-
|
options to be passed to the plots[display] command
|
|
|
|
|
Description
|
|
•
|
The first parameter X is either a single data sample - given as a Vector or list - or a list of data samples. Note, that the individual samples may be of variable size.
|
|
|
Options
|
|
|
The options argument can contain one or more of the options shown below. All unrecognized options will be passed to the plots[display] command. See plot[options] for details.
|
•
|
frequencyscale=relative or absolute
|
|
This option controls whether the absolute or relative data frequencies should be plotted. If frequencyscale is set to relative (the default) then the frequency plot will be rescaled to that the area under the plot is equal to 1. Otherwise, the vertical axis gives the exact number of data points in each bin. Using frequencyscale = absolute together with non-constant bin widths (specified using the binbounds option) is not recommended, since the resulting frequency plot may be misleading.
|
|
This option specifies how many frequency plots are used to compute the average. The default value is 1. Values other than 1 are only compatible with binbounds = deduce (the default value for that option).
|
•
|
binbounds=proportional or list or quantiles(...) or quartiles(...) or deciles(...) or percentiles(...) or deduce
|
|
This option specifies the bounds for each of the bins.
|
•
|
binbounds = deduce (the default) specifies that the bins should be equally wide and determined by the bincount, binwidth, maxbins, minbins, and range options. All other values of binbounds override all of these options, unless mentioned otherwise below. This is the only value for binbounds that is compatible with averageshifted being different from 1.
|
•
|
binbounds = [x0, x1, ..., xn] with x0 < x1 < ... < xn specifies that numbers given in the list are the boundaries between the bins. So binbounds = [x0, x1, x2, x3] would specify three bins: from x0 to x1, from x1 to x2, and from x2 to x3. Instead of a list, you can also specify a Vector or other 1-dimensional rtable.
|
•
|
binbounds = quantiles(x0, x1, ..., xn) with 0 <= x0 < x1 < `...` < xn <= 1 is equivalent to specifying the list . See the Quantile help page for details. If the option ignore=true is specified, then it is also passed to the Quantile command.
|
•
|
binbounds = quartiles(x0, x1, ..., xn) or deciles(x0, x1, ..., xn) or percentiles(x0, x1, ..., xn) is similar to quantiles(x0, x1, ..., xn), except the commands used are Quartile, Decile, and Percentile, respectively, instead of Quantile.
|
•
|
binbounds = proportional determines the bin count in the same way as binbounds = deduce, resulting in, say, bins. It then proceeds as binbounds = quantiles(0, 1/n, 2/n, ..., 1). Consequently, every bin has roughly the same number of data points in it.
|
|
The default value is deduce and all available data will be used to generate the frequency plot. If range is specified, all data points outside the given range will be ignored.
|
•
|
bincount=positive or deduce
|
|
This option controls the number of bins for the histogram. If the number of bins is explicitly specified through this option, the maxbins, minbins, and binwidth options will be ignored.
|
•
|
binwidth=positive, Sturges, Scott, or FreedmanDiaconis
|
|
This option controls the bin width (and consequently the number of bins) for the frequency plot. The bin width can be either specified explicitly (which overrides the maxbins and minbins options) or can be calculated using one of three methods: Sturges, Scott, or FreedmanDiaconis. The default value is Sturges.
|
|
This option specifies the maximum number of bins in a frequency plot. If the computed bin width is too small, it will be increased.
|
|
This option specifies the minimum number of bins in a frequency plot. If the computed bin width is too large, it will be decreased.
|
•
|
color=name, list, or range
|
|
This option specifies the color of the frequency plot. When a list or range of colors is given, the first color is used.
|
|
If discrete is set to true, the FrequencyPlot command will tally the data and plot each data point with the corresponding frequency. The default value is false.
|
|
This option is used to specify how to handle non-numeric data. If ignore is set to true all non-numeric items in X will be ignored. The default is false.
|
|
Legend entry for the resulting plot. For information on how to specify s, see plot/typesetting. If s is a list, then it should have as many elements as there are data samples, and the legends and data samples should be given in the corresponding order.
|
|
|
Compatibility
|
|
•
|
The binbounds option was introduced in Maple 16.
|
|
|
Examples
|
|
>
|
|
>
|
|
>
|
|
>
|
|
>
|
|
By default frequencyscale is set to relative.
>
|
|
Ordinary frequency plot. This is the command to create the plot from the Plotting Guide using the data above.
>
|
|
Frequency plot where each point represents equally many data points.
>
|
|
Misleading frequency plot of the same data where specifies bins of different widths and .
>
|
|
| (1) |
>
|
|
Other examples:
>
|
|
>
|
|
>
|
|
>
|
|
>
|
|
The following example shows a sample with extreme outliers.
>
|
|
| (2) |
>
|
|
In this case, it is more useful to consider a smaller range.
>
|
|
In the next example, FrequencyPlot returns an error because the input includes non-numeric data. Use the ignore option to see the plot.
>
|
|
>
|
|
|
|