Histogram - Maple Help

All Products Maple MapleSim

Home : Support : Online Help : Math Apps : Probability and Statistics : Statistics : Histogram

Histograms

Main Concept

A histogram is a graphical representation of a frequency distribution of a sample of data. In a histogram, tabulated frequencies are shown as adjacent rectangles over discrete intervals, known as bins. The area of each rectangle is proportional to the frequency of observations in the interval and the height of a rectangle is equal to the frequency divided by the width of the interval.

The total area of the histogram corresponds to the number of observations in the data sample. If the data has been normalized, the resulting graph displays a relative frequency histogram where each rectangle shows the proportion of observations in its particular interval and the total area of the histogram is equal to 1.

The choice of the number of bins to use is important, however, there is no best number of bins as different bins sizes can reveal different features within a data set.

Number of Bins

There are various guidelines when picking the number of bins, where k is the number of bins and n is the range. The number of bins, k, can be calculated from a suggested bin width h as follows:

h = $⌈\frac{maximum - minimum}{h}⌉$ ,

where the braces indicate the ceiling function.

Square-root choice: The simplest method of deciding on the number of bins is to take square root of the number of data points.

k = $\sqrt{n}$

Sturges' formula: Sturges' Formula is derived from a binomial distribution and assumes that the data is normally distributed. Sturges' formula has been known to perform poorly in some cases if n is less than 30 and if the data is not normally distributed.

k = $⌈\log_{2} n + 1⌉$

Scott's normal reference rule: Scott's normal reference rule minimizes the integrated mean squared error of the density estimate and is well suited for random samples of normally distributed data.

h = $\frac{3.5 \overset{&Hat;}{σ}}{n^{\frac{1}{3}}}$ ,

where $\overset{&Hat;}{σ}$ is the sample standard deviation.

Freedman-Diaconis' choice: The Freedman-Diaconis' choice is based on the interquartile range (IQR). It is less sensitive to outliers in data than Scott's normal reference rule because of using the interquartile range.

h = $\frac{2 IQR (Sample)}{n^{\frac{1}{3}}}$

Choose numbers between 1 and

# of bins =

More MathApps

MathApps/ProbabilityAndStatistics

Download Help Document

Maple

Maple Add-Ons

Math Success Platform

Improving Retention Rates

Maple Flow

MapleSim

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim

Maple

Powerful math software that is easy to use

Maple Add-Ons

Math Success Platform

Improving Retention Rates

Maple Flow

Engineering calculations & documentation

MapleSim

Advanced System Level Modeling

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim