Statistics - Maple Programming Help

Online Help

All Products    Maple    MapleSim


Home : Support : Online Help : Statistics and Data Analysis : Statistics Package : Visualization : Statistics/GridPlot

Statistics

  

GridPlot

  

generate a grid of plots

 

Calling Sequence

Parameters

Options

Examples

Compatibility

Calling Sequence

GridPlot( dataset, options, TabulateOptions )

Parameters

dataset

-

Matrix or DataFrame containing 2 or more columns of values

options

-

(optional) equation(s) of the form option=value, where option is one of upper, lower, diagonal, correlation, labels, or plotoptions

TabulateOptions

-

options to be passed to the DocumentTools:-Tabulate command

Options

• 

upper : name, procedure, list(procedure, anything); specifies the type of plot to use in the upper triangle of the grid of plots. The default is Statistics:-ScatterPlot. If this is set to none, the upper triangle is empty. If a procedure or Maple command is entered, the upper triangle is filled with the results of using the procedure on the ith and jth columns of data. If a procedure is entered as the first value of a list, subsequent arguments are passed to the procedure.

• 

lower : name, procedure, list(procedure, anything); specifies the type of plot to use in the lower triangle of the grid of plots. The default is upper, which uses the same output as the upper triangle. If this is set to none, the lower triangle is empty. If a procedure or Maple command is entered, the lower triangle is filled with the results of using the procedure on the ith and jth columns of data. If a procedure is entered as the first value of a list, subsequent arguments are passed to the procedure.

• 

diagonal : name, procedure, list(procedure, anything); specifies the type of plot or value to use on the diagonal of the grid of plots. The default is labels, which uses either the value of the labels option, the DataFrame column names, or a list of values corresponding to the column index value. If a procedure or Maple command is entered, the diagonal is filled with the results of using the procedure on the ith columns of data. In this case, labels are passed as titles to the procedure. If a procedure is entered as the first value of a list, subsequent arguments are passed to the procedure.

• 

labels : list; a list of values corresponding to each of the columns of data. If the dataset is stored in a DataFrame, the labels are automatically generated using the column names and can be overridden using the labels option.

• 

correlation : list(truefalse, truefalse, truefalse); A three element list containing truefalse values that specify if the values from the Statistics:-CorrelationMatrix command should be used. The default is [false, false, false]. The elements in the list correspond to using the values from the correlation matrix on [ upper, lower, diagonal ] respectively.

• 

plotoptions : list(exprseq); A list containing optional arguments of the form, plot attribute = value, to be passed to all plots in GridPlot.

Examples

withStatistics:

The GridPlot command can be used to look for patterns in higher dimensional datasets. In the following example, the columns of a sample data set are plotted against one another in order to look for possible correlation between the columns.

dataSampleUniform0,1,50,3:

GridPlotdata

Tabulate

(1)

1

2

3

 

Global plot options can be passed to all plots in the grid using plotoptions:

GridPlotdata,plotoptions=symbol=solidbox,symbolsize=20,color=Orange,width=40

Tabulate0

(2)

1

2

3

 

The Iris dataset contains measurements in centimeters for several properties of 150 flowers from 3 species of iris. In the following example, the GridPlot command is used to look for patterns between the properties of the flowers.

IrisDataImportMatrixFileTools:-JoinPathdatasets,iris.csv,base=datadir,skiplines=1

IrisData 150 x 5 MatrixData Type: anythingStorage: rectangularOrder: Fortran_order

(3)

IrisLabelsSepal length,Sepal width,Petal length,Petal width

IrisLabelsSepal length,Sepal width,Petal length,Petal width

(4)

The Iris data can also be stored in a DataFrame.

IrisDFDataFrameIrisData..,1..4,columns=IrisLabels

IrisDFSepal lengthSepal widthPetal lengthPetal width15.13.51.40.224.931.40.234.73.21.30.244.63.11.50.2553.61.40.265.43.91.70.474.63.41.40.3853.41.50.2...............

(5)

Since the data is stored in a DataFrame, the GridPlot command can automatically determine the labels for the diagonal of the grid from the column names. The upper and lower options control the types of plots to show on the upper triangle and lower triangle of the matrix respectively. The width option is passed to the DocumentTools:-Tabulate command, and controls the size of the resulting grid of plots.

GridPlotIrisDF,upper='ScatterPlot',lower='AgglomeratedPlot',color=Crimson,plotoptions=color=Grey,width=600,widthmode=pixels

Tabulate1

(6)

Sepal length

Sepal width

Petal length

Petal width

 

Note that global plot options set by the plotoptions option can be locally overridden by specifying plot options in the upper, lower or diagonal arguments.

Additional options such as width and fillcolor are passed to the DocumentTools:-Tabulate command. This means that custom coloring can be applied to the cells of the grid of plots. In the following example, a custom coloring scheme is first created for the upper triangle of plots and passed to the plots:-pointplot command. Next, a custom coloring scheme is applied to the empty lower triangle of the grid that corresponds to a HeatMap for the CorrelationMatrix of the Iris dataset.

SpeciesColorsmapt→piecewiset=setosa,DarkGreen,t=versicolor,MediumSlateBlue,t=virginica,MediumVioletRed,IrisData..,5:

GridPlotIrisData..&comma;1..4&comma;upper&equals;&apos;plots:-pointplot&apos;&comma;color&equals;SpeciesColors&comma;diagonal&equals;&apos;Histogram&apos;&comma;style&equals;polygon&comma;lower&equals;none&comma;labels&equals;IrisLabels&comma;width&equals;600&comma;widthmode&equals;pixels&comma;fillcolor&equals;T&comma;i&comma;j&rarr;piecewisej<i&comma;ColorTools:-ToRGB24ColorTools:-BlendWhite&comma;Grey&comma;Statistics:-CorrelationMatrixIrisData..&comma;1..4i&comma;j&comma;ij&comma;255&comma;255&comma;255

Tabulate2

(7)

 

 

 

 

 

 

 

From using the custom coloring scheme, it can be observed that the cell in the 4th row and 3rd column shows the darkest color. The coloring used colors from white to black to indicate correlation values from 0 to 1, meaning that in this case, their is a higher level of correlation between the Pedal Length and Pedal Width variables.

The correlation option is useful for showing more details on correlation between columns of data and in order to generate Correlograms for multivariate data. For example, if the values from the correlation matrix are used for the lower triangle, the following plot can be generated:

GridPlotIrisData..&comma;1..4&comma;upper&equals;&apos;plots:-pointplot&apos;&comma;color&equals;SpeciesColors&comma;lower&equals;&apos;x&rarr;Statistics:-PieChart &equals;x&comma; &equals;1x&comma;color&equals;CornflowerBlue&comma;WhiteSmoke&comma;title&equals;evalf&lsqb;3&rsqb;x&comma;size&equals;100&comma;100&apos;&comma;correlation&equals;false&comma;true&comma;false&comma;labels&equals;IrisLabels&comma;width&equals;600&comma;widthmode&equals;pixels

Tabulate3

(8)

Sepal length

Sepal width

Petal length

Petal width

 

Compatibility

• 

The Statistics[GridPlot] command was introduced in Maple 2016.

• 

For more information on Maple 2016 changes, see Updates in Maple 2016.

See Also

Statistics

Statistics[Visualization]

Statistics[CorrelationMatrix]

Statistics[Correlation]