HodgesLehmann - Maple Help

All Products Maple MapleSim

Home : Support : Online Help : Statistics and Data Analysis : Statistics Package : Quantities : HodgesLehmann

Statistics

HodgesLehmann

compute Hodges and Lehmann's location estimator

Random Variable Options

Examples

References

Compatibility

	Calling Sequence
	HodgesLehmann(A, ds_options) HodgesLehmann(X, rv_options)

Parameters

A	-	data set or Matrix data set
X	-	algebraic; random variable or distribution
ds_options	-	(optional) equation(s) of the form option=value where option is one of correction, ignore, or weights; specify options for computing Hodges and Lehmann's location statistic of a data set
rv_options	-	(optional) equation of the form numeric=value; specifies options for computing Hodges and Lehmann's location statistic of a random variable

Description

•

The HodgesLehmann function computes a robust measure of the location of the specified data set or random variable, as introduced by Hodges and Lehmann in [2] and independently by Sen in [3]. This statistic is variously called the Hodges-Lehmann-Sen estimator, the Hodges-Lehmann estimator, the Hodges-Lehmann-Sen statistic, or the Hodges-Lehmann statistic.

•	The Hodges-Lehmann statistic, referred to as $HLE$ in the remainder of this help page, is defined for a data set $A_{1}, A_{2}, ..., A_{n}$ as:

$HLE = \frac{Median ([seq (seq (A_{i} + A_{j}, i = 1 .. n), j = 1 .. n)])}{2}$

•	The Hodges-Lehmann statistic is a reasonably robust statistic: it has a fairly high breakdown point (the proportion of arbitrarily large observations it can handle before giving an arbitrarily large result). The breakdown point of $HLE$ is $1 - \frac{\sqrt{2}}{2}$ or about $0.29$ .

•	The Hodges-Lehmann statistic is a measure of location: if $HodgesLehmann (X) = a$ , then for all real constants $α$ and $β$ , we have $HodgesLehmann (α X + β) = α a + β$ .

•

The first parameter can be a data set, a distribution (see Statistics[Distribution]), a random variable, or an algebraic expression involving random variables (see Statistics[RandomVariable]). For a data set $A$ , HodgesLehmann computes the Hodges-Lehmann statistic as defined above. For a distribution or random variable $X$ , HodgesLehmann computes the asymptotic equivalent - the value that the Hodges-Lehmann statistic converges to for ever larger samples of $X$ .

Computation

•	By default, all computations involving random variables are performed symbolically (see option numeric below).

•	All computations involving data are performed in floating-point; therefore, all data provided must have type/realcons and all returned solutions are floating-point, even if the problem is specified with exact values.

•	For more information about computation in the Statistics package, see the Statistics[Computation] help page.

Data Set Options

•	The ds_options argument can contain one or more of the options shown below. More information for some options is available in the Statistics[DescriptiveStatistics] help page.

•

ignore=truefalse -- This option controls how missing data is handled by the HodgesLehmann command. Missing items are represented by undefined or Float(undefined). So, if ignore=false and A contains missing data, the HodgesLehmann command may return undefined. If ignore=true all missing items in A will be ignored. The default value is false.

•	weights=Vector -- Data weights. The number of elements in the weights array must be equal to the number of elements in the original data sample. By default all elements in A are assigned weight $1$ .

Random Variable Options

The rv_options argument can contain one or more of the options shown below. More information for some options is available in the Statistics[RandomVariables] help page.

•	numeric=truefalse -- By default, the Hodges-Lehmann statistic is computed using exact arithmetic. To compute the Hodges-Lehmann statistic numerically, specify the numeric or numeric = true option.

Examples

>	$with (Statistics) &colon;$

Compute the Hodges-Lehmann statistic for a data sample.

>	$s ≔ ⟨1, 5, 2, 2, 7, 4, 1, 6⟩$

$s ≔ [\begin{array}{c} 1 \\ 5 \\ 2 \\ 2 \\ 7 \\ 4 \\ 1 \\ 6 \end{array}]$

(1)

>	$HodgesLehmann (s)$

$3.50000000000000$

(2)

Let's replace two of the values with very large values.

>	$t ≔ copy (s) &colon;$

>	$t [1 .. 2] ≔ 10^{100} &colon;$

$t$

$[\begin{array}{c} 10000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 \\ 10000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 \\ 2 \\ 2 \\ 7 \\ 4 \\ 1 \\ 6 \end{array}]$

(3)

>	$HodgesLehmann (t)$

$5.75000000000000$

(4)

The Hodges-Lehmann statistic stays bounded, because it has a high breakdown point.

Compute the Hodges-Lehmann statistic for an exponential distribution.

>	$HodgesLehmann ('Exponential' (1),'numeric')$

$0.839173495008330$

(5)

The symbolic result below evaluates to the same floating point number if the parameter is 1.

>	$HodgesLehmann ('Exponential' (b))$

$- \frac{(LambertW (−1, - \frac{{&ExponentialE;}^{−1}}{2}) + 1) b}{2}$

(6)

>	$evalf (eval (, b = 1))$

$0.8391734950$

(7)

Generate a random sample of size 1000000 from the same distribution and compute the sample's Hodges-Lehmann statistic.

>	$A ≔ Sample ('Exponential' (1), 1000000) &colon;$

>	$HodgesLehmann (A)$

$0.840272671959114$

(8)

Consider the following Matrix data set.

>	$M ≔ Matrix ([[3, 1130, 114694], [4, 1527, 127368], [3, 907, 88464], [2, 878, 96484], [4, 995, 128007]])$

$M ≔ [\begin{array}{c} 3 & 1130 & 114694 \\ 4 & 1527 & 127368 \\ 3 & 907 & 88464 \\ 2 & 878 & 96484 \\ 4 & 995 & 128007 \end{array}]$

(9)

We compute the Hodges-Lehmann statistic for each of the columns.

>	$HodgesLehmann (M)$

$[\begin{array}{c} 3. & 1018.50000000000 & 111926. \end{array}]$

(10)

References

[1] Stuart, Alan, and Ord, Keith. Kendall's Advanced Theory of Statistics. 6th ed. London: Edward Arnold, 1998. Vol. 1: Distribution Theory.

[2] Hodges, Joseph L.; Lehmann, Erich L. Estimation of location based on ranks. Annals of Mathematical Statistics 34 (2), 1963, pp.598-611.

[3] Sen, Pranab K. On the estimation of relative potency in dilution(-direct) assays by distribution-free methods. Biometrics 19(4), 1963, pp.532-552.

Compatibility

•	The Statistics[HodgesLehmann] command was introduced in Maple 18.

•	For more information on Maple 18 changes, see Updates in Maple 18.

Maple

Maple Add-Ons

Math Success Platform

Improving Retention Rates

Maple Flow

MapleSim

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim

Maple

Powerful math software that is easy to use

Maple Add-Ons

Math Success Platform

Improving Retention Rates

Maple Flow

Engineering calculations & documentation

MapleSim

Advanced System Level Modeling

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim