Home  >>  Archives  >>  Volume 15 Number 1  >>  st0370

The Stata Journal
Volume 15 Number 1: pp. 77-94



Subscribe to the Stata Journal
cover

Time-efficient algorithms for robust estimators of location, scale, symmetry, and tail heaviness

Wouter Gelade
University of Namur
Centre of Research in the Economics of Development (CRED)
Namur, Belgium
[email protected]
Vincenzo Verardi
University of Namur
Centre of Research in the Economics of Development (CRED)
Namur, Belgium
and Université libre de Bruxelles
ECARES and iCite
Brussels, Belgium
[email protected]
Catherine Vermandele
Université libre de Bruxelles
Laboratoire de Méthodologie du Traitement des Données (LMTD)
Brussels, Belgium
[email protected]
Abstract.  The analysis of the empirical distribution of univariate data often includes the computation of location, scale, skewness, and tail-heaviness measures, which are estimates of specific parameters of the underlying population distribution. Several measures are available, but they differ by Gaussian efficiency, robustness regarding outliers, and meaning in the case of asymmetric distributions. In this article, we briefly compare, for each type of parameter (location, scale, skewness, and tail heaviness), the “classical” estimator based on (centered) moments of the empirical distribution, an estimator based on specific quantiles of the distribution, and an estimator based on pairwise comparisons of the observations. This last one always performs better than the other estimators, particularly in terms of robustness, but it requires a heavy computation time of an order of n^2. Fortunately, as explained in Croux and Rousseeuw (1992, Computational Statistics 1: 411–428), the algorithm of Johnson and Mizoguchi (1978, SIAM Journal of Scientific Computing 7: 147–153) allows one to substantially reduce the computation time to an order of n log n and, hence, allows the use of robust estimators based on pairwise comparisons, even in very large datasets. This has motivated us to program this algorithm for Stata. In this article, we describe the algorithm and the associated commands. We also illustrate the computation of these robust estimators by involving them in a normality test of Jarque–Bera form (Jarque and Bera 1980, Economics Letters 6: 255–259; Brys, Hubert, and Struyf, 2008, Computational Statistics 23: 429–442) using real data.
Terms of use     View this article (PDF)

View all articles by these authors: Wouter Gelade, Vincenzo Verardi, Catherine Vermandele

View all articles with these keywords: mhl, sqn, medcouple, robjb, location, scale, symmetry, tail heaviness, mean, median, skewness, kurtosis, robust estimation

Download citation: BibTeX  RIS

Download citation and abstract: BibTeX  RIS