Normal and lognormal data distribution in geochemistry: death of a myth. Consequences for the statistical treatment of geochemical and environmental data |
| |
Authors: | C Reimann P Filzmoser |
| |
Institution: | (1) Geological Survey of Norway, N-7491 Trondheim, Norway e-mail: Clemens.Reimann@ngu.no, NO;(2) Department of Statistics, Probability Theory and Actuarial Mathematics, Vienna University of Technology, Wiedner Hauptstr. 8–10, A-1040 Vienna, Austria e-mail: P. Filzmoser@tuwien.ac.at, AT |
| |
Abstract: | All variables of several large data sets from regional geochemical and environmental surveys were tested for a normal or
lognormal data distribution. As a general rule, almost all variables (up to more than 50 analysed chemical elements per data
set) show neither a normal or a lognormal data distribution. Even when different transformation methods are used more than
70 % of all variables in every single data set do not approach a normal distribution. Distributions are usually skewed, have
outliers and originate from more than one process. When dealing with regional geochemical or environmental data normal and/or
lognormal distributions are an exception and not the rule. This observation has serious consequences for the further statistical
treatment of geochemical and environmental data. The most widely used statistical methods are all based on the assumption
that the studied data show a normal or lognormal distribution. Neglecting that geochemcial and environmental data show neither
a normal or lognormal distribution will lead to biased or faulty results when such techniques are used.
Received: 21 June 1999 · Accepted: 14 August 1999 |
| |
Keywords: | Normal distribution Lognormal distribution Geochemistry Exploratory data analysis Multivariate normal distribution Robust methods Non-parametric methods Median |
本文献已被 SpringerLink 等数据库收录! |
|