Shifting Boundaries, Shifting Results:

 The Modifiable Areal Unit Problem

Lisa Oliver
Geog 516  2001

Openshaw and Alvanides (1999) warn that GIS users need to seriously consider how the zones of analysis effect results.  If relations between variables change with the selection of different areal units, the reliability of results is called into question. The effect of the selection of areal units on analysis, is termed the modifiable areal unit problem (MAUP), it is formally defined as:

“a problem arising from the imposition of artificial units of spatial reporting on continuous geographical phenomenon resulting in the generation of artificial spatial patterns (1998 Heywood).”

The MAUP had been most prominent in the analysis of socio-economic and epidemiological data (see 1999 Wong, Lasus, Falk; 2000 Nakaya; 1999 Openshaw and Alvandies).  Such areal data cannot be measured at a single point, but must be contained within a boundary to be meaningful.  For example, it is not possible to measure the percent of low birth-weight babies at a single point, this percentage must be calculated within a defined area.  It is the selection of these artificial boundaries and their use in analysis that produces the MAUP.

The effects of the MAUP can be divided into two components: the scale effect and the zonation effect (1995 Armhein).  The scale effect is the variation in numerical results that occurs due to the number of zones used in an analysis. For example, the difference in numerical results between mortality rates by municipality and health area in British Columbia is a scale effect. The zonation effect is the variation in numerical results arising from the grouping of small areas into larger units.  For example, using Canada census data, numerical differences in employment rates between a census tract data and its enumeration area would be a zonation effect

It is necessary to understand the ways in which the MAUP effects the results of statistical analysis.  Caution, however, is required, as there is a random aspect to the effects of the MAUP.  It may be difficult to generalise how different data sets with different spatial units are effected by the MAUP.  This caution aside, the use of small areal units has a tendency to provide unreliable rates because the population used to calculate the rate is smaller. On the other hand, using larger areal units will provide more stable rates but may mask meaningful geographic variation evident with smaller areal units (Nakaya 2000).  Choosing between the scale of zones depends upon the particular use and requirements of the data.    

Armheim (1995) in his study of a region, aggregated data into three scales of analysis and calculated mean, variance, regression and correlation coefficients.  This study found that regression correlation coefficients increase as data is aggregated into larger spatial units.  Intuitively, this makes sense because unstable rates tend to be averaged when aggregated into larger units, thereby increasing correlation coefficients. According to Armheim (1995) mean and variance are more stable as areal units of analysis change. Example 1 shows the change in means that occurs when smaller units are aggregated into larger units. Given the susceptibility of statistical results to change at different scales of analysis ecological fallacy needs to be considered.  For example, it is unlikely that an increase in correlation with larger areal units reflects stronger correlation at the individual level.


Example 1

Modifiable Areal Unit Problem:  Scale Effects (a,b) and Zoning Effects (c,d)

x=8.88; n=9 

 

 

x=8.33;  n=3


a)                                                                                 b)

x=8.47;  n=3 

 

 

x=9.33; n=3
c)                                                                                                                                                                  d)

x   =  mean

These few paragraphs have discussed the MAUP and its effects on analysis.  It has, however, avoided discussing solutions to the problem.  There are two reasons for this omission: (1) researchers have only begun to unpack the effects of the MAUP on analysis and; (2) few generic and practical solutions exist.  The weighing areal units by population, as well as complex statistical procedures are currently being researched to address the MAUP. A simple strategy to deal with the problem, is to undertaking analysis at multiple scales or zones. Despite the lack of solutions, being cognisant of the fact that analysis results may be dependent on the zones used to aggregate data is an important step.

References:

Armhein C 1995 Searching for the elusive aggregation effect: Evidence from statistical simulations. Environment & Planning A, Jan95, Vol. 27 Issue 1, p105

Heywood 1998 Introduction to Geographical Information Systems. New York: Addison Wesley Longman.

Nakaya, T 2000 An information statistical approach to the modifiable areal unit problem in incidence rate maps. Environment & Planning A, Vol. 32 Issue 1

Openshaw S, Alvandies S 1999 Applying geocomputation to the analysis of spatial distributions.  In Longley P, Goodchild M, Maguire D, Rhind, D (eds) Geograhpic Information Systems: Principles and Technical Issues. Vol 1, 2nd ed. New York: John Wiley and Sons Inc. 


Song D, Lasus W 1999 Exploring the variability of segregation index D with scale and zonal systems: an analysis of... H.; Environment & Planning A, Vol. 31 Issue 3, p507 

Links and resources on Scale and the MAUP

http://www.sfu.ca/geog/geog452spring00/

- A scale dependent analysis of gentrification in Vancouver’s Downtown Eastside.  Myself and four others attempted to find evidence of gentrification using City neighbourhoods, census tracts, enumeration areas, and individual properties. BC Assessment data and census data was used.

http://www.edcenter.sdsu.edu/staff/ilya/wmu/567/lect/567_l6.html#maup

-lecture notes on the MAUP

http://www.esri.com/library/userconf/proc97/proc97/to300/pap298/p298.htm

-on-line paper that addresses the modifiable areal unit problem

Goodchild M, Proctor J 1997 Scale in a digital world. Geographical and Environmental Modelling vol 1, no 1