A Cluster Analysis Approach to Comparing Atmospheric Radiation Measurement (ARM) Data with Global Climate Model (GCM) Results

Forrest M. Hoffman, Salil Mahajan, William W. Hargrove, Richard T. Mills, Anthony Del Genio

December 2007

Cluster analysis was employed to compare ARM observational data at the Southern Great Plains (SGP) site with corresponding 6-hourly output from an integration of the Community Climate System Model (CCSM) run under the IPCC SRES A2 scenario for the current decade. Cluster analysis is a technique for classifying multivariate data into distinct regimes or states based on Euclidean distance in a phase space formed from the variables under consideration. A parallel clustering algorithm developed at Oak Ridge National Laboratory (ORNL) and designed for analyzing very large datasets was applied to obtain atmospheric column states from observations at the SGP site and, separately, from CCSM results based on vertical temperature, humidity and wind speed tropospheric profiles and surface pressure. A three way process was implemented to compare ARM data with GCM output, where 1) CCSM output was projected onto states derived from ARM observations, 2) ARM observations were projected onto states derived from CCSM output, and 3) both ARM observations and CCSM output were projected onto states derived from the combination of the two datasets. Comparisons of twelve atmospheric states derived from the combination of ARM observations and CCSM output indicate that distinct singular states exist in each dataset. As shown in the figure, state number 5 has no analog in the ARM observational data while states numbered 1, 3, 7, and 11 are never captured by CCSM. State number 5 is characterized by very high humidity and temperature at the surface, and it has no analog in the observational data. States 1, 3, and 7 have very low frequency in the observations (see frequency plot), so their absence from model predictions does not suggest a problem. However, state 11, which is characterized by high humidity and temperature with very low wind shear, is never predicted by CCSM. In addition, CCSM predicts an over-abundance of state 9 (low humidity and high temperature conditions) while under-representing state 4 (moderate humidity, temperature, and shear conditions). Misrepresentation of atmospheric states in CCSM over the SGP site could have impacts on predictions of cloud formation and hence the local radiation budget.

Mean tropospheric profiles of twelve distinct atmospheric states for ARM observations (left) and corresponding CCSM output (right).
Mean tropospheric profiles of twelve distinct atmospheric states for ARM observations (left) and corresponding CCSM output (right) derived from cluster analysis of the combination of ARM observations and CCSM output. Shown here are temperature profiles (red, bottom axis), specific humidity profiles (blue, top upper axis), and wind speed profiles (green, top lower axis) against height (left vertical axis). Also shown is the surface pressure (asterisk, right vertical axis). The frequency plot (lower right) shows the population density of states from the ARM observations and CCSM results.

References


Contact: Forrest Hoffman (forrest@climatemodeling.org)

This research was sponsored by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research (OBER). This research used resources of the National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL), which is managed by UT-Battelle, LLC, for the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.