CRPAQS Surface and Aloft Meteorological RepresentativenessCalifornia Regional PM10/PM2.5 Air Quality Study
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Contact: Charley Knoderer
|
Print Version
Methodology
· Data Selection/Data Acquisition · Parameter-Weighted Distance Calculation · Representativeness Index Calculation Introduction This page details the methodology used to create the representativeness maps. Figure 1 is a schematic diagram of the process.
Figure 1. Process flow diagram for representativeness calculation Data Selection/Data Acquisition Surface Meteorology Wind speed and direction data were queried and downloaded from the Central California Air Quality Studies (CCAQS) database. All hourly and hourly averaged data from 2-m and 10-m towers within the domain with valid QC flags (V2 – valid estimated value, VH – valid hourly data, V0 – valid value) were selected and imported into a local database. For the few areas with both 10-m and 2-m data, only the 10-m data were used. Site locations are shown in Figure 2.
Figure 2. 2-m and 10-m meteorological tower sites used in the analysis
Upper-air Meteorology Aloft wind data were downloaded from the CCAQS database. All hourly averaged data from radar wind profilers within the domain with valid QC flags were selected and imported into a database. Figure 3 shows the locations of these profilers. These data were grouped into 50-m bins.
Figure 3. Wind profiler locations Elevation Elevation data were compiled for the domain from USGS digital elevation maps. Surface Data Surface wind data were grouped and averaged by season and
time of day. For this study, we focused
on winter, defined as November through February. Three time-of-day averages were created, morning
( Upper-air Data The binned upper-air wind data were averaged using the same scheme as the surface data. This produced diurnal vertical profiles. Time-height cross sections of these profiles, such as that shown in Figure 4, were created for each site. Layers were subjectively determined by examining the time-height cross-sections. Generally, three layers were identified: 1. Boundary layer winds, characterized by diurnal fluctuations in wind speed and direction. 2. Mid-boundary layer winds, characterized by a less dramatic diurnal shift in wind speed and direction. 3. Synoptic layer winds, characterized by a more uniform wind speed and direction typical of synoptic scale high and low pressure systems.
Figure 4. Time-height cross-section of Elevation Data USGS digital elevation maps were available at 30-m resolution. Because this resolution was much higher than necessary for this analysis, we reduced the resolution to 300 m. An analysis grid was defined over the entire domain shown in Figure 5, with a nominal grid cell size of 2 km2. All spatial analysis was performed using ESRI’s ArcGIS® software.
Figure 5. Domain used in the analysis (shown in blue) Parameter-Weighted
Distance Calculation The likelihood that a particular grid cell a will have meteorology data similar to data from an observation site at grid cell b is dependant on the data parameter values in the grid cells between a and b. In our analysis the parameters were wind speed, elevation, elevation slope, and wind direction difference angle. The processing described below resulted in a grid of classified values for each parameter that ranged from 1 to 10. These grids were combined to create a single factor at each grid cell. We call this the overall factor grid. This factor represents the “cost” in terms of representativeness of traveling across the cell, relative to other cells. With the overall factor grid, we calculated the parameter-weighted distance (PWD) from each grid cell to the “nearest” meteorology observation site. This calculation also determines the site allocation boundaries. The PWD is determined not by geographic distance but by calculating the cumulative representativeness cost to traverse from the grid cell to the observation site along the “least cost” path. This is not necessarily a straight line. A small PWD at a grid cell means that the wind speed and direction are more likely to be similar to the allocated observation data than a cell with a larger PWD. Surface and Layer 1 For the Surface and Layer 1 data, three factor grids (classified slope, wind speed, and wind direction) were combined to create the overall factor grid, F.
Each of the three factors were weighted differently for each layer and averaging period. The weights, a, were determined using a weight-tuning process described below.
Layer 2 and Layer 3 For these layers, the overall factor grid consisted of the classified elevation grid only. Parameter Preparation The PWD we used required a combination of different parameters together to create a single overall factor at each grid cell. In order to combine our parameters, we gridded the data using the analysis grid and reclassified all values to integers from 1 to 10. The four parameters we used were elevation slope, wind speed, wind direction difference angle, and elevation. Elevation Slope Slope calculations were performed on the 300-m elevation
maps using ArcView Spatial Analyst. The
resulting slopes were then reclassified to a scale of 1 to 10 using “natural
breaks,” an algorithm that groups
values on the basis of least variance within class and maximum variance between
the classes. This method divides the
data into natural clusters of values.
The resulting classified slope grid is shown in Figure 6.
Figure 6. Elevation Slope Classes Wind Speed Each cell in the analysis grid was assigned the average wind speed of the nearest measurement site. These values were then reclassified based on a scale of 1 to 10. The highest wind speed was given a value of 1 and the lowest a value of 10. All values in between were divided into equal-sized subranges. The resulting classified wind speed grid is shown in Figure 7.
Figure 7. Classified wind speed grid for the surface
data winter afternoon averaging period
Wind Direction
Difference Angle For each cell in the analysis grid we calculated the smallest angle between the average wind direction at the nearest measurement site and the direction to that site (see Figure 8). We called this the wind direction difference angle. Note that this angle ranges from 0 – 90°.
Figure 8. Wind direction difference angle These values were then reclassified. Ultimately, grid cells along the direction of the wind have a shorter PWD than those perpendicular to the average wind. This is true especially in canyons, where the wind flow is restricted to channels. Figure 9 shows an example of a classified wind direction difference angle grid.
Figure 9. Classified wind direction difference angle for surface data winter afternoon averaging period Elevation For the upper two layers (Layer 2 and Layer 3), elevation was used as a parameter in place of slope. The elevation values were classified as shown in the table.
For each time of day for the surface and first layer, the parameters were tuned so that station pairs with similar wind data had smaller PWDs than station pairs with relatively dissimilar data. The optimum parameter weights, a, were calculated programmatically. The program cycled through many different parameter-weight combinations and calculated PWDs. For each station location, the PWD to the nearest station was calculated, along with the differences in wind speed and wind direction. Example program output is shown in the table below.
With this information, the best parameter-weight combination was found by doing the following for each combination: 1. Find the PWD value that encompasses 33 % of the site pairs. This is defined as the cutoff value, CV. 2. Define the error weight (EW) by the function:
3. Calculate the total error as the sum of Diff*EW for all data pairs, where Diff is the difference between the two measurements. Do this for both wind speed and wind direction. The total error for each parameter-weight combination was then ranked from lowest to highest for both wind speed and wind direction. The combination that produced the lowest total sum of the two ranks was chosen as the best fit. Representativeness Index Calculation Once the optimum parameter weights were found, the final PWD was calculated for each grid cell. Each grid cell was allocated to the “nearest” tower site in terms of PWD. This site was not necessarily the nearest in terms of geographic distance. The grid cells were finally assigned an Index of Representativeness, which was determined by using the data collected from the parameter-weight tuning program. Site pairs were ordered by increasing PWD. Then we determined if the site pairs had “similar” meteorological data over the averaging period, where similar was defined as average wind speeds within 0.5 m/s and average wind direction within 90° of each other. The cumulative fraction of “similar” site pairs was then plotted as a function of PWD. Figure 10 shows that as the PWD increases, sites are less likely to be similar to each other. We used this plot to set the boundaries of the Representativeness Indices. All grid cells with PWD less than the value at which 50% of site pairs were similar were given a Representativeness Index (RI) of 1–“most representative.” Grid cells with PWD greater than the first bin and less than the value at which 35% of site pairs were similar were given an RI of 2–“less representative.” All grid cells with a greater PWD were assigned an RI of 3–“not representative.”
Figure 10. Representativeness plot for surface data winter morning averaging period |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Prepared by
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||