*
Yubao Liu , Wei Yu, Francois Vandenberghe, Andrea Hahmann, Tom Warner and Scott Swerdlin National Center for Atmospheric Research/RAP, Boulder, Colorado
In collaboration with the Army Test and Evaluation Command (ATEC), NCAR/RAL has been developing a multi-scale, rapid-cycling, Real-Time Four-Dimensional Data Assimilation (FDDA) and forecasting system (RTFDDA). In the last five years, the system has undergone continuous enhancement, and has been successfully deployed at six Army test ranges. It has also been used to support operations of various homeland-security missions, including the anti-terrorism support during SLC2002 and Athens-2004 Olympics, and Joint-Urban 2003 Field Experiment in Oklahoma City, and a number of other government and international applications. Some recent improvements to this system are described in Liu et al. (2002) and Bowers et al. (2003).
Mesoscale (10 -2000 km) meteorological data assimilation and prediction are challenging due to the sparseness of observations, especially in the upper-air atmosphere. In the past 15 years, a number of instruments, e.g. wind profilers, commercial aircraft reports, satellite measurements, Doppler radars, and most recently, TAMDAR (Tropospheric Airborne Meteorological Data Reporting), have been developed to enhance the upper-air observations. Various surface mesonets from public agencies and private companies make a very high density and frequency of surface weather observations available for mesoscale data analysis and forecast. The diverse observations from these conventional and unconventional, synoptic and asynoptic, observation platforms have been incorporated into the RTFDDA model system. In this paper, impacts of different observation platforms are evaluated and compared with numerical studies using the RTFDDA system. Sensitivity experiments are also carried out with sub-groups of observations to study the relative roles of different observations, such as surface data versus upper-air data, temperature versus winds observations, and in-situ observations versus remote-sensing volume measurements.
The RTFDDA is an MM5/WRF-based, multi-scale (grid-sizes varying from 0.5 km to 45 km), rapid cycling,
* Corresponding author address: Yubao Liu, National Center for Atmospheric Research, P. O. Box 3000, Boulder, CO 80307-3000. Phone: (303) 497-8211 Email: yliu@ucar.edu
real-time, four-dimensional data assimilation and forecasting system. By employing continuous data assimilation through the Newtonian-relaxation method, the system produces four-dimensional, dynamically and physically consistent, analyses and short-term (0 -48 hour) weather forecasts. The forecast system can be cycled (forecasts initiated) at time intervals of 1 -12 hours, and the use of continuous data assimilation minimizes the spin-up problem that can be associated with intermittent data assimilation.
Data assimilation and forecasting on the meso-beta and gamma scales face many challenges. Observations are sparse relative to grid sizes, and they are sometimes available irregularly in time and space. Local circulations associated with orography and other surface heterogeneities may experience serious "spin-up" problems if improperly initialized. There are also spin-up problems associated with cloud and precipitation. The shortage of observations and small simulation domains also make it extremely important to properly simulate the synoptic weather processes as well, and their forcing through lateral boundary.
The RTFDDA system was designed to address these challenges. It makes use of two-way nested grids to model multi-scale interactions, from synoptic-scale to meso-gamma-scale circulations. Data assimilation is performed on all domains. Because the sparse observations are not sufficient to produce accurate three-dimensional analyses of the complex circulations, conventional intermittent analysis methods are not applicable. Instead, the so-called "station nudging" method was employed in the RTFDDA system. A basic station nudging procedure was firstly introduced to the MM4(5) system by Stauffer and Seaman (1994). It was adapted to real-time applications in RTFDDA by Cram et al. (2001), and has undergone significant enhancement since then (Liu et al 2002).
The station-nudging approach allows sequential insertion of each observation into a continuously running, full-physics, MM5/WRF model, with proper temporal and spatial weights. This algorithm is especially adequately for assimilating irregular observations such as ACARS and TAMDAR weather reports along flight tracks. As observations are nudged into the model solution, the model state is forced toward the observations. The non-observed model variables adjust under the constraints imposed by the full equations of the model. As time progresses, the model spreads the observation information in three-dimensional space according to the local weather circulations. This four-dimensional, multivariable analysis method allows the model physics to represent the local and synoptic conditions in regions where no observations are available, and corrects the model solution in the regions where observations exist. Experiments show that this four-dimensional data assimilation method is not only capable of properly assimilating information from sparse observation networks, but it is also well behaved when observations are abundant.
The conventional data types assimilated by the system include the twice-daily radiosondes and pibal winds, hourly surface observations at synoptic stations, data from ships and buoys, and data from other special reports disseminated by WMO (World Meteorological Organization). High-frequency measurements from special observation platforms at the Army test ranges, such as 15-minute averages from surface mesonet stations and boundary-layer profilers, are utilized, as are range radiosonde soundings. The four-dimensional data assimilation approach used here makes full use of the above conventional and special observations that are available at both synoptic and asynoptic times. Nevertheless, these data sources still only represent a very limited portion of the atmospheric at the time and space scales of interest. For example, between the 12-hourly synoptic radiosonde observation times, we hardly obtain any upper-air information from these data sources. It is obvious that incorporating more data sources should be one of the major objectives in enhancing this system.
The capability of the RTFDDA system to utilize observations at irregular times and locations gives us the flexibility to explore the usefulness of a few advanced, non-conventional observation platforms that have emerged in recent years. At surface, the high-density and high-frequency observation from a large number of public and private agencies, are collected in the MADIS (Meteorological Assimilation Data Ingest System) data set of the NOAA/FSL. A total of about 18,000 surface weather stations over the CONUS are collected in real-time at present. For upper-air, in collaboration with NOAA/NES-DIS (National Environmental Satellite, Data, and Information Service; Chris Velden and Jaime Daniels), the experimental hourly rapid-scan GOES 8 and 9 (now 10 and 12) winds that are derived from visible, IR and water vapor images, were ingested into the system. Beginning at about the same time, data from the NPN (NOAA Profiler Network) and the CAP (Cooperative Agency Profilers) network, disseminated by NOAA/FSL, were incorporated. Aircraft reports (ACARS/AMDAR -Aircraft Communications Addressing and Reporting System/Air-craft Meteorological Data Relay) processed and disseminated by NOAA/FSL were also successfully tested with the RTFDDA system. Recently, NEXRAD VAD, Radio-metrics Radiometers, NASA/JPL QuikScat sea surface winds, and AirDat LLC TAMDAR data are also incorporated into the RTFDDA system. These new data sources enhance greatly the time and space coverage, and with their high quality and real-time availability the data sources fill the temporal gap between the synoptic radiosonde observations. Fig. 1a summarizes the observations used in the Army White Sand Missile Range (WSMR) RTFDDA system at 00Z 28 July 2003. Fig. 1b

Fig.1 (a) Observations used in the Army’s White Sand Missile Range (WSMR) RTFDDA system at 00Z 28 July 2003. (b). WSMR RTFDDA system domain configuration.
shows the model domain configuration. The sum of the observations from the new upp-air platforms is more than five times the total number of observations from radiosonde reports in Domain 1 (radiosondes have 70 levels, on average). There are significant additions of data to the areas of the fine-mesh grids also. It is worth pointing out that there are more ACARS and TAMDAR reports and satellite winds reports in the daytime than at night. The reported data numbers are only those that passed rigorous quality control checks, and were used in the system during an 80-minute time window.
Measurements from different platforms and sensors possess different qualities, and temporal and spatial densities. The sample volumes can be very different. And a platform may observe only one or two weather variables. Therefore, it is important to investigate how these observations with their particular attributes affect the RTFDDA forecast skill. Numerical experiments were carried out to study the impact of 1) subsets of the observations from different platforms, 2) surface versus upper-air data, and 3) different variables. In this section, results from the experiments with the Army’s WSMR RTFDDA system, running with a single coarse domain (grid size of 30 km) for a 6-day period, will be presented. In all experiments shown below, the RTFDDA system ran with 3-hourly cycling (forecast initiation frequency), with 12 hour forecasts in each cycle. The 6-day model runs are evaluated by verifying the model 10-12 hour forecasts against all hourly surface observations and 12 hour upper-air forecasts against all 00Z and 12Z radiosonde observations. Note that the shorter-duration forecasts, and the analyses, are more accurate (Cram et al. 2001). Finally, we point out that these experiments were conducted a few months ago when only 3-hourly satellite winds (rather than hourly) and a subset of ACARS reports (those from United Airlines only) were available.
Two sets of experiments, with two different weather scenarios respectively, were carried out to study the data impact. The first set is made up of a total of 8 experiments. The second set of the experiments contains 20 experiments with a summer convection case in which severe storms wiping through the northeastern States. We are in process of analyzing model output of the second set. This paper focuses on the first set. The experiments were run with the same version of the system, with the same model physics. Each experiment was run with a subset of the observations. The experiment abbreviations are listed below,
A). CTRL --control experiment, uses all observations,
B). NOobs --no observations are assimilated,
C). SFConly --uses (all types of) surface observations only,
D). PROFSFC --the same as SFConly, but with the addition of wind-profiler data,
E). SATWSFC --the same as SFConly, but with the addition of satellite winds,
F). UPRonly --uses (all types of) upper-air observations only,
G). TQonly --assimilates (both surface and upper-air) temperature and moisture only, and
H). WINDonly --assimilates (both surface and upper-air) wind only.
As will be seen from the model results, the control experiment (CTRL) that made use of the full data set, and the experiment NOobs that used no observations at all, represent the upper and lower bounds of the system performance, respectively.
One interest of the data assimilation and NWP (Numerical Weather Prediction) communities is the impact of surface and upper-air observations on forecast skill. Specifically, it is important to investigate how effectively surface data can correct the model error near the surface, and how these data influence the three-dimensional upper-air forecast. Similarly, we are also interested in how upper-air observations can correct the upper-air model error, and how these data affect the forecast near the surface. The continuous FDDA approach allows the surface layer and the free atmosphere to interact through the PBL dynamics. Thus, surface-observation information can propagate into the upper-atmosphere, and vice versa.
Four experiments, CTRL, SFConly, UPRonly and NOobs are compared here. The verification results (Figs. 2 and 3) are very interesting. First of all, as one could anticipate, by fully assimilating all observations, the CTRL runs produced the best results in terms of both upper-air and surface forecast verification statistics. Conversely, the runs without assimilation of any observations, NOobs, generated the largest forecast errors. Assimilating surface observations only, SFConly, dramatically improves the quality of the surface analyses and forecasts. As compared with NOobs, SFConly reduces the moisture RMSE error of the 10-12 h surface forecasts by 20-35% and temperature by ~0.5-1C (about 20-30%). The surface winds are minimally affected. Similarly, UPRonly was able to improve the upper-air model wind, temperature and moisture forecasts. The RMSE (Root Mean-Squared is little effect on the surface winds. Unlike in the upper-air, the TQonly runs lead to much larger corrections of the surface temperature and moisture than those seen in the WINDonly runs.



It is worth pointing out that, aside from the 12-hourly radiosonde observations, the upper-air observations in the experiment are mostly dominated by wind observations (from wind profilers and satellite winds). Thus, the different effects of assimilating temperature, moisture and winds individually, seen in the current study, may result from the different data densities (dataset size). Nevertheless, it is interesting to point out that the larger effect of assimilating winds, relative to temperature, with our station-nudging approach agrees with many earlier studies that used the grid-nudging method.
The station-nudging approach of the NCAR/ATEC Real-time FDDA and forecast system (RTFDDA) provides a flexible method for incorporating all observations taken at irregular times and spatial locations. New unconventional data sources were introduced to the system, and many model features were refined to improve the RTFDDA analyses and forecasts. A set of eight comparison experiments were conducted to investigate the RTFDDA model response to the assimilation of the observations from satellite wind measurements and wind-profiler networks, from upper-air and surface platforms, and from temperature and moisture observations and wind observations. It can be concluded that, 1) Adding new data sources will always reduce the error, though to varying extents, no matter which variable is provided; 2) observations that are well distributed temporally and spatially appear to perform better than those clustered in time and space; 3) upper-air wind observations are most important and effective in driving the model toward the correct evolution (This indicates that data sources such as NESDIS satellite winds, FSL wind profilers and the FSL ACARS aircraft observations are especially valuable); and 4) surface observations (of temperature, moisture and winds) are critical to maintaining the proper evolution and accuracy of the model surface states and the physical structure of the lower troposphere; and lastly and interestingly, the simulations that employ the complete set of observations tend to produce the best analyses and forecasts.
Finally, we noted that, although the above conclusions appear to be robust for this case, it is important to investigate how the value of the each platform changes with synoptic weather and geographic locations by conducting more numerical experiments. The second group experiment mentioned earlier is one effort aiming at this. Also more complete data sources were tested in the second set. The result from this set of experiments will be reported at the conference and future publications.
The authors would like to thank Wayne Bresky and Jaime Daniels for providing with the GOES satellite winds, Bill Moninger with ACARS data, Patty Miller and Michael Barth with MADIS data, Vicki Peck and Gene Legg with NASA QuikScat data, and Mark Anderson and his team with TAMDAR data.
Bowers, J., S. Swerdlin, T. Warner, Y. Liu, L. Carson, A. Bourgeois,
H. Fisher, D. Hahn, S. Low-Nam, D. Rife, T. Sandblom, B. Sharman, R. Sheu, F. Vandenberghe and M. Xu, 2003: Enhancement to and new application of, the ATEC data assimilation and forecast system. BACIMO-2003 conf., Monterey, CA.
Cram, J.M., Y. Liu, S. Low-Nam, R-S. Sheu, L. Carson, C.A. Davis, T. Warner, J.F. Bowers, 2001: An operational mesoscale RT-FDDA analysis and forecasting system. Preprints 18th WAF and 14th NWP Confs., Ft. Lauderdale, AMS, Boston, MA.
Liu, Y., S. Low-Nam, R. Sheu, L. Carson, C. Davis, T. Warner, S. Swerdlin, J. Bowers, M. Xu, H.-M. Hsu and D. Rife, 2002: Performance and enhancement of the NCAR/ATEC mesoscale FDDA and forecast system. 15th Conference on Numerical Weather Prediction , San Antonio, Taxes, 399-402. 12-16 August, 2002.
Stauffer, D.R., and N. L. Seaman, 1994: Multiscale four-dimensional data assimilation. J. Appl. Meteor., 33, 416-434.