Global DEM and Error Propagation

Table of Contents

  1. Objective
  2. Software and Documentation
  3. NASA Data Portal
  4. SAGA Analysis
    1. Mosaic
    2. Hillshade
    3. Sink drainage and removal
    4. Flow accumulation
    5. Channel Network
  5. SAGA automation with Batch Script
  6. Comparing ASTER and SRTM  

 

1. Objective

This lab has two learning goals: 1) to become familiar with the open source terrain analysis software SAGA and 2) to learn to examine and compare raster datasets with a critical lens. SAGA is an opensource terrain analysis software has been around since 2004, and it is ideal for physical geography analysis using raster data. I will use two datasets, ASTER and SRTM, and through the analysis I will identify the dataset that is better suited for the study region.  


 

2. Software and Documentation

2.1 Software used

2.2 Documentation Referenced


 

3. NASA Earthdata Portal

The NASA Earthdata Portal is a great resource for downloading high resolution digital elevation models from anywhere in the world. It is free to use, but require an account. For this lab, I will be examining Mount Kilimanjaro in Tanzania using two datasets: Shuttle Radar Topography Mission (SRTM) and the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER). SRTM data was obtained in the Space Shuttle missions, where during orbit, two radar sensors were placed at a significant distance apart to measure the elevation of the terrain from parallax. ASTER, a joint US and Japanese venture uses a satellite to capture stereoscopic visible and infrared images. For both datasets, I will be using rasters at a resolution of 1 arcsecond, i.e. 30 meters per pixel width, which should provide enough definition. In the later sections, I will discuss the strengths and weaknesses of the datasets for our study region of Mount Kilimanjaro.  


 

4. SAGA Analysis

An important note: Be sure to save the file often, as SAGA can crash unexpectedly.

4.1 Mosaic

Let us import the raster layers to SAGA. If there are multiple rasters covering the area of interest, as is the case with mine, there will be an obvious discontinuity between the rasters when opened in the same map. Fortunately, this is (probably) not due to the dataset, but the representation of the data, as colors are assigned based on the relative highs and lows of each raster. To fix this, I will make a mosaic of the rasters Tools -> Mosaicking. Choose the bilinear interpolation for the sampling method, since this is a quantitative raster. I will chose the appropriate bounding box coordinates to crop my final mosaic. Lastly, I will reproject the layer to the correct UTM zone Tools -> Projection -> Proj.4 -> UTM Projection (Grid).

Before Mosaic
After Mosaic
 

4.2 Hillshade

Hillshading is a great place to start, as it facilitates the visualisation of our data. Tools -> Terrain Analysis -> Lighting, Visibility -> Analytical Hillshading. Parameters should be modified as desired. One thing to note is that the default position of the sun is often at an angle that is physically impossible i.e. rays from the north in the northern hemisphere. Especially in fields such as cartography, it is important to make a conscious decision about the placement of the sun, whether to favor realism or legibility (the default setting looks natural to right-handed individuals who illuminate their desk from the upper left position).

 

4.3 Sink drainage and removal

Let us perform a preprocessing step to prepare our raster for hydrological analysis. First, I run the sink drainage route tool to detect sinks based on unusual water flow Tools -> Terrain Analysis -> Preprocessing -> Sink Drainage Route. Sinks are usually due to noise in the dataset that causes bumps in the terrain:

Sink drainage output
Zoomed in

To get accurate water flow route and channels, these sinks must be removed using the sink removal tool Tools -> Terrain Analysis -> Preprocessing -> Sink Removal. The output is identical to the original DEM to the naked eye.

4.4 Flow accumulation

Next, I will run the flow accumulation function Tools -> Terrain Analysis -> Hydrology -> Flow Accumulation (Top-Down). This tool maps out, for each cell, how many cells contribute to its water flow. This is helpful in detecting where streams start and how they build up.

Flow Accumulation Output
Zoom in
 

4.5 Channel Network

Finally, a useful tool is the channel network tool that creates both a raster and vector layer of water channels. They can be exported to be used in other GIS software. Tools -> Terrain Analysis -> Channels -> Chanel Network

Channel network output
Zoom in
   

 

5. SAGA automation with Batch Script

SAGA analysis does not have to be done on the graphic user interface. Instead, there is a command-line tool, so one could easily write up a batch script to automate the analysis I did in the above section. This requires a little getting-used-to but it is exceedingly straightforward.

Firstly, I will set all appropriate directories so that the command module can find the SAGA commands, input files, and output directory. Then, I include the tools to be executed. All the SAGA tools in command line format is available on this website. Alternatively, right clicking on any tool in SAGA will reveal a copy as command-line option that is usually more convenient. Finally, I reference the website and include all appropriate inputs and outputs for each command, save and run!

Useful trick

Using variables with the set function can minimize mistakes and make the batch file easily adaptable. For example, the input and output directories can be defined using a variable, so that if these directories should change, it is only necessary to modify it once: no need to go through each command to change them.

The files I used are available for download here:


 

6. Comparing ASTER and SRTM

Now that we have successfully automated the process, we can easily run a different dataset. I ran both the SRTM and ASTER rasters through the batch file. However, before doing that, I had to mosaic and reproject the rasters. The batch file for this process can be downloaded here.

Once I ran all of the SAGA analysis, I first took the difference of the rasters using Grid -> Calculus -> Grid Difference. This was done to show where the two rasters disagreed:

Two places stick out immedeately. The first is near the peak of Mount Kilimanjaro. The second is the lake.

Zoom in to Mt. Kilimanjaro
Zoom in to the lakes

The difference around the lakes is plain and simple: it signifies the difference in calibration. However, the peak is curious. Let us take a close look at the hillshade around this region. I visualized the following in QGIS:

Clearly, there is something going on in the SRTM layer. It appears as though huge chunks of data are missing, and filled in with interpolated results. Let us look at the num file for this region. Num files are metadata raster files that signify the provenance of the data. Indeed, the problematic area was derived from another source: GMTED2010 7.5 arcsecond. Compared to the 1 arcsecond SRTM data, the resolution is 8 times worse which explains the lack of detail. Perhaps the SRTM could not capture data here because of perpetual clouds over the valleys, but this is only speculation.

When we look at the channel network in the mountain peak region, both the SRTM and ASTER reveal errors. It seems like there is something about high elevation that makes data capture difficult. However, SRTM stands out as being much more affected (blue line)

Looking at the hillshade revealed another problematic area in the SRTM. There appears to be unnatural striations in the data. This is not present in the ASTER dataset. I presume this has something to do with error caused while the shuttle was orbiting and scanning the region.

The striations were severe enough to affect the channel simulations in this region.

Finally, I noticed channel network gets confused in extremely flat areas. This might not have to do so much with the accuracy of the DEM; small differences in elevation will have significant effects in how the channel networks are drawn. Examine this close-up of a farm and a lake. The channel networks do badly in the flat farmland, and go haywire on the lake.

 

 

 

7. Which is better?

It is quite clear from these comparisons that ASTER data is a much more reliable digital elevation model compared to the SRTM, at least in the region around Mount Kilimanjaro.