Downloading Data from the Global Canopy Height Map

Meta and the World Resources Institute have released a global canopy height map with a resolution of 1 meter, marking a significant advancement for forest monitoring and terrestrial biodiversity management. Although it’s impressive to view these data online, many professionals and enthusiasts may wish to work directly with this data on their local systems. Below, we explain how to download these valuable data using the AWS Command Line Interface (AWS CLI).

Step 1: Installing AWS CLI

For Windows users, although the process is similar on other operating systems:

Visit the AWS CLI website to download the appropriate installer for your operating system at AWS CLI.

For Windows, use this direct link to download: AWS CLI for Windows.
Follow the instructions to install the software on your computer.

Step 2: Accessing AWS CLI Resources

Open the Terminal: This can be done by navigating to the folder where you will store the data, right-clicking, and selecting ‘Open in Terminal’. Alternatively, you can access it via the Start menu by searching for ‘cmd’ or ‘command prompt’.

Navigate to the desired directory using the cd command, for example, cd E:\AWS.
Access the resources using the following AWS CLI command (no AWS account required):

aws s3 ls --no-sign-request s3://dataforgood-fb-data/forests/v1/

Step 3: Listing the Available Data

To explore the data available in a specific directory, such as alsgedi_global_v6_float:

aws s3 ls s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/ --no-sign-request | Select-Object -First 10

To access a specific subdirectory such as chm, simply add the path to the previous command:

aws s3 ls s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/chm/ --no-sign-request | Select-Object -First 10

Step 4: Downloading the Data

To download a complete dataset, such as alsgedi_global_v6_float, use the following command, which will start copying the data to your local directory:

aws s3 cp s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/ . --recursive --no-sign-request

It is also crucial to download the tile index file, which allows identification of the exact location corresponding to each downloaded image. This facilitates efficient organization and use of the data in subsequent applications and analyses:

aws s3 cp --no-sign-request s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/tiles.geojson tiles.geojson

Once downloaded, the files are ready to be opened and utilized in any GIS application, such as QGIS or ArcGIS, allowing for advanced manipulation and analysis.

siki

May 2, 2024 at 8:02 am

Thanks for the concise tutorial. But how do I download a specific tile.I have the tile.geojson, but I am not sure how to find the number/code of the tile I am interested in.

Thank you in advance!

Est

June 12, 2024 at 10:38 pm

Hi Siki

You can use the code to download the tiles you want just change the numbers to the tile you want.

aws s3 cp –no-sign-request s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/chm/311003100.tif 311003100.tif

Jam
July 16, 2024 at 11:58 am

Thank you for your detailed teaching, but I still cannot download the data of a single piece, for example: 132123231.tif.
After entering and executing, only “Unknown options: 132123231.tif” appears.
The entered statement is “aws s3 cp –no-sign-request s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/chm/132123231.tif 132123231.tif”
Could you please give me some guidance? Thank you.
1. Dr F
  November 4, 2024 at 9:59 am
  
  this works for me:
  
  aws s3 cp s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/chm/120230011.tif 120230011.tif –no-sign-request
JPM
August 11, 2024 at 6:02 am

Hi, thanks for the useful crib-sheet. I downloaded the tiles.geojson and used QGIS to display the label of the tiles I am interested in over my area of interest but the command still doesn’t work

I get a different error message when I try to use aws s3 cp –no-sign-request s3://dataforgood-fb-data/forests/v1/alsgedi_global_v6_float/chm/tile_number.tif
The user-provided path –no-sign-request does not exist.

The tile_number is the label I see from tiles.geojson.

Please would you advise? Does this mean that I have to sign up to AWS account from now on?