Exercise 2: Processing parameters in requests and processors

Task

  • Create processor script that reads parameters from a velocity template to specify the bands to be combined into an RGB
  • Create a request and generate a false-colour image from B8, B4, B3
  • Inspect output

Material

  • script s2-irg-process
  • parameter velocity template s2-irg-parameters.vm
  • request template s2-irg-request.json
  • see /home/martin/training/parameters/ on ehproduction02

Step 1: Additional processor

There can be several processors in a processor package. Copy the additional script and the velocity template to the processor package from the previous exercise on HDFS, i.e. to

/calvalus/home/<username>/software/gdal-script-1.0

e.g.

/calvalus/home/martin/software/gdal-script-1.0

Additional explanation: At runtime on a compute node, Calvalus selects all files from the processor package that share the processor name prefix and copies them into the working directory. While copying, Calvalus strips off the prefix. s2-irg-process will become just process, s2-irg-parameters.vm will become parameters while in addition inserting parameter values.

# look into the velocity template and into the script to understand what it will do
less /home/martin/training/parameters/s2-irg-parameters.vm
less /home/martin/training/parameters/s2-irg-process
hdfs dfs -put -f /home/martin/training/parameters/s2-irg-parameters.vm /calvalus/home/<username>/software/<processor-package-name>-1.0/
hdfs dfs -put -f /home/martin/training/parameters/s2-irg-process /calvalus/home/<username>/software/<processor-package-name>-1.0/
ls -l /calvalus/home/<username>/software/<processor-package-name>-1.0

Step 2: Request with processor parameters

Write and submit a processing request.

Copy the template into the special-requests directory of your instance.

cd ~/training3-inst
cp /home/martin/training/parameters/s2-irg-request.json special-requests/
# edit special-requests/s2-irg-request.json
{
    "productionType"    : "processing",
    "productionName"    : "",

    "inputPath"         : "/calvalus/eodata/S2_L1C/v5/${yyyy}/${MM}/${dd}/S2.*_T34VFL_.*.zip",
    "dateRanges"        : "[2024-06-03:2024-06-03]",

    "processorName"     : "s2-irg",
    "processorParameters": {
        "parameters": {
            "red": "",
            "green": "",
            "blue": ""
        }
    },

    "outputDir"         : "",

    "queue"             : "general",
    "attempts"          : "1",
    "failurePercent"    : "0",
    "timeout"           : "1200",
    "executableMemory"  : "4096",

    "processorBundles"  : "",
    "calvalus"          : "calvalus-2.26",
    "snap"              : "snap-9.3cv"
}
  • Insert some name of your job in the Hadoop queue into productionName, e.g. Parameter test <username>
  • Insert three band names that have the same resolution in Sentinel-2 MSI L1C into the values of red, green, blue, e.g. B08, B04, B03
  • Insert an output dir starting with /calvalus/home/<username>/ into outputDir, e.g. /calvalus/home/martin/parameter-test
  • Insert the path to your processor bundle into processorBundles, i.e. /calvalus/home/<username>/software/gdal-script-1.0

Step 4: Request submission

Submit your request.

If you have logged in again after the previous exercise

. mytraining3

Then, submit the request with the Calvalus Hadoop Tool cht. You may use -a with cht to submit the requst asynchronously without waiting for it to finish.

cht -a special-requests/s2-irg-request.json

You can monitor status using the job id reported by submission with

cht --status job_xxxxx_yyy

In case of failure you can use the same commands as listed in exercise 1:

yarn application -list -appStates FAILED | grep <username>
yarn logs -applicationId application_<nnnnn>_<mmm> -log_files stderr,stdout | less

Try to find out what is wrong, correct it, and re-submit your request. Please, ask if you do not succeed.

Step 5: Result inspection

Download the result (e.g. with filezilla) and open it in a viewer, e.g. in QGIS. Adjust the colour scale (e.g. to values between 300 and 5000 per band) to make visible the ground.