source:
flex_extract.git/For_developers/Sphinx/Source/Documentation/Input/jobscript.rst
@
47be2684
Last change on this file since 47be2684 was 6931f61, checked in by anphi <anne.philipp@…>, 4 years ago | |
---|---|
|
|
File size: 5.4 KB |
The job script job.ksh
The job script is a Korn-shell script which will be created at runtime for each flex_extract execution in the application modes remote and gateway.
It is based on the submitscript.template template file stored in the Templates directory. This template is generated in the installation process from a jobscript.template template file.
Flex_extract uses the Python package genshi to generate the Korn-shell script from the template files by substituting the individual parameters. These individual parameters are marked by $$ in jobscript.template.
The job script has a number of settings for the batch system which are fixed, and differentiates between the ecgate and the cca/ccb server system to load the necessary modules for the environment when submitted to the batch queue.
The submission is done by the ECaccess tool from within flex_extract with the command ecaccess-job-submit.
What does the job script do?
- It sets necessary batch system parameters.
- It prepares the job environment at the ECMWF servers by loading the necessary library modules.
- It sets some environment variables for the single session.
- It creates the directory structure in the user's $SCRATCH file system.
- It creates a CONTROL file on the ECMWF servers whith the parameters set before creating the job.ksh. Flex_extract has a set of parameters which are passed to the job script with their default or the user-defined values. It also sets CONTROL as an environment variable.
- Flex_extract is started from within the work directory of the new directory structure by calling the submit.py script. It sets new paths for input and output directories and the recently generated CONTROL file.
- At the end, it checks whether the script has returned an error or not, and emails the log file to the user.
Example job.ksh
#!/bin/ksh # ON ECGB: # start with ecaccess-job-submit -queueName ecgb NAME_OF_THIS_FILE on gateway server # start with sbatch NAME_OF_THIS_FILE directly on machine #SBATCH --workdir=/scratch/ms/at/km4a #SBATCH --qos=normal #SBATCH --job-name=flex_ecmwf #SBATCH --output=flex_ecmwf.%j.out #SBATCH --error=flex_ecmwf.%j.out #SBATCH --mail-type=FAIL #SBATCH --time=12:00:00 ## CRAY specific batch requests ##PBS -N flex_ecmwf ##PBS -q np ##PBS -S /usr/bin/ksh ## -o /scratch/ms/at/km4a/flex_ecmwf.${PBS_JOBID}.out ## job output is in .ecaccess_DO_NOT_REMOVE ##PBS -j oe ##PBS -V ##PBS -l EC_threads_per_task=24 ##PBS -l EC_memory_per_task=32000MB set -x export VERSION=7.1 case ${HOST} in *ecg*) module unload grib_api module unload emos module load python3 module load eccodes module load emos/455-r64 export PATH=${PATH}:${HOME}/flex_extract_v7.1/Source/Python ;; *cca*) module switch PrgEnv-cray PrgEnv-intel module load python3 module load eccodes module load emos/455-r64 export SCRATCH=${TMPDIR} export PATH=${PATH}:${HOME}/flex_extract_v7.1/Source/Python ;; esac cd ${SCRATCH} mkdir -p python$$ cd python$$ export CONTROL=CONTROL cat >${CONTROL}<<EOF accmaxstep 24 acctime 18 acctype FC accuracy 24 addpar None area 74.0/-24.0/10.0/60.0 basetime None cds_api None controlfile CONTROL_CERA cwc 1 dataset None date_chunk 3 debug 1 destination <specificname>@genericSftp doubleelda 0 dpdeta 1 dtime 3 ec_api None ecfsdir ectmp:/${USER}/econdemand/ ecgid at ecstorage 0 ectrans 1 ecuid km4a end_date 20000809 eta 1 etadiff 0 etapar 77 expver 1 format GRIB1 gateway srvx8.img.univie.ac.at gauss 0 gaussian grib2flexpart 0 grid 1.0/1.0 inputdir <path-to-flex_extract>/flex_extract_v7.1/run/workspace install_target None job_chunk 1 job_template job.temp left -24. level 91 levelist 1/to/91 logicals gauss omega omegadiff eta etadiff dpdeta cwc wrf grib2flexpart ecstorage ectrans debug oper request public purefc rrint doubleelda lower 10. mailfail ${USER} mailops ${USER} marsclass EP maxstep 0 number 000 omega 0 omegadiff 0 oper 0 outputdir <path-to-flex_extract>/flex_extract_v7.1/run/workspace prefix CE public 0 purefc 0 queue ecgate request 2 resol 159 right 60. rrint 0 smooth 0 start_date 20000809 step 00 00 00 00 00 00 00 00 stream ENDA time 00 03 06 09 12 15 18 21 type AN AN AN AN AN AN AN AN upper 74. wrf 0 EOF submit.py --controlfile=${CONTROL} --inputdir=./work --outputdir=./work 1> prot 2>&1 if [ $? -eq 0 ] ; then l=0 for muser in `grep -i MAILOPS ${CONTROL}`; do if [ ${l} -gt 0 ] ; then mail -s flex.${HOST}.$$ ${muser} <prot fi l=$((${l}+1)) done else l=0 for muser in `grep -i MAILFAIL ${CONTROL}`; do if [ ${l} -gt 0 ] ; then mail -s "ERROR! flex.${HOST}.$$" ${muser} <prot fi l=$((${l}+1)) done fi
.. toctree:: :hidden: :maxdepth: 2