CMAQ version 5.3beta UNC3 (March 2019 release) Technical Documentation

From CMASWIKI
Jump to: navigation, search

Release Notes for CMAQv5.3beta_UNC3 - March 2019

The Community Multiscale Air Quality (CMAQ) modeling system Version 5.3 (CMAQv5.3) is scheduled for release in July 2019.
To facilitate community contributions to this model release, a “Beta_UNC3” version was created and made available to external collaborators/developers in March 2019. This Beta_UNC3-version will undergo extensive application and evaluation over the next few months, which may lead to additional changes before the final release.

The CMAQv5.3beta_UNC3 repository is available from https://github.com/bnmurphy/CMAQ/tree/UNC3.

Complete documentation for CMAQv5.3beta, including build instructions and technical features are available from the CMAQv5.3beta GitHub page https://github.com/USEPA/CMAQ/tree/5.3.b2.

Release Testing

The CMAS Center will test the CMAQv5.3beta_UNC3 release package with several versions and configurations of the Portland Group, Intel, and GNU Fortran compilers. In addition to different serial and parallel configurations, the release was tested under different science configurations. The results of the CMAS Center tests will include run times for different compilers and configurations are provided below. All tests were conducted with the U.S. EPA SE52 12km domain July 11, 2011 test dataset. This dataset is distributed with CMAQv5.3beta to use for benchmarking the model installation and includes new EPIC files.

Two classes of tests:

  • Compiler tests used the default benchmark configuration with different compilers and MPI configurations.
  • Configuration tests used the gcc or intel vXX OpenMPI compiler to generate executables that exercise different scientific configurations of the release software.

Compiler flags:

  • GCC: mpif90 -c -ffixed-form -ffixed-line-length-132 -funroll-loops -finit-character=32 -O3 -funroll-loops -finit-character=32 -Wtabs -Wsurprising
  • Intel: mpif90 -c -fixed -132 -O3 -fno-alias -mp1 -fp-model source

Note: this website contains explanation of why timing varies depending on the mpi flavor that is being used, and suggests flags to pin or bind tasks to a core set of cpus,and also options to use the infiniband. https://www.chpc.utah.edu/documentation/software/mpilibraries.php http://www.nersc.gov/users/computational-systems/cori/running-jobs/example-batch-scripts/

Specs for Dogwood machine: https://help.unc.edu/files/2018/03/dogwood.cheatsheet.pdf
QA plots are generated on longleaf using the interactive queue

R script for comparing model output results from different compilers

  • module load r/3.4.3
  • setenv R_LIBS /nas/longleaf/home/lizadams/R/x86_64-pc-linux-gnu-library/3.4
  • cd /proj/ie/proj/CMAS/CMAQ/CMAQv5.3_branch_UNC3/CMAQ_QA

If you are on the login node, your prompt will contain the login node id:

  • [lizadams@longleaf-login2 ~]

Start an interactive queue (don't run on the login node), your environment variables and R module will be set up

  • srun -p interact --pty /bin/csh

Your login prompt will be something like

  • [lizadams@off03 CMAQ_QA]$

Run the R script that compares model output

  • R CMD BATCH plot_max_diff_compiler_sens_v5.3beta.R

Plotting the difference in values between the base and sensitivity cases at the column, row, layer of the maximum difference between the variables. | Compilation Testing Plots

Table 1. CMAQv5.3beta compilation testing manifest on UNC Dogwood Cluster
Scenario Compiler netCDF I/O API MPI_YN (#P) Module Timing(1 Lay, 12 Var) Timing(35 Lay, 12 Var) Timing (35 Lay, 215 Var) Notes
Gfortran OpenMPI Gfortran version 6.3 4.6.2 3.2 Y (64) openmpi_3.0.0/gcc_6.3.0 2087.7 2346.2 time mpirun --oversubscribe -np $NPROCS, using srun didn't work
Gfortran MPICH Gfortran version 4.8.1 4.6.0 3.2 Y (64) mpich/3.2.1_1 61776.8 Linuxbrew gcc (it is performing like it is running on 1 processor..)
Intel OpenMPI Intel Fortran version 17.2 4.1.3 3.2 Y (64) openmpi_3.0.0/intel_17.2 373.8 534.5
Intel OpenMPI Intel Fortran version 17.2 4.1.3 3.2 Y (128) openmpi_3.0.0/intel_17.2 242.1 375
Intel OpenMPI Intel Fortran version 17.2 4.1.3 3.2 Y (256) openmpi_3.0.0/intel_17.2 184.4 211.1
Intel MVAPICH2 Intel Fortran version 17.2 4.1.3 3.2 Y (64) mvapich2_2.3rc1/intel_17.2 361.50 setenv MV2_ENABLE_AFFINITY 0 see:https://github.com/USEPA/CMAQ/issues/118
Intel MVAPICH2 Intel Fortran version 17.2 4.1.3 3.2 Y (128) mvapich2_2.3rc1/intel_17.2 247.3 setenv MV2_ENABLE_AFFINITY 0 see:https://github.com/USEPA/CMAQ/issues/118
Intel MVAPICH2 Intel Fortran version 17.2 4.1.3 3.2 Y (256) mvapich2_2.3rc1/intel_17.2 216.5 339.2 setenv MV2_ENABLE_AFFINITY 0 see:https://github.com/USEPA/CMAQ/issues/118
Table 2. CMAQv5.3beta configuration testing manifest
Scenario Description Mechanism Notes Timing (64 PE)
Benchmark Case Online emissions processing, inline photolysis, inline lightning using hourly NLDN strike file, windblown dust, surface HONO, bidirectional NH3, no BIDI HG, no potential vorticity scaling cb6r3_ae7_aq Pending; LTNGNO InLine, LTNGPARM = N, LOG_START = 2.0
Halogen Chemistry Same as Benchmark case with halogen chemistry enabled cb6r3_ae7_aq Pending. Turned off the diagnostic file logging.
No Bidi Same as Benchmark with Hg and NH3 BiDi deactivated cb6r3_ae7_aq Pending. set CTM_HGBIDI = N; set ABFLUX = N
Process Analysis Benchmark case with IPR and IRR ros3 Pending. Switch to Rosenbrock solver because EBI solver not supported by PA module; ran with inline process analysis
MOSAIC Benchmark case with MOSAIC and additional stomatal flux files activated cb6r3_ae7_aq Pending. set CTM_MOSAIC = Y; set CTM_FST = Y
New Mechanism Test Benchmark case with toluene and chlorine chemistry cb05tucl_ae6 Pending.
Potential vorticity UTLS exchange Benchmark case with scaling for STE cb6r3_ae7_aq Pending: Uncomment potvortO3 in build. Need PV variable in METCRO3D file
Dust Benchmark case with dust, including new MODIS FP input cb6r3_ae7_aq Pending. setenv CTM_WB_DUST Y; setenv CTM_ERODE_AGLAND Y; setenv CTM_WBDUST_BELD BELD3
Hourly NLDN Benchmark with lightning NOx calculated using hourly bNLDN strikes cb6r3_ae7_aq Pending; LTNGNO InLine, LTNGPARM = Y, USE_NLDN Y
POA Sensitivity Benchmark with new POA mechanism cb6r3_ae7_aq #set potvortO3 #> potential vorticity from the free troposphere