• Simulating urban air quality over the Clermont-Ferrand agglomeration in France using a pre-constructed chain of meta-models.
• MCMC simulations using the air quality meta-modeling chain with observation data on traffic flow and pollutant concentrations, to estimate the distribution of the input parameters. Various simulations will be necessary to calibrate the appropriate initial parameter distributions and the maximum likelihood estimator.
• Uncertainty quantification method coded in Python which will be transferable to other computational domains given appropriate data and pre-constructed meta-model.
• Maps and graphics representing urban air quality and uncertain distributions of model parameters.
• Results to be presented at scientific conferences.
{Empty}
{Empty}
{Empty}
Can work with any level
{Empty}
Stonehill College
{Empty}
NE-MGHPCC
05/10/2021
No
Already behind5Start date is flexible
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
The results of this project will lead to an article on Uncertainty Quantification for Urban Air Quality Modeling, which we plan to submit to a leading journal in the field. I expect that a significant portion of the work towards this publication has already been done, but need access to computational resources to continue exploring numerical results. An article has recently been published [cite] on the meta-modeling chain used in this study.
In addition to experience helping researchers transition from individual computing resources to off-site cluster resources, the student facilitator will have the opportunity to learn about the computational requirements and methods used in this project. This includes model order reduction, which remains a topic of interest despite increases in computational capacity, data assimilation for the calibration and improvement of models, and uncertainty quantification by Markov chain Monte Carlo methods. This project also involves international collaboration with the French company NUMTECH, a leader in environmental modeling and data science.
{Empty}
{Empty}
My goal is to perform MCMC simulations for the input parameters of a month-long hourly air quality meta-model simulation to study the uncertainty in these parameters. In practice MCMC simulations are generally done with at least 100,000 iterations, where the nth simulation depends on the previous, precluding the possibility of pseudo-parallel computations by dividing the iterations among multiple workstation machines. Each iteration calls a chain of meta-models, for which the initialization requires approximately 20 minutes and should thus be done only once at the initialization of the MCMC loop. Each iteration of the MCMC chain should compute one month of hourly meta-model simulations, requiring up to approximately 42GB of RAM. The MCMC iterations produce outputs of shape (100000,2520,469), and without parallelization, each iteration requires 2520 meta-model computations totaling 191.5 seconds. Adjustments to the meta-model could reduce the RAM requirements, but would simultaneously reduce the precision of the reduced-order approximations. The MCMC outputs could be written at each iteration (as opposed to holding the entire array in working memory) with correct method compatible with parallel computing.
{Empty}