Menu

Modeling the Air we Breathe

March 25, 2020

reporting by Helen Hill
Researchers at Harvard’s School of Public Health and elsewhere are using computers housed at the MGHPCC to fine-scale air pollution estimation to help guide policymakers.
Over the past several decades the air we breathe has been getting cleaner but as amounts of air pollution fall, understanding and quantifying the health impact of low-level exposure to help guide future regulation becomes more challenging.
Nitrogen dioxide (NO2) is one of a group of highly reactive gases known as the nitrogen oxides. The major source of atmospheric NO2 in the United States is mobile emissions (from cars, trucks, and buses) although power plants and other instances of large-scale fossil fuel combustion are also important. Nitrogen dioxide is recognized as an important air pollutant because it contributes to the formation of photochemical smog, but increased levels of NO2 also can have significant direct health impacts. The main effect of breathing in raised levels of NO2 is the increased likelihood of respiratory problems. Breathing in NO2 inflames the lining of the lungs and has been linked to more frequent and more intense attacks of asthma in sufferers and may also reduce immunity to lung infections.
Qian Di, formerly a doctoral student in the Dominici Group in Harvard’s T.H. Chan School of Public Health, uses environmental modeling to understand the distribution of air pollutants in tandem with analyzing health data to estimate the health effects of air pollution and other environmental factors.
In a new paper, Di and co-authors report on their effort to model, with high spatiotemporal resolution, the nationwide distribution of NO2 over a 16 year period and how their integration of different predictor variables and fitting algorithms can achieve an improved air pollution modeling framework allowing researchers to interpolate NO2 levels away from monitoring stations. The paper was recently published in the journal Environmental Science and Technology.
“When epidemiologists try to analyze the association between human health and air pollution (including NO2), they often encounter the issue of inadequate monitoring data,” Di explains. “That is, many people live in areas without air quality monitoring sites so that we cannot assess the health impact of air pollution on them. Using a machine learning + big data approach to estimate NO2 for the entire country with high spatiotemporal resolution as we did in our study provides that coverage and sufficient exposure data for epidemiologists to analyze the health effect.”
In their paper, Di et al describe using an ensemble model to integrate multiple machine learning algorithms, including neural network, random forest, and gradient boosting, with a variety of predictor variables, including chemical transport models, to assess NO2 levels in a model covering the entire contiguous US with daily predictions on a 1-km-level grid for the period 2000 to 2016. With around 20 TB of input data, Di says their study would not have been possible without access to Harvard University’s Odyssey research computing cluster (housed at the MGHPCC.) No existing study has simultaneously achieved such high spatial and temporal resolution over such a large domain.
“The highly distributed nature of NO2 sources and sinks nationwide results in local hotspots and a highly heterogeneous distribution,” says Di. “NO2 modeling, therefore, needs to capture small-scale variations, and that can be challenging. Other factors like the height of the planetary boundary layer, wind speed, and wind direction all influence concentrations in any location, with substantial variations from day to day, even given similar emissions. Hence accurate modeling must also capture these temporal patterns.”
“The very high spatiotemporal resolution of our NO2 estimate allows for the examination of the health effects of NO2 in unmonitored areas,” Di explains. “While we found the highest NO2 levels along highways and in cities, we also observed that while the NO2 level is dropping nationwide, it is dropping differently in rural and urban areas and that urban areas have higher NO2 levels.”
To their surprise, the team found many economists were interested in our work, too. “They need such air pollution estimation to analyze the impact of environmental policies,” says Di. “They often face similar problems of inadequate monitoring data, and our work provides a solution.”
Qian was previously involved in similar work modeling PM2.5 (particulate matter,) and has another paper on ozone modeling to be submitted very soon.
Story image: courtesy pixnio.com

About the Researcher

Qian Di


Having now completed his doctorate at Harvard’s T.H. Chan School of Public Health, Qian Di is now an Assistant Professor at Tsinghua University’s School of Medicine. He holds a BS in remote sensing and geographic information system from Peking University and an MS in geography from Pennsylvania State University.
 

Publications Mentioned

Qian Di, Heresh Amini, Liuhua ShiItai Kloog, Rachel Silvern, James Kelly, M. Benjamin Sabath, Christine Choirat, Petros Koutrakis, Alexei Lyapustin, Yujie Wang, Loretta J. Mickley, Joel Schwartz (2020), Assessing NO2 Concentration and Model Uncertainty with High Spatiotemporal Resolution across the Contiguous United States Using Ensemble Model Averaging, Environmental Science and Technology,  doi: 10.1021/acs.est.9b03358
Qian Di, Heresh Amini, Liuhua Shi, Itai Kloog, Rachel Silvern, James Kelly, M. Benjamin Sabath, Christine Choirat, Petros Koutrakis, Alexei Lyapustin, Yujie Wang, Loretta J. Mickley, Joel Schwartz (2019), An ensemble-based model of PM2.5 concentration across the contiguous United States with high spatiotemporal resolution, Environmental International, doi: 10.1016/j.envint.2019.104909

Links

Dominici Lab
Health data science: Study strengthens link between air pollution and premature death Harvard T.H. Chan School of Public Health video

Research projects

A Future of Unmanned Aerial Vehicles
Yale Budget Lab
Volcanic Eruptions Impact on Stratospheric Chemistry & Ozone
The Rhode Island Coastal Hazards Analysis, Modeling, and Prediction System
Towards a Whole Brain Cellular Atlas
Tornado Path Detection
The Kempner Institute – Unlocking Intelligence
The Institute for Experiential AI
Taming the Energy Appetite of AI Models
Surface Behavior
Studying Highly Efficient Biological Solar Energy Systems
Software for Unreliable Quantum Computers
Simulating Large Biomolecular Assemblies
SEQer – Sequence Evaluation in Realtime
Revolutionizing Materials Design with Computational Modeling
Remote Sensing of Earth Systems
QuEra at the MGHPCC
Quantum Computing in Renewable Energy Development
Pulling Back the Quantum Curtain on ‘Weyl Fermions’
New Insights on Binary Black Holes
NeuraChip
Network Attached FPGAs in the OCT
Monte Carlo eXtreme (MCX) – a Physically-Accurate Photon Simulator
Modeling Hydrogels and Elastomers
Modeling Breast Cancer Spread
Measuring Neutrino Mass
Investigating Mantle Flow Through Analyses of Earthquake Wave Propagation
Impact of Marine Heatwaves on Coral Diversity
IceCube: Hunting Neutrinos
Genome Forecasting
Global Consequences of Warming-Induced Arctic River Changes
Fuzzing the Linux Kernel
Exact Gravitational Lensing by Rotating Black Holes
Evolution of Viral Infectious Disease
Evaluating Health Benefits of Stricter US Air Quality Standards
Ephemeral Stream Water Contributions to US Drainage Networks
Energy Transport and Ultrafast Spectroscopy Lab
Electron Heating in Kinetic-Alfvén-Wave Turbulence
Discovering Evolution’s Master Switches
Dexterous Robotic Hands
Developing Advanced Materials for a Sustainable Energy Future
Detecting Protein Concentrations in Assays
Denser Environments Cultivate Larger Galaxies
Deciphering Alzheimer’s Disease
Dancing Frog Genomes
Cyber-Physical Communication Network Security
Avoiding Smash Hits
Analyzing the Gut Microbiome
Adaptive Deep Learning Systems Towards Edge Intelligence
Accelerating Rendering Power
ACAS X: A Family of Next-Generation Collision Avoidance Systems
Neurocognition at the Wu Tsai Institute, Yale
Computational Modeling of Biological Systems
Computational Molecular Ecology
Social Capital and Economic Mobility
All Research Projects

Collaborative projects

ALL Collaborative PROJECTS

Outreach & Education Projects

See ALL Scholarships
100 Bigelow Street, Holyoke, MA 01040