Smog disaster forecasting using social web data and physical sensor data

Jiaoyan Chen, Huajun Chen, Daning Hu, Jeff Z. Pan, Yalin Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Smog disaster is a type of air pollution event that negatively affects people's life and health. Forecasting smog disasters may largely reduce potential loss that they may cause. However, it is a great challenge since smog disasters are often caused by many complex factors. With the availability of huge amounts of data from the social web and physical sensors, covering information of air quality, meteorology, social event, human mobility, people's opinion, etc., it becomes possible to utilize such big data to forecast smog disasters. Especially, we can investigate the effect of social activities in smog disaster forecasting with the help of social web, which is ignored in traditional studies. In this paper, we propose a big data approach named B-Smog for smog disaster forecasting. It mainly has two components: 1) features extraction from multiple data sources to model the factors that indicate the appearance or disappearance of a smog disaster like traffic condition, human mobility, weather condition and air pollution transportation; 2) learning and predicting with heterogeneous features in multiple views. For the second component, we propose a prediction model based on an ensemble learning framework and artificial neural networks (ANNs), which achieves high accuracy in this application and can also be applied to other similar problems. We present the effectiveness of B-Smog through two cases studies in Beijing and Shanghai, and evaluate the accuracy of the prediction model through comparing it with some baselines. Moreover, the empirical findings of our study can also support decision making in smog disaster management.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Big Data, IEEE Big Data 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages991-998
Number of pages8
ISBN (Electronic)9781479999255
DOIs
Publication statusPublished - 22 Dec 2015
Event3rd IEEE International Conference on Big Data, IEEE Big Data 2015 - Santa Clara, United States
Duration: 29 Oct 20151 Nov 2015

Conference

Conference3rd IEEE International Conference on Big Data, IEEE Big Data 2015
CountryUnited States
CitySanta Clara
Period29/10/151/11/15

Fingerprint

Disasters
Sensors
Air pollution
Meteorology
Air quality
Feature extraction
Decision making
Health
Availability
Neural networks

Keywords

  • Big Data
  • Ensemble Learning
  • Feature Extraction
  • Smog Disaster Forecasting
  • Social Media

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems
  • Software

Cite this

Chen, J., Chen, H., Hu, D., Pan, J. Z., & Zhou, Y. (2015). Smog disaster forecasting using social web data and physical sensor data. In 2015 IEEE International Conference on Big Data, IEEE Big Data 2015 (pp. 991-998). [7363850] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BigData.2015.7363850

Smog disaster forecasting using social web data and physical sensor data. / Chen, Jiaoyan; Chen, Huajun; Hu, Daning; Pan, Jeff Z.; Zhou, Yalin.

2015 IEEE International Conference on Big Data, IEEE Big Data 2015. Institute of Electrical and Electronics Engineers Inc., 2015. p. 991-998 7363850.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, J, Chen, H, Hu, D, Pan, JZ & Zhou, Y 2015, Smog disaster forecasting using social web data and physical sensor data. in 2015 IEEE International Conference on Big Data, IEEE Big Data 2015., 7363850, Institute of Electrical and Electronics Engineers Inc., pp. 991-998, 3rd IEEE International Conference on Big Data, IEEE Big Data 2015, Santa Clara, United States, 29/10/15. https://doi.org/10.1109/BigData.2015.7363850
Chen J, Chen H, Hu D, Pan JZ, Zhou Y. Smog disaster forecasting using social web data and physical sensor data. In 2015 IEEE International Conference on Big Data, IEEE Big Data 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 991-998. 7363850 https://doi.org/10.1109/BigData.2015.7363850
Chen, Jiaoyan ; Chen, Huajun ; Hu, Daning ; Pan, Jeff Z. ; Zhou, Yalin. / Smog disaster forecasting using social web data and physical sensor data. 2015 IEEE International Conference on Big Data, IEEE Big Data 2015. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 991-998
@inproceedings{cdca4ca2bb6449158ebff2f9a77392ce,
title = "Smog disaster forecasting using social web data and physical sensor data",
abstract = "Smog disaster is a type of air pollution event that negatively affects people's life and health. Forecasting smog disasters may largely reduce potential loss that they may cause. However, it is a great challenge since smog disasters are often caused by many complex factors. With the availability of huge amounts of data from the social web and physical sensors, covering information of air quality, meteorology, social event, human mobility, people's opinion, etc., it becomes possible to utilize such big data to forecast smog disasters. Especially, we can investigate the effect of social activities in smog disaster forecasting with the help of social web, which is ignored in traditional studies. In this paper, we propose a big data approach named B-Smog for smog disaster forecasting. It mainly has two components: 1) features extraction from multiple data sources to model the factors that indicate the appearance or disappearance of a smog disaster like traffic condition, human mobility, weather condition and air pollution transportation; 2) learning and predicting with heterogeneous features in multiple views. For the second component, we propose a prediction model based on an ensemble learning framework and artificial neural networks (ANNs), which achieves high accuracy in this application and can also be applied to other similar problems. We present the effectiveness of B-Smog through two cases studies in Beijing and Shanghai, and evaluate the accuracy of the prediction model through comparing it with some baselines. Moreover, the empirical findings of our study can also support decision making in smog disaster management.",
keywords = "Big Data, Ensemble Learning, Feature Extraction, Smog Disaster Forecasting, Social Media",
author = "Jiaoyan Chen and Huajun Chen and Daning Hu and Pan, {Jeff Z.} and Yalin Zhou",
note = "This work is funded by NSFC 61473260, national key S&T Special Projects 2015ZX03003012-004 and Y-B2013120143 of Huawei.",
year = "2015",
month = "12",
day = "22",
doi = "10.1109/BigData.2015.7363850",
language = "English",
pages = "991--998",
booktitle = "2015 IEEE International Conference on Big Data, IEEE Big Data 2015",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Smog disaster forecasting using social web data and physical sensor data

AU - Chen, Jiaoyan

AU - Chen, Huajun

AU - Hu, Daning

AU - Pan, Jeff Z.

AU - Zhou, Yalin

N1 - This work is funded by NSFC 61473260, national key S&T Special Projects 2015ZX03003012-004 and Y-B2013120143 of Huawei.

PY - 2015/12/22

Y1 - 2015/12/22

N2 - Smog disaster is a type of air pollution event that negatively affects people's life and health. Forecasting smog disasters may largely reduce potential loss that they may cause. However, it is a great challenge since smog disasters are often caused by many complex factors. With the availability of huge amounts of data from the social web and physical sensors, covering information of air quality, meteorology, social event, human mobility, people's opinion, etc., it becomes possible to utilize such big data to forecast smog disasters. Especially, we can investigate the effect of social activities in smog disaster forecasting with the help of social web, which is ignored in traditional studies. In this paper, we propose a big data approach named B-Smog for smog disaster forecasting. It mainly has two components: 1) features extraction from multiple data sources to model the factors that indicate the appearance or disappearance of a smog disaster like traffic condition, human mobility, weather condition and air pollution transportation; 2) learning and predicting with heterogeneous features in multiple views. For the second component, we propose a prediction model based on an ensemble learning framework and artificial neural networks (ANNs), which achieves high accuracy in this application and can also be applied to other similar problems. We present the effectiveness of B-Smog through two cases studies in Beijing and Shanghai, and evaluate the accuracy of the prediction model through comparing it with some baselines. Moreover, the empirical findings of our study can also support decision making in smog disaster management.

AB - Smog disaster is a type of air pollution event that negatively affects people's life and health. Forecasting smog disasters may largely reduce potential loss that they may cause. However, it is a great challenge since smog disasters are often caused by many complex factors. With the availability of huge amounts of data from the social web and physical sensors, covering information of air quality, meteorology, social event, human mobility, people's opinion, etc., it becomes possible to utilize such big data to forecast smog disasters. Especially, we can investigate the effect of social activities in smog disaster forecasting with the help of social web, which is ignored in traditional studies. In this paper, we propose a big data approach named B-Smog for smog disaster forecasting. It mainly has two components: 1) features extraction from multiple data sources to model the factors that indicate the appearance or disappearance of a smog disaster like traffic condition, human mobility, weather condition and air pollution transportation; 2) learning and predicting with heterogeneous features in multiple views. For the second component, we propose a prediction model based on an ensemble learning framework and artificial neural networks (ANNs), which achieves high accuracy in this application and can also be applied to other similar problems. We present the effectiveness of B-Smog through two cases studies in Beijing and Shanghai, and evaluate the accuracy of the prediction model through comparing it with some baselines. Moreover, the empirical findings of our study can also support decision making in smog disaster management.

KW - Big Data

KW - Ensemble Learning

KW - Feature Extraction

KW - Smog Disaster Forecasting

KW - Social Media

UR - http://www.scopus.com/inward/record.url?scp=84963726288&partnerID=8YFLogxK

U2 - 10.1109/BigData.2015.7363850

DO - 10.1109/BigData.2015.7363850

M3 - Conference contribution

SP - 991

EP - 998

BT - 2015 IEEE International Conference on Big Data, IEEE Big Data 2015

PB - Institute of Electrical and Electronics Engineers Inc.

ER -