首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于网络微博的地震宏观异常信息提取研究——以芦山地震为例
引用本文:张群燕,黄健熙,张晓东,苏晓慧,张旭.基于网络微博的地震宏观异常信息提取研究——以芦山地震为例[J].震灾防御技术,2013,8(4):459-467.
作者姓名:张群燕  黄健熙  张晓东  苏晓慧  张旭
作者单位:中国农业大学信息与电气工程学院,北京 100083;中国农业大学信息与电气工程学院,北京 100083;中国农业大学信息与电气工程学院,北京 100083;中国农业大学信息与电气工程学院,北京 100083;中国农业大学信息与电气工程学院,北京 100083
基金项目:国家“十二五”科技支撑计划课题(2012BAK19B04-03)
摘    要:微博平台有用户群大、公众参与性强、实时性等优点,同时微博平台信息又具有信息真伪难辨、地址信息模糊等缺点.本文以芦山地震为例,针对微博内容如何提取和地址如何定位两方面进行了分析研究,对于如何在网络微博平台中及时的提取地震宏观异常信息,提出了聚焦爬虫技术,并对微博地址进行了分类,同时将正向最大匹配和特征词地址分词的中文地址匹配模型应用于地址信息的提取和地址匹配中;最后将不同的地址类别定位为不同的行政级别,使微博平台和微博信息得到了充分的利用.通过研究认识到微博信息在反应震前异常的发生趋势方面有一定的参考价值(动物异常和气象异常所占比例较大),是不能被忽略的:地址方面可以看出异常随着时间的逼近有向震中聚集的趋势,有一定的参考价值.

关 键 词:微博平台  聚焦爬虫  地震宏观异常  分词技术  地址匹配
收稿时间:8/9/2013 12:00:00 AM

Micro-blog Application in Extracting Information of Earthquake Macro-anomalies
Zhang Qunyan,Huang Jianxi,Zhang Xiaodong,Su Xiaohui and Zhang Xu.Micro-blog Application in Extracting Information of Earthquake Macro-anomalies[J].Technology for Earthquake Disaster Prevention,2013,8(4):459-467.
Authors:Zhang Qunyan  Huang Jianxi  Zhang Xiaodong  Su Xiaohui and Zhang Xu
Institution:College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China;College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China;College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China;College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China;College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China
Abstract:Micro-blog is characterized with large user groups, strong public participation and real-time information although it is often difficult to verify the accuracy and authenticity of the information. Taking Lushan earthquake as an example, we performed analysis on how to extract the information and how to locate the addresses. Firstly, we use the focused crawling technique to extract the Macro-anomalies from Micro-blog in time. Then we put the addresses into four classifications and chose the model of Maximum Matching from beginning and characteristic words segmentation as the algorithm in the address matching perform. Finally, we locate the different address classification to the different administrative units which makes full utilization of the micro-blog platform and information. Our results suggest that the information on micro-blog may provide some references regarding to animal behavior anomaly and the weather anomaly.
Keywords:Micro-blog  Focused crawling technique  Earthquake macro-anomalies  Word segmentation algorithm  Address matching
本文献已被 维普 等数据库收录!
点击此处可从《震灾防御技术》浏览原始摘要信息
点击此处可从《震灾防御技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号