首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A deep learning architecture for semantic address matching
Authors:Yue Lin  Yuyang Wu  Qingyun Du  Tao Liu
Institution:1. School of Resource and Environmental Sciences, Wuhan University, Wuhan, China;2. Department of Geography, The Ohio State University, Columbus, OH, USAORCID Iconhttps://orcid.org/0000-0001-8568-7734;3. School of Geography and Information Engineering, China University of Geosciences, Wuhan, China;4. School of Resource and Environmental Sciences, Wuhan University, Wuhan, ChinaORCID Iconhttps://orcid.org/0000-0003-4615-2029;5. Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou, China
Abstract:ABSTRACT

Address matching is a crucial step in geocoding, which plays an important role in urban planning and management. To date, the unprecedented development of location-based services has generated a large amount of unstructured address data. Traditional address matching methods mainly focus on the literal similarity of address records and are therefore not applicable to the unstructured address data. In this study, we introduce an address matching method based on deep learning to identify the semantic similarity between address records. First, we train the word2vec model to transform the address records into their corresponding vector representations. Next, we apply the enhanced sequential inference model (ESIM), a deep text-matching model, to make local and global inferences to determine if two addresses match. To evaluate the accuracy of the proposed method, we fine-tune the model with real-world address data from the Shenzhen Address Database and compare the outputs with those of several popular address matching methods. The results indicate that the proposed method achieves a higher matching accuracy for unstructured address records, with its precision, recall, and F1 score (i.e., the harmonic mean of precision and recall) reaching 0.97 on the test set.
Keywords:Geocoding  deep neural network  machine learning  semantic matching  word2vec
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号