首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A weighted multi-attribute method for matching user-generated Points of Interest
Authors:Grant McKenzie  Krzysztof Janowicz  Benjamin Adams
Institution:1. Department of Geography, The University of California, Santa Barbara, CA, USAgrant.mckenzie@geog.ucsb.edu;3. Department of Geography, The University of California, Santa Barbara, CA, USA;4. Department of Computer Science, Centre for eResearch, The University of Auckland, New Zealand
Abstract:To a large degree, the attraction of Big Data lies in the variety of its heterogeneous multi-thematic and multi-dimensional data sources and not merely its volume. To fully exploit this variety, however, requires conflation. This is a two-step process. First, one has to establish identity relations between information entities across different data sources; and second, attribute values have to be merged according to certain procedures that avoid logical contradictions. The first step, also called matching, can be thought of as a weighted combination of common attributes according to some similarity measures. In this work, we propose such a matching based on multiple attributes of Points of Interest (POI) from the Location-based Social Network Foursquare and the local directory service Yelp. While both contain overlapping attributes that can be used for matching, they have specific strengths and weaknesses that make their conflation desirable. For instance, Foursquare offers information about user check-ins to places, while Yelp specializes in user-contributed reviews. We present a weighted multi-attribute matching strategy, evaluate its performance, and discuss application areas that benefit from a successful matching. Finally, we also outline how the established POI matches can be stored as Linked Data on the Semantic Web. Our strategy can automatically match 97% of randomly selected Yelp POI to their corresponding Foursquare entities.
Keywords:Points of Interest  matching  Linked Data  volunteered geographic information
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号