首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 734 毫秒
1.
Multidimensional Similarity Measuring for Semantic Trajectories   总被引:1,自引:0,他引:1       下载免费PDF全文
Most existing approaches aiming at measuring trajectory similarity are focused on two‐dimensional sequences of points, called raw trajectories. However, recent proposals have used background geographic information and social media data to enrich these trajectories with a semantic dimension, giving rise to the concept of semantic trajectories. Only a few works have proposed similarity measures for semantic trajectories or multidimensional sequences, having limitations such as predefined weight of the dimensions, sensitivity to noise, tolerance for gaps with different sizes, and the prevalence of the worst dimension similarity. In this article we propose MSM, a novel similarity measure for multidimensional sequences that overcomes the aforementioned limitations by considering and weighting the similarity in all dimensions. MSM is evaluated through an extensive experimental study that, based on a seed trajectory, creates sets of semantic trajectories with controlled transformations to introduce different kinds and levels of dissimilarity. For each set, we compute the similarity between the seed and the transformed trajectories, using different measures. The results showed that MSM was more robust and efficient than related approaches in the domain of semantic trajectories.  相似文献   

2.
MASTER: A multiple aspect view on trajectories   总被引:1,自引:0,他引:1  
For many years trajectory data have been treated as sequences of space‐time points or stops and moves. However, with the explosion of the Internet of Things and the flood of big data generated on the Internet, such as weather channels and social network interactions, which can be used to enrich mobility data, trajectories become more and more complex, with multiple and heterogeneous data dimensions. The main challenge is how to integrate all this information with trajectories. In this article we introduce a new concept of trajectory, called multiple aspect trajectory, propose a robust conceptual and logical data model that supports a vast range of applications, and, differently from state‐of‐the‐art methods, we propose a storage solution for efficient multiple aspect trajectory queries. The main strength of our data model is the combination of simplicity and expressive power to represent heterogeneous aspects, ranging from simple labels to complex objects. We evaluate the proposed model in a tourism scenario and compare its query performance against the state‐of‐the‐art spatio‐temporal database SECONDO extension for symbolic trajectories.  相似文献   

3.
In order to better understand the movement of an object with respect to a region, we propose a formal model of the evolving spatial relationships that transition between local topologies with respect to a trajectory and a region as well as develop a querying mechanism to analyze movement patterns. We summarize 12 types of local topologies built on trajectory‐region intersections, and derive their transition graph; then we capture and model evolving local topologies with two types of trajectory‐region strings, a movement string and a stop‐move string. The stop‐move string encodes the stop information further during a trajectory than the movement string. Such a string‐format expression of trajectory‐region movement, although conceptually simple, carries unprecedented information for effectively interpreting how trajectories move with respect to regions. We also design the corresponding Finite State Automations for a movement string as well as a stop‐move string, which are used not only to recognize the language of trajectory‐region strings, but also to deal effectively with trajectory‐region pattern queries. When annotated with the time information of stops and intersections, a trajectory‐region movement snapshot and its evolution during a time interval can be inferred, and even the relationships among trajectories with respect to the same region can be explored.  相似文献   

4.
Trajectory similarity measurement is a basic and vital task in trajectory data mining, which has attracted extensive research in the past decades. Recent works focused on the sequence and hierarchy property of trajectories to construct similarity measurements. However, these methods ignore the user information on the visiting locations, such as semantic and time distribution. In light of this, a novel trajectory similarity measurement based on Node-Sequence Hierarchical Digraph (NSHD) framework is proposed in this article. We first propose a Time-Weighted Stay Point Detection (TWSPD) method to extract real visiting locations of users more accurately. Then, the nodes of digraph are obtained by clustering users' stay points and the edges of digraph are sequence information that users move between these nodes. An Advanced Earth Mover's Distance (AEMD) is proposed to measure the node similarity between users, considering visiting time distribution and semantic information simultaneously. Both node and sequence similarities are used to calculate the similarity score to obtain the final trajectory similarity measurement. Experiments on Geolife and T-Drive datasets show that our proposed method offers competitive performance with mean reciprocal rank values reaching 96.01 and 81.26%, which outperforms related trajectory similarity measurements by more than 10 and 15%.  相似文献   

5.
为识别城市交通中的频繁路径,本文提出了一种出租车轨迹数据的频繁轨迹识别方法。该方法首先对轨迹数据进行轨迹压缩,以降低计算复杂度;然后基于最长公共子序列和动态时间规整算法进行轨迹相似性度量计算,利用计算得到的轨迹间相似度生成距离矩阵;最后将生成的距离矩阵结合HDBSCAN算法进行聚类得到频繁轨迹。选取厦门岛内两个区域进行试验分析,结果表明,该方法能够识别出轨迹数据集中的频繁轨迹,进而得到城市区域之间通行的频繁路径,对道路规划、路径优化与推荐、交通治理等应用提供帮助。  相似文献   

6.
Mobile user identification aims at matching different mobile devices of the same user using trajectory data, which has attracted extensive research in recent years. Most of the previous work extracted trajectory features based on regular grids, which will lead to incorrect feature representation due to lack of geographic information. Besides, most trajectory similarity models only considered one single distance measure to calculate the similarity between users, which ignore the connection between different distance measures and may lead to some false matches. In light of this, we present a novel user identification method based on road networks and multiple distance measures in this article. The proposed method segments a city map into several grids and road segments based on road networks. Then it extracts location and road information of trajectories to jointly construct user features. Multiple distance measures are fused by a discriminant model to improve the effect of user identification. Experiments on real GPS trajectory datasets show that our proposed method outperforms related similarity measure methods and is stable for mobile user identification. Meanwhile, our method can also achieve good identification results even on sparse trajectory datasets.  相似文献   

7.
Enormous quantities of trajectory data are collected from many sources, such as GPS devices and mobile phones, as sequences of spatio‐temporal points. These data can be used in many application domains such as traffic management, urban planning, tourism, bird migration, and so on. Raw trajectory data, as generated by mobile devices have very little or no semantics, and in most applications a higher level of abstraction is needed to exploit these data for decision making. Although several different methods have been proposed so far for trajectory querying and mining, there are no software tools to help the end user with semantic trajectory data analysis. In this article we present a software architecture for semantic trajectory data mining as well as the first software prototype to enrich trajectory data with both semantic information and data mining. As a prototype we extend the Weka data mining toolkit with the module Weka‐STPM, which is interoperable with databases constructed under OGC specifications. We tested Weka‐STPM with real geographic databases, and trajectory data stored under the Postgresql/PostGIS DBMS.  相似文献   

8.
Several works have been proposed in the last few years for raw trajectory data analysis, and some attempts have been made to define trajectories from a more semantic point of view. Semantic trajectory data analysis has received significant attention recently, but the formal definition of semantic trajectory, the set of aspects that should be considered to semantically enrich trajectories and a conceptual data model integrating these aspects from a broad sense is still missing. This article presents a semantic trajectory conceptual data model named CONSTAnT, which defines the most important aspects of semantic trajectories. We believe that this model will be the foundation for the design of semantic trajectory databases, where several aspects that make a trajectory “semantic” are taken into account. The proposed model includes the concepts of semantic subtrajectory, semantic points, geographical places, events, goals, environment and behavior, to create a general concept of semantic trajectory. The proposed model is the result of several years of work by the authors in an effort to add more semantics to raw trajectory data for real applications. Two application examples and different queries show the flexibility of the model for different domains.  相似文献   

9.
Recent urban studies have used human mobility data such as taxi trajectories and smartcard data as a complementary way to identify the social functions of land use. However, little work has been conducted to reveal how multi‐modal transportation data impact on this identification process. In our study, we propose a data‐driven approach that addresses the relationships between travel behavior and urban structure: first, multi‐modal transportation data are aggregated to extract explicit statistical features; then, topic modeling methods are applied to transform these explicit statistical features into latent semantic features; and finally, a classification method is used to identify functional zones with similar latent topic distributions. Two 10‐day‐long “big” datasets from the 2,370 bicycle stations of the public bicycle‐sharing system, and up to 9,992 taxi cabs within the core urban area of Hangzhou City, China, as well as point‐of‐interest data are tested to reveal the extent to which different travel modes contribute to the detection and understanding of urban land functions. Our results show that: (1) using latent semantic features delineated from the topic modeling process as the classification input outperforms approaches using explicit statistical features; (2) combining multi‐modal data visibly improves the accuracy and consistency of the identified functional zones; and (3) the proposed data‐driven approach is also capable of identifying mixed land use in the urban space. This work presents a novel attempt to uncover the hidden linkages between urban transportation patterns with urban land use and its functions.  相似文献   

10.
With fast growth of all kinds of trajectory datasets, how to effectively manage the trajectory data of moving objects has received a lot of attention. This study proposes a spatio‐temporal data integrated compression method of vehicle trajectories based on stroke paths coding compression under the road stroke network constraint. The road stroke network is first constructed according to the principle of continuous coherence in Gestalt psychology, and then two types of Huffman tree—a road strokes Huffman tree and a stroke paths Huffman tree—are built, based respectively on the importance function of road strokes and vehicle visiting frequency of stroke paths. After the vehicle trajectories are map matched to the spatial paths in the road network, the Huffman codes of the road strokes and stroke paths are used to compress the trajectory spatial paths. An opening window algorithm is used to simplify the trajectory temporal data depicted on a time–distance polyline by setting the maximum allowable speed difference as the threshold. Through analysis of the relative spatio‐temporal relationship between the preceding and latter feature tracking points, the spatio‐temporal data of the feature tracking points are all converted to binary codes together, accordingly achieving integrated compression of trajectory spatio‐temporal data. A series of comparative experiments between the proposed method and representative state‐of‐the‐art methods are carried out on a real massive taxi trajectory dataset from five aspects, and the experimental results indicate that our method has the highest compression ratio. Meanwhile, this method also has favorable performance in other aspects: compression and decompression time overhead, storage space overhead, and historical dataset training time overhead.  相似文献   

11.
充分利用出租车GPS时空轨迹数据分布广和时效性强的特点,提出一种基于车载GPS轨迹数据的路网拓扑自动变化检测新方法。该方法首先利用向量相似性度量模型,度量GPS轨迹向量与路网局部拓扑向量之间的相似性,检测疑似道路拓扑变化点,然后通过比较疑似道路拓扑变化点与路网拓扑关系,完成新增、废弃、改建等道路变化,实现基于车载GPS轨迹的路网拓扑自动变化检测。实验结果表明,该方法不仅有效地检测出道路新增、道路废弃与道路改扩建等变化,而且能利用出租车实时和大范围分布特点来实现城市路网大范围实时变化检测。  相似文献   

12.
Mobility and spatial interaction data have become increasingly available due to the wide adoption of location‐aware technologies. Examples of mobility data include human daily activities, vehicle trajectories, and animal movements, among others. In this article we focus on a special type of mobility data, i.e. origin‐destination pairs, and present a new approach to the discovery and understanding of spatio‐temporal patterns in the movements. Specifically, to extract information from complex connections among a large number of point locations, the approach involves two steps: (1) spatial clustering of massive GPS points to recognize potentially meaningful places; and (2) extraction and mapping of the flow measures of clusters to understand the spatial distribution and temporal trends of movements. We present a case study with a large dataset of taxi trajectories in Shenzhen, China to demonstrate and evaluate the methodology. The contribution of the research is two‐fold. First, it presents a new methodology for detecting location patterns and spatial structures embedded in origin‐destination movements. Second, the approach is scalable to large data sets and can summarize massive data to facilitate pattern extraction and understanding.  相似文献   

13.
Understanding diverse characteristics of human mobility provides profound knowledge of urban dynamics and complexity. Human movements are recorded in a variety of data sources and each describes unique mobility characteristics. Revealing similarity and difference in mobility data sources facilitates grasping comprehensive human mobility patterns. This study introduces a new method to measure similarities on two origin–destination (OD) matrices by spatially extending an image‐assessment tool, the structural similarity index (SSIM). The new measurement, spatially weighted SSIM (SpSSIM), utilizes weight matrices to overcome the SSIM sensitivity issue due to the ordering of OD pairs by explicitly defining spatial adjacency. To evaluate SpSSIM, we compared performances between SSIM and SpSSIM with resampling the orders of OD pairs and conducted bootstrapping to test the statistical significance of SpSSIM. As a case study, we compared OD matrices generated from three data sources in San Diego County, CA: U.S. Census‐based Longitudinal Employer–Household Dynamics Origin–Destination employment statistics, Twitter, and Instagram. The case study demonstrated that SpSSIM was able to capture similarities of mobility patterns between datasets that varied by distance. Some regions showed local dissimilarity while the global index indicated they were similar. The results enhance the understanding of complex mobility patterns from various datasets, including social media.  相似文献   

14.
Tracking facilities on smartphones generate enormous amounts of GPS trajectories, which provide new opportunities to study movement patterns and improve transportation planning. Converting GPS trajectories into semantically meaningful trips is attracting increasing research effort with respect to the development of algorithms, frameworks, and software tools. There are, however, few works focused on designing new semantic enrichment functionalities taking privacy into account. This article presents a raster‐based framework which not only detects significant stop locations, segments GPS records into stop/move structures, and brings semantic insights to trips, but also provides possibilities to anonymize users’ movements and sensitive stay/move locations into raster cells/regions so that a multi‐level data sharing structure is achieved for a variety of data sharing purposes.  相似文献   

15.
移动轨迹聚类方法研究综述   总被引:6,自引:2,他引:4  
轨迹数据是人类移动行为的表征,能够映射出人的出行模式和社会属性等信息。怎样有效挖掘轨迹数据蕴藏的人类活动规律一直是研究的热点。通过轨迹聚类发现行为相似的类簇,从而探究群体的移动模式是轨迹挖掘和深度应用常见的方法之一。本文首先根据轨迹数据的特点,将轨迹数据模型分为轨迹点模型和轨迹段模型,并据此定义相应的相似性度量:空间相似性度量和时空相似性度量;然后,对两类模型的聚类方法进行了综述,并总结不同聚类算法的优缺点,以期为不同应用选取聚类算法提供科学依据;最后对移动轨迹数据聚类方法研究的发展趋势进行了讨论。  相似文献   

16.
Providing Geographical Information Systems with mechanisms for processing geo‐data based on their semantics may help to solve problems like heterogeneity. This is because GIS could process geo‐data focusing on their meaning and not on their syntax and/or structure. An important aspect for achieving these objectives is the establishment of an automatic means of correspondence between geo‐data and their conceptualization in Higher Levels Ontologies (HLO). In this article, a new type of Ontology is proposed (Data‐Representation Ontology (DRO)). This Ontology describes the semantic embedded in geo‐data, which cannot be represented in current types of Ontologies. Across this Ontology, heterogeneous geographical data can be integrated in the semantic space contributing positively to the development of solutions for the problems of interoperability between heterogeneous systems. Likewise, we propose a new method for the automatic generation of the DRO and its interrelationships with HLO, based on pattern classification techniques. The experiments show that once the DRO is generated, the classifier can classify all data correctly. Thus, these data are semantically enriched. Moreover, this article shows how the topological relationships can enrich the semantics in the generated Ontology and increase the effectiveness of spatial analysis.  相似文献   

17.
Existing methods of spatial data clustering have focused on point data, whose similarity can be easily defined. Due to the complex shapes and alignments of polygons, the similarity between non‐overlapping polygons is important to cluster polygons. This study attempts to present an efficient method to discover clustering patterns of polygons by incorporating spatial cognition principles and multilevel graph partition. Based on spatial cognition on spatial similarity of polygons, four new similarity criteria (i.e. the distance, connectivity, size and shape) are developed to measure the similarity between polygons, and used to visually distinguish those polygons belonging to the same clusters from those to different clusters. The clustering method with multilevel graph‐partition first coarsens the graph of polygons at multiple levels, using the four defined similarities to find clusters with maximum similarity among polygons in the same clusters, then refines the obtained clusters by keeping minimum similarity between different clusters. The presented method is a general algorithm for discovering clustering patterns of polygons and can satisfy various demands by changing the weights of distance, connectivity, size and shape in spatial similarity. The presented method is tested by clustering residential areas and buildings, and the results demonstrate its usefulness and universality.  相似文献   

18.
针对目前只能对单一运动特征(速度、方向等)进行轨迹相似性分析的不足,提出了基于多重运动特征的轨迹相似性度量,该度量对于分析和理解移动对象的运动行为和规律具有重要意义。将其应用于基于多重运动特征的运动序列模式发现。该相似性度量借鉴数据立方体的思想,将多重运动特征时间序列进行量化和符号化表示,在多重运动特征值域空间中计算两字符间的距离作为字符间替换代价,最终以加权编辑距离作为相似性度量。将该相似性度量与谱聚类方法相结合进行运动序列模式发现。实验以飓风数据为例,通过气象文献中飓风的发生与运动规律验证了模型的有效性。  相似文献   

19.
This article studies the analysis of moving object data collected by location‐aware devices, such as GPS, using graph databases. Such raw trajectories can be transformed into so‐called semantic trajectories, which are sequences of stops that occur at “places of interest.” Trajectory data analysis can be enriched if spatial and non‐spatial contextual data associated with the moving objects are taken into account, and aggregation of trajectory data can reveal hidden patterns within such data. When trajectory data are stored in relational databases, there is an “impedance mismatch” between the representation and storage models. Graphs in which the nodes and edges are annotated with properties are gaining increasing interest to model a variety of networks. Therefore, this article proposes the use of graph databases (Neo4j in this case) to represent and store trajectory data, which can thus be analyzed at different aggregation levels using graph query languages (Cypher, for Neo4j). Through a real‐world public data case study, the article shows that trajectory queries are expressed more naturally on the graph‐based representation than over the relational alternative, and perform better in many typical cases.  相似文献   

20.
针对当前在精细识别道路拥堵时空范围方面研究的不足,提出一种利用GPS轨迹的二次聚类方法,通过快速识别大批量在时间、空间上差异较小且速度相近的轨迹段,反映出道路交通状态及时空变化趋势,并根据速度阈值确定拥堵状态及精细时空范围。首先将轨迹按采样间隔划分成若干条子轨迹,针对子轨迹段提出相似队列的概念,并设计了基于密度的空间聚类的相似队列提取方法,通过初次聚类合并相似子轨迹段,再利用改进的欧氏空间相似度度量函数计算相似队列间的时空距离,最后以相似队列为基本单元,基于模糊C均值聚类的方法进行二次聚类,根据聚类的结果进行交通流状态的识别和划分。以广州市主干路真实出租车GPS轨迹数据为例,对该方法进行验证。实验结果表明,该二次聚类方法能够较为精细地反映城市道路的拥堵时空范围,便于管理者精准疏散城市道路拥堵,相比直接聚类方法可以有效提升大批量轨迹数据的计算效率。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号