首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Existing spatial clustering methods primarily focus on points distributed in planar space. However, occurrence locations and background processes of most human mobility events within cities are constrained by the road network space. Here we describe a density-based clustering approach for objectively detecting clusters in network-constrained point events. First, the network-constrained Delaunay triangulation is constructed to facilitate the measurement of network distances between points. Then, a combination of network kernel density estimation and potential entropy is executed to determine the optimal neighbourhood size. Furthermore, all network-constrained events are tested under a null hypothesis to statistically identify core points with significantly high densities. Finally, spatial clusters can be formed by expanding from the identified core points. Experimental comparisons performed on the origin and destination points of taxis in Beijing demonstrate that the proposed method can ascertain network-constrained clusters precisely and significantly. The resulting time-dependent patterns of clusters will be informative for taxi route selections in the future.  相似文献   

2.
NET‐DBSCAN, a method for clustering the nodes of a linear network, whose edges may be temporarily inaccessible, is introduced. The new method extends the idea of a well‐known spatial clustering method, named density‐based spatial clustering of applications with noise (DBSCAN). The new algorithm is described in detail and through a series of examples. A prototype system, which implements the algorithm, developed in Java and tested through a series of synthetic networks, is also presented. Finally, the application of NET‐DBSCAN method to support real‐world situations is briefly discussed.  相似文献   

3.
Recently, points of interest (POIs) recommendation has evolved into a hot research topic with real-world applications. In this paper, we propose a novel semantics-enhanced density-based clustering algorithm SEM-DTBJ-Cluster, to extract semantic POIs from GPS trajectories. We then take into account three different factors (popularity, temporal and geographical features) that can influence the recommendation score of a POI. We characterize the impacts caused by popularity, temporal and geographical information, by using different scoring functions based on three developed recommendation models. Finally, we combine the three scoring functions together and obtain a unified framework PTG-Recommend for recommending candidate POIs for a mobile user. To the best of our knowledge, this work is the first that considers popularity, temporal and geographical information together. Experimental results on two real-world data sets strongly demonstrate that our framework is robust and effective, and outperforms the baseline recommendation methods in terms of precision and recall.  相似文献   

4.
在京津冀协同发展战略背景下,以建设京津冀世界级城市群为引领,遵循城市发展规律,优化城市空间布局,明确京津冀城市群等级结构及其空间特征具有重要意义。本文以京津冀城市群156个区县为研究对象,从经济中心性、交通中心性、信息中心性、人口中心性4个角度,利用4种空间聚类方法进行5个等级的聚类分析,并基于克氏中心地理论对京津冀城市群等级划分结果进行空间结构分析。结果显示,自组织特征映射神经网络算法(SOM)较适合京津冀城市群的等级划分;京津冀城市群正从以北京城区为单核心的圈层空间结构向3条带型空间结构转变,其中京津都市发展走廊发育成熟,沿海都市发展带也初具规模,而包括雄安新区在内的京石都市发展带正在孕育。  相似文献   

5.
Abstract

According to recent research, one of the most promising strategies for intraurban job growth lies promoting localized clusters that produce goods and services which are primarily sold within a single city, metropolitan area, or urban region. However, in order to design urban policies to create or reinforce local clusters, the first challenge is to measure in a reliable way the clustering tendencies of different kinds of economic units in intraurban space. The aim is to compare the similarities and differences in results obtained from two methods designed to measure global clustering tendencies (the planar and network K-functions) in terms of characterization, scale, and intensity of intraurban localization patterns for tertiary economic units in a Latin American metropolis. It is concluded that the network K-function is a more appropriate method for measuring agglomeration patterns, scale, and intensity at the intra-urban level.  相似文献   

6.
ABSTRACT

Urban black holes and volcanoes are typical traffic anomalies that are useful for optimizing urban planning and maintaining public safety. It is still challenging to detect arbitrarily shaped urban black holes and volcanoes considering the network constraints with less prior knowledge. This study models urban black holes and volcanoes as bivariate spatial clusters and develops a network-constrained bivariate clustering method for detecting statistically significant urban black holes and volcanoes with irregular shapes. First, an edge-expansion strategy is proposed to construct the network-constrained neighborhoods without the time-consuming calculation of the network distance between each pair of objects. Then, a network-constrained spatial scan statistic is constructed to detect urban black holes and volcanoes, and a multidirectional optimization method is developed to identify arbitrarily shaped urban black holes and volcanoes. Finally, the statistical significance of multiscale urban black holes and volcanoes is evaluated using Monte Carlo simulation. The proposed method is compared with three state-of-the-art methods using both simulated data and Beijing taxicab spatial trajectory data. The comparison shows that the proposed method can detect urban black holes and volcanoes more accurately and completely and is useful for detecting spatiotemporal variations of traffic anomalies.  相似文献   

7.
基于POI数据的郑东新区服务业空间聚类研究   总被引:8,自引:2,他引:6  
李江苏  梁燕  王晓蕊 《地理研究》2018,37(1):145-157
探讨城市新区的服务业空间格局,对城市新区规划和服务业空间布局的优化具有重要指导意义。采用POI数据对郑东新区服务业的总体、分行业空间布局进行聚类,结果显示:① 在总体上,聚类呈现“414”的空间体系,各聚类所在区域的通达性较好;服务业在功能区内部聚集和跨越功能区聚集并存;噪声点分布零散,局部出现了服务业聚集的潜力区域;空间临近效应、行政力量带动和市场导向作用导致服务业空间极化特征明显。② 从分行业来看,部分行业的重要空间节点分布存在一定差异,CBD核心区和商住物流区成为各行业空间节点的分布区域;部分行业的空间节点与功能区的产业定位存在吻合与错位特征。最后,从规划视角提出了郑东新区不同功能区产业结构优化的方向。  相似文献   

8.

This paper describes the application of an unsupervised clustering method, fuzzy c-means (FCM), to generate mineral prospectivity models for Cu?±?Au?±?Fe mineralization in the Feizabad District of NE Iran. Various evidence layers relevant to indicators or potential controls on mineralization, including geochemical data, geological–structural maps and remote sensing data, were used. The FCM clustering approach was employed to reduce the dimensions of nine key attribute vectors derived from different exploration criteria. Multifractal inverse distance weighting interpolation coupled with factor analysis was used to generate enhanced multi-element geochemical signatures of areas with Cu?±?Au?±?Fe mineralization. The GIS-based fuzzy membership function MSLarge was used to transform values of the different evidence layers, including geological–structural controls as well as alteration, into a [0–1] range. Four FCM-based validation indices, including Bezdek’s partition coefficient (VPc) and partition entropy (VPe) indices, the Fukuyama and Sugeno (VFS) index and the Xie and Beni (VXB) index, were employed to derive the optimum number of clusters and subsequently generate prospectivity maps. Normalized density indices were applied for quantitative evaluation of the classes of the FCM prospectivity maps. The quantitative evaluation of the results demonstrates that the higher favorability classes derived from VFS and VXB (Nd?=?9.19) appear more reliable than those derived from VPc and VPe (Nd?=?6.12) in detecting existing mineral deposits and defining new zones of potential Cu?±?Au?±?Fe mineralization in the study area.

  相似文献   

9.
In a spatio-temporal data set, identifying spatio-temporal clusters is difficult because of the coupling of time and space and the interference of noise. Previous methods employ either the window scanning technique or the spatio-temporal distance technique to identify spatio-temporal clusters. Although easily implemented, they suffer from the subjectivity in the choice of parameters for classification. In this article, we use the windowed kth nearest (WKN) distance (the geographic distance between an event and its kth geographical nearest neighbour among those events from which to the event the temporal distances are no larger than the half of a specified time window width [TWW]) to differentiate clusters from noise in spatio-temporal data. The windowed nearest neighbour (WNN) method is composed of four steps. The first is to construct a sequence of TWW factors, with which the WKN distances of events can be computed at different temporal scales. Second, the appropriate values of TWW (i.e. the appropriate temporal scales, at which the number of false positives may reach the lowest value when classifying the events) are indicated by the local maximum values of densities of identified clustered events, which are calculated over varying TWW by using the expectation-maximization algorithm. Third, the thresholds of the WKN distance for classification are then derived with the determined TWW. In the fourth step, clustered events identified at the determined TWW are connected into clusters according to their density connectivity in geographic–temporal space. Results of simulated data and a seismic case study showed that the WNN method is efficient in identifying spatio-temporal clusters. The novelty of WNN is that it can not only identify spatio-temporal clusters with arbitrary shapes and different spatio-temporal densities but also significantly reduce the subjectivity in the classification process.  相似文献   

10.
Abstract

Mapping variable stream buffers in a vector environment in which buffer width values are delineated often yields inaccurate results. Possible vector solutions are either ineffective or inefficient, An alternate raster approach is presented here in which a buffer effectiveness-achievement function (b-function) is introduced to map desirable buffer zones at an individual cell level based upon areal differentiations in physical and ecological conditions. The implementation of b-function is made feasible by a GIS procedure devised in this article. This tested method can be extended to a variety of variable buffer studies, such as visual buffers, noise buffers, greenways, and urban natural buffers.

‘A robe can never be made of the fur from one fox's axillae’ (A Chinese idiom).  相似文献   

11.
Social media applications are widely deployed in mobile platforms equipped with built-in GPS tracking devices, and these devices have led to an unprecedented collection of geolocated data (geo-tags). Geo-tags, along with place names, offer new opportunities to explore the trajectory and mobility patterns of social media users. However, trajectory data captured by social media are sparsely and irregularly spaced and therefore have varying degrees of resolution in both space and time. Previous studies on next location prediction are mostly applicable for detecting the upcoming location of a moving object using dense GPS trajectories where locations are recorded at regular time intervals (e.g., 1 minute). Additionally, point features are commonly used to represent the locations of visits, but using point features cannot capture the variability of human mobility. This article introduces a new methodology to predict an individual’s next location based on sparse footprints accumulated over a long time period using social networks, and uses polygons to represent the location corresponding to the physical activity area of individuals. First, the density-based spatial clustering algorithm is employed to discover the most representative activity zones that an individual frequently visits on a daily basis, and a polygon-based region is then derived for each representative activity zone. A sparse mobility Markov chain model considering both the movements and online behaviors of the social media user is trained and used to predict the user’s next location. Initial experiments with a group of Washington DC Twitter users demonstrate that the proposed methodology successfully discovers the activity regions and predicts the user’s next location with accuracy approaching 78.94%.  相似文献   

12.
The discovery of spatial clusters formed by proximal spatial units with similar non-spatial attribute values plays an important role in spatial data analysis. Although several spatial contiguity-constrained clustering methods are currently available, almost all of them discover clusters in a geographical dataset, even though the dataset has no natural clustering structure. Statistically evaluating the significance of the degree of homogeneity within a single spatial cluster is difficult. To overcome this limitation, this study develops a permutation test approach Specifically, the homogeneity of a spatial cluster is measured based on the local variance and cluster member permutation, and two-stage permutation tests are developed to determine the significance of the degree of homogeneity within each spatial cluster. The proposed permutation tests can be integrated into the existing spatial clustering algorithms to detect homogeneous spatial clusters. The proposed tests are compared with four existing tests (i.e., Park’s test, the contiguity-constrained nonparametric analysis of variance (COCOPAN) method, spatial scan statistic, and q-statistic) using two simulated and two meteorological datasets. The comparison shows that the proposed two-stage permutation tests are more effective to identify homogeneous spatial clusters and to determine homogeneous clustering structures in practical applications.  相似文献   

13.
As they increase in popularity, social media are regarded as important sources of information on geographical phenomena. Studies have also shown that people rely on social media to communicate during disasters and emergency situation, and that the exchanged messages can be used to get an insight into the situation. Spatial data mining techniques are one way to extract relevant information from social media. In this article, our aim is to contribute to this field by investigating how graph clustering can be applied to support the detection of geo-located communities in Twitter in disaster situations. For this purpose, we have enhanced the fast-greedy optimization of modularity (FGM) clustering algorithm with semantic similarity so that it can deal with the complex social graphs extracted from Twitter. Then, we have coupled the enhanced FGM with the varied density-based spatial clustering of applications with noise spatial clustering algorithm to obtain spatial clusters at different temporal snapshots. The method was experimented with a case study on typhoon Haiyan in the Philippines, and Twitter’s different interaction modes were compared to create the graph of users and to detect communities. The experiments show that communities that are relevant to identify areas where disaster-related incidents were reported can be extracted, and that the enhanced algorithm outperforms the generic one in this task.  相似文献   

14.
ABSTRACT

The spatial scan statistic method has been widely used for detecting disease clusters. Its results may be affected by scales, including the aggregation level of the input data and the population threshold used in the detection. Previous studies offered inconsistent findings, and few had considered both types of scales at the same time. Using 24 simulated datasets and two real disease datasets, we investigated the method’s sensitivity to the two types of scales. We aggregated the individual-level data into areal units of three levels, including county, town, and a 900 m grid. We detected clusters with three population thresholds, including 10%, 25%, and 50%. We used two measurements, distance between cluster centres and the Jaccard index, to quantify the consistency of clusters detected with different scale settings. We find: (1) the method is not greatly sensitive to the data aggregation level when the cluster is strong and in a place with high population density; (2) the method’s sensitivity to the population threshold is determined by the actual size of the true cluster; and (3) a regular grid with fine resolution is advantageous over the subjectively defined areal units. The process and findings may have broader meanings to similar spatial analyses.  相似文献   

15.
ABSTRACT

Spatial point tracks are of concern for an increasing number of analysts studying spatial behaviour patterns and environmental effects. Take an epidemiologist studying the behaviour of cyclists and how their health is affected by the city’s air quality. The accuracy of such analyses critically depends on the positional accuracy of the tracked points. This poses a serious privacy risk. Tracks easily reveal a person’s identity since the places visited function as fingerprints. Current obfuscation-based privacy protection methods, however, mostly rely on point quality reduction, such as spatial cloaking, grid masking or random noise, and thus render an obfuscated track less useful for exposure assessment. We introduce simulated crowding as a point quality preserving obfuscation principle that is based on adding fake points. We suggest two crowding strategies based on extending and masking a track to defend against inference attacks. We test them across various attack strategies and compare them to state-of-the-art obfuscation techniques both in terms of information loss and attack resilience. Results indicate that simulated crowding provides high resilience against home attacks under constantly low information loss.  相似文献   

16.
The identification of disease clusters in space or space–time is of vital importance for public health policy and action. In the case of methicillin‐resistant Staphylococcus aureus (MRSA), it is particularly important to distinguish between community and health care‐associated infections, and to identify reservoirs of infection. 832 cases of MRSA in the West Midlands (UK) were tested for clustering and evidence of community transmission, after being geo‐located to the centroids of UK unit postcodes (postal areas roughly equivalent to Zip+4 zip code areas). An age‐stratified analysis was also carried out at the coarser spatial resolution of UK Census Output Areas. Stochastic simulation and kernel density estimation were combined to identify significant local clusters of MRSA (p<0.025), which were supported by SaTScan spatial and spatio‐temporal scan. In order to investigate local sampling effort, a spatial ‘random labelling’ approach was used, with MRSA as cases and MSSA (methicillin‐sensitive S. aureus) as controls. Heavy sampling in general was a response to MRSA outbreaks, which in turn appeared to be associated with medical care environments. The significance of clusters identified by kernel estimation was independently supported by information on the locations and client groups of nursing homes, and by preliminary molecular typing of isolates. In the absence of occupational/lifestyle data on patients, the assumption was made that an individual's location and consequent risk is adequately represented by their residential postcode. The problems of this assumption are discussed, with recommendations for future data collection.  相似文献   

17.
Identifying zones and movement patterns of people is crucial to understanding adjacent regions and the relationship in urban areas. Most previous studies addressed zones or movement patterns separately without analysing simultaneously the two issues. In this article, we propose an integrated approach to discover directly both zones and movement patterns among the zones, referred to as movement patterns between zones (MZPs), from historical boarding behaviours of passengers in subway networks by using an agglomerative clustering method. In addition, evaluation measures of MZPs are suggested in terms of coverage and accuracy. The effectiveness of the proposed approach is finally demonstrated through a real-world data set obtained from smart cards on a subway network in Seoul, Korea.  相似文献   

18.
ABSTRACT

An increasing number of social media users are becoming used to disseminate activities through geotagged posts. The massive available geotagged posts enable collections of users’ footprints over time and offer effective opportunities for mobility prediction. Using geotagged posts for spatio-temporal prediction of future location, however, is challenging. Previous studies either focus on next-place prediction or rely on dense data sources such as GPS data. Introduced in this article is a novel method for future location prediction of individuals based on geotagged social media data. This method employs the hierarchical density-based clustering algorithm with adaptive parameter selection to identify the regions frequently visited by a social media user. A multi-feature weighted Bayesian model is then developed to forecast users’ spatio-temporal locations by combining multiple factors affecting human mobility patterns. Further, an updating strategy is designed to efficiently adjust, over time, the proposed model to the dynamics in users’ mobility patterns. Based on two real-life datasets, the proposed approach outperforms a state-of-the-art method in prediction accuracy by up to 5.34% and 3.30%. Tests show prediction reliability is high with quality predictions, but low in the identification of erroneous locations.  相似文献   

19.
Clustering of temporal event processes   总被引:1,自引:0,他引:1  
A temporal point process is a sequence of points, each representing the occurrence time of an event. Each temporal point process is related to the behavior of an entity. As a result, clustering of temporal point processes can help differentiate between entities, thereby revealing patterns of behaviors. This study proposes a hierarchical cluster method for clustering temporal point processes based on the discrete Fréchet (DF) distance. The DF cluster method is divided into four steps: (1) constructing a DF similarity matrix between temporal point processes; (2) constructing a complete linkage hierarchical tree based on the DF similarity matrix; (3) clustering the point processes with a threshold determined by locating the local maxima on the curve of the pseudo-F statistic (an index which measures the separability between clusters and the compactness in clusters); and (4) identifying inner patterns for each cluster formed by a series of dense intervals, each of which contains at least one event of all processes of the cluster. The contributions of the article are: (1) the proposed DF cluster method can cluster temporal point processes into different groups and (2) more importantly, it can identify the inner pattern of each cluster. Two synthetic data sets were created to illustrate the DF distance between temporal point process clusters (the first data set) and validate the proposed DF cluster method (the second data set), respectively. An experiment and a comparison with a method based on dynamic time warping show that DF cluster successfully identifies the preconfigured patterns in the second synthetic data set. The cluster method was then applied to a population migration history data set for the Northern Plains of the United States, revealing some interesting population migration patterns.  相似文献   

20.
In recent years, the evolution and improvement of LiDAR (Light Detection and Ranging) hardware has increased the quality and quantity of the gathered data, making the storage, processing and management thereof particularly challenging. In this work we present a novel, multi-resolution, out-of-core technique, used for web-based visualization and implemented through a non-redundant, data point organization method, which we call Hierarchically Layered Tiles (HLT), and a tree-like structure called Tile Grid Partitioning Tree (TGPT). The design of these elements is mainly focused on attaining very low levels of memory consumption, disk storage usage and network traffic on both, client and server-side, while delivering high-performance interactive visualization of massive LiDAR point clouds (up to 28 billion points) on multiplatform environments (mobile devices or desktop computers). HLT and TGPT were incorporated and tested in ViLMA (Visualization for LiDAR data using a Multi-resolution Approach), our own web-based visualization software specially designed to work with massive LiDAR point clouds.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号