首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Individuals and other entities move through space as a function of local characteristics of place, their internal behavioral models, and the topological structure of the underlying space. When a collection of locations (i.e. geotagged photos or other geotagged social media information) from a large number of individuals is assembled, it becomes possible to understand the interrelationship between the individuals and the space they occupy. This research systematically considers this interrelationship through an examination of the effect of the intersection of behavioral and spatial characteristics on individuals moving on street networks. The research illustrates how social media data, in combination with a biased random walker, can be used to understand and model the interaction of spatial structure and social‐environmental factors on influencing individuals' use of their environment. The biased walker offers a flexible approach to incorporate consideration of both social‐environmental and structural factors into a model and we demonstrate this through a case study wherein we are able to use the random walker to model the characteristics of Flickr users in New York City.  相似文献   

2.
The use of social media data in geographic studies has become common, yet the question of social media's validity in such contexts is often overlooked. Social media data suffers from a variety of biases and limitations; nevertheless, with a proper understanding of the drawbacks, these data can be powerful. As cities seek to become “smarter,” they can potentially use social media data to creatively address the needs of their most vulnerable groups, such as ethnic minorities. However, questions remain unanswered regarding who uses these social networking platforms, how people use these platforms, and how representative social media data is of users' everyday lives. Using several forms of regression, I explore the relationships between a conventional data source (the U.S. Census) and a subset of Twitter data potentially representative of minority groups: tweets created by users with an account language other than English. A considerable amount of non‐stationarity is uncovered, which should serve as a warning against sweeping statements regarding the demographics of users and where people prefer to post. Further, I find that precisely located Twitter data informs us more about the digital status of places and less about users' day‐to‐day travel patterns.  相似文献   

3.
Online representations of places are becoming pivotal in informing our understanding of urban life. Content production on online platforms is grounded in the geography of their users and their digital infrastructure. These constraints shape place representation, that is, the amount, quality, and type of digital information available in a geographic area. In this article we study the place representation of user‐generated content (UGC) in Los Angeles County, relating the spatial distribution of the data to its geo‐demographic context. Adopting a comparative and multi‐platform approach, this quantitative analysis investigates the spatial relationship between four diverse UGC datasets and their context at the census tract level (about 685,000 geo‐located tweets, 9,700 Wikipedia pages, 4 million OpenStreetMap objects, and 180,000 Foursquare venues). The context includes the ethnicity, age, income, education, and deprivation of residents, as well as public infrastructure. An exploratory spatial analysis and regression‐based models indicate that the four UGC platforms possess distinct geographies of place representation. To a moderate extent, the presence of Twitter, OpenStreetMap, and Foursquare data is influenced by population density, ethnicity, education, and income. However, each platform responds to different socio‐economic factors and clusters emerge in disparate hotspots. Unexpectedly, Twitter data tend to be located in denser, more deprived areas, and the geography of Wikipedia appears peculiar and harder to explain. These trends are compared with previous findings for the area of Greater London.  相似文献   

4.
5.
ABSTRACT

Data availability is a persistent constraint in social policy analysis. Web 2.0 technologies could provide valuable new data sources, but first, their potentials and limitations need to be investigated. This paper reports on a method using Twitter data for deriving indications of active citizenship, taken as an example of social indicators. Active citizenship is a dimension of social capital, empowering communities and reducing possibilities of social exclusion. However, classical measurements of active citizenship are generally costly and time-consuming. This paper looks at one of such classic indicators, namely, responses to the survey question ‘contacts to politicians’. It compares official survey results in Spain with findings from an analysis of Twitter data. Each method presents its own strengths and weakness, thus best results may be achieved by the combination of both. Official surveys have the clear advantage of being statistically robust and representative of a total population. Instead, Twitter data offer more timely and less costly information, with higher spatial and temporal resolution. This paper presents our full methodological workflow for analysing and comparing these two data sources. The research results advance the debate on how social media data could be mined for policy analysis.  相似文献   

6.
ABSTRACT

Understanding the characteristics of tourist movement is essential for tourist behavior studies since the characteristics underpin how the tourist industry management selects strategies for attraction planning to commercial product development. However, conventional tourism research methods are not either scalable or cost-efficient to discover underlying movement patterns due to the massive datasets. With advances in information and communication technology, social media platforms provide big data sets generated by millions of people from different countries, all of which can be harvested cost efficiently. This paper introduces a graph-based method to detect tourist movement patterns from Twitter data. First, collected tweets with geo-tags are cleaned to filter those not published by tourists. Second, a DBSCAN-based clustering method is adapted to construct tourist graphs consisting of the tourist attraction vertices and edges. Third, network analytical methods (e.g. betweenness centrality, Markov clustering algorithm) are applied to detect tourist movement patterns, including popular attractions, centric attractions, and popular tour routes. New York City in the United States is selected to demonstrate the utility of the proposed methodology. The detected tourist movement patterns assist business and government activities whose mission is tour product planning, transportation, and development of both shopping and accommodation centers.  相似文献   

7.
This paper examines the spatial and temporal distribution of all COVID-19 cases from January to June 2020 against the underlying distribution of population in the United States. It is found that, as time passes, COVID-19 cases become a power law with cutoff, resembling the underlying spatial distribution of populations. The power law implies that many states and counties have a low number of cases, while only a few highly populated states and counties have a high number of cases. To further differentiate patterns between the underlying populations and COVID-19 cases, we derived their inherent hierarchy or spatial heterogeneity characterized by the ht-index. We found that the ht-index of COVID-19 cases persistently approaches that of the populations; that is, 5 and 7 at the state and county levels, respectively. Mapping the ht-index of COVID-19 cases against that of populations shows that the pandemic is largely shaped by the underlying population with the R-square value between infection and population up to 0.82.  相似文献   

8.
ABSTRACT

Allergic rhinitis (hay fever) resulting from seasonal pollen affects 15–30% of the population in the United States, and can exacerbate several related conditions, including asthma, atopic eczema, and allergic conjunctivitis. Timely monitoring, accurate prediction, and visualization of pollen levels are critical for public health prevention purposes, such as limiting outdoor exposure or physical activity. The low density of pollen detecting stations and complex movement of pollen represent a challenge for accurate prediction and modeling. In this paper, we reconstruct the dynamics of pollen variation across the Eastern United States for 2016 using space–time interpolation. Pollen levels were extracted according to a stratified spatial sampling design, augmented by additional samples in densely populated areas. These measurements were then used to estimate the space–time cross-correlation, inferring optimal spatial and temporal ranges to calibrate the space–time interpolation. Given the computational requirements of the interpolation algorithm, we implement a spatiotemporal domain decomposition algorithm, and use parallel computing to reduce the computational burden. We visualize our results in a 3D environment to identify the seasonal dynamics of pollen levels. Our approach is also portable to analyze other large space–time explicit datasets, such as air pollution, ash clouds, and precipitation.  相似文献   

9.
Cities are increasingly promoting policies that increase and conserve urban forests based largely on biophysical and land use-cover metrics. This study demonstrates how socioeconomic factors need to be considered in geospatial analyses when formulating urban greening policies. Using remote sensing, geographical information systems, spatial field and census data, and policy analyses, we analyzed the effectiveness of urban forest cover policies that included socioeconomic factors when quantifying urban forest cover. We found that urban forest cover was heterogeneous across the study area and non-white residents younger than 19 and greater than 45 years old living in rentals were more likely to reside in areas with less urban forest cover than any other age cohort. Our analyses also indicated that urban forest cover was temporally variable and demographic factors unique to Miami-Dade County bring to light the complexity of establishing homogenous, county-wide "tree canopy" and urban greening policy goals. We present a localized socioeconomic and ecologically based geospatial approach for formulating urban forest cover goals.  相似文献   

10.
With the rapid growth and popularity of mobile devices and location‐aware technologies, online social networks such as Twitter have become an important data source for scientists to conduct geo‐social network research. Non‐personal accounts, spam users and junk tweets, however, pose severe problems to the extraction of meaningful information and the validation of any research findings on tweets or twitter users. Therefore, the detection of such users is a critical and fundamental step for twitter‐related geographic research. In this study, we develop a methodological framework to: (1) extract user characteristics based on geographic, graph‐based and content‐based features of tweets; (2) construct a training dataset by manually inspecting and labeling a large sample of twitter users; and (3) derive reliable rules and knowledge for detecting non‐personal users with supervised classification methods. The extracted geographic characteristics of a user include maximum speed, mean speed, the number of different counties that the user has been to, and others. Content‐based characteristics for a user include the number of tweets per month, the percentage of tweets with URLs or Hashtags, and the percentage of tweets with emotions, detected with sentiment analysis. The extracted rules are theoretically interesting and practically useful. Specifically, the results show that geographic features, such as the average speed and frequency of county changes, can serve as important indicators of non‐personal users. For non‐spatial characteristics, the percentage of tweets with a high human factor index, the percentage of tweets with URLs, and the percentage of tweets with mentioned/replied users are the top three features in detecting non‐personal users.  相似文献   

11.
Abstract

This paper covers the development of a GIS instructional module centered on the reintroduction of the Mexican Gray Wolf in the Southwest, United States. This module is used in an undergraduate geography course on the United States. The paper also reports on how forty‐one students applied the module in trying to find an appropriate location to reintroduce the wolf.  相似文献   

12.
When travelling, people are accustomed to taking and uploading photos on social media websites, which has led to the accumulation of huge numbers of geotagged photos. Combined with multisource information (e.g. weather, transportation, or textual information), these geotagged photos could help us in constructing user preference profiles at a high level of detail. Therefore, using these geotagged photos, we built a personalised recommendation system to provide attraction recommendations that match a user's preferences. Specifically, we retrieved a geotagged photo collection from the public API for Flickr (Flickr.com) and fetched a large amount of other contextual information to rebuild a user's travel history. We then created a model-based recommendation method with a two-stage architecture that consists of candidate generation (the matching process) and candidate ranking. In the matching process, we used a support vector machine model that was modified for multiclass classification to generate the candidate list. In addition, we used a gradient boosting regression tree to score each candidate and rerank the list. Finally, we evaluated our recommendation results with respect to accuracy and ranking ability. Compared with widely used memory-based methods, our proposed method performs significantly better in the cold-start situation and when mining ‘long-tail’ data.  相似文献   

13.
The implementation of social network applications on mobile platforms has significantly elevated the activity of mobile social networking. Mobile social networking offers a channel for recording an individual’s spatiotemporal behaviors when location-detecting capabilities of devices are enabled. It also facilitates the study of time geography on an individual level, which has previously suffered from a scarcity of georeferenced movement data. In this paper, we report on the use of georeferenced tweets to display and analyze the spatiotemporal patterns of daily user trajectories. For georeferenced tweets having both location information in longitude and latitude values and recorded creation time, we apply a space–time cube approach for visualization. Compared to the traditional methodologies for time geography studies such as the travel diary-based approach, the analytics using social media data present challenges broadly associated with those of Big Data, including the characteristics of high velocity, large volume, and heterogeneity. For this study, a batch processing system has been developed for extracting spatiotemporal information from each tweet and then creating trajectories of each individual mobile Twitter user. Using social media data in time geographic research has the benefits of study area flexibility, continuous observation and non-involvement with contributors. For example, during every 30-minute cycle, we collected tweets created by about 50,000 Twitter users living in a geographic region covering New York City to Washington, DC. Each tweet can indicate the exact location of its creator when the tweet was posted. Thus, the linked tweets show a Twitter users’ movement trajectory in space and time. This study explores using data intensive computing for processing Twitter data to generate spatiotemporal information that can recreate the space–time trajectories of their creators.  相似文献   

14.
《The Cartographic journal》2013,50(3):262-267
Abstract

The paper discusses changes that have occurred over the last 15 years in how maps are sold, where they are sold and who is buying them. The emphasis is on the situation in the United States of America, but developments in Europe and the United Kingdom are also included.  相似文献   

15.
This article investigates how workout trajectories from a mobile sports tracking application can be used to provide automatic route suggestions for bicyclists. We apply a Hidden Markov Model (HMM)‐based method for matching cycling tracks to a “bicycle network” extracted from crowdsourced OpenStreetMap (OSM) data, and evaluate its effective differences in terms of optimal routing compared with a simple geometric point‐to‐curve method. OSM has quickly established itself as a popular resource for bicycle routing; however, its high‐level of detail presents challenges for its applicability to popularity‐based routing. We propose a solution where bikeways are prioritized in map‐matching, achieving good performance; the HMM‐based method matched correctly on average 94% of the route length. In addition, we show that the extremely biased nature of the trajectory dataset, which is typical of volunteered user‐generated data, can be of high importance in terms of popularity‐based routing. Most computed routes diverged depending on whether the number of users or number of tracks was used as an indicator of popularity, which may imply varying preferences among different types of cyclists. Revising the number of tracks by diversity of users to surmount local biases in the data had a more limited effect on routing.  相似文献   

16.
Rapid flood mapping is critical for local authorities and emergency responders to identify areas in need of immediate attention. However, traditional data collection practices such as remote sensing and field surveying often fail to offer timely information during or right after a flooding event. Social media such as Twitter have emerged as a new data source for disaster management and flood mapping. Using the 2015 South Carolina floods as the study case, this paper introduces a novel approach to mapping the flood in near real time by leveraging Twitter data in geospatial processes. Specifically, in this study, we first analyzed the spatiotemporal patterns of flood-related tweets using quantitative methods to better understand how Twitter activity is related to flood phenomena. Then, a kernel-based flood mapping model was developed to map the flooding possibility for the study area based on the water height points derived from tweets and stream gauges. The identified patterns of Twitter activity were used to assign the weights of flood model parameters. The feasibility and accuracy of the model was evaluated by comparing the model output with official inundation maps. Results show that the proposed approach could provide a consistent and comparable estimation of the flood situation in near real time, which is essential for improving the situational awareness during a flooding event to support decision-making.  相似文献   

17.
ABSTRACT

Understanding and detecting the intended meaning in social media is challenging because social media messages contain varieties of noise and chaos that are irrelevant to the themes of interests. For example, conventional supervised classification approaches would produce inconsistent solutions to detecting and clarifying whether any given Twitter message is really about a wildfire event. Consequently, a renovated workflow was designed and implemented. The workflow consists of four sequential procedures: (1) Apply the latent semantic analysis and cosine similarity calculation to examine the similarity between Twitter messages; (2) Apply Affinity Propagation to identify exemplars of Twitter messages; (3) Apply the cosine similarity calculation again to automatically match the exemplars to known training results, and (4) Apply accumulative exemplars to classify Twitter messages using a support vector machine approach. The overall correction ratio was over 90% when a series of ongoing and historical wildfire events were examined.  相似文献   

18.
ABSTRACT

There is an increasing availability of geospatial data describing patterns of human settlement and population such as various global remote-sensing based built-up land layers, fine-grained census-based population estimates, and publicly available cadastral and building footprint data. This development constitutes new integrative modeling opportunities to characterize the continuum of urban, peri-urban, and rural settlements and populations. However, little research has been done regarding the agreement between such data products in measuring human presence which is measured by different proxy variables (i.e. presence of built-up structures derived from different remote sensors, census-derived population counts, or cadastral land parcels). In this work, we quantitatively evaluate and cross-compare the ability of such data to model the urban continuum, using a unique, integrated validation database of cadastral and building footprint data, U.S. census data, and three different versions of the Global Human Settlement Layer (GHSL) derived from remotely sensed data. We identify advantages and shortcomings of these data types across different geographic settings in the U.S., which will inform future data users on implications of data accuracy and suitability for a given application, even in data-poor regions of the world.  相似文献   

19.
互联网的广泛应用产生了越来越多与地理空间位置关联的文本信息。现有地理信息系统一般通过外部链接来浏览这些数据,需要频繁的缩放、漫游和点击操作,而其他方法又难以有效表达出空间位置关系。提出了一种基于标签云的位置关联文本信息可视化方法———标签云地图,给出了标签云地图的设计思路和实现流程,并以腾讯微博的真实数据集为例建立了原型,重点研究了点状和面状地理要素的Cartogram生成算法,关键字和词频的提取算法,面向不同尺度和不同时间的标签云显示规则的标签位置生成算法。实验表明,该方法能够帮助用户从大量的位置关联文本信息中快速感知并把握信息的总体特征和发展趋势。  相似文献   

20.
User interaction in social networks, such as Twitter and Facebook, is increasingly becoming a source of useful information on daily events. The online monitoring of short messages posted in such networks often provides insight on the repercussions of events of several different natures, such as (in the recent past) the earthquake and tsunami in Japan, the royal wedding in Britain and the death of Osama bin Laden. Studying the origins and the propagation of messages regarding such topics helps social scientists in their quest for improving the current understanding of human relationships and interactions. However, the actual location associated to a tweet or to a Facebook message can be rather uncertain. Some tweets are posted with an automatically determined location (from an IP address), or with a user‐informed location, both in text form, usually the name of a city. We observe that most Twitter users opt not to publish their location, and many do so in a cryptic way, mentioning non‐existing places or providing less specific place names (such as “Brazil”). In this article, we focus on the problem of enriching the location of tweets using alternative data, particularly the social relationships between Twitter users. Our strategy involves recursively expanding the network of locatable users using following‐follower relationships. Verification is achieved using cross‐validation techniques, in which the location of a fraction of the users with known locations is used to determine the location of the others, thus allowing us to compare the actual location to the inferred one and verify the quality of the estimation. With an estimate of the precision of the method, it can then be applied to locationless tweets. Our intention is to infer the location of as many users as possible, in order to increase the number of tweets that can be used in spatial analyses of social phenomena. The article demonstrates the feasibility of our approach using a dataset comprising tweets that mention keywords related to dengue fever, increasing by 45% the number of locatable tweets.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号