首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The principal rationale for applying geographically weighted regression (GWR) techniques is to investigate the potential spatial non-stationarity of the relationship between the dependent and independent variables—i.e., that the same stimulus would provoke different responses in different locations. The calibration of GWR employs a geographically weighted local least squares regression approach. To obtain meaningful inference, it assumes that the regression residual follows a normal or asymptotically normal distribution. In many classical econometric analyses, the assumption of normality is often readily relaxed, although it has been observed that such relaxation might lead to unreliable inference of the estimated coefficients' statistical significance. No studies, however, have examined the behavior of residual non-normality and its consequences for the modeled relationships in GWR. This study attempts to address this issue for the first time by examining a set of tobacco-outlet-density and demographic variables (i.e., percent African American residents, percent Hispanic residents, and median household income) at the census tract level in New Jersey in a GWR analysis. The regression residual using the raw data is apparently non-normal. When GWR is estimated using the raw data, we find that there is no significant spatial variation of the coefficients between tobacco outlet density and percentage of African American and Hispanics. After transforming the dependent variable and making the residual asymptotically normal, all coefficients exhibit significant variation across space. This finding suggests that relaxation of the normality assumption could potentially conceal the spatial non-stationarity of the modeled relationships in GWR. The empirical evidence of the current study implies that researchers should verify the normality assumption prior to applying GWR techniques in analyses of spatial non-stationarity.  相似文献   

2.
Abstract

Geographically weighted regression (GWR) is a local spatial statistical technique for exploring spatial nonstationarity. Previous approaches to mapping the results of GWR have primarily employed an equal step classification and sequential no-hue colour scheme for choropleth mapping of parameter estimates. This cartographic approach may hinder the exploration of spatial nonstationarity by inadequately illustrating the spatial distribution of the sign, magnitude, and significance of the influence of each explanatory variable on the dependent variable. Approaches for improving mapping of the results of GWR are illustrated using a case study analysis of population density–median home value relationships in Philadelphia, Pennsylvania, USA. These approaches employ data classification schemes informed by the (nonspatial) data distribution, diverging colour schemes, and bivariate choropleth mapping.  相似文献   

3.
互联网记录了人们的日常生活,对带有位置信息的搜索引擎数据进行分析和挖掘可以获得隐藏于其中的地理信息。本文通过分析中国各省流感月度发病数与相关关键词百度搜索指数之间的相关性,选取相关性较高关键词的百度指数作为解释变量,发病数作为因变量,在采用主成分分析法消除变量共线性后,分别使用普通最小二乘回归(OLS)、地理加权回归(GWR)及时空地理加权回归(GTWR)构建流感发病数的空间分布模型。模型的拟合度能够从OLS的0.737、GWR的0.915提高到GTWR的0.959,赤池信息准则(AIC)也表明,GTWR模型明显优于OLS与GWR模型。验证结果显示,GTWR模型能准确识别流感高发地区,将该方法与搜索引擎数据结合能较好地模拟流感空间分布,为空间流行病学的研究提供预测模型和统计解释。  相似文献   

4.
Geographically Weighted Regression (GWR) is a method of spatial statistical analysis used to explore geographical differences in the effect of one or more predictor variables upon a response variable. However, as a form of local analysis, it does not scale well to (especially) large data sets because of the repeated processes of fitting and then comparing multiple regression surfaces. A solution is to make use of developing grid infrastructures, such as that provided by the National Grid Service (NGS) in the UK, treating GWR as an "embarrassing parallel" problem and building on existing software platforms to provide a bridge between an open source implementation of GWR (in R) and the grid system. To demonstrate the approach, we apply it to a case study of participation in Higher Education, using GWR to detect spatial variation in social, cultural and demographic indicators of participation.  相似文献   

5.
This study analyses the relationship between fire incidence and some environmental factors, exploring the spatial non-stationarity of the phenomenon in sub-Saharan Africa. Geographically weighted regression (GWR) was used to study the above relationship. Environment covariates comprise land cover, anthropogenic and climatic variables. GWR was compared to ordinary least squares, and the hypothesis that GWR represents no improvement over the global model was tested. Local regression coefficients were mapped, interpreted and related with fire incidence. GWR revealed local patterns in parameter estimates and also reduced the spatial autocorrelation of model residuals. All the covariates were non-stationary and in terms of goodness of fit, the model replicates the data very well (R 2 = 87%). Vegetation has the most significant relationship with fire incidence, with climate variables being more important than anthropogenic variables in explaining variability of the response. Some coefficient estimates exhibit locally different signs, which would have gone undetected by a global approach. This study provides an improved understanding of spatial fire–environment relationships and shows that GWR is a valuable complement to global spatial analysis methods. When studying fire regimes, effects of spatial non-stationarity need to be incorporated in vegetation-fire modules to have better estimates of burned areas and to improve continental estimates of biomass burning and atmospheric emissions derived from vegetation fires.  相似文献   

6.
Present methodological research on geographically weighted regression (GWR) focuses primarily on extensions of the basic GWR model, while ignoring well-established diagnostics tests commonly used in standard global regression analysis. This paper investigates multicollinearity issues surrounding the local GWR coefficients at a single location and the overall correlation between GWR coefficients associated with two different exogenous variables. Results indicate that the local regression coefficients are potentially collinear even if the underlying exogenous variables in the data generating process are uncorrelated. Based on these findings, applied GWR research should practice caution in substantively interpreting the spatial patterns of local GWR coefficients. An empirical disease-mapping example is used to motivate the GWR multicollinearity problem. Controlled experiments are performed to systematically explore coefficient dependency issues in GWR. These experiments specify global models that use eigenvectors from a spatial link matrix as exogenous variables.This study was supported by grant number 1 R1 CA95982-01, Geographic-Based Research in Cancer Control and Epidermiology, from the National Cancer Institute. The author thank the anonymous reviewers and the editor for their helpful comments.  相似文献   

7.
星地多源数据的区域土壤有机质数字制图   总被引:4,自引:0,他引:4  
周银  刘丽雅  卢艳丽  马自强  夏芳  史舟 《遥感学报》2015,19(6):998-1006
土壤有机质(SOM)是全球碳循环、土壤养分的重要组成部分,精确估算土壤有机质含量具有重要意义。本文以中国东北—华北平原为研究区,收集了1078个土壤样本,以遥感数据(MODIS,TRMM和STRM数据)与土壤地面光谱数据为预测因子,运用基于树形结构的数据挖掘技术构建土壤有机质-环境预测因子模型进行数字土壤制图。通过不同建模样本数建模精度比较,选择300个样本数时的模型为最优模型。建模结果表明土壤光谱和气候因子是研究区SOM变异的主控因子,生物因子次之,而地形因子影响最小。预测结果经检验,RMSE为7.25,R2为0.69,RPD为1.53制图结果与基于第二次全国土壤普查数据的土壤有机质地图具有相似的分布规律,呈现SOM自东北向西南递减的趋势。通过比较分析发现,经过20年左右的土地开发与利用,研究区低SOM和高SOM含量土壤面积减少,而中等SOM含量土壤面积增加。  相似文献   

8.
This study deals with the issue of extreme coefficients in geographically weighted regression (GWR) and their effects on mapping coefficients using three datasets with different spatial resolutions. We found that although GWR yields extreme coefficients regardless of the resolution of the dataset or types of kernel function: (1) GWR tends to generate extreme coefficients for less spatially dense datasets; (2) coefficient maps based on polygon data representing aggregated areal units are more sensitive to extreme coefficients; and (3) coefficient maps using bandwidths generated by a fixed calibration procedure are more vulnerable to the extreme coefficients than adaptive calibration.  相似文献   

9.
针对采用地理加权回归模型(GWR)进行预测时输入变量较多导致计算复杂度高,而输入变量较少引起预测精度降低这一问题,提出了一种基于主成分分析的地理加权回归方法(PCA-GWR)。首先,该方法检验了气溶胶光学厚度(AOD)影响因素之间的共线性;然后,通过非线性主成分分析法(NLPCA)对影响AOD值的若干相关变量进行处理,既消除了相关变量彼此之间的多重共线性,又可以起到降维的作用;最后,利用非线性主成分分析得到较少的几个综合指标,通过地理加权回归模型对AOD值进行分析预测。为验证该方法的有效性,采用京津冀地区的AOD、高程、风速、气温、湿度、气压、坡度、坡向数据,利用Pearson相关系数法选取与AOD浓度具有较高相关性的影响因素作为常规的GWR模型的输入变量,在变量个数相同的前提下,与本文方法进行对比。研究结果表明:应用非线性主成分分析法对相关变量进行预处理后,有效地解决了变量之间的共线性,保留了原始影响因素主要信息,提高了运算效率,且该方法所得的MAE、RMSE、AIC及其拟合优度R2均优于常规的GWR模型。  相似文献   

10.
地理加权回归方法在小样本数据下回归分析精度往往不高。半监督学习是一种利用未标记样本参与训练的机器学习方法,可以有效地提升少量有标记样本的学习性能。基于此本文提出了一种基于半监督学习的地理加权回归方法,其核心思想是利用有标记样本建立回归模型来训练未标记样本,再选择置信度高的结果扩充有标记样本,不断训练,以提高回归性能。本文采用模拟数据和真实数据进行试验,以均方误差提升百分比作为性能评价指标,将SSLGWR与GWR、COREG对比分析。模拟数据试验中,SSLGWR在3种不同配置下性能分别提升了39.66%、11.92%和0.94%。真实数据试验中,SSLGWR在3种不同配置下性能分别提升了8.94%、3.36%和5.87%。SSLGWR结果均显著优于GWR和COGWR。试验证明,半监督学习方法能利用未标记数据提升地理加权回归模型的性能,特别是在有标记样本数量较少时作用显著。  相似文献   

11.
高精度降水场是水文、气象以及环境分析的重要数据支撑,直接影响相关服务的准确性。传统降水分布模拟大多依赖站点空间维的驱动因素,而忽略了降水时序变化特征对其空间分布的影响。使用2015—2017年中国湖北省83个国家气象观测站点和28个省级观测站点近3 a月平均累积降水资料,通过相关性分析,引入站点降水时序理论变差函数模型的拱高值(C)和块金值(C0)作为影响因素,使用地理加权回归(geographically weighted regression, GWR)建立湖北省月平均降水分布模型。结果表明:(1)各站点降水的时序变差函数曲线与降水的季节性基本吻合。站点时序理论变差函数模型中,有25.3%能够在4个月内达到平稳,36.14%在6个月内达到平稳。(2)站点降水时序理论变差函数模型的C和C0与逐年12月平均累积降水在0.01水平(双侧)上显著相关,平均相关系数分别为0.745和0.526,大于地理位置和高程对降水的影响。(3)引入C和C0 有助于提升GWR模型的整体拟合优度和插值精度。对比仅使用经纬度的GWR模型和引入时序理论变差函数特征的GWR模型,3 a平均整体拟合优度从0.852提升至0.912。验证集站点插值精度评价显示,3 a绝对误差、均方根误差和平均绝对百分误差下降幅度均大于60%。因此,引入时序理论变差函数特征的时空GWR模型能够获得较高精度的降水模拟结果,更适合具有丰富历史降水资料地区的降水空间分布估算。  相似文献   

12.
Geographically Weighted Regression (GWR) is a method of spatial statistical analysis allowing the modeled relationship between a response variable and a set of covariates to vary geographically across a study region. Its use of geographical weighting arises from the expectation that observations close together by distance are likely to share similar characteristics. In practice, however, two points can be geographically close but socially distant because the contexts (or neighborhoods) within which they are situated are not alike. Drawing on a previous study of geographically and temporally weighted regression, in this article we develop what we describe as contextualized Geographically Weighted Regression (CGWR), applying it to the field of hedonic house price modeling to examine spatial heterogeneity in the land parcel prices of Beijing, China. Contextual variables are incorporated into the analysis by adjusting the geographical weights matrix to measure proximity not only by distance but also with respect to an attribute space defined by measures of each observation's neighborhood. Comparing CGWR with GWR suggests that adding the contextual information improves the model fit.  相似文献   

13.
邓悦  刘洋  刘纪平  徐胜华  罗安 《测绘通报》2018,(3):32-37,42
近年来,我国大部分地区屡遭洪涝与干旱两种自然灾害侵袭,对重洪涝干旱区域进行空间插值具有重要的意义。针对传统地理加权回归(GWR)模型建模过程中模型识别和参数估计易受观测值异常点影响的问题,本文提出了一种基于吉布斯采样的贝叶斯地理加权回归(GBGWR)方法。运用基于吉布斯采样的马尔可夫链蒙特卡罗贝叶斯方法,估计地理加权回归模型参数,通过平滑函数降低观测值中异常点位数据,最后对湖南省1985-2015年35个观测站点的降水观测数据进行了空间分布模拟。试验结果表明,本文提出的方法相较于GWR模型性能提高了19.8%,相较于BGWR模型性能提高了8.2%,该方法可以有效降低异常值和"弱数据"对回归结果的影响,能够更加真实地模拟湖南省降水量的空间分布。  相似文献   

14.
地理加权回归是常用的空间分析方法,已广泛应用于各个领域,但利用此方法进行回归分析前,往往忽略了对设计矩阵进行局部多重共线性的诊断,从而导致对模型的估计不准确。因此,本文在引入了全局模型的多重共线性诊断方法的基础上,对这些方法进行了改进,改进后构建了加权方差膨胀因子法和加权条件指标方法——分解比法,用于诊断地理加权回归模型设计矩阵的多重共线性问题。实验结果表明,多重共线性不存在于全局模型,而可能存在于局部模型中,构建的两种方法能够有效地诊断地理加权回归模型的多重共线性问题,且加权条件指标方法——分解比法比加权方差膨胀因子法在诊断多重共线性问题上更有优势。  相似文献   

15.
混合地理加权回归模型算法研究   总被引:1,自引:0,他引:1  
以迭代算法为基础,推导出混合地理加权回归模型的常系数(全局参数)和变系数(局域参数)的计算方法,并以上海市住宅小区楼盘销售平均价格为例进行验证。结果表明,混合地理加权回归模型的计算量略大于地理加权回归模型,但对样本数据的拟合更好,局域参数估计更稳健。  相似文献   

16.
以地形地貌特征复杂、观测站点分布稀疏不均匀的四川省为研究区,引入地形因子(坡度和坡向)和植被指数,采用顾及空间关系非平稳性的(混合)地理加权回归克里格模型((mixed)geographically weighted regression Kriging,(m)GWRK)进行月尺度平均气温插值方法及精度分析研究。针对不同季节和不同地区,将(m)GWRK插值结果与基于全局回归的回归克里格(regression Kriging,RK)插值结果进行对比。结果表明,RK、GWRK、mGWRK回归关系的决定系数R2分别为0.795、0.922、0.911,均方根误差分别为0.83℃、0.64℃、0.55℃,表明GWRK、mGWRK对目标变量的解释能力以及插值精度都优于RK;GWRK、mGWRK相对于RK对月平均气温插值的改进具有季节与地区差异,冬半年的改进大于夏半年,在地形地貌变化大的地区改进大于地形地貌变化小的地区。  相似文献   

17.
This research uses the most recent (2003) census data and a Landsat ETM+ image to build a population estimation model for Port-au-Prince, Haiti. The purpose of the study is to establish the linkage of population density with remotely sensed surface reflectance signals of an urban area, and use that to estimate population when census data are not available in a timely fashion. The research begins with deriving subpixel vegetation-impervious surface-soil (VIS) fractions derived from the Landsat ETM+ multispectral bands, and then uses the geographically weighted regression (GWR) model to examine how the variation of population density can be explained by the VIS variables and their derivatives. With comparison to the ordinary least square (OLS) model, the GWR model accounts for spatial non-stationarity in the relationship between population patterns and land characteristics in the study area. The study reveals that three VIS variables are significant in explaining population density: the mean value of houses fraction image, the mean value of vegetation fraction image, and the standard deviation of vegetation fraction image.  相似文献   

18.
Geographically weighted regression (GWR) is an important local method to explore spatial non‐stationarity in data relationships. It has been repeatedly used to examine spatially varying relationships between epidemic diseases and predictors. Malaria, a serious parasitic disease around the world, shows spatial clustering in areas at risk. In this article, we used GWR to explore the local determinants of malaria incidences over a 7‐year period in northern China, a typical mid‐latitude, high‐risk malaria area. Normalized difference vegetation index (NDVI), land surface temperature (LST), temperature difference, elevation, water density index (WDI) and gross domestic product (GDP) were selected as predictors. Results showed that both positively and negatively local effects on malaria incidences appeared for all predictors except for WDI and GDP. The GWR model calibrations successfully depicted spatial variations in the effect sizes and levels of parameters, and also showed substantially improvements in terms of goodness of fits in contrast to the corresponding non‐spatial ordinary least squares (OLS) model fits. For example, the diagnostic information of the OLS fit for the 7‐year average case is R2 = 0.243 and AICc = 837.99, while significant improvement has been made by the GWR calibration with R2 = 0.800 and AICc = 618.54.  相似文献   

19.
Based on remote sensing and GIS, this study models the spatial variations of urban growth patterns with a logistic geographically weighted regression (GWR) technique. Through a case study of Springfield, Missouri, the research employs both global and local logistic regression to model the probability of urban land expansion against a set of spatial and socioeconomic variables. The logistic GWR model significantly improves the global logistic regression model in three ways: (1) the local model has higher PCP (percentage correctly predicted) than the global model; (2) the local model has a smaller residual than the global model; and (3) residuals of the local model have less spatial dependence. More importantly, the local estimates of parameters enable us to investigate spatial variations in the influences of driving factors on urban growth. Based on parameter estimates of logistic GWR and using the inverse distance weighted (IDW) interpolation method, we generate a set of parameter surfaces to reveal the spatial variations of urban land expansion. The geographically weighted local analysis correctly reveals that urban growth in Springfield, Missouri is more a result of infrastructure construction, and an urban sprawl trend is observed from 1992 to 2005.  相似文献   

20.
Mapping the spatial distribution of soil nutrient contents from sample data has received much attention in the recent decade. Accurately mapping soil nutrients purely based on sample data, however, is difficult due to the sparsity and high cost of samples. Land use types usually influence the contents of soil nutrients at the local level and it is desirable to integrate such information into predictive mapping. The area-and-point kriging (AAPK) method, which was proposed recently, may provide an interpolation technique for such purposes. This study mapped the soil total nitrogen (TN) distribution of Hanchuan County, China, using AAPK with sample data (consisting of 402 points) and land use information. Ordinary kriging (OK) and residual kriging (RK) were compared to evaluate the performance of AAPK. Results showed that: (1) land use types had important impacts on the spatial distribution of soil TN; (2) measured data at 135 validation locations had stronger correlation with the data predicted by AAPK than by RK and OK, and the mean error and root mean square error with AAPK were lower than with RK and OK; and (3) AAPK generated smaller error variances than RK and OK did. This suggests that AAPK represents an effective method for increasing the interpolation accuracy of soil TN. It should be pointed out that some of the land use polygons used in this study are very large and complex, which might impact the effectiveness of AAPK in improving the prediction accuracy. Segmenting them into simple smaller areas might be helpful.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号