首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于随机森林算法的草原地上生物量遥感估算方法研究
引用本文:邢晓语,杨秀春,徐斌,金云翔,郭剑,陈昂,杨东,王平,朱立博.基于随机森林算法的草原地上生物量遥感估算方法研究[J].地球信息科学,2021,23(7):1312-1324.
作者姓名:邢晓语  杨秀春  徐斌  金云翔  郭剑  陈昂  杨东  王平  朱立博
作者单位:1.农业部农业信息技术重点实验室 中国农业科学院农业资源与农业区划研究所,北京 1000812.北京林业大学草业与草原学院 草地资源与生态研究中心,北京 1000833.遥感科学国家重点实验室 北京师范大学地理科学学部,北京 1008754.环境遥感与数字城市北京市重点实验室 北京师范大学地理科学学部,北京 1008755.内蒙古呼伦贝尔市畜牧科学研究所, 海拉尔 021008
基金项目:国家重点研发计划项目(2017YFC0506504);国家自然科学基金项目(41571105)
摘    要:草原是我国面积最大的陆地生态系统,生物量是反映生态系统质量和功能的关键指标,准确地掌握草原生物量对草原资源合理利用、生态修复、畜牧业高质量发展都具有重要的意义和作用。本研究以内蒙古锡林郭勒盟为研究区,利用高分一号遥感卫星影像,结合216个野外样本数据,采用随机森林算法(Random Forest,RF)对草原地上生物量(Aboveground Biomass,AGB)遥感估算进行了适用性分析与应用。在运用随机森林算法的过程中,进行了K-折交叉验证、多元共线性诊断、偏效应等一系列分析,完成了随机森林模型的构建,同时,将建模结果与其它模型进行了对比,最终实现了锡林郭勒盟草原AGB的反演估算。结果表明:① 随机森林算法能够较好地规避生物量建模中自变量多元共线性的问题;② 随机森林模型在草原AGB估算中较其它模型具有更好的适用性,模型精度达85%,RMSE为202.13 kg/hm2;③ 应用构建的随机森林算法估算了研究区2017年草原AGB,从结果来看,其空间分布上呈现为自东向西逐渐递减的趋势;从草地类型上看,山地草甸类AGB单产最高,温性草原类总产量最高。研究结果将对草原生态系统监测评估和草原宏观管理具有一定的参考价值。

关 键 词:草原地上生物量  随机森林  支持向量机  高分一号  多元共线性  偏效应  机器学习  回归模型  
收稿时间:2020-10-15

Remote Sensing Estimation of Grassland Aboveground Biomass based on Random Forest
XING Xiaoyu,YANG Xiuchun,XU Bin,JIN Yunxiang,GUO Jian,CHEN Ang,YANG Dong,WANG Ping,ZHU Libo.Remote Sensing Estimation of Grassland Aboveground Biomass based on Random Forest[J].Geo-information Science,2021,23(7):1312-1324.
Authors:XING Xiaoyu  YANG Xiuchun  XU Bin  JIN Yunxiang  GUO Jian  CHEN Ang  YANG Dong  WANG Ping  ZHU Libo
Abstract:Grassland is the largest terrestrial ecosystem in China. Biomass is a key indicator of ecosystem quality and ecosystem function. It is of great significance for us to accurately estimate the grassland biomass for the effective and rational use of grassland resources, the restoration of damaged grassland ecosystem, and the high-quality development of animal husbandry. In this study, we took Xilinguole league of Inner Mongolia autonomous region as the research area. We used GF-1 satellite images, ground sample data of 216 sites, and Random Forest (RF) algorithm to estimate Grassland Aboveground Biomass (AGB) and explore the applicability of the algorithm in grassland biomass estimation. Moreover, in order to evaluate the applicability of random forest algorithm in aboveground biomass estimation, we carried out a series of analysis when using the algorithm, such as k-fold cross validation, multicollinearity diagnosis, partial effect and so on. Based this, we completed the construction of the random forest model and compared the modeling results with those from other models. Then, we selected the best model to realize the inversion estimation of grassland aboveground biomass in Xilinguole league. The main conclusions are as follows: (1) In the process of biomass model construction in Xilinguole league, random forest algorithm can avoid multicollinearity problem if there are multiple input variables; (2) The random forest model has better applicability than other models in the estimation of grassland biomass. The accuracy of the random forest model is 85% while the RMSE is 202.13 kg/hm2; (3) Using the random forest model, we estimated the grassland aboveground biomass of the whole study area in 2017. The results indicated that the spatial distribution had a decreasing trend from east to west. When grassland types are concerned, the grassland aboveground biomass yield of mountain meadow was the highest among all grassland types while the total yield of temperate grassland was the highest among all grassland types. The results are not only beneficial to the monitoring and evaluation of grassland ecosystem, but also have a certain reference value for grassland macro management.
Keywords:grassland aboveground biomass  random forest  support vector machine  GF-1  multicollinearity  partial effect  machine learning  regression model  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《地球信息科学》浏览原始摘要信息
点击此处可从《地球信息科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号