• Journal of Geographical Sciences
  • Vol. 30, Issue 2, 251 (2020)
Tao PEI1、2、*, Ci SONG1、2, Sihui GUO1、2, Hua SHU1、2, Yaxi LIU1、2, Yunyan DU1、2, Ting MA1、2, and Chenghu ZHOU1、2
Author Affiliations
  • 1. State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, CAS, Beijing 100101, China
  • 2. University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    DOI: 10.1007/s11442-020-1726-7 Cite this Article
    Tao PEI, Ci SONG, Sihui GUO, Hua SHU, Yaxi LIU, Yunyan DU, Ting MA, Chenghu ZHOU. Big geodata mining: Objective, connotations and research issues[J]. Journal of Geographical Sciences, 2020, 30(2): 251 Copy Citation Text show less
    References

    [1] C Andrieu, Freitas N De, A Doucet et al. An introduction to MCMC for machine learning. Machine learning, 50, 5-43(2003).

    [2] P M Atkinson, A R L Tatnall. Neural networks in remote sensing: Introduction. International Journal of Remote Sensing, 18, 699-709(1997).

    [3] M Batty. The New Science of Cities(2013).

    [4] C M Beale, J J Lennon, J M Yearsley et al. Regression analysis of spatial data. Ecology Letters, 13, 246-264(2010).

    [5] U C Benz, P Hofmann, G Willhauck et al. Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information. ISPRS Journal of Photogrammetry and Remote Sensing, 58, 239-258(2004).

    [6] R G Brereton, G R Lloyd. Support vector machines for classification and regression. Analyst, 135, 230-267(2010).

    [7] C Brunsdon, A S Fotheringham, M E Charlton. Geographically weighted regression: A method for exploring spatial nonstationarity. Geographical Analysis, 28, 281-298(1996).

    [8] C Brunsdon, S Fotheringham, M Charlton. Geographically weighted regression. Journal of the Royal Statistical Society: Series D (The Statistician), 47, 431-443(1998).

    [9] G F Byrne, P F Crapper, K K Mayo. Monitoring land-cover change by principal component analysis of multitemporal Landsat data. Remote Sensing of Environment, 10, 175-184(1980).

    [10] M Castells. Grassrooting the space of flows. Urban Geography, 20, 294-302(1999).

    [11] P S Castro, D Zhang, S Li. Urban traffic modelling and prediction using large scale taxi GPS traces. Proceeding of Pervasive’12 Proceedings of the 10th International Conference on Pervasive Computing, Newcastle, UK, June 18-22, 2012, 57-72(2012).

    [12] J Chen, Y Ban, S Li. China: Open access to Earth land-cover map. Nature, 514, 434(2015).

    [13] Long Chen, Neil Stuart, A Mackaness. Williams. Cluster and hot spot analysis in Lincoln, Nebraska, USA. Geomatics and Spatial Information Technology, 38, 189-192(2015).

    [14] M S Chen, J Han, P S Yu. Data mining: An overview from a database perspective. IEEE Transactions on Knowledge and Data Engineering, 8, 866-883(1996).

    [15] Changxiu Cheng, Peijun Shi, Changqing Song et al. Geographic big-data: A new opportunity for geography complexity study. Acta Geographica Sinica, 73, 1397-1406(2018).

    [16] R Cheng, T Emrich, H P Kriegel et al. Managing uncertainty in spatial and spatio-temporal data. Proceedings of the IEEE 30th International Conference on Data Engineering (ICDE), Chicago, IL, USA, Mar 31-Apr 04, 1302-1305(2014).

    [17] R G Congalton. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sensing of Environment, 37, 35-46(1991).

    [18] J Dai, B Yang, C Guo et al. Personalized route recommendation using big trajectory data. Proceedings of the 2015 IEEE 31st International Conference on Data Engineering (ICDE), Seoul, South Korea, April 13-17, 2015, 543-554(2015).

    [19] 2017 User Development Report of Sina Micro-blog(2017). http://data.weibo. com/report/reportDetail?id=404

    [20] Zhenyu Du, Shangjun Xing, Yumin Song et al. Lead pollution along expressways and its attenuation by green belts in Shandong province. Journal of Soil and Water Conservation, 21, 175-179(2007).

    [21] S Džeroski. Relational Data Mining. Boston, MA: Springer, 887-911(2009).

    [22] M Ester, H P Kriegel, J Sander et al. A density-based algorithm for discovering clusters in large spatial databases with noise. Proceeding KDD’96 Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, OR, USA, Aug 02-04, 226-231(1996).

    [23] U Fayyad, G Piatetsky-Shapiro, P Smyth. The KDD process for extracting useful knowledge from volumes of data. Communications of the ACM, 39, 27-34(1996).

    [24] M A Friedl, C E Brodley. Decision tree classification of land cover from remotely sensed data. Remote Sensing of Environment, 61, 399-409(1997).

    [25] J Ginsberg, M H Mohebbi, R S Patel et al. Detecting influenza epidemics using search engine query data. Nature, 457, 1012-1015(2009).

    [26] P O Gislason, J A Benediktsson, J R Sveinsson. Random forests for land cover classification. Pattern Recognition Letters, 27, 294-300(2006).

    [27] M F Goodchild. The validity and usefulness of laws in geographic information science and geography. Annals of the Association of American Geographers, 94, 300-303(2004).

    [28] M F Goodchild, M Yuan, T J Cova. Towards a general theory of geographic representation in GIS. International Journal of Geographical Information Science, 21, 239-260(2007).

    [29] J Han, J G Lee, M Kamber. An overview of clustering methods in geographic data analysis. Geographic Data Mining and Knowledge Discovery, 2, 149-170(2009).

    [30] Zhigang Han, Yunfeng Kong, Yaochen Qin. Research on geographic representation: A review. Progress in Geography, 30, 141-148(2011).

    [31] J M Harvey, J W Han. Geographic Data Mining and Knowledge Discovery(2009).

    [32] Y Huang, S Shekhar, H Xiong. Discovering colocation patterns from spatial data sets: A general approach. IEEE Transactions on Knowledge and Data Engineering, 16, 1472-1485(2004).

    [33] S Keola, M Andersson, O Hall. Monitoring economic development from space: Using nighttime light and land cover data to measure economic growth. World Development, 66, 322-334(2015).

    [34] X Kong, Z Xu, G Shen et al. Urban traffic congestion estimation and prediction based on floating car trajectory data. Future Generation Computer Systems: The International Journal of EScience, 61, 97-107(2016).

    [35] K Koperski, J Han. Discovery of spatial association rules in geographic information databases. Proceedings of the 4th International Symposium on Large Spatial Databases (SSD 95), Portland, ME, USA, Aug 06-09, 47-66(1995).

    [36] K Koperski, J Han, N Stefanovic. An efficient two-step method for classification of spatial data. Proceedings of the 8th International Symposium on Spatial Data Handling (SDH’98), Vancouver, BC, Canada, July 11-15, 1998, 45-54(1998).

    [37] D Lazer, R Kennedy, G King et al. The parable of Google Flu: Traps in big data analysis. Science, 343, 1203-1205(2014).

    [38] Y LeCun, Y Bengio, G Hinton. Deep learning. Nature, 521, 436-444(2015).

    [39] Deren Li, Tao Cheng. Knowledge discovery from GIS databases. Acta Geodaetica et Cartographica Sinica, 24, 37-44(1995).

    [40] Deren Li, Shuliang Wang, Deyi Li et al. Theories and technologies of spatial data mining and knowledge discovery. Geomatics and Information Science of Wuhan University, 27, 221-233(2002).

    [41] Deren Li, Shuliang Wang, Wenzhong Shi et al. On spatial data mining and knowledge discovery. Geomatics and Information Science of Wuhan University, 26, 491-499(2001).

    [42] X Li, A G O Yeh. Neural-network-based cellular automata for simulating multiple land use changes using GIS. International Journal of Geographical Information Science, 16, 323-343(2002).

    [43] Yang Liu, Ronggao Liu. Retrieval of global long-term leaf area index from LTDR AVHRR and MODIS observations. Journal of Geo-Information Science, 17, 1304-1312(2015).

    [44] Yu Liu. Revisiting several basic geographical concepts: A social sensing perspective. Acta Geographica Sinica, 71, 564-575(2016).

    [45] Z Liu, T Ma, Y Du et al. Mapping hourly dynamics of urban population using trajectories reconstructed from mobile phone records. Transactions in GIS, 22, 494-513(2018).

    [46] B Marr. Big Data: Using SMART Big Data, Analytics and Metrics to Make Better Decisions and Improve Performance. Chichester, UK: John Wiley & Sons(2015).

    [47] V Mayer-Schonberger, K Cukier. Big Data: A Revolution That Will Transform How We Live, Work, and Think. London: John Murray(2013).

    [48] D P McMillen. Geographically weighted regression: The analysis of spatially varying relationships. American Journal of Agricultural Economics, 86, 554-556(2004).

    [49] H J Miller, M F Goodchild. Data-driven geography. GeoJournal, 80, 449-461(2015).

    [50] T K Moon. The expectation-maximization algorithm. IEEE Signal Processing Magazine, 13, 47-60(1996).

    [51] O Mutanga, E Adam, M A Cho. High density biomass estimation for wetland vegetation using WorldView-2 imagery and random forest regression algorithm. International Journal of Applied Earth Observation and Geoinformation, 18, 399-406(2012).

    [52] New Night Lights Maps Open Up Possible Real-Time Applications(2017). https://www.nasa.gov/feature/goddard/2017/new-night-lights-maps-open-up-possible-real-time-applications

    [53] A Newing, B Anderson, A B Bahaj et al. The role of digital trace data in supporting the collection of population statistics: The case for smart metered electricity consumption data. Population, Space and Place, 22, 849-863(2016).

    [54] D Niemeijer. Developing indicators for environmental policy: Data-driven and theory-driven approaches examined by example. Environmental Science & Policy, 5, 91-103(2002).

    [55] N Niu, X P Liu, H Jin et al. Integrating multi-source big data to infer building functions. International Journal of Geographical Information Science, 31, 1871-1890(2017).

    [56] Global Historical Climate Network Daily: Description(2018). https://www.ncdc.noaa.gov/ghcn-daily-description

    [57] J Novembre, T Johnson, K Bryc et al. Genes mirror geography within Europe. Nature, 456, 98-101(2008).

    [58] M A Oliver, R Webster. Kriging: A method of interpolation for geographical information systems. International Journal of Geographical Information System, 4, 313-332(1990).

    [59] M Pal, P M Mather. Support vector machines for classification in remote sensing. International Journal of Remote Sensing, 26, 1007-1011(2005).

    [60] T Pei, J Gao, T Ma et al. Multi-scale decomposition of point process data. GeoInformatica, 16, 625-652(2012).

    [61] T Pei, S Sobolevsky, C Ratti et al. A new insight into land use classification based on aggregated mobile phone data. International Journal of Geographical Information Science, 28, 1988-2007(2014).

    [62] Tao Pei, Ting Li, Chenghu Zhou. Spatiotemporal point process: A new data model, analysis methodology and viewpoint for geoscientific problem. Journal of Geo-Information Science, 15, 793-800(2013).

    [63] Chengcheng Qian, Ge Cheng. Big data science for ocean: Present and future. Bulletin of Chinese Academy of Sciences, 33, 884-891(2018).

    [64] D Silver, A Huang, C J Maddison et al. Mastering the game of Go with deep neural networks and tree search. Nature, 529, 484-489(2016).

    [65] D Silver, J Schrittwieser, K Simonyan et al. Mastering the game of Go without human knowledge. Nature, 550, 354-359(2017).

    [66] C Song, Z Qu, N Blumm et al. Limits of predictability in human mobility. Science, 327, 1018-1021(2010).

    [67] M L Stein. Interpolation of Spatial Data: Some Theory for Kriging. New York: Springer Science & Business Media(2012).

    [68] Zhongwei Sun, Zi Lu. A geographical perspective to the elementary nature of space of flows. Geography and Geo-Information Science, 21, 109-112(2005).

    [69] W R Tobler. A computer movie simulating urban growth in the Detroit region. Economic Geography, 46, 234-240(1970).

    [70] M M Vazifeh, P Santi, G Resta et al. Addressing the minimum fleet problem in on-demand urban mobility. Nature, 557, 534-538(2018).

    [71] Haiqi Wang, Jinfeng Wang. Research on progress of spatial data mining. Geography and Geo-Information Science, 6-10(2005).

    [72] J Yuan, Y Zheng, X Xie et al. T-Drive: Enhancing driving directions with taxi drivers’ intelligence. IEEE Transactions on Knowledge & Data Engineering, 25, 220-232(2013).

    [73] Y Yuan, G Wei, Y Lu. Evaluating gender representativeness of location-based social media: A case study of Weibo. Annals of GIS, 24, 163-176(2018).

    [74] P A Zandbergen. Positional accuracy of spatial data: Non-normal distributions and a critique of the national standard for spatial data accuracy. Transactions in GIS, 12, 103-130(2008).

    [75] Wenjian Zhang. WMO integrated global observing system (WIGOS). Meteorological Monthly, 36, 1-8(2010).

    [76] B Zhao, S Zhang. Rethinking spatial data quality: Pokémon go as a case study of location spoofing. The Professional Geographer(2018).

    [77] Z L Zhao, S L Shaw, Y Xu et al. Understanding the bias of call detail records in human mobility research. International Journal of Geographical Information Science, 30, 1738-1762(2016).

    [78] Y Zheng, Y Liu, J Yuan et al. Urban computing with taxicabs. Proceedings of the 13th International Conference on Ubiquitous Computing, Beijing, China, September 17-21, 89-98(2011).

    [79] A-X Zhu, F Qi, A Moore et al. Prediction of soil properties using fuzzy membership values. Geoderma, 158, 199-206(2010).

    [80] A-X Zhu, L Yang, B Li et al. Construction of membership functions for predictive soil mapping under fuzzy logic. Geoderma, 155, 164-174(2010).

    Tao PEI, Ci SONG, Sihui GUO, Hua SHU, Yaxi LIU, Yunyan DU, Ting MA, Chenghu ZHOU. Big geodata mining: Objective, connotations and research issues[J]. Journal of Geographical Sciences, 2020, 30(2): 251
    Download Citation