基于位置数据的用户多周期移动行为挖掘
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Mining Users' Multiple Periodic Moving Behaviors Based on Location Data
  • 作者:范一苇 ; 吕晓玲
  • 英文作者:FAN Yi-wei;LU Xiao-ling;Center of Applied Statistics,Renmin University of China;Department of Statistics,Renmin University of China;
  • 关键词:位置数据 ; 移动行为 ; 周期检测 ; 聚类 ; 动态时间规整
  • 英文关键词:location data;;moving behavior;;period detection;;clustering;;dynamic time warping
  • 中文刊名:SSJS
  • 英文刊名:Mathematics in Practice and Theory
  • 机构:中国人民大学应用统计科学研究中心;中国人民大学统计学院;
  • 出版日期:2019-07-23
  • 出版单位:数学的实践与认识
  • 年:2019
  • 期:v.49
  • 基金:国家自然科学基金(61472475);; 中央高校建设世界一流大学(学科)和特色发展引导专项资金
  • 语种:中文;
  • 页:SSJS201914020
  • 页数:10
  • CN:14
  • ISSN:11-2018/O1
  • 分类号:183-192
摘要
挖掘位置数据中的用户行为规律是大数据时代的研究热点之一.现有研究主要关注于用户在某时刻出现在某地点的行为,对于用户从一个地点移动到另一个地点的动态行为研究较为空缺.提出一种挖掘位置数据中用户移动行为的算法可以发现用户的多个周期移动行为,描述用户在时空上的移动规律.首先,利用离散傅里叶变换和自相关系数检测用户移动行为的周期,在这一过程中,利用Apriori性质减少计算复杂度;而后提出用户移动行为的生成模型,估计用户的移动行为概率矩阵,考虑到观测数据的稀疏性,采用带全局限制的动态时间规整距离对不同时间段的行为进行聚类以发现用户的多个周期移动行为.最后,我们选取某市公共自行车系统收集的位置数据进行实证分析,结果表明,新方法能有效地挖掘用户的多个周期移动行为,进一步地,通过归纳可以得到用户群体在周期移动行为上的主要特征.
        Mining the rule of users' behaviors in location data is one of the research hotspots in the era of big data.Existing researches focus on the behavior of users appearing at a certain place at a certain moment,and the dynamic behavior research of users moving from one place to another is relatively empty.Proposing an algorithm to mine user's moving behavior based on location data can discover users' multiple periodic moving behaviors and describe users' moving rules in time and space.Firstly,we use Discrete Fourier Transform and Autocorrelation Coefficient to detect the period of users' moving behaviors.In this process,Apriori property is considered to reduce the computational complexity.Then,we propose a generative model and estimate the probability matrix of users' moving behaviors.Taking into account the sparsity of users' observation data,we cluster the behaviors of different time periods based on the dynamic time warping distance with global constraints to find the users' multiple periodic moving behaviors.Finally,we select the location data collected by the public bicycle system in some city for empirical analysis.The results show that the new method can effectively mine users' multiple periodic moving behaviors.Moreover,we summarize the main characteristics of the periodic moving behaviors for the users group.
引文
[1]陈康,黄晓宇,王爱宝,陶彩霞,关迎晖,李磊.基于位置信息的用户行为轨迹分析与应用综述[J].电信科学,2013,4:118-123.
    [2]郭迟,刘经南,方媛,罗梦,崔竞松.位置大数据的价值提取与协同挖掘方法[J].软件学报,2014,4:713-730.
    [3]Jindal T,Giridhar P,Tang L A,Li J,and Han J.Spatiotemporal periodical pattern mining in traffic data[C].In Proceedings of the 2Nd ACM SIGKDD International Workshop on Urban Computing,2013:1-8.
    [4]Tang Y,Lin C,Yuan Y,and Deng D J.Dividing sensitive ranges based mobility prediction algorithm in wireless networks[J].Tamkang Journal of Science and Engineering,2010,13(1):107-115.
    [5]Daoud M S,Ayesh A,Al-Fayoumi M,and Hopgood A A.Location prediction based on a sector snapshot for location-based services[J].Journal of Network and Systems Management,2014,22(1):23-49.
    [6]Balamurugan V.Mining user mobile behavior in location based services[J].International Journal of Scientific and Research Publications,2012,2(9):1-3.
    [7]Liu H,and Schneider M.Similarity measurement of moving object trajectories[C]//In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on GeoStreaming,2012:19-22.
    [8]Song C,Qu Z,Blumm N,and Barabasi A.Limits of predictability in human mobility[J].Science,2010,327(5968):1018-1021.
    [9]Kirmse A,Udeshi T,Bellver P,and Shuma J.Extracting patterns from location history[C].In Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems,2011:397-400.
    [10]Li Z,Ding B,Han J,Kays R,and Nye P.Mining periodic behaviors for moving objects[C]//In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2010:1099-1108.
    [11]Berberidis C,Vlahavas I P,Aref W G,Atallah M J,and Elmagarmid A K.On the discovery of weak periodicities in large time series[C]//In Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery,2002:51-61.
    [12]Muller M.Information retrieval for music and motion[M].Springer Verlag,2007.
    [13]潘纲,李石坚,齐观德,张王晟.移动轨迹数据分析与智慧城市.中国计算机学会通讯[J].2012,8(5):31-37.
    [14]Vlachos M,Yu P,and Castelli V.On periodicity detection and structural periodic similaxity[C]//In Proceedings of the 2005 SIAM International Conference on Data Mining,2005:449-460.
    [15]Kaufman L,and Rousseeuw PJ.Finding groups in data:an introduction to cluster analysis[M].Wiley,1990.
    [16]Tibshirani R,Walther G,and Hastie T.Estimating the number of clusters in a data set via the gap statistic[J].Journal of the Royal Statistical Society B.2001,63(2):411-423.