流式数据多维建模与查询关键技术研究

英文题名：Research on Key Issues of Data Stream Multi-dimensional Modeling and Querying
作者：侯东风
论文级别：博士
学科专业名称：管理科学与工程
中文关键词：流式数据 ; 聚集计算 ; 多维数据模型 ; 流立方体 ; 兴趣视图 ; 多维连续查询 ; Web日志分析
英文关键词：data stream ; aggregation computing ; multidimensional data stream model ; stream cube ; interesting view ; multidimensional continuous query ; weblog analysis
学位年度：2010
导师：张维明
学科代码：1201
学位授予单位：国防科学技术大学
论文提交日期：2010-12-01

摘要

近年来,随着流式数据应用的不断扩展,用户亟待在这些数据中发现不同维度视角、不同数据粒度层次的异常模式、兴趣模式、发展趋势等,为实时决策提供支持,流式数据多维建模与查询技术的研究正适应这一现实需求。流式数据具有不同于传统数据的动态性、无限性、突发性等特征,另外一方面,与传统的分析方法相比较,多维查询具有较高的复杂性,从而数据建模、数据组织与查询计算均面临巨大挑战。
     本文针对上述挑战深入研究了多层次时间窗口模型、流式数据多维模型、流立方体计算方法、多维连续查询计算方法等关键技术。多层次时间窗口模型约束了流式数据的无限性,同时能够表达数据的多时间粒度和动态特征,通过适应性聚集树结构维持当前时间窗口中不同时间粒度的聚集信息,为流式数据聚集计算提供支持;定义流式数据多维模型,利用多层次时间窗口模型描述时间维度,用于描述模型的动态性,并定义了基本操作代数、分析操作代数与维护操作代数,为实现流式数据的多维组织与查询奠定了理论基础。针对流式数据多维组织问题提出了基于兴趣视图子集的流立方体计算方法,采用一种多路聚集树结构物化存储兴趣视图中的数据单元,支持立方体动态更新与多维查询计算。针对多维连续查询计算问题提出了基于查询状态维护的基本计算框架,并建立索引结构提高多维连续查询的执行效率。最后,以Web日志分析为例,说明了流式数据多维查询在实际中的应用。
     本文的主要贡献如下:
     (一)提出多层次时间窗口模型,通过时间粒度体系描述不同时间粒度层次数据的聚集关系,将无限的流式数据集合映射到有限的滑动窗口中,同时能够适应用户查询的多时间粒度需求,为流式数据处理提供基础支撑。提出适应性聚集计算方法维持多层次时间窗口上的聚集信息,建立适应性层次聚集树维持当前时间窗口中的聚集信息,其中的稀疏部分仅维持高层次聚集值,实验结果表明,该方法在非稳定的流式数据聚集计算中具有较为明显的优势。
     (二)提出流式数据多维模型,利用多层次时间窗口模型描述时间维度,约束了时间维度的无限性,同时能够表达维度的多粒度性,通过窗口中流事实的不断变化描述模型的动态特征。为了支持流式数据多维查询应用,定义了流式数据多维实例上的基本操作代数、分析操作代数、维护操作代数。最后,针对时间维度无限性、流事实动态性以及聚集函数复杂性等属性分析了模型的适用范围和约束条件。流式数据多维模型及操作代数的定义反映了多维计算的动态特征,为实现数据组织与查询奠定了理论基础。
     (三)针对流式数据的动态多维组织,提出基于兴趣视图子集的流立方体计算方法,兴趣视图反映了用户的查询需求,且仅占据视图集合的小部分,物化存储其中的数据单元能够减少存储空间消耗,同时能够满足大部分用户需求。在该方法中,采用一种多路聚集树结构维持物化数据单元及聚集关系,用于支持快速数据更新、即席查询与分析,在计算过程中采用多层次时间窗口约束和适应性划分策略进一步减少占用的存储空间,实验结果表明,该方法能够满足用户查询需求,并且具有较高时间和空间效率。
     (四)针对多维连续查询计算问题,提出了基于查询状态维护的查询计算框架,多维连续查询中维持了对连续执行结果产生影响的数据单元,通过更新、移除、输出查询结果等操作支持多维连续查询的动态计算;同时为了提高连续查询计算效率,基于连续查询选择条件建立索引树结构,用于支持多维连续查询状态的快速更新维护。实验结果表明,该方法为实现多维连续查询提供了有效途径,并具有较高的时间和空间效率。
     综上所述,本文针对关于流式数据多维建模、多维组织与查询计算等关键技术进行了突破性研究,提出了相应的理论与方法,对促进流式数据应用的发展具有一定的理论与实践意义。
In recent years, with the extension of data stream application in a wide range of fields, the users want to discovery the trends, unusual and interesting patterns from diverse composite dimensions and different granularities for the real time decision making. The research on data stream multidimensional modeling and querying is conducted to meet this requirement. Compared with traditional data, data stream features variability, infinity and bursty; and on the other hand, compared with traditional analysis methods, multidimensional query is highly sophisticated, which presents huge challenges to s data modeling, storage and querying.
     In response to these challenges, this dissertation aims to address several key problems, including multi-level time window model, multidimensional model of data stream, stream cube computing and multidimensional continuous query. The multi-level time window model bounded the infinity of data stream, and also described the multi-time granularities and variability. The aggregated values of multi-level time window were maintained in the adaptive hierarchy aggregate tree for aggregation computing. The multidimensional model of data stream was defined for organization. The time dimension followed the multi-level time window model which represents the dynamic property of multidimensional model, the basic algebra, analysis algebra and maintenance algebra were defined for lading the theory foundation of multidimensional organizing and querying. The stream cubing method based on interesting view subset was put forward for multidimensional organizing of data stream, in this method, the multi-way aggregation tree is established for maintaining the cells of interesting views. The tree structure can be updated dynamically to meet the multidimensional queries. The framework based on query state maintenance was designed for computing multidimensional continuous queries, and the index structure is built for improving the efficiency of queries execution. Finally, the application of multidimensional query is illuminated by the case of weblog analysis.
     The main contributions of the dissertation are as follows:
     (1) Multi-level time window model was put forward for mapping the infinite data stream to the sliding window, and the relations of different level were described by time granularities system, which can fill the multi-time granularities of queries and provide the foundational support for data stream processing. The adaptive method for computing the aggregation in time window was studied, in this method, the adaptive hierarchy aggregate tree is adopted as the basic structure, in which the sparse parts only the high level value is held. The experiment shows that the method is superior than others in the bursty data stream.
     (2) The multidimensional model of data stream is proposed in the dissertation, the time dimension was described by multi-level time window model, the infinity of time dimension is restricted and the multi-granularities is expressed, and the dynamic of model is depict by the evolving of stream fact. The algebras were described for defining multidimensional queries, including the basic algebra, analysis algebra and maintenance algebra. In the end, we aimed at the infinity of time dimension, dynamic of stream fact and sophistication of aggregate function analyzed the scope and restrictions of model. The definition of multidimensional model of data stream and algebra despite the dynamic of multidimensional computing, and lay the theory foundation for data stream organizing and querying.
     (3) The stream cube computing method based on Interesting View Subset was proposed for dynamic multidimensional organizing of data stream. The interesting view set indicates the requirements of queries, and cover only small parts of all ones. Materializing the cells of interesting views could reduce the consumed memories and also fill the needs of most users. In this method, an multi-way aggregate tree is adopt for maintaining the cells and it's relations, it can be used for quickly updating the cube and the result of ad-hoc querying, in the running phase, the storage space of structure can be reduced by multi-level time window and adaptive partition strategy. The experiments show that the method could satisfy users' requirements and also is efficient in time and space.
     (4) The framework of query computing based on query state maintenance was proposed for multidimensional continuous querying. The state of continuous query hold the cells that may contribute to any future query results, and support dynamically computing of multidimensional continuous query by the operator of update, remove and generate results. We also constructed the index tree based on the select predication of continuous queries for improving the update efficiency of state. The experiment shows that the method was effective in multidimensional continuous query implementation, and also was efficient in time and space.
     In conclusion, this dissertation put emphasis on several key issues of data stream multidimensional modeling and querying, and a series of algorithms and theories were studied. It is significant in theory and practice for the development of data stream application.

引文

[1] Agrawal R, Ailamaki A, Bernstein P A, Brewer E i A, et al. The claremont report on database research[J]. Communications of the ACM, 2009,52 (6): 56-65.
    [2]孟小峰,周龙骧,王珊.数据库技术发展趋势[J].软件学报, 2004,15 (12): 1822-1836.
    [3] Babcock B, Babu S, Datar M, Motwani R, et al. Models and issues in data stream systems [C]. in: Proceedings of the 21st ACM Symposium on Principles of Database Systems.Madison, Wisconsin, USA: ACM,2002. 1-16.
    [4] Chaudhry N A, Shaw K, Abdelguerfi M. Stream data management[M]. New York,USA: Springer Science+Business Media, Inc,2005.
    [5] Das R, Turkoglu I. Creating meaningful data from Web logs for improving the impressiveness of a Website by using path analysis method[J]. Expert Systems with Applications, 2008,36 (3): 6635-6644.
    [6] Phua C, Gayler R, Lee V, Smith-Miles K. On the communal analysis suspicion scoring for identity crime in streaming credit applications[J]. European Journal of Operational Research, 2009,195 (2): 595-612.
    [7] Balazinska M, Deshpande A, Franklin M J, Gibbons P B, et al. Data management in the worldwide sensor Web[J]. IEEE Pervasive Computing, 2007,6 (2): 30-40.
    [8] Wu J, Zhou Y, Aberer K, Tan K-L. Towards integrated and efficient scientific sensor data processing: a database approach [C]. in: Proceedings of the 12th International Conference on Extending Database Technology.Saint Petersburg, Russia: ACM,2009. 922-933.
    [9] Park N H, Oh S H, Lee W S. Anomaly intrusion detection by clustering transactional audit streams in a host computer[J]. Information Sciences, 2010,180 (12): 2375-2389.
    [10] Welsh M. Sensor networks for the sciences[J]. Communications of the ACM, 2010,53 (11): 36-39.
    [11] Goth G. Turning data Into knowledge[J]. Communications of the ACM, 2010,53 (11): 13-15.
    [12] Wu T, Xin D, Han J. ARCube: supporting ranking aggregate queries in partially materialized data cubes [C]. in: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data.Vancouver, BC, Canada: ACM,2008. 79-91.
    [13] Xi R, Lin N, Chen Y. Compression and aggregation for logistic regression analysis in data cubes[J]. IEEE Transactions on Knowledge and Data Engineering, 2009,21 (4): 479-492.
    [14] Kumar N, Gangopadhyay A, Bapna S, Karabatis G, et al. Measuring interestingness of discovered skewed patterns in data cubes[J]. Decision Support Systems,2008,46 (1): 429-439.
    [15] Cariou V, CubilléJ, Derquenne C, Goutier S, et al. Built-in indicators to discover interesting drill paths in a cube [C]. in: Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery.Turin, Italy: Springer,2008. 33-44.
    [16] Pedersen T B, Jensen C S, Dyreson C E. A foundation for capturing and querying complex multidimensional data[J]. Information Systems, 2001,26 (5): 383-423.
    [17] Gray J, Chaudhuri S, Bosworth A, Layman A, et al. Data cube: a relational aggregation operator generalizing group-by, cross-tab, and sub-totals[J]. Data Mining and Knowledge Discovery, 1997,1 (1): 29-53.
    [18]林子雨,杨冬青,王腾蛟,宋国杰.实视图选择研究[J].软件学报, 2009,20 (2): 193-213
    [19] Cuzzocrea A. A top-down approach for compressing data cubes under the simultaneous evaluation of multiple hierarchical range queries[J]. Journal of Intelligent Information Systems, 2010,34 (3): 305-343.
    [20] Kozielski S, Wrembel R. New trends in data warehousing and data analysis[M]. New York, USA: Springer Science+Business Media, LLC,2009.
    [21] Gehrke J. Data stream processing - when you only get one look[J]. Communications of the ACM, 2009,52 (10): 96-96.
    [22] Thiele M, Fischer U, WolfgangLehner. Partition-based workload scheduling in living data warehouse environments[J]. Information Systems, 2009,34 382-399.
    [23] Han J, Li Z, Tang L A. Mining moving object, trajectory and traffic data [C]. in: Proccdings of the 15th International Conference on Database System for Advanced Applications.Tsukuba,Japan: Springer,2010. 485-486.
    [24] Stacey M, McGregor C. Temporal abstraction in intelligent clinical data analysis: A survey[J]. Artificial Intelligence in Medicine, 2007,39 (1): 1-24.
    [25] Fazzinga B, Flesca S, Furfaro F, Masciari E. Efficient and effective RFID data warehousing [C]. in: Proceedings of the 12th International Database Engineering and Applications Symposium.Cetraro, Calabria,Italy: ACM,2009. 251-258.
    [26] Baeza-Yates R, Ramakrishnan R. Data challenges at Yahoo! [C]. in: Proceedings of the 11th International Conference on Extending Database Technology.Nantes, France: ACM,2008. 652-655.
    [27] Zhang Yu, Fang BinXing, Zhang YongZheng. Identifying heavy hitters in high-speed network monitoring[J]. Science China Information Sciences, 2010,53 (3): 659-676.
    [28] Golab L, ?zsu M T. Issues in data stream management[J]. SIGMOD Record, 2003,32 (2): 5-14.
    [29] Muthukrishnan S. Data streams: algorithms and applications[M]. Hanover,MA,USA: Now Publisher Inc,2005.
    [30] Stonebraker M, ?etintemel U, Zdonik S. The 8 requirements of real-time stream processing[J]. SIGMOD Record, 2005,34 (4): 42-47.
    [31] Babu S, Widom J. Continuous queries over data streams[J]. SIGMOD Record, 2001,30 (3): 109-120.
    [32] Burdick D, Deshpande P M, Jayram T S, Ramakrishnan R, et al. OLAP over uncertain and imprecise data[J]. The VLDB Journal, 2007,16 (1): 123-144.
    [33] Hua M, Pei J. Continuously monitoring top-k uncertain data streams: a probabilistic threshold method[J]. Distributed and Parallel Databases, 2009,26 (1): 29-65.
    [34] Aggarwal C C, Yu P S. A survey of uncertain data algorithms and applications[J]. IEEE Transactions on Knowledge and Data Engineering, 2009,21 (5): 609-623.
    [35] Chang J H, Kumb H-C M. Frequency-based load shedding over a data stream of tuples[J]. Information Sciences, 2009,179 (21): 3733-3744.
    [36] Bowman I T, Salem K. Optimization of query streams using semantic prefetching[J]. ACM Transactions on Database Systems, 2005,30 (4): 1056-1101.
    [37] Hildrum K, Douglis F, Wolf J L, Yu P S. Storage optimization for large-scale distributed stream-processing systems[J]. ACM Transactions on Storage, 2008,3 (4): Article 18.
    [38] The STREAM Group. STREAM: the stanford stream data manager[J]. IEEE Data Engineering Bulletin, 2003,26 (1): 19-26.
    [39] Abadi D J, Carney D, ?etintemel U, Cherniack M, et al. Aurora: a new model and architecture for data stream management[J]. The VLDB Journal, 2003,12 (2): 120-139.
    [40] Abadi D J, Ahmad Y, Balazinska M, ?etintemel U, et al. The design of the borealis stream processing engine [C]. in: Proceedings of the 2nd Biennial Conference on Innovative Data System Research.Asilomar,CA,USA: Online,2005. 277-289.
    [41] Chandrasekaran S, Cooper O, Deshpande A, Franklin M J, et al. TelegraphCQ continuous dataflow processing for an uncertain world [C]. in: Proceedings of the 1st Biennial Conference on Innovative Data System Research.Asilomar,CA,USA: Online,2003.
    [42] Cai Y D, Clutter D, Pape G, Jiawei Han, et al. MAIDS: mining alarming incidents from data streams [C]. in: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data.Paris, France: ACM,2004. 919-920.
    [43]金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报, 2004,15 (8): 1172-1181.
    [44] Tryfonopoulos C, Koubarakis M, Drougas Y. Information filtering and query indexing for an information retrieval model[J]. ACM Transactions on Information Systems, 2009,27 (2): Article 10.
    [45] Li X, Barajas J M, Ding Y. Collaborative filtering on streaming data withinterest-drifting[J]. Intelligent Data Analysis, 2007,11 (1): 75-87.
    [46] Cheng R, Kao B C M, Kwan A, Prabhakar S, et al. Filtering data streams for entity-based continuous queries[J]. IEEE Transactions on Knowledge and Data Engineering, 2010,22 (2): 234-248.
    [47] Sharfman I, Schuster A, Keren D. A geometric approach to monitoring threshold functions over distributed data streams[J]. ACM Transactions on Database Systems, 2007,32 (4): Article 23.
    [48] Jin C, Ding B, Yu J X. Making filters smart in distributed data stream environments[J]. Information Sciences, 2009,179 (9): 1348-1361.
    [49] Vitter J S. Random sampling with a reservoir[J]. ACM Transactions on Mathematical Software, 1985,11 (1): 37-57.
    [50] Babcock B, Datar M, Motwani R. Sampling from a moving window over streaming data [C]. in: Proceedings of the 13th ACM-SIAM symposium on Discrete algorithms.San Francisco, California: ACM,2002. 633 - 634.
    [51] Kolonko M, Wasch D. Sequential reservoir sampling with a nonuniform distribution[J]. ACM Transactions on Mathematical Software, 2006,32 (2): 257-273.
    [52] Tao Y, Lian X, Papadias D, Hadjieleftheriou M. Random sampling for continuous streams with arbitrary updates[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (1): 96-110.
    [53] Park B-H, Ostrouchov G, Samatova N F. Sampling streaming data with replacement[J]. Computational Statistics & Data Analysis, 2007,52 (2): 750-762.
    [54] Chuang K-T, Chen H-L, Chen M-S. Feature-preserved sampling over streaming data[J]. ACM Transactions on Knowledge Discovery from Data, 2009,2 (4): Article 15.
    [55] Li X, Han J, Yin Z, Lee J-G, et al. Sampling cube: a framework for statistical OLAP over sampling data [C]. in: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data.Vancouver, BC, Canada: ACM,2008. 779-790.
    [56] Fischer P M, Esmaili K S, Miller R J. Stream schema: providing and exploiting static metadata for data stream processing [C]. in: Proceedings of the 12th International Conference on Extending Database Technology.Lausanne, Switzerland: ACM,2010. 207-218.
    [57] Tucker P A, Maier D, Sheard T, Fegaras L. Exploiting punctuation semantics in continuous data streams[J]. IEEE Transactions on Knowledge and Data Engineering, 2003,15 (3): 1-14.
    [58] Tucker P A, Maier D, Sheard T, Stephens P. Using punctuation schemes to characterize strategies for querying over data streams [J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (9): 1227-1240.
    [59] Patroumpas K, Sellis T. Window specification over data streams [C]. in: Proceedings of the 10th International Conference on Extending Database Technology Workshops.Munich, Germany: Springer,2006. 445-464.
    [60] Arasu A, Babu S, Widom J. The CQL continuous query language: semantic foundations and query execution[J]. The VLDB Journal, 2006,15 (2): 121-142.
    [61] Jiao Y. Maintaining stream statistics over multiscale sliding windows[J]. ACM Transactions on Database Systems, 2006,31 (4): 1305-1334.
    [62] Li J, Maier D, Tufte K, Papadimos V, et al. No Pane, No Gain: Efficient Evaluation of Sliding-Window Aggregates over Data Streams[J]. SIGMOD Record, 2005,34 (1): 39-44.
    [63] Braverman V, Ostrovsky R. Effective computations on sliding windows[J]. SIAM Journal on Computing, 2010,39 (6): 2113-2131.
    [64] Mouratidis K, Bakiras S, Papadias D. Continuous monitoring of top-k queries over sliding windows [C]. in: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data.Chicago, Illinois, USA: ACM,2006. 635-646.
    [65] Tao Y, Papadias D. Maintaining sliding window skylines on data streams[J]. IEEE Transactions on Knowledge and Data Engineering, 2006,18 (3): 377-391.
    [66] Mouratidis K, Papadias D. Continuous nearest neighbor queries over sliding windows[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (6): 789-803.
    [67] Tanbeer S K, Ahmed C F, Jeong B-S, Lee Y-K. Sliding window-based frequent pattern mining over data streams[J]. Information Sciences, 2009,179 (22): 3843-3865.
    [68] Tsai P S M. Mining top-k frequent closed itemsets over data streams using the sliding window model[J]. Expert Systems with Applications, 2010,37 (10): 6968-6973.
    [69] Tu L, Chen Y. Stream data clustering based on grid density and attraction[J]. ACM Transactions on Knowledge Discovery from Data, 2009,3 (3): Article 12.
    [70] Guha S. On the space-time of optimal, approximate and streaming algorithms for synopsis construction problems[J]. The VLDB Journal, 2008,17 (6): 1509-1535.
    [71] Gilbert A C, Kotidis Y, Muthukrishnan S, Strauss M J. One-pass wavelet decompositions of data streams[J]. IEEE Transactions on Knowledge and Data Engineering, 2003,15 (3): 541-554.
    [72] Karras P, Mamoulis N. One-pass wavelet synopses for maximum-error metrics [C]. in: Proceedings of the 31st International Conference on Very Large Data Bases.Trondheim, Norway: ACM,2005. 421-432.
    [73] Guha S. Space efficiency in synopsis construction algorithms [C]. in: Proceedings of the 31st International Conference on Very Large Data Bases.Trondheim, Norway: ACM,2005. 409-420.
    [74] Guha S, Harb B. Wavelet synopsis for data streams: minimizing non-euclidean error [C]. in: Proceedings of 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Chicago, Illinois, USA: ACM,2005. 88-97.
    [75] Deligiannakis A, Garofalakis M, Roussopoulos N. Extended wavelets for multiple measures[J]. ACM Transactions on Database Systems, 2007,32 (2): Article 10.
    [76] Guha S, Park H, Shim K. Wavelet synopsis for hierarchical range queries with workloads[J]. The VLDB Journal, 2008,17 (5): 1079-1099.
    [77] Matias Y, Urieli D. Optimal workload-based weighted wavelet synopses[J]. Theoretical Computer Science, 2007,371 (3): 227-246.
    [78] Sacharidis D, Deligiannakis A, Sellis T. Hierarchically compressed wavelet synopses[J]. The VLDB Journal, 2008,18 (1): 203-231.
    [79] Liu K-H, Teng W-G, Chen M-S. Dynamic wavelet synopses management over sliding windows in sensor networks[J]. IEEE Transactions on Knowledge and Data Engineering, 2010,22 (2): 193-206.
    [80] Ioannidis Y. The history of histograms (abridged) [C]. in: Proceedings of 29th International Conference on Very Large Data Bases.Berlin, Germany: Morgan Kaufmann 2003. 19-30.
    [81] Gibbons P B, Matias Y, Poosala V. Fast incremental maintenance of approximate histograms[J]. ACM Transactions on Database Systems, 2002,27 (3): 261-298.
    [82] Guha S, Shim K, Woo J. REHIST: relative error histogram construction algorithms [C]. in: Proceedings of the Thirtieth International Conference on Very Large Data Bases.Toronto, Canada: Morgan Kaufmann,2004. 300-311.
    [83] Guha S, Shim K. A note on linear time algorithms for maximum error histograms[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (7): 993-997.
    [84] Lin X, Zhang Q, Yuan Y, Liu Q. Error minimization in approximate range aggregates[J]. Data & Knowledge Engineering, 2007,62 (1): 156-176.
    [85] Guha S, Koudas N, Shim Y. Approximation and streaming algorithms for histogram construction problems[J]. ACM Transactions on Database Systems, 2006,31 (1): 396-438.
    [86] Datar M, Gionis A, Indyk P, Motwani R. Maintaining stream statistics over sliding windows[J]. SIAM Journal on Computing, 2002,31 (6): 1794-1813.
    [87] Baltrunas L, Mazeika A, B?hlen M. Multi-dimensional histograms with tight bounds for the error [C]. in: Proceedings of the 10th International Database Engineering and Applications Symposium.Delhi, India: IEEE Computer Society,2006. 105-112.
    [88] Furfaro F, Mazzeo G M, SaccàD, Sirangelo C. Compressed hierarchical binary histograms for summarizing multi-dimensional data[J]. Knowledge and Information Systems, 2007,15 (3): 335-380.
    [89] Indyk P. Stable distributions, pseudorandom generators,embeddings, and data stream computation[J]. Journal of the ACM, 2006,53 (3): 307–323.
    [90] Rusu F, Dobra A. Pseudo-random number generation for sketch-based estimations[J]. ACM Transactions on Database Systems, 2007,32 (2): Article 11.
    [91] Cormode G, Muthukrishnan S. An improved data stream summary: thecount-min sketch and its applications[J]. Journal of Algorithms, 2005,55 (1): 58-75.
    [92] Ntarmos N, Triantafillou P, Weikum G. Distributed hash sketches: scalable, efficient, and accurate cardinality estimation for distributed multisets[J]. ACM Transactions on Computer Systems, 2009,27 (1): Article 2.
    [93] Cormode G, Hadjieleftheriou M. Methods for finding frequent items in data streams[J]. The VLDB Journal, 2010,19 (1): 3-20.
    [94] Rusu F, Dobra A. Fast range-summable random variables for efficient aggregate estimation [C]. in: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data.Chicago, Illinois, USA: ACM,2006. 193-204.
    [95] Metwally A, Agrawal D, Abbadi A E. Efficient computation of frequent and top-k elements in data streams [C]. in: Proceedings of the 10th International Conference on Database Theory.Edinburgh, UK: Springer,2005. 398-412.
    [96] Considine J, Hadjieleftheriou M, Li F, Byers J, et al. Robust approximate aggregation in sensor data management systems[J]. ACM Transactions on Database Systems, 2009,34 (1): Article 6.
    [97] Cormode G, Garofalakis M. Approximate continuous querying over distributed streams[J]. ACM Transactions on Database Systems, 2008,33 (2): Article 9.
    [98] Buccafurri F, Lax G. Approximating sliding windows by cyclic tree-like histograms for efficient range queries[J]. Data & Knowledge Engineering, 2010,69 (9): 979-997.
    [99] Yaxin Y, Guoren W, Dong S, Xinhua Z. Linked-Tree:An Aggregate Query Algorithm Based on Sliding Window over Data Stream[J]. Wuhan University Journal of Natural Sciences, 2006,11 (5): 1114-1119.
    [100] Nagaraj K, Naidu K V M, Rastogi R, Satkin S. Efficient aggregate computation over data streams [C]. in: Proceedings of the 24th International Conference on Data Engineering.Cancún México: IEEE Computer Society,2008. 1382-1384.
    [101] Park N H, Lee W S. Cell trees: an adaptive synopsis structure for clustering multi-dimensional on-line data streams[J]. Data & Knowledge Engineering, 2007,63 (2): 528-549.
    [102] Lee J W, Park N H, Lee W S. Efficiently tracing clusters over high-dimensional on-line data streams[J]. Data & Knowledge Engineering, 2009,68 (3): 362-379.
    [103] Zhang D, Markowetz A, Tsotras V J, Gunopulos D, et al. On computing temporal aggregates with range predicates[J]. ACM Transactions on Database Systems, 2008,33 (2): Article 12, 39 pages.
    [104] Hershberger J, Shrivastava N, Suri S, Tóth C D. Adaptive spatial partitioning for multidimensional data streams[J]. Algorithmica, 2006,46 (1): 97-117.
    [105] Terry D, Goldberg D, Nichols D, Oki B. Continuous queries over append-only databases [C]. in: Proceedings of the 1992 ACM SIGMOD International Conference onManagement of Data.San Diego, California: ACM,1992. 321-330.
    [106] Gurevich Y, Leinders D, Bussche J V d. A theory of stream queries [C]. in: Proceedings of the 11th International Symposium on Database Programming Languages.Vienna, Austria: Springer,2007. 153-168.
    [107] Kr?mer J, Seeger B. Semantics and implementation of continuous sliding window queries over data streams[J]. ACM Transactions on Database Systems, 2009,34 (1): Article 4.
    [108] Hsueh Y-L, Zimmermann R, Ku W-S. Efficient updates for continuous skyline computations [C]. in: Proceedings of the 19th International Conference on Database and Expert Systems Applications.Turin, Italy: Springer,2008. 419-433.
    [109] Gao L, Wang X S. Continuous similarity-based queries on streaming time series[J]. IEEE Transactions on Knowledge and Data Engineering, 2005,17 (10): 1320-1332.
    [110] Golab L, ?zsu M T. Update-Pattern-Aware modeling and processing of continuous queries [C]. in: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data.Baltimore, Maryland, USA: ACM,2005. 658-669.
    [111] Patroumpas K, Sellis T. Window update patterns in stream operators [C]. in: Proceedings of the 13th East European Conference on Advances in Databases and Information Systems.Riga, Latvia: Springer,2009. 118-132.
    [112] Patroumpas K, Sellis T. Maintaining consistent results of continuous queries under diverse window specifications[J]. Information Systems, 2011,36 (1): 42-61.
    [113] Wei M, Rundensteiner E. Mode aware stream query processing [C]. in: Proceedings of the 21st International Conference on Scientific and Statistical Database Management.New Orleans, LA, USA: Springer,2009. 380-397.
    [114] Ghanem T M, Hammad M A, Mokbel M F, Aref W G, et al. Incremental evaluation of sliding-window queries over data streams[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (1): 57-72.
    [115] Arasu A, Babcock B, Babu S, Mcalister J, et al. Characterizing memory requirements for queries over continuous data streams[J]. ACM Transactions on Database Systems, 2004,29 (1): 162-194.
    [116] Babu S, Srivastava U, Widom J. Exploiting k-constraints to reduce memory overhead in continuous queries over data streams[J]. ACM Transactions on Database Systems, 2004,29 (3): 545-580.
    [117] Cammert M, Kr?mer J, Seeger B, Vaupel S. A cost-based approach to adaptive resource management in data stream systems[J]. IEEE Transactions on Knowledge and Data Engineering, 2008,20 (2): 230-245.
    [118] Botan I, Alonso G, Fischer P M, Kossmann D, et al. Flexible and scalable storage management for data-intensive stream processing [C]. in: Proceedings of the 12th International Conference on Extending Database Technology.Saint Petersburg, Russia:ACM,2009. 934-945.
    [119] Babcock B, Babu S, Datar M, Motwani R, et al. Operator scheduling in data stream systems[J]. The VLDB Journal, 2004,13 (4): 333-353.
    [120] Yang Y, Kr?mer J, Papadias D, Seeger B. HybMig: a hybrid approach to dynamic plan migration for continuous queries[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (3): 398-411.
    [121] Sharaf M A, Chrysanthis P K, Labrinidis A, Pruhs K. Algorithms and metrics for processing multiple heterogeneous continuous queries[J]. ACM Transactions on Database Systems 2008,32 (1): Article 5.
    [122] Zhou Y, Salehi A, Aberer K. Scalable delivery of stream query result [C]. in: Proceedings of the 35th International Conference on Very Large Data Bases.Lyon, France: ACM,2009. 49-60.
    [123] Watanabe Y, Kitagawa H. Query result caching for multiple event-driven continuous queries[J]. Information Systems, 2010,35 (1): 94-110.
    [124] Ghanem T M, Elmagarmid A K, Larson P-A, Aref W G. Supporting views in data stream management systems[J]. ACM Transactions on Database Systems, 2010,35 (1): Article 1.
    [125] Lee H-H, Yun E-W, Lee W-S. Attribute-based evaluation of multiple continuous queries for filtering incoming tuples of a data stream[J]. Information Sciences, 2008,178 (11): 2416-2432.
    [126] Gyssens M, Lakshmanan L V S. A foundation for multi-dimensional databases [C]. in: Proceedings of the 23rd International Conference on Very Large Data Bases.Athens,Greece: Morgan Kaufmann,1997. 106-115.
    [127] Agrawal R, Gupta A, Sarawagi S. Modeling multidimensional databases [C]. in: Proceedings of the 13th International Conference on Data Engineering.Birmingham, U.K.: IEEE Computer Society,1997. 232-243.
    [128] Lechtenb?rger J, Vossen G. Multidimensional normal forms for data warehouse design[J]. Information Systems, 2003,28 (5): 415-434.
    [129] Malinowski E, Zimanyi E. Hierarchies in a Multidimensional Model: From Conceptual Modeling to Logical Representation[J]. Data & Knowledge Engineering, 2006,59 (2): 348-377.
    [130] Mansmann S, Scholl M H. Empowering the OLAP technology to support complex dimension hierarchies[J]. International Journal of Data Warehousing and Mining, 2007,3 (4): 31-50.
    [131] Li Z, Sun J, Yu H. EDH: A Powerful method for the modeling of Irregular Dimensions[J]. International Journal of Computer Science and Network Security, 2006,6 (4): 36-41.
    [132] Eavis T, Taleb A. MapGraph: efficient methods for complex OLAP hierarchies [C]. in: Proceedings of the 16th ACM Conference on Information and KnowledgeManagement.Lisbon, Portugal: ACM,2007. 465-474.
    [133] Rizzi S, AbellóA, Lechtenb?rger J, Trujillo J. Research in data warehouse modeling and design: dead or alive? [C]. in: Proceedings of the 9th ACM International Workshop on Data Warehousing and OLAP.Arlington, Virginia, USA: ACM,2006. 3-10.
    [134] Hurtado C A, Gutierrez C, Mendelzon A O. Capturing summarizability with integrity constraints in OLAP[J]. ACM Transactions on Database Systems, 2005,30 (3): 854-886.
    [135] Mazón J-N, Lechtenb?rger J, Trujillo J. A survey on summarizability issues in multidimensional modeling[J]. Data & Knowledge Engineering, 2009,68 (12): 1452-1469.
    [136] Jensen C S, Kligys A, Pedersen T B, Timko I. Multidimensional data modeling for location-based services[J]. The VLDB Journal, 2004,13 (1): 1-21.
    [137] Horner J, Song I-Y. A taxonomy of inaccurate summaries and their management in OLAP systems [C]. in: Proceedings of the 24th International Conference on Conceptual Modeling.Klagenfurt,Austria: Springer,2005. 433-448.
    [138]陆昌辉.复杂多维数据模型的描述、构建与查询处理方法研究[D].长沙:国防科学技术大学.博士学位论文, 2006.
    [139]刘青宝.模糊、动态多维数据建模理论与方法研究[D].长沙:国防科学技术大学.博士学位论文, 2006.
    [140] AbellóA, Samos J, Saltor F. YAM2: a multidimensional conceptual model extending UML[J]. Information Systems, 2006,31 (6): 541-567.
    [141] Golfarelli M, Rizzi S. A survey on temporal data warehousing[J]. International Journal of Data Warehousing and Mining, 2009,5 (1): 1-17.
    [142] Hurtado C A, Mendelzon A O, Vaisman A A. Updating OLAP dimensions [C]. in: Proceedings of ACM 2nd International Workshop on Data Warehousing and OLAP.Kansas City, Missouri, USA: ACM,1999. 60-66.
    [143] Body M, Miquel M, Bédard Y, Tchounikine A. Handling evolutions in multidimensional structures [C]. in: Proceedings of the 19th IEEE International Conference on Data Engineering.Bangalore,India: IEEE Computer Society 2003. 581-592.
    [144] Golfarelli M, Lechtenb?rger J, Rizzi S, Vossen G. Schema versioning in data warehouses: enabling cross-version querying via schema augmentation[J]. Data & Knowledge Engineering 2006,59 (2): 435-459.
    [145] Moreno F, Fileto R, Arango F. Season queries on a temporal multidimensional model for OLAP[J]. Mathematical and Computer Modelling, 2010,52 (7-8): 1103-1109.
    [146] Vaisman A A, Espil M M, Paradela M. P2P OLAP: Data model, implementation and case study[J]. Information Systems, 2009,34 (2): 231-257.
    [147] Pedersen T B, Gu J, Shoshani A, Jensen C S. Object-extended OLAPquerying[J]. Data & Knowledge Engineering, 2009,68 (5): 453-480.
    [148] Huang C-C, Tseng T-L B, Li M-Z, Gung R R. Models of multi-dimensional analysis for qualitative data and its application[J]. European Journal of Operational Research, 2007,174 (2): 983-1008.
    [149] Chou L P, Zhang X. Computing complex iceberg cubes by multiway aggregation and bounding [C]. in: Proceedings of the 6th International Conference on Data Warehousing and Knowledge Discovery.Zaragoza, Spain: Springer,2004. 108-117.
    [150] Shao Z, Han J, Xin D. MM-Cubing: computing iceberg cubes by factorizing the lattice space [C]. in: Proceedings of the 16th Conference on Statistical and Scientific Database Management.Santorini Island, Greece: IEEE Computer Society,2004. 213-222.
    [151] Beyer K, Ramakrishnan R. Bottom-up computation of sparse and iceberg cubes [C]. in: Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data.Philadelphia,PA: ACM,1999. 359-370.
    [152]骆吉洲,李建中,赵锴.大型压缩数据仓库上的Iceberg Cube算法[J].软件学报, 2006,17 (8): 1743-1752.
    [153] Xin D, Han J, Li X, Shao Z, et al. Computing iceberg cubes by top-down and bottom-up integration:the StarCubing approach[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (1): 111-126.
    [154] Han J, Pei J, Dong G, Wang K. Efficient computation of iceberg cubes with complex measures [C]. in: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data.Santa Barbara,California USA: ACM,2001. 1-12.
    [155] Wang K, Jiang Y, Yu J X, Dong G, et al. Divide-and-approximate: a novel constraint push strategy for iceberg cube mining[J]. IEEE Transactions on Knowledge and Data Engineering, 2005,17 (3): 354-368.
    [156] Zhang X, Lien P, Chou h, Dong G. Efficient computation of iceberg cubes by bounding aggregate functions[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (7): 903-918.
    [157] Xin D, Shao Z, Han J, Liu H. C-Cubing: efficient computation of closed cubes by aggregation-based checking [C]. in: Proceedings of the 22nd International Conference on Data Engineering.Atlanta,Georgia,USA: IEEE Computer Society,2006. Article 4.
    [158] Cuzzocrea A, Furfaro F, Mazzeo G M. A probabilistic approach for computing approximate iceberg cubes [C]. in: Proceedings of the 19th International Conference on Database and Expert Systems Applications.Turin, Italy: Springer,2008. 348-361.
    [159] Chen Y, Dehne F, Eavis T, Rau-Chaplin A. PnP: sequential, external memory, and parallel iceberg cube computation[J]. Distributed and Parallel Databases, 2008,23 (2): 99-126.
    [160] Han J, Chen Y, Dong G, Pei J, et al. Stream cube: an architecture for multi-dimensional analysis of data streams[J]. Distributed and Parallel Databases,2005,18 (2): 173-197.
    [161] Wang W, Feng J, Lu H, Yu J X. Condensed cube: an effective approach to reducing data cube size [C]. in: Proceedings of the 18th International Conference on Data Engineering.San Jose, California, USA: IEEE Computer Society,2002. 155-165.
    [162] Sismanis Y, Deligiannakis A, Roussopoulos N, Kotidis Y. Dwarf: shrinking the PetaCube [C]. in: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data.Madison, Wisconsin, USA: ACM,2002. 464-475.
    [163] Leng F, Bao Y, Wang D, Yu G. A clustered dwarf structure to speed up queries on data cubes [C]. in: Proceedings of the 9th International Conference on Data Warehousing and Knowledge Discovery.Regensburg, Germany: Springer,2007. 170-180.
    [164] Lakshmanan L V S, Pei J, Han J. Quotient cube: how to summarize the semantics of a data cube [C]. in: Proccedings of the 28th International Conference on Very Large Data Bases.Hong Kong,China: Morgan Kaufmann,2002. 778-789.
    [165] Lakshmanan L V S, Pei J, Zhao Y. QCTrees:an efficient summary structure for semantic OLAP [C]. in: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data.San Diego, CA: ACM,2003. 64-75.
    [166] Cui-ping L, Shan W. Efficient incremental maintenance for distributive and non-distributive aggregate functions[J]. Journal of Computer Science & Technology, 2006,21 (1): 52-65.
    [167] Casali A, Cicchetti R, Lakhal L. Extracting semantics from data cubes using cube transversals and closures [C]. in: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Washington, DC, USA: ACM,2003. 69-78.
    [168]师智斌,黄厚宽.基于形式概念分析的约简数据立方体研究[J].计算机研究与发展, 2009,46 (11): 1956-1962.
    [169]向隆刚,龚健雅.一种高度浓缩和语义保持的数据立方[J].计算机研究与发展, 2007,44 (5): 837-844.
    [170] Kuznetsov S D, Kudryavtsev Y A. A mathematical model of the OLAP cubes[J]. Programming and Computer Software, 2009,35 (5): 257-265.
    [171] Li X, Han J, Gonzalez H. High-dimensional OLAP: a minimal cubing approach [C]. in: Proceedings of the 30th International Conference on Very Large Data Bases.Toronto, Ontario, Canada: Morgan Kaufmann,2004. 528-539.
    [172] Harinarayan V, Rajaraman A, Ullman J D. Implementing data cubes efficiently [C]. in: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data.Montreal, Que, Canada: ACM,1996. 205-216.
    [173] Liang W, Wang H, Orlowska M E. Materialized view selection under the maintenance time constraint[J]. Data & Knowledge Engineering, 2001,37 (2): 203-216.
    [174] Hung M-C, Huang M-L, Yang D-L, Hsueh N-L. Efficient approaches formaterialized views selection in a data warehouse[J]. Information Sciences, 2007,177 (6): 1333–1348.
    [175] Gupta H, Mumick I S. Selection of views to materialize in a data warehouse[J]. IEEE Transactions on Knowledge and Data Engineering, 2005,17 (1): 24-43.
    [176] Lawrence M, Rau-Chaplin A. Dynamic view selection for OLAP[J]. International Journal of Data Warehousing and Mining, 2008,4 (1): 47-61.
    [177] Poosala V, Ganti V. Fast approximate answers to aggregate queries on a data cube [C]. in: Proceedings of the 11th International Conference on Statistical and Scientific Database Management.Cleveland, Ohio, USA: IEEE Computer Society,1999. 24-33.
    [178] Vitter J S, Wangt M, Iyer B. Data cube approximation and histograms via wavelets [C]. in: Proceedings of the 7th ACM International Conference on Information and Knowledge Management.Bethesda MD USA: ACM,1998. 96-104.
    [179] Wu Y-L, Agrawal D, Abbadi A E. Using wavelet decomposition to support progressive and approximate range-sum queries over data cubes [C]. in: Proceedings of the 9th ACM International Conference on Information and Knowledge Management.McLean, VA, USA: ACM,2000. 414-421.
    [180] Barbara D, Xintao Wu. Loglinear-based quasi cubes[J]. Journal of Intelligent Information Systems, 2001,13 (3): 255-276.
    [181] Cuzzocrea A, Wei W. Approximate range–sum query answering on data cubes with probabilistic guarantees[J]. Journal of Intelligent Information Systems, 2007,28 (2): 161-197.
    [182] Pei J, Yuan Y, Lin X, Jin W, et al. Towards multidimensional subspace skyline analysis[J]. ACM Transactions on Database Systems, 2006,31 (4): 1335-1381.
    [183] Nedjar S, Casali A, Cicchetti R, Lakhal L. Emerging cubes: borders, size estimations and lossless reductions[J]. Information Systems 2009,34 (6): 536-550.
    [184] Romero O, AbellóA. On the need of a reference algebra for OLAP [C]. in: Proceedings of the 8th International Conference on Data warehouses and Knowledge Discovery.Krakow,Poland: Springer,2007. 99-110.
    [185] Ravat F, Teste O, Tournier R, Zurfluh G. Algebraic and graphic languages for OLAP manipulations[J]. International Journal of Data Warehousing and Mining, 2008,4 (1): 17-46.
    [186] Kotidis Y, Roussopoulos N. A case for dynamic view management[J]. ACM Transactions on Database Systems, 2001,26 (4): 388-423.
    [187] Park C-S, Kim M H, Lee Y-J. Finding an efficient rewriting of OLAP queries using materialized views in data warehouses[J]. Decision Support Systems, 2002,32 (4): 379-399.
    [188] Yang W S, Chung Y D, Kim M H. The RD-Tree: a structure for processing Partial-MAX/MIN queries in OLAP[J]. Information Sciences, 2002,146 (1-4): 137-149.
    [189] Hammer J, Fu L. CubiST++: evaluating ad-hoc CUBE queries using statisticstrees[J]. Distributed and Parallel Databases, 2003,14 (3): 221-254.
    [190] Theodoratos D, Tsois A. Processing OLAP queries in hierarchically clustered databases[J]. Data & Knowledge Engineering, 2003,45 (2): 205-224.
    [191] Chun S-J, Chung C-W, Lee S-L. Space-efficient cubes for OLAP range-sum queries[J]. Decision Support Systems, 2004,37 (1): 83-102.
    [192] Chatziantonioua D, Ross K A. Partitioned optimization of complex queries[J]. Information Systems, 2007,32 (2): 248-282.
    [193] Smith J R, Li C-S, Jhingran A. A Wavelet Framework for Adapting Data Cube Views for OLAP[J]. IEEE Transactions on Knowledge and Data Engineering, 2004,16 (5): 552-565.
    [194] Yiu M L, Mamoulis N, Hristidis V. Extracting k most important groups from data efficiently[J]. Data & Knowledge Engineering, 2008,66 (2): 289-310.
    [195] Yiu M L, Mamoulis N. Efficient processing of top-k dominating queries on multi-dimensional data [C]. in: Proceedings of the 33rd International Conference on Very Large Data Bases.Vienna, Austria: ACM,2007. 483-494.
    [196] Ramakrishnan R, Chen B-C. Exploratory mining in cube space[J]. Data Mining and Knowledge Discovery, 2007,15 (1): 29-54.
    [197] Li H-G, Yu H, Agrawal D, Abbadi A E. Progressive ranking of range aggregates[J]. Data & Knowledge Engineering, 2007,63 (1): 4-25.
    [198] Chung Y D, Yang W S, Kim M H. An efficient, robust method for processing of partial top-k/bottom-k queries using the RD-Tree in OLAP[J]. Decision Support Systems, 2007,43 (2): 313-321.
    [199] Jiang L, Yang D, Tang S, Ma X, et al. Constructing cube blocks effectively for stream data analysis [C]. in: Proceedings of the 7th International Conference on Web-Age Information Management Workshops.Hong Kong,China: IEEE Computer Society,2006. Article 11.
    [200] Cho M, Pei J, Wang K. Answering ad hoc aggregate queries from data streams using prefix aggregate trees[J]. Knowledge and Information Systems, 2007,12 (3): 301-329.
    [201] Hsieh M-J, Chen M-S, Yu P S. Approximate query processing in cube streams[J]. IEEE Transactions on Knowledge and Data Engineering, 2007,19 (11): 1557-1570.
    [202] Cuzzocrea A, Chakravarthy S. Event-based lossy compression for effective and efficient OLAP over data streams[J]. Data & Knowledge Engineering, 2010,69 (7): 678-708.
    [203] Pitarch Y, Laurent A, Poncelet P. Summarizing multidimensional data streams: a hierarchy-graph-based approach [C]. in: Proceedings of the 14th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining.Hyderabad,India: Springer,2010. 335-342.
    [204] Yin X, Pedersen T B. What can hierarchies do for data streams [C]. in:Proceedings of the 1st International Workshop on Business Intelligence for the Real Time Enterprise.Seoul,Korea: Springer,2006. 4-19.
    [205]李爱平,杨庆民,甘亮.基于Dwarf的数据流立方体的研究与实现[J].计算机研究与发展, 2009,46 (Suppl.): 192-197.
    [206] Cuzzocrea A. CAMS: OLAPing multidimensional data streams efficiently [C]. in: Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery.Linz, Austria: Springer,2009. 48-62.
    [207] Lee H-H, Lee W-S. Consistent collective evaluation of multiple continuous queries for filtering heterogeneous data streams[J]. Knowledge and Information Systems, 2010,22 (2): 185-210.
    [208]张冬冬,李建中,王伟平,郭龙江.数据流历史数据的存储与聚集查询处理算法[J].软件学报, 2005,16 (12): 2089-2098.
    [209] Bettini C, Dyreson C E, Evans W S, Snodgrass R T, et al. 1998. A glossary of time granularity concepts. In Temporal Databases: Research and Practice, O. Etzion, S. Jajodia, S. Sripada Eds. Springer-Verlag, Berlin, 406-413.
    [210] López I F V, Snodgrass R T, Moon B. Spatiotemporal Aggregate Computation: A Survey[J]. IEEE Transactions on Knowledge and Data Engineering, 2005,17 (2): 271 - 286.
    [211] Fu L, Rajasekaran S. Evaluating holistic aggregators efficiently for very large datasets[J]. The VLDB Journal, 2004,13 (2): 148-161.
    [212]刘青宝,金燕,侯东风,张维明.数据流层次窗口模型及聚集查询算法[J].计算机科学, 2007,34 (5): 194-196.
    [213] Cohen S, Nutt W, Sagiv Y. Deciding equivalences among conjunctive aggregate queries[J]. Journal of the ACM, 2007,54 (2): Article 5.
    [214] Facca F M, Lanzi P L. Mining interesting knowledge from Weblogs: a survey[J]. Data & Knowledge Engineering, 2005,53 (3): 225-241.
    [215] Huang Y-F, Hsu J-M. Mining Web logs to improve hit ratios of prefetching and caching[J]. Knowledge-Based Systems, 2008,21 (1): 62-69.
    [216] Yang Y C. Web user behavioral profiling for user identification[J]. Decision Support Systems, 2010,49 (3): 261-271.
    [217] Yang Y C, Padmanabhan B. Toward user patterns for online security: Observation time and online user identification[J]. Decision Support Systems, 2010,48 (4): 548-558.
    [218] Masseglia F, Poncelet P, Teisseire M, Marascu A. Web usage mining: extracting unexpected periods from Web logs[J]. Data Mining and Knowledge Discovery, 2008,16 (1): 39-65.
    [219] Hawwash B, Nasraoui O. Mining and tracking evolving Web user trends from large Web server logs[J]. Statistical Analysis and Data Mining, 2010,3 (2): 106-125.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700