基于可扩展对象的海量存储系统研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
信息存储是人类社会永恒的需求。随着计算机技术的发展和应用的普及,信息存储容量成爆炸性地增长,现有网络存储系统已无法满足人们对于存储的需要。基于对象存储(Object-Based Storage,OBS)技术逢时崛起,利用现有的存储组件、处理技术和网络技术,通过简单方式来获得前所未有的高吞吐量,成为下一代网络存储的主流。它采用包含数据和属性的“对象”作为接口,既有了“块”接口的快速,又有“文件”接口的便于共享,并分离了存储数据的逻辑视图和物理视图,将存储数据的逻辑视图保留在元数据服务器中,而物理数据存放在基于对象存储设备(Object-Based Storage Device,OSD)中。同时,它将传统文件分解为系列数据对象,分发到一个或多个OSD中。虽然对象给存储系统带来了一种新的理念,但现有的与对象相关的存储系统中对象都仅定义为非定长的数据单位,束缚了“对象”这个有着丰富内涵的词汇。
     基于可扩展对象的海量存储系统(Based on Scalable Object Mass Storage System,BSO-MSS)吸取了OBS的优点,在“对象”现有的含义基础上扩充,使它不仅只包括用户数据,还将目录、文件、存储设备管理等纳入对象之中,形成层次结构的对象体系结构,实现对象的分布存储、层次管理的模式,并建立基于存储对象统一访问模式,将块、对象和文件三种存储接口进行融合与统一。这样不仅具有统一逻辑视图、数据共享、主动服务、并行访问、统一存储和易管理等特点,而且有着其他存储结构难以达到的高可扩展性和高性能。
     通过建立系统广义随机Petri网模型,对BSO-MSS进行性能评价,模拟结果显示无论增加存储对象(Storage Object,SO)还是客户端,系统性能都随之增加。并采用测试工具iozone对系统原型与Lustre系统作对比测试,测试结果表明写性能超过Lustre,读性能略比Lustre好,并验证了BSO-MSS的广义随机Petri网模型。
     首次将存储系统与元胞自动机相结合,利用元胞自动机的原理,解析BSO-MSS动力演变规律。构建了一个通用框架的BSO-MSSCA概念模型框架,并在此基础上,分析了两种具体元胞自动机模型。基于存储对象负载分配模型是将SO解析为元胞,模拟了一个简单的负载均衡分配的动态变化,高度概括了BSO-MSS的演变过程。基于数据对象访问行为模型则分析数据对象的访问频率对系统的影响,结合数据对象访问的特征和主动性,通过机械学习适当调整数据对象的访问行为频率,使系统朝着稳定方向发展。通过分析基于存储对象的负载分配模型和基于数据对象的访问行为模型的演变过程,可以看出系统具有主动性、共享性、并行性、相关性等特性,是一个自组织管理的对象存储系统。
     大规模分布式存储系统中,元数据高性能服务、负载均衡以及扩展性已成为一个重要的研究热点。在元数据服务器中,将元数据分解为目录对象和文件对象,目录对象为定位性元数据,提供文件所在位置和访问控制;文件对象为描述性元数据,描述文件的数据特性。每一个元数据服务器(Metadata Server,MDS)负责所有目录对象和自身的文件对象,这样充分利用MDS中Cache,提高Cache的命中率,减少磁盘I/O次数,而且能够动态扩展MDS。同时,以目录对象ID和文件名为关键字的哈希值作为局部元数据查找表(Local Metadata Lookup Table,LMLT)的索引,获得相应的MDS_ID。一旦目录权限改变、更名、移动目录、修改权限等都不会造成元数据的迁移。通过Bloom Filter算法将每个MDS的LMLT压缩成一个摘要,能够实现快速的元数据查找。同时采用主从备三重链式结构的MDS服务,不仅在未提高硬件成本下能够保证系统高可靠性和可用性,而且根据热点访问进行迁移,实现负载均衡。
     SO是BSO-MSS重要组成单位,它与OSD不同之处是本身具有“接口”与“状态”标识,由数据、属性和方法组成,这样对现有的T10 OSD标准进行了扩充。由于数据对象是通常在一维空间中命名,传统文件系统管理大量数据对象的效率是极其低,采用线性哈希查找算法,由负载因子控制分裂和合并,与传统文件系统的树结构查找相比,哈希法查找时间复杂度为O(1)。同时,针对Ext2文件系统中数据访问至少需两次以上的磁盘操作特性,将数据的块地址和长度链接在一起,作为对象的扩展属性,连同数据对象一起存储到磁盘中,这样无论数据对象大小为多少,磁盘访问次数仅为两次。
     在BSO-MSS中,负载与众多因素相关,如请求队列长度、CPU处理能力、内存大小、网络带宽、磁盘带宽和磁盘容量等。负载柔性放置策略不仅考虑网络的影响,而且考虑SO之间存在差异,并设置权重,权重大的SO担负较多的负载。依据SO属性中信息统计出负载特征,以系统响应时间为代价,自适应选择SO数目,采用不同大小的分条进行存储,使BSO-MSS具有更高的性能、可扩展性和自适应负载均衡能力。
Information storage is an eternal need in the human society. With the development of computer technology and the popularization of its application, the current network storage system can no longer satisfy the increasing demands of storage as the information storage capacity increases explosively. Object Based Storage (OBS) is an emerging technology and will become the next wave of network storage technology. It will achieve an unprecedented high throughput through a simple method using the existing component, processing and network technology. Using object-based interface containing data and attributes, OBS captures the features of both the high speed of block-based interface and convenient data sharing of file-based interface. It has separated the logical and physical views of the storage data, with the former unchanged and the latter offloaded to the Object-based Storage Device (OSD). It also divides a file into separate a series of data objects that are striped across an OSD or multiple OSDs. However, although the object brings a new concept for storage system, as a variable-length data unit in the existing object storage systems, its meaning as a rich object is limited.
     The Based on Scalable Object Mass Storage System (BSO-MSS) takes advantage of OBS and extends the concept of object. It includes not only the user data, but also the directory, file and storage device, forming a hierarchical object storage system architecture. It realizes the object distributed storing and hierarchical managing. It builds a compatible object access mode based on the storage object, which harmonizes block-based, file-based and object-based interfaces. In this way, BSO-MSS can provide unified logic view, data sharing, active service, parallel access, unified storage and easy management. And it possesses a high scalability and performance that has never existed before.
     The Generalized Stochastic Petri Net (GSPN) model is set up to evaluate the performance of the BSO-MSS. The simulation result shows that the system performance improves with the increase in the number of both the storage objects and clients. Moreover, iozone is also used to benchmark BSO-MSS and Lustre. The result demonstrates that the writing performance of BSO-MSS is superior to that of Lustre, and its reading performance is only slightly better. It also validates the GSPN model of BSO-MSS.
     The storage system is first combined with Cellar Automata. The theory of Cellar Automata is used to analyze the dynamic behavior of the BSO-MSS, and construct a general model frame of BSO-MSSCA which provides the foundation for the analysis of two cases. Taking a SO as cell, the SO-based load assignation model, simulates a simple dynamic movement of load balance, and highly generalizes the evolving course of BSO-MSS. The DO-based access behavior model analyzes the effect of the access frequency of Data Object (DO) on storage system, and then suitably adjusts the access frequency of data objects in accordance with their characteristics through mechanical study and activity so as to improve the system stableness. Through analyzing SO-based load assignation model and DO-based access behavior mode, we can easily learn that the BSO-MSS is an object storage system of self-management with characteristics of activity, sharing, parallelism and relativity.
     In large distributed storage systems, it is critical to research on high performance metadata service, load balance and scalability. In the metadata server (MDS), metadata is divided into directory object and file object. The former is a locating metadata, providing the file location and access control; the latter is a descriptive metadata, depicting the data of file. Each MDS is responsible for the whole directory objects and its own file objects so as to make full use of the Cache, improving the hit rate of Cache, reducing the number of disk I/O, and extending MDS dynamically. Meanwhile,The hash value of key including directory object ID and file name is used as an index to the local metadata lookup table (LMLT), obtaining the corresponding metadata server id. It will not cause the metadata migration in case that directory permission is changed, directory or file is renamed, directory is moved, or permission is modified. Through Bloom Filter algorithm, the LMLT of each MDS can be compressed into a brief to achieve rapid metadata finding. The metadata service of master/slave/standby chain structure is also used, which not only ensures the high reliability and availability of the system without increasing the hardware costs, but also enables the access migration for hot spots and achieves the load balance.
     The SO is an important unit of BSO-MSS. It differs from OSD in that it is identified with“interface”and“state”, consisting of data, attribute and method. So T10 OSD protocol is modified. As data object has been usually named in the one-dimensional space, the efficiency of traditional file system was very low to manage a large number of data object. After the linear hash Lookup algorithm has been adopted, the load factor controls splitting and merging. Compared with the tree-structure in the tradition file system, the lookup time complexity of the hash algorithm is O(1).Using Ext2 file system for object storage, one needs at least two data access to operate disk. The combination of block address and block length forms the extended attributes of object, stored on the disk along with data object. No matter what size data object is, only two disk accesses are needed
     The workload is related to many factors in BSO-MSS, such as the length of the request queue, CPU processing capability, memory size, network bandwidth, disk bandwidth and disk capacity. Considering the differences between SOs as well as the influence of network, workload flexible layout strategy sets weight to allow SO of greater weight shouldering more workload. Based on the workload characteristics of the attribute information statistically calculating in the SO, the number of SO can be adaptively selected at the expense of system response time, with strips of different sizes to store data. It shows that the BSO-MSS is of higher performance, scalability and better load balance.
引文
[1] M. Mesnier, G. R. Ganger, and E. Riedel. Object-based storage. Communications Magazine, 2003, 41 (8): 84~90
    [2] D. Patterson, G. Gibson and R. Katz. A Case for Redundant arrays of Inexpensive Disks (RAID). In: Proceedings of the 1988 ACM SIGMOD international conference on Management of data. 1988. 109~116
    [3] J. Menon, M. Hartung. The IBM 3990 Disk Cache. In: Proceedings of the IEEE Computer Society International COMPCON Conference. 1988. 146~151
    [4] A. Vasudeva. A Case for Disk Array Storage System. In: Proceedings of Systems Design and Networks Conference, Mass Storage Trends and Systems Integration, ed. Kenneth Majithia. April 1988
    [5] K. Salem, H. Garcia-Molina. Disk striping. In: Proceedings of 2nd IEEE International Conference on Data Engineering. February 1986. 336~342
    [6] L. Cabrera, D. Long. Swift. Using Distributed Disk Striping to Provide High I/O Data Rates. Computing Systems, 1991, 4(4): 405~439
    [7] M. Y. Kim. Synchronized Disk Interleaving. IEEE Transactions on Computers, 1986, 35 (11): 978~988
    [8] M. Y. Kim, A. N. Tantawi. Asynchronous Disk Interleaving: Approximating Access Delays. IEEE Transactions on Computer, 1991, 40(7): 801~810
    [9] P. M. Chen, E. K. Lee, G. A. Gibson et al. RAID: High-Performance, Reliable Secondary Storage. ACM Computing Surveys, 1994, 26(2): 145~185
    [10] R. H. Katz, G. A. Gibson and D. A. Patterson. Disk System Architectures for High Performance Computing. IEEE, 1989, 77(12): 1842~1858
    [11] 张江陵, 冯丹. 海量信息存储. 北京: 科学出版社, 2003
    [12] E. Riedel, G. Gibson. Understanding Customer Dissatisfaction with Underutilized Distributed File Servers. In: Proceedings of the 5th NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies, September 1996. 17~19
    [13] X. Molero, F. Silla, V. Santonja, et al. On the Interconnection Topology for StorageArea Networks. In: Proceedings of 15th Parallel and Distributed Processing Symposium. 2001. 1648~1656
    [14] allSAN Research Services. http: //www. allsan. com/marketresearch. php3
    [15] Harry Hulen, Otis Graf, Keith Fitzgerald, Richard W. Watson. Storage Area Networks and the High Performance Storage System. In: Proceedings of the 19th IEEE/10th NASA Goddard Conference on Mass Storage Systems and Technologies, April 15-18, 2002. 225~240
    [16] P. Wang, R. E. Gilligan, Green, et al. IP SAN - From iSCSI to IP-Addressable Ethernet Disks. In: Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 189~193
    [17] A. Singh, S. Gopisetty, L. Duyanovich, et al. Security vs Performance: Tradeoffs using a Trust Framework. In: Proceedings of the 22nd IEEE/13th NASA Goddard Conference on Mass Storage Systems and Technologies. April 11-14, 2005. 270~277
    [18] M. Banikazemi, D. Poff, B. Abali. Storage-Based Intrusion Detection for Storage Area Networks (SANs). In: Proceedings of the 22nd IEEE / 13th NASA Goddard Conference on Mass Storage Systems and Technologies. April 11-14, 2005. 118~127
    [19] Garth A. Gibson and Rodney Van Meter. Network Attached Storage Architecture. Communications of the ACM, 2000, 43 (11): 37~45
    [20] J. Bright, J. Chandy. A Scalable Architecture for Clustered Network Attached Storage. In: Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 196~206
    [21] 邓玉辉. 基于网络磁盘阵列的海量信息存储系统: [博士学位论文]. 武汉: 华中科技大学图书馆, 2004
    [22] Dan Feng, Yuhui Deng, Ke Zhou, et al. Enlarge Bandwidth of Multimedia Server with Network Attached Storage System. In: Proceedings of Network and Parallel Computing, IFIP International Conference. October 18-20, 2004. LNCS 3222. 489~492
    [23] Intel Corporation. Object-Based Storage: The Next Wave of Storage Technologyand Devices. 2003
    [24] Panasas Inc. Object Storage Architecture-Defining a new generation of storage systems built on distributed, intelligent storage devices. Rev. 1. 0-10/19/2003. http: //www. panasas. com/docs
    [25] Mike Mesnier, Gregory R. Ganger, Erik Riedel. Object-based storage: pushing more functionality into storage. IEEE Potentials, 2005, 24 (2): 31-34
    [26] Kai Hwang. Advanced Computer Architecture Parallelism Scalability Programmability, Mc-Graw-Hill, Inc. 1993
    [27] T. Koch, B. Kramer, G. Rohde. On a Rule Based Management Architecture. In: Proceedings of the 2nd International Workshop on Services in Distributed and Networked Environments. 1995. 68~75
    [28] R. Baird. Virtual Storage Architecture Guide (VSAG). In: Proceedings of the 14th IEEE Symposium on Mass Storage Systems. September 11-14, 1995. 312~326
    [29] Magnus Karlsson, Christos Karamanolis and Xiaoyun Zhu. Triage: Performance Isolation and Differentiation for Storage Systems. In: Proceedings of International Workshop of Quality of Service. IEEE. 2004. 67~74
    [30] IEEE Storage System Standards Working Group. Reference Model for Open Storage System Interconnection. http: // www. ssswg. org, 1994
    [31] R. A. Coyne, H. Hulen and R. W. Watson. The High Performance Storage System. In: Proceedings of 1993 ACM/IEEE conference on Supercomputing. November 15-19, 1993. 83~92
    [32] R. W. Watson, R. A. Coyne. The Parallel I/O Architecture of the High Performance Storage System (HPSS). In: Proceedings of the 14th IEEE Symposium on Mass Storage Systems and Technologies. March 15-18, 1999. 150~157
    [33] Richard W. Watson. High Performance Storage System Scalability: Architecture, Implementation and Experience. In: Proceedings of the 22nd IEEE/13th NASA Goddard Conference on Mass Storage Systems and Technologies. April 11-14, 2005. 145~159
    [34] Per Lynse, Gary Lee, Lynn Jones, and Mark Roschke. HPSS at Experiences and analysis. In: Proceedings of the 16th IEEE/7th NASA Goddard Conference onMass Storage Systems and Technologies. March 15-18, 1999. 150~157
    [35] M. Gleicher, Hierarchical Storage Interface. http: //www. hpsscollaboration. org /hpss/HSI/index. html
    [36] SCSI Storage Interfaces. http: //www. t10. org/
    [37] Information technology - SCSI Architecture Model - 4 (SAM-4). T10/1683-D Standard. http: //www. t10. org/ftp/t10/drafts/sam4/sam4r06. pdf, May 2006
    [38] Storage Networking Industry Association. http: //www. snia. org/home
    [39] John B Lohmeyer, George O Penokie, Paul D Aloisi, et al. Information technology-SCSI Object-Based Storage Device Commands (OSD). INCITS T10 Working Draft. http: //www. t10. org/drafts. htm. 2005
    [40] D. Hildebrand, P. Honeyman. Exporting Storage Systems in a Scalable Manner with pNFS. In: Proceedings of the 22th IEEE / 13th NASA Goddard Conference on Mass Storage Systems and Technologies. April 11-14, 2005. 18~27
    [41] IP Storage. http: //www. ietf. org/html. charters/ips-charter. html
    [42] Internet Small Computer Systems Interface (iSCSI). http: //www. ietf. org/rfc/ rfc3720. txt
    [43] Kalman Z. Meth, Julian Satran. Design of the iSCSI Protocol. In: Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 116~122
    [44] Wang P, Gilligan E R, Green H, Raubitschek J. IP SAN–From iSCSI to IP-addressable Ethernet Disks. In: Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 189~193
    [45] Rodney Van Meter, Gregory G, Finn, Steve Hotz. VISA: Netstation’s Virtual Internet SCSI Adapter. In: Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems. ACM. 1998. 71~80
    [46] Hui Xiong, Renuga Kanagavelu, Yaolong Zhu, et al. An iSCSI Design and Implementation. In: Proceedings of the 21th IEEE/12th NASA Goddard Conference on Mass Storage Systems and Technologies. April 13-16, 2004. 112101~107
    [47] Weijun Xiao, Yinan Liu, and Qing (Ken) Yang, et al. Implementation and Performance Evaluation of Two Snapshot Methods on iSCSI Target Storages. In: Proceedings of the 23nd IEEE/14th NASA Goddard Conference on Mass Storage Systems and Technologies. May 15-18, 2006. 101~110
    [48] Ming Zhang, Yinan Liu, and Qing (Ken) Yang. Cost-Effective Remote Mirroring Using the iSCSI Protocol. In: Proceedings of the 21th IEEE/12th NASA Goddard Conference on Mass Storage Systems and Technologies. April 13-16, 2004. 101~114
    [49] Gibson, G., Nagle, D., Nat, W., et al. NASD Scalable Storage Systems. In: Proceedings of USENIX’99 Extreme Linux Workshop, 1999. 121~130
    [50] Riedel, E. and Gibson, G. A. Understanding Customer Dissatisfaction with Underutilized Distributed File Servers. In: Proceedings of the 5th NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies, September 1996. 1~18
    [51] Gibson, G. A., Nagle, D. F., Amiri, K., Chang, et al. File Server Scaling with Network-Attached Secure Disks. In: Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems, 1997. 272~284
    [52] Gibson, G. A., Nagle, D. F., Amiri, K., et al. A Cost-effective, High-bandwidth Storage Architecture. In: Proceedings of the 8th Conference on Architectural Support for Programming Languages and Operating Systems. ACM. November 1998. 92~103
    [53] Gobioff, H., Gibson, G. A. and Tygar, D. Security for Network Attached Storage Devices. CMU SCS Technical Report CMU-CS-97-185, 1997
    [54] Erik Riedel, Garth Gibson. Active Disks: Remote Execution for Network-Attached Storage. CMU-CS-97-198, December 1997
    [55] Erik Riedel, Garth A. Gibson, Christos Faloutsos. Active Storage for Large-Scale Data Mining and Multimedia. In: Proceedings of the 24th International Conference on Very Large Databases. 1998. 62~73
    [56] Erik Riedel, Christos Faloutsos, Garth A. Gibson, et al. Active Disks forLarge-Scale Data Processing. IEEE Computer, 2001, 34(6): 68~74
    [57] Gregory R. Ganger, John D. Strunk, Andrew J. Klosterman. Self-* Storage: Brick-based Storage with Automated Administration. Published as Carnegie Mellon University Technical Report, CMU-CS-03-178, August 2003
    [58] John D. Strunk, Gregory R. Ganger. A Human Organization Analogy for Self-* Systems. In: Proceedings of the First Workshop on Algorithms and Architectures for Self-Managing Systems. ACM. 2003. 1~6
    [59] Michael Mesnier, Eno Thereska, Daniel Ellard, et al. File Classification in Self-* Storage Systems. In: Proceedings of the First International Conference on Autonomic Computing. May 2004. 44~51
    [60] Eno Thereska, Michael Abd-El-Malek, Jay J. Wylie, et al. Informed Data Distribution Selection in a Self-predicting Storage System. In: Proceedings of the International Conference on Autonomic Computing. 13-16 June 2006. 187~198
    [61] J. H. Howard. An Overview of the Andrew File System. In: Proceedings of the USENIX Winter Technical Conference. 1988. 23~36
    [62] Coda File System Website. http: //www. coda. cs. cmu. edu
    [63] Andy Hospodor and Ethan L. Miller. Interconnection architectures for petabyte-scale high-performance storage systems. In: Proceedings of the 21st IEEE/12th NASA Goddard Conference on Mass Storage Systems and Technologies. April 13-16, 2004. 273~81
    [64] Feng Wang, Scott A. Brandt, Ethan L. Miller, and Darrell D. E. Long. OBFS: A file system for object-based storage devices. In: Proceedings of the 21st IEEE / 12th NASA Goddard Conference on Mass Storage Systems and Technologies. April 13-16, 2004. 283~300
    [65] Peter J. Braam. The Lustre Storage Architecture. March 2004. http: //www. lustre. org.
    [66] Panasas Inc. Object-based Storage: Enabling Peta-scale Computing. November 2003. http: //www. panasas. com/docs
    [67] J. D. Kubiatowicz, et al. OceanStore: An Architecture for Global-Scale Persistent Storage. In: Proceedings of the ACM International Conference on ArchitecturalSupport for Programming Languages and Operating Systems. November 2000. 190~201
    [68] A. Azagury, V. Dreizin, M. Factor, et al. Toward an Object Store. In: Proceedings of 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 165~176
    [69] O. Rodeh and A. Teperman. zFS-A Scalable Distributed File System Using Object Disks. In: Proceedings of 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 207~218
    [70] Frank Schmuck and Roger Haskin. GPFS: A Shared-Disk File System for Large Computing Clusters. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies. 2002. 231~244
    [71] J. Menon, D. A. Pease, R. Rees, et al. IBM Storage Tank—A heterogeneous scalable SAN file system. IBM SYSTEMS JOURNAL, 2003, 42 (2): 250~267
    [72] Wei-Khing For, Wei-Ya Xi. Adaptive Extents-Based File System for Object-Based Storage Devices. In: Proceedings of the 23th IEEE/14th NASA Goddard Conference on Mass Storage Systems and Technologies. May 15-18, 2006
    [73] Weiya Xi, Wei-Khing For, Donghong Wang, et al. OSDsim - a Simulation and Design Platform of an Object-based Storage Device. In: Proceedings of the 23th IEEE/14th NASA Goddard Conference on Mass Storage Systems and Technologies. May 15-18, 2006
    [74] Renuga Kanagavelu, Yong Khai Leong. A Bit-Window based Algorithm for Balanced and Efficient Object Placement and Lookup in Large-Scale Object based Storage Cluster. In: Proceedings of the 23th IEEE/14th NASA Goddard Conference on Mass Storage Systems and Technologies. May 15-18, 2006
    [75] 张悠慧, 郑纬民. 基于网络附属对象设备的集群存储体系结构. 软件学报, 2003, 14(2): 293~299
    [76] 张悠慧, 郑纬民. 一种新的网络对象存储设备研究. 电子学报, 2003, 31(5): 679~682
    [77] 刘仲, 周兴铭. 基于动态区间映射的数据对象布局算法. 软件学报, 2005, 16(11): 1886~1893
    [78] 刘仲, 章文嵩, 王召福等. 基于对象存储的集群存储系统设计. 计算机工程与科学, 2005, 27(2): 78-81
    [79] E. Levy and A. Silberschatz. Distributed file systems: Concepts and examples. ACM Computing Surveys, 1990, 22 (4): 321~374
    [80] J. H. Morris, M. Satyanarayanan, M. H. Conner, et al. Andrew: A distributed personal computing environment. Communications of the ACM, 1986, 29(3): 184~201
    [81] M. Satyanarayanan, J. J. Kistler, P. Kumar, et al. Coda: A highly available file system for a distributed workstation environment. IEEE Transactions on Computers, 1990, 39 (4): 447~459
    [82] B. Pawlowski, S. Shepler, C. Beame, et al. The NFS version 4 Protocol. In: Proceedings of the 2nd International System Administration and Networking Conference. May 2000
    [83] P. F. Corbett and D. G. Feitelso. The Vesta parallel file system. ACM Transactions on Computer Systems, 1996, 14(3): 225~264
    [84] Sage A. Weil, Kristal T. Pollack, Scott A. Brandt, et al. Dynamic metadata management for petabyte-scale file systems. In: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing. November 6-12, 2004. 1~4
    [85] Scott A. Brandt, Lan Xue, Ethan L. Miller, et al. Efficient metadata management in large distributed file systems. In: Proceedings of the 20th IEEE / 11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 290~298
    [86] Weijia Li, Wei Xue, Jiwu Shu, et al. Dynamic Hashing: Adaptive Metadata Management for Petabyte-scale File Systems. In: Proceedings of the 23th IEEE/14th NASA Goddard Conference on Mass Storage Systems and Technologies. May 15-18, 2006
    [87] 王召福, 章文嵩, 刘仲. 大规模集群文件系统 LCFS 的元数据管理与访问机制. 计算机工程与科学, 2005, 27(18): 103~105
    [88] 章文嵩, 王召福, 刘仲. 基于对象存储的集群文件系统 CFSlight 设计与实现. 大连理工大学学报, 2003, 43(z1): 171~174
    [89] Yaxin Yu, Guoren Wang, Ge Yu, et al. Data Placement and Query Processing Based on RPE Parallelisms. In: Proceedings of 27th Annual International Computer Software and Applications Conference. 3-6 November 2003. 151~156
    [90] Peter Scheuermann, Gerhard Weikum, Peter Zabback. Data partitioning and load balancing in parallel disk systems. International Journal on Very Large Data Bases, 1998. 7 (1): 48~66
    [91] W. Litwin, M. -A. Neimat, and D. A. Schneider. LH*—A Scalable, Distributed Data Structure. ACM Transactions on Database Systems, 1996, 21 (4): 480~525
    [92] W. Litwin, M. A. Neimat. High-availability LH* schemes with mirroring. In: Proceedings of the Conference on Cooperative Information Systems. 1996. 196~205
    [93] W. Litwin, M. Neimat, G. Levy, et al. LH*S : a high-availability and high-security scalable distributed data structure. In: Proceedings of the 7th International Workshop on Research Issues in Data Engineering. 1997. 141~150
    [94] W. Litwin, T. Schwarz. LH*: a high-availability scalable distributed data structure using Reed Solomoncodes. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. 2000. 237~248
    [95] W. Litwin, T. Risch. LH*g: a high-availability scalable distributed data structure by record grouping. IEEE Transactions on Knowledge and Data Engineering, 2002, 14 (4): 923~927
    [96] D. M. Choy, R. Fagin, and L. Stockmeyer. Efficiently extendible mappings for balanced data distribution. Algorithmica. 1996. 16: 215~232
    [97] R. J. Honicky, E. L. Miller. A fast algorithm for online placement and reorganization of replicated data. In: Proceedings of Parallel and Distributed Processing Symposium. 2003. 10~20
    [98] R. J. Honicky and E. L. Miller. Replication under scalable hashing: A family of algorithms for scalable decentralized data distribution. In: Proceedings of the 18th International Parallel & Distributed Processing Symposium. 26-30 April 2004. 96~105
    [99] Lee Lin-Wen, P. Scheuermann, R. Vingralek. File assignment in parallel I/Osystems with minimal variance of service time. IEEE Transactions on Computers, 2000, 49 (2): 127~14
    [100] Yeong. W, Howes. T, Kille. S. Lightweight Directory Access Protocol. RFC 1778. 1995
    [101] Dan Feng, Lingfang Zeng, Fang Wang, etc. Adaptive Policy Trigger Mechanism for OBSS. In: Proceedings of the 19th International Conference on Advanced Information Networking and Applications. 28-30 March 2005. 2: 591~595
    [102] 刘群, 冯丹, 覃灵军等. 一种基于 IP 网络的统一对象存储访问模型的研究. 计算机科学, 2006, 33 (8): 86~88
    [103] 林闯. 计算机网络和计算机系统的性能评价. 北京: 清华大学出版社, 2001
    [104] K. Kant, Introduction to Computer System Performance Evaluation. New York: McGraw-Hill. 1992
    [105] 林闯. 随机 Petri 网和系统性能评价. 北京: 清华大学出版社, 2000
    [106] Zhiguang Shan, Chuang Lin, and Fengyuan Ren: Modeling and Performance Analysis of a Multiserver Multiqueue System on the Grid. In: Proceedings of the 9th International Workshop on Future Trends of Distributed Computing Systems. IEEE. 28-30 May 2003. 337~343
    [107] 田立勤, 林闯, 周文江. 随机 Petri 网模型中变迁的串、并联性能等价化简技术. 电子学报, 2002, 30(8): 1134~1137
    [108] Zhangxi Tan, Chuang Lin, Hao Yin, Ye Hong: Approximate Performance Analysis of Web Services Flow Using Stochastic Petri Net. In: Proceedings of the International Workshop on Grid and Cooperative Computing, LNCS 3251, 2004. 193~200
    [109] A. Marsan, G. Balbo, and G. Conte: A class of generalized stochastic Petri nets for the performance evaluation of multiprocessor systems. In: Proceedings of Performance’83. ACM. October 1983. 198~199
    [110] Qun Liu, Dan Feng. An Approximate Analytic Performance Model of Object-Based Storage. In: Proceedings of the International Conference on Computational Science and Its Appications. May 8-11, 2006. LNCS 3980. 1045~1052
    [111] J. Von Neumann. The General and Logical Theory of Automata, In: Taub (1961).Chapter 9. 288~328
    [112] S. Wolfram. Theory and Applications of Cellular Automata. World Scientific, Singapore, 1986.
    [113] 周成虎, 孙战利, 谢一春. 地理元胞自动机研究. 北京: 科学出版社, 2001
    [114] 李才伟. 元胞自动机及复杂系统的时空演化模拟: [博士学位论文]. 武汉: 华中科技大学图书馆, 1997.
    [115] K. Culik II, L. P. Hurd, and S. Yu. Computation theoretic aspects of cellular automata. Physica D. 1990. 357~378
    [116] S. Wolfram. Statistical Mechanics of Cellular Automata. Reviews of Modern Physics, 1983, 55 (3): 601~664
    [117] M. Gardner. Mathematical Games. Scientific American, 1971, 224 (2): 112~116
    [118] 李元香, 康立山, 陈毓屏. 格子气自动机. 北京: 清华大学出版社与广西技术出版社, 1994
    [119] C. G. Longton. Life at the Edge of Chaos, in C. G. Langton, &etc. (ed), Artificial Life Ⅱ. New York: Addison - Wesley, 1992. 41~91
    [120] G. A. Gibson, D. F. Nagle, K. Amiri, et al. A Case for Network-Attached Secure Disks. CMU SCS Technical Report CMU-CS-96-142, September 1996
    [121] 应尚军, 魏一鸣, 范英等. 基于元胞自动机的股票市场投资行为模拟. 系统工程学报, 2001, 16(5): 382~388
    [122] 王安麟. 复杂系统的分析与建模. 上海: 上海交通大学出版社, 2004
    [123] 吴彤. 自组织方法论论纲. 系统辩证学学报, 2001, 9(2): 4~10
    [124] Bastien Chopard, Michel Droz. 物理系统的元胞自动机模拟. 祝玉学, 赵学龙 译. 北京: 清华大学出版社, 2003
    [125] M. K. McKusick, W. N. Joy, S. J. Leffler, et al. A fast file system for UNIX. ACM Transactions on Computer Systems, 1984, 2 (3): 181~197
    [126] Gregory R. Ganger and M. Frans Kaashoek. Embedded inodes and explicit grouping: exploiting disk bandwidth for small files. USENIX Annual Technical Conference. January 1997. 1~17
    [127] J. K. Ousterhout, H. D. Costa, D. Harrison, et al. A Trace-Driven Analysis of the Unix 4. 2 BSD File System. In: Proceedings of the 10th ACM Symposium onOperating Systems Principles. December 1985. 15~24
    [128] Sage Weil, Scott A. Brandt, Ethan L. Miller, et al. Intelligent Metadata Management for a Petabyte-Scale File System. In: Proceedings of the 2nd Intelligent Storage Workshop. May 2004
    [129] Burton Bloom. Space time Trade-offs in Hash Coding with Allowable Errors. Communications of the ACM, 1970, 13 (7): 422~426
    [130] Chang C C, Lee T F, Leu J J. Partition search filter and its performance analysis. Journal of Systems and Software, 1999, 47 (1): 35~ 43
    [131] Yifeng Zhu, Hong Jiang, Jun Wang. Hierarchical Bloom Filter Arrays (HBA): A Novel, Scalable Metadata Management System for Large Cluster-based Storage. In: Proceedings of the 2004 International Conference on Cluster Computing. September 2004. 165~174
    [132] L ittleM C, Speirs N A, Shrivastava S K. Using bloom filters to speed-up name lookup in distributed systems. The Computer Journal, 2002, 45 (6): 645~652
    [133] Sarang D, Praveen K, David E T. Longest p refix matching using bloom filters. In: Proceedings of SIGCOMM 2003. ACM. 2003. 201~212
    [134] Q. Xin, EL Miller, DDE Long, et al. Reliability Mechanisms for Very Large Storage Systems. In Proceedings of the 20th IEEE /11th NASA Goddard Conference on Mass Storage Systems and Technologies. April 7-10, 2003. 146~156
    [135] 张涛, 胡东成. TCRES: 一种实用的容错计算机系统 可靠性评价系统. 计算机研究与发展, 1997, 34 (9): 716~720
    [136] 张虎, 伍卫国, 董小社等. 一种适用于并行文件系统的高可用机制. 西安交通大学学报, 2005, 39(10): 1085~1087
    [137] Baek S H, Kim B W, Joung E J, et al. Reliability and performance of hierarchical RAID with multiple controllers. In: Proceedings of the twentieth annual ACM symposium on Principles of Distributed Computing. 2001. 246~254
    [138] R. Fagin, J. Nievergelt, N. Pippenger, and H. R. Strong. Extendible hashing - a fast access method for dynamic files. ACM Transactions on Database Systems, 1979, 4 (3): 315~344
    [139] Ellis, C. S. Extendible Hashing for Concurrent Operations and Distributed Data. In:Proceedings of the 2nd ACM SIGACT-SIGMOD symposium on Principles of database systems. 1983. 106~116
    [140] W. Litwin. Linear hashing: A new tool for file and table addressing. In: Proceedings of the 6th Conference on Very Large Data Bases. October 1980. 212~223
    [141] Jukka Teuhola: Effective Clustering of Objects Stored by Linear Hashing. In: Proceedings of the 1995 International Conference on Information and Knowledge Management. November 28 - December 2, 1995. 274~280
    [142] Knuth D E. The art of computer programming (2nd edition). Volumes 3: sorting and searching. London: Addison-Wesley Publishing Co. 1998. 263~276
    [143] Daniel P. Bovet, Marco Cesati. 深入理解 Linux 内核. 陈莉君, 冯锐, 牛欣源 译. 北京: 中国电力出版社, 2001
    [144] 王芳. 网络磁盘阵列系统的研究: [博士学位论文]. 武汉: 华中科技大学图书馆, 2001.
    [145] D. Roselli, J. Lorch, and T. Anderson. A comparison of file system workloads. In: Proceedings of the 2000 USENIX Annual Technical Conference, June 2000. 41~54.
    [146] John L. Hennessy, David A. Patterson. 计算机体系结构量化研究方法(英文版 第 3 版). 北京: 机械出版社, 2002. 793~802
    [147] Fang Wang, Shunda Zhang, Dan Feng, et al. A Hybrid Scheme for Object Allocation in a Distributed Object-Storage System. Accepted by the International Conference on Computational Science 2006. May 2006. 396~403
    [148] 罗荣桂. 排队模型及其应用. 武汉: 华中科技大学出版社, 1990
    [149] 冯丹. 外存储系统并行性研究: [博士学位论文]. 武汉: 华中科技大学图书馆, 1997.
    [150] M. Y. Kim and A. N. Tantawi. Asynchronous Disk Interleaving: Approximating Access Delays. IEEE Transactions on Computers, 1991, 40 (7): 801~810
    [151] 周可. 外存储系统数据组织与体系结构: [博士学位论文]. 武汉: 华中科技大学图书馆, 2003.
    [152] Huseyin Simitci, Daniel A. Reed. Adaptive Disk Striping for Parallel Input/Output. In: Proceedings of the 7th NASA Goddard Conference on Mass Storage Systems and Technologies. March 15-18, 1999. 88~102

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700