基础教育多媒体网络教学资源检索研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
教育信息化建设转变了教育思想和观念,对教师和学生都提出了新的要求。教师要具备利用网络获取教学资源、组织教学的能力,学生要有利用网络进行自我学习能力。因特网蕴含了大量信息资源,但可用的教学资源分布零散且质量良莠不齐。虽然现有的WEB搜索引擎功能日益完善,但多数采用基于关键词的方法,对于教学和学习所需要的多媒体资源的检索无能为力。尤其对于计算机能力不强的中小学教师和学生而言,在多媒体资源的查找方面更需要方便快捷的系统加以辅助。
     本课题正是基于以上原因,我们以中小学教材为依据,组织基础教育教学预搜索关键词,搜索网络资源,建立了一个以中小学师生为使用对象,面向基础教育的多媒体网络教学资源索引库。并以asp技术为支持,以多媒体资源索引库为基础,建立了一个面向基础教育的多媒体网络教学资源索引库的检索系统。组织基础教育教学预搜索关键词,是为预搜索系统提供搜索指向,是建立面向基础教育的多媒体资源索引库的前期工作。我们以中小学教材为依据,通过人工收集和整理,从学段、学科和类型三个维度建立了基础教育教学主题词库体系。学段分为小学、初中和高中,其中小学的学科有5门,初中的学科有12门,高中的学科有14门,主题词类型分为图像、动画、视频和音频。
     论文设计并建立了一个以面向基础教育的多媒体资源索引库为基础的检索系统,该检索系统是面向WEB的多媒体资源检索系统,可以根据用户名连接相应的WEB多媒体资源索引库。每个资源库包含了图像、动画、视频、音频四类资源。该系统包括用户登录界面、用户输入界面、检索结果输出界面。检索系统是在分析了资源库中媒体的类型、特征及存储特点的基础上,采用中文自然语言查询的方法,以相似度来衡量查询目标媒体和数据库媒体之间的差距。
     自然语言是表达思想的有效工具,利用自然语言表达多媒体资源的语义是一种简洁、有效的方法。论文对自然语言分词的一般方法做了介绍,引用已有的分词词典建立了自用的分词函数,对查询文本进行分词和词性标注。从查询文本中去除虚词、设定的缺省词汇,提出名词、动词、形容词、成语等我们需要的主题关键词,即可得到对目标媒体的描述,称为主题内容。计算相似度之前,主题内容要依据同义词词典进行扩展。
     媒体资源索引库中包含图像、动画、视频、音频四种类型的媒体,论文采用相似度来衡量查询目标媒体和数据库媒体之间的差距。媒体的特征包括文件属性和内容特征,相似度计算主要是针对媒体的内容特征,对于不同的内容特征使用不同的相似度计算方法。通过比较扩展后的主题内容与数据库中内容描述字段相同词的个数来计算主题内容相似度;主色调颜色词转换为HSI模式,与数据库中以数值方式标注的主色调字段进行色调相似度的计算;图像的主体与主体属性针对数据库中的主体字段计算相似度。所有的内容特征按照其所在层次确定重要性后,计算总相似度。将总相似度大于一定阈值的数据库记录按照总相似度由大到小的顺序,作为检索结果反馈给用户。
     本文在上述工作的基础上,对面向基础教育的多媒体资源索引库的检索系统进行了大量实验,并对实验结束进行了详细的表述。经实验表明,该系统对结构比较简单的、嵌套较少的查询文本能比较准确的进行分词,对数据库中内容特征标注准确、详实的记录,检索结果准确度较高,证明依据内容特征检索的方法是可行的。缺点是随着多媒体资源索引库中记录的增多,当检索条件比较多时,系统运行速度比较慢。论文最后总结了本文的工作,并提出了下一步的研究方向。
The education informalization has brought transformation of the concept of educational thought, making the new requirements to both of teachers and students. The teachers should possess to utilize the network to obtain teaching resources, and to organize teaching, as well the students should have the ability of taking advantage of the network to carry on the self-learning. Although there are abundant information resources in the Internet, the available teaching resources are scatteredly distributed and the quality is very different. While existing WEB search engine function has improved day by day, but the majority adopts methods based on the keyword, which is powerless in the retrieval of multimedia resources. Therefore, it is necessary to find out a more convenient and efficient system to assist in the multimedia resources retrieval, especially for the teachers and students of primary and secondary schools, who are lack of computer capacity relatively.
     On the basis of above reasons, we have established a basic education-oriented index database of multimedia resources, by organizing the pre-search keywords of basic education and searching network resources. Regarding the teachers and students of primary and secondary schools as its main users, the resources index database is exactly based on their textbooks. Furthermore, a multimedia resources index database retrieval system which faces the basic education has been built up, with the support of the ASP technology.
     Organizing the elementary education pre-search keywords, as preparatory work of the establishment of the basic education-oriented multimedia resources index database, is intent to provide searching direction for the pre-search system. Based on the textbooks of primary and secondary schools, through manually collection and sorting, we built up a thematic words system of basic education from three perspectives, i.e., stage, subject and type. From the aspect of stage, we discussed primary school, junior middle school and senior middle school periods, while there are 5, 12, 14 subjects concerned with each stage respectively. Also, the thematic words are divided into types of image, animation, video and audio.
     In this thesis, we design and establish a retrieval system that based on the basic education-oriented multimedia resources index database. This is a web-oriented retrieval system, which can create a connection to the WEB multimedia resources index database according to each username. Each resource database contains four types of resource: image, animation, video and audio. User log-in interface, user input interface, search result output interface are included in this system. Analyzing the type, characteristic and storage feature of the media, the retrieval system adopts a method of Chinese natural language query, to measure the difference between the object media and media in the database by means of similarity.
     The natural language is the effective tool to express thoughts, using which to describe the semantics of multimedia resources will be a simple and effective method. This thesis introduces the general ways of word segmentation on the natural language, builds our own word segmentation algorithm from the existent segmentation dictionaries, to divide the query texts and label its parts of speech (POS) tagging. After obtaining the thematic words such as nouns, verbs, adjectives and idioms by omitting the function words and the default words from the query texts, we can get the description of the object media and call them“theme content”. The theme content should be extended according to the synonym dictionary before calculating the similarity.
     The media resource index database includes medias of four types of image, animation, video and audio, and the thesis adopts the similarity to measure the difference between the object media and media in the database. The media features include text features and content features, between which we mainly refer to the latter while calculating the similarity. As to different content features, different technologies will be used to calculate the similarity. The similarity is obtained by finding the number of same words between the extended theme content and the content description field in the database. The color word of the dominant hue is changed into HSI model, and then the similarity calculation of the tone with the dominant hue field marking in the database by way of numerical value could be carried on. The subject and subject attribute of the picture are calculated similar degree to the subject field in the database. After confirming the importance of the content features according to their level, calculate the total similarity. The records whose total similarity is greater than a certain threshold value will be recorded from great to small order according to total similar degree, and finally feedbacked to users as searching result.
     Based on the work described above, a large number of experiments to the retrieval system that based on the basic education-oriented multimedia resources index database have been carried on, and detailed statements to the experimental result have been presented. The experiment showed that, with the relatively simple and less nested query texts, the retrieval system is able to provide pretty accurate words segmentation, detailed and accurate record to the content feather in the database, and the search results of higher accuracy, proving that the method to search according to the content characteristic is feasible. The shortcoming is that with the increase of recording in the multimedia resource index database, or when there are relatively more searching conditions, the system operation slows down. Finally, the thesis summarizes the main conclusion and puts forward the next research direction.
引文
[1] 张敬涛, 李 馨. 论我国基础教育资源建设策略[J]. 中国电化教育, 2006.10
    [2] 徐险峰. 基于内容的多媒体信息检索技术[J]. 现代情报, 2005.3
    [3] 孙树生, 黄 焱. 基于内容视频信息检索系统的分析研究[J]. 电视技术, 2006.3
    [4] 陆燕,陈福生. 基于内容的视频检索技术[J]. 计算机应用研究,2003.11
    [5] 张宁. 基于内容的多媒体检索的研究现状和应用前景 [J].上饶师范学院学报,2006.6
    [6] 刘浩一. 基于中文自然语言查询的多媒体数据库检索系统 [D].山东师范大学硕士学位论文,2005.6
    [7] 李国辉. 基于内容的音频检索:概念和方法[J].小型微型计算机系统,2000.11
    [8] 汤艳莉. 汉语自然语言检索及其用户提问处理[D]. 北京师范大学硕士学位论文,2003.06
    [9] 刘浩一. 基于中文自然语言查询的多媒体数据库检索系统 [D].山东师范大学硕士学位论文,2005.6
    [10] 樊凌涛,陈健. 图象和视频的检索技术[J]. 计算机工程与应用,2001.09
    [11] 刘俊晓,孟祥增,吴鹏飞.基于内容的视频分析与检索技术及其教学应用[J].中国电化教育,2006.04
    [12]白云晖. 基于内容的音频检索[J]. 广播与电视技术,2007.06
    [13] 吴春辉,钟宝荣. 基于内容的音频检索技术研究[J]. 科技情报开发与经济,2007.06
    [14] 李海霞. 基于自然语言的图像数据库检索技术研究[D]. 山东师范大学硕士学位论文,2004.05 [15 徐金雷,杨晓江. 基础教育资源搜索引擎的排序算法研究[J]. 电化教育研究,2007.02
    [16] 章毓晋.基于内容的视觉信息检索.北京,科学出版社.2003.5
    [17] 李国辉, 李恒峰.基于内容的音频检索:概念和方法[J].小型微型计算机系统,2000.11
    [18] 胡吉明.浅析基于内容的视频信息检索技术[J].图书馆学研究,2006.2
    [19] 韦素云,吉根林.基于加权颜色直方图和颜色对的图像检索系统[N].南京师范大学学报(工程技术版),2005.03
    [20] 王 斌,谢庆生,刘 丹,王 晓.Web 教学资源主题检索系统的设计与实现[J].现代图书情报技术,2006
    [21] 谭德坤,王力红.基于模糊语言方法的信息检索系统的研究[J].计算机仿真,2005.02
    [22] 魏 丽.基于颜色特征的图像检索系统的研究与实现[D]. 重庆大学硕士学位论文,2006.05
    [23] 彭 波.大规模搜索引擎检索系统框架与实现要点[J]. 计算机工程与科学,2006.03
    [24] 刘菁华,夏定元.基于多媒体融合的图像检索系统的实现[J].视频技术应用与工程,2006.02 [25 王芳,董军宇,唐瑞春.基于内容的图像检索的关键技术[J]. 现代计算机,2006.01
    [26] 郑秋梅 孙绪华.基于数据隐藏的图像检索系统[J]. 计算机应用与软件,2006.03
    [27] 许 劲.基于图像可视属性的检索系统的设计与研究[J]. 长沙电力学院学报(自然科学版),2006.02
    [28] 蔡昌许. 基于语义的图像标注与检索系统研究[D]. 武汉大学硕士学位论文,2005.05
    [29] 柳群英,金军.基于知识挖掘技术的智能信息检索系统研究[J]. 图书情报,2006.06
    [30] 曹勇刚,曹羽中,金茂忠,刘超.面向信息检索的自适应中文分词系统[J]. Journal of Software,Vol.17,No.3,March 2006
    [31] 周宝兰,张义兵.数字图书馆的基于内容图像检索系统研究[J].大众科技,2006.06
    [32] 祁宇明,季俊忠.Internet 中图像检索技术的研究[J].科技咨询导报,2007
    [33] 何咏梅,毛云舸.搜索引擎的发展现状与趋势研究[J].吉林省经济管理干部学院学报,2007.08
    [34] 乔东枝.新一代搜索引擎的智能化特征及技术进展[J].现代信息技术,2007.04
    [35] 陈权,曹卓文,杨晓江.一个基础教育网站搜索引擎的设计与实现[J].现代图书情报技术, 2007.6
    [36] 庞银卓.基于内容的图像检索系统[D]. 天津大学电子信息工程学院硕士学位论文, 2004.12
    [37] 杨悦.基于内容的多媒体检索系统[D]. 天津大学电子信息工程学院硕士学位论文,2003.06
    [38] 杨小莉,黄水清.国内常见全文检索系统比较[J].图书与情报,2006.02
    [39] 刘菁华,夏定元.基于多媒体融合的图像检索系统的实现[J]. 视频技术应用与工程,2006.02
    [40] 王志梅,周锦成.基于内容的流媒体课件检索系统的实现[J].计算机系统应用,2006.06
    [41] 王立珍.教育资源库现状分析与建设策略[J].观察与观点,2006
    [42] 林健.教师教育资源建设中的问题与对策[J].成人教育,2006.02
    [43] 闫铁莹.网络环境下学校教育资源建设[J]. 多媒体教学,2006
    [44] 张静,刘延申,卫金磊.论中小学多媒体知识元库的建设[J].现代教育技术,2005
    [45] 殷群.谈网络时代的教育资源建设从知识管理视角[J].信息技术教育,2006.07
    [46] 刘丰.基于运动捕获数据的若干动画技术研究[D].浙江大学硕士学位论文,2004.03
    [47] 刘美凤.教育技术的定位:中国学者的观点[J]. 教育技术学科建设,2003.02
    [48] 冀 颖.论自然语言检索系统中的控制问题[J].晋图学刊,2006.02
    [49] Burt PJ,Adelson EH/The Laplacian Pyramid as a Compact Image Code[j].IEEE Trans.on Communications,1983,31(4)
    [50] Lumini A,Maio D.A Wavelet-based Image Watermarking Scheme[C]//Proc.of Intel.Conf. on Information Tech:Coding and Computing.2000
    [51] Peer Ingwersen,The calculation of web impact factors,Journal of Documentation,Vol.54,No.2 March 1998

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700