语音数控机床的研究与实现

英文题名：The Research and Implementation on Speech Numerical Control Machine Tool
作者：史俊
论文级别：硕士
学科专业名称：计算机应用技术
中文关键词：语音识别 ; 数控机床 ; 自动数控编程 ; 模拟加工
英文关键词：speech recognition ; numerical control machine tool ; automatically numerical control programming ; simulating processing
学位年度：2007
导师：富钢
学科代码：081203
学位授予单位：沈阳航空工业学院
论文提交日期：2006-12-31

摘要

语音识别是让机器通过识别和理解过程把语音信号转变为相应的文本或命令的技术，目前应用于很多领域。语音识别技术在机床控制方面也大有用武之地。语音数控机床可以打破人工、地点和设备的限制，突出人性化、智能化。在商贸谈判现场的应用更加突显其优势，使得谈判成功率大幅提高。
     本文从语音识别技术入手，首先论述了语音识别的原理和小波阈值降噪技术，其次简要介绍了数控加工程序编制基础。通过应用微软语音应用程序软件开发工具包Speech API 5.1(SAPI)、OpenGL技术和Visual C++6.0编程工具，二次设计开发了语音数控机床系统。本系统也是数控领域较为新颖的尝试与研究，旨在通过应用语音识别技术识别出用户语音输入的基本命令、加工参数和零件几何信息，系统经过分析处理后自动生成数控加工代码，最后进行动态模拟加工演示。本文针对语音识别、自动加工编程和模拟加工三个方面重点论述了系统的设计方案以及具体实现过程。
     本文对实验室和数控加工车间进行了系统语音识别率的试验。试验结果表明，语音数控机床系统在实验室条件下的识别效果较好，可以满足实际应用的要求，但是在数控加工车间噪声环境下的识别率还有待加强。通过提出改进语音识别效果的方法，指明了本系统未来的研究方向，进一步体现语音识别技术为数控领域所带来的创新革命。
The speech recognition is a technology of turning speech signal to corresponding text or command by machines' recognition and comprehension. So far, it has been widely applied in many fields. Speech recognition technology is also useful in the aspect of machine tool's controlling. Speech numerical control machine tool can break the restrictions of manpower, place and equipments. It also gives prominence to be humanism and intelligentized. It shows its predominance in the scene of trade negotiation, which enhances the successful rate of negotiation greatly.
    This paper started from the speech recognition technology, firstly discussed the principles of speech recognition and wavelet threshold denoising technology, and secondly introduced the basic of numerical control processing programming simply. The system of speech numerical control machine tool was secondly designed by using the Microsoft Speech Application Software Development Kit Speech API 5.1(SAPI), OpenGL technology and Visual C++ 6.0 programming tool. The system is a rather original attempt and research in the field of numerical control, too. It aims at recognizing the basic orders, processing parameters and accessories' geometrical informations which are input by users' speech, using speech recognition technology. The system automatically generates the numerical control processing codes after analyzing and disposing, and dynamically simulates the processing at last. The emphasis of this paper was discussing the system's design plans and concrete implementation by aiming at three aspects, which included speech recognition, automatically processing programming and simulating processing.
    This paper had experimented to test the system's speech recognition rate in the laboratory and numerical control processing workshop. It was indicated from the experiments that the system of speech numerical control machine tool had preferable recognition effect under the laboratory environment, which could satisfy the needs of practical application, however, the recognition rate in the numerical control processing workshop which in the noise needed to strengthen. This paper had put forward the methods of improving the speech recognition
    effect, and designated the future research orientation of this system. It embodies the innovative revolution for the field of numerical control which is brought by speech recognition technology.

引文

1 CTI论坛．语音技术应用的现状和未来[EB/OL]．http://www.ctiforum.com，2004-09
    2 俞铁城．语音识别的发展现状[EB／OL]．http://www.ctiforum.com，2005-03
    3 黄寅飞，郑方，燕鹏举，等．校园导航系统EasyNav的设计与实现[J]．中文信息学报，2001，15(4)：35-40
    4 CTI论坛．国家863计划又一核心技术成功起航[EB／OL]．http://www.ChinaByte.com，2002-05
    5 于晓雯．计算机辅助数控编程系统的研究[D]：[硕士学位论文]．吉林：吉林工业大学，2000
    6 韩明花．数控车床自动编程系统[D]：[硕士学位论文]．沈阳：东北大学，2000
    7 赵海信．基于拟实制造技术的数控自动编程系统的研究[D]：[硕士学位论文]．武汉：武汉理工大学，2001
    8 王炳锡，屈丹，彭煊．实用语音识别基础[M]．北京：国防工业出版社，2005．26-250
    9 胡航．语音信号处理[M]．哈尔滨：哈尔滨工业大学出版社，2000．48-172
    10 韩纪庆，张磊，郑铁然．语音信号处理[M]．北京：清华大学版社，2004．43-13
    11 张雄伟，陈亮，杨吉斌．现代语音处理技术及应用[M]．北京：机械工业出版社，2003．180-226
    12 周振辉．机器人手术网络遥控系统——语音识别和机械手控制[D]：[硕士学位论文]．沈阳：沈阳航空工业学院，2006
    13 赵力．语音信号处理[M]．北京：机械工业出版社，2005．127-140
    14 张众．小型语音识别系统的研究和开发[D]：[硕士学位论文]．天津：天津大学，2004
    15 李祖朋，姚佩阳．一种语音段起止端点检测新方法[J]．电讯技术，2000，3(1)：68-71
    16 林遂芳．噪声环境下语音识别方法的研究[D]：[硕士学位论文]．西安：西安理工大学，2005
    17 孙鹏．噪声下语音识别研究[D]．沈阳：沈阳航空工业学院，2003
    18 朱淑琴．语音识别系统关键技术研究[D]：[硕士学位论文]．西安：西安电子科技大学，2004
    19 靳济芳．VisualC++小波变换技术与工程实践[M]．北京：人民邮电出版社，2004．25-35
    20 谭东昱．基于小波变换的语音增强方法研究及实时实现[D]：[硕士学位论文]．湖南：湖南大学，2004
    21 Donoho D L, Johnstone I M. Adapting to unknown smoothness via wavelet shrinkage[J]. J Amor Stat, 1995, 90(3): 580-602
    22 文莉，刘正士，葛运建．小波去噪的几种方法[J]．合肥工业大学学报(自然科学版)，2002，25(2)：167-172
    23 杨大利，徐明星，吴文虎．噪音环境下的语音识别研究概述[J]．第五届全国现代语音学术会议．北京．2001．8
    24 吴淑珍，冯成林，黄新宇．噪声环境下语音识别方法研究[J]．北京大学学报(自然科学版)，2001，37(3)：365-370
    25 Kadambe S, Faye G F. Application of Wavelet Transform for Pitch Detection of Speech Signals. IEEE Trans On Information, 1992, 38(2): 917-924
    26 Stephane G M, Hwang W L. Singularity Detection and Processing with Waveles. IEEE Trans On IT, 1992, 38(2): 617-643
    27 马明，张杰，王建宇，黄志同．语音识别中隐马尔可夫模型初值的估计[J]．数据采集与处理，1997，12(2)：38-43
    28 张焱，张杰，黄志同．语音识别中隐马尔可夫模型状态数的研究[J]．南京理工大学学报， 1998，22(3)：15-21
    29 严建红．数控机床编程与加工技术[M]．福州：福建科学技术版社，2004．64-104
    30 董玉红，邵俊鹏．机床数控技术[M]．哈尔滨：哈尔滨工业大学出版社，2003．9-40
    31 范炳炎．数控加工程序编制[M]．北京：航空工业出版社，1995．75-82
    32 孙德茂．数控机床铣削加工直接编程技术[M]．北京：机械工业出版社，2005．12-98
    33 王润孝．机床数控原理与系统[M]．西安：西北工业大学出版社，1996．1-6
    34 Microsoft Inc．Speech API帮助文档[DB／DK].\Microsoft Speech SDK 5.1\Docs\Help\sapi.chm
    35 李禹材，左友东，郑秀清等．基于Speech SDK的语音控制应用程序的设计与实现[J]．计算机应用，2004，24(6)：114-116
    36 Michael Morrison，et al．XML揭秘——入门·应用·精通[M]．陆新年，陆新宇等译．北京：清华大学出版社，2001．1-15
    37 Richard S W．OpenGL超级宝典[M]．北京：人们邮电出版社，2005．55-75
    38 和平鸽工作室．OpenGL高级编程与可视化系统开发(高级编程篇)[M]．北京：中国水利水电出版社，2005．1-17
    39 张任寰．虚拟数控车床加工仿真技术的研究[D]：[硕士学位论文]．沈阳：沈阳航空工业学院，2004
    40 李晓燕，张翔，陈立伟．基于VC 6.0和OpenGL机械手三维仿真演示系统[J]．计算机工程与设计，2004，26(6)：982-985
    41 王太勇，王晓斌，王国锋，等．数控切削过程仿真系统的研究[J]．组合机床与自动化加工技术．2004，1(3)：63-66
    42 顾良，刘润生．汉语数码语音识别：发展现状、难点分析与方法比较[J]．电路与系统学报，1997，4(2)：32-39
    43 顾良，刘润生．改进汉语数码语音识别中的语音特征提取性能[J]．电路与系统学报，1997，4(2)：44-52
    44 李虎生．汉语数码串语音识别及说话人自适应[D]：[硕士学位论文]．北京：清华大学，2000
    45 冯成林，吴淑珍．一种噪声环境下语音识别方法(线性预测误差法)研究[J]．北京大学学报(自然科学版)，2000，36(5)：665-671
    46 《电脑编程技巧与维护》杂志社．Visual C／C++编程精选集锦关键技术精解分册[M]．北京：科学出版社．2003．233-241
    47 丁沛．语音识别中的抗噪声技术[D]：[博士学位论文]．北京：清华大学，2003
    48 孙玲．汉语数字语音识别技术与实现方法研究[D]：[硕士学位论文]．武汉：华中科技大学，2004
    49 李虎生，刘加，刘润生．高性能汉语数码串语音识别．电子学报，2001，9(5)：595-599
    50 肖江南．汉语孤立词识别系统开发与研究[D]：[硕士学位论文]．桂林：广西师范大学，2004
    51 刘加．汉语大词汇量连续语音识别系统研究进展[J]．．电子学报．2000，28(1)：85-91
    52 田斌．实用化汉语语音识别理论及关键技术研究[D]：[博士学位论文]．西安：西安电子科技大学，2000
    53 钟金宏．基于音节的汉语连续语音声调识别方法研究[D]：[博士学位论文]．合肥：合肥工业大学，2001
    54 Lamia K, Arnaud M.Towards improving speech detection robustness forspeech recognition in adverse conditions[J]. Speech Communication. 2003, 261-276
    55 Kaisheng Y, Kuldip K P, Satoshi N. Noise adaptive speech recognition based on sequential noise parameter estimation[J]. Speech Communication. 2004, 5-23
    56 Leggetter C J. Improved Acoustic Modeling for HMMs Using Linear Transformations[D]: [Dissertation for PhD]. UK: Cambridge University, 1995
    57 Kaisheng Y, Kuldip K P, Te-Won Lee. Generative factor analyzed HMM for automatic speech recognition[J]. Speech Communication, 2005, 435-454
    58 雷静．语音识别技术的研究及基本实现[D]：[硕士学位论文]．武汉：武汉理工大学，2002
    59 熊飞丽．语音识别技术在语言教学软件中的应用研究[D]：[硕士学位论文]．长沙：国防科学技术大学，2002
    60 Lee C H, Lin C H, Juang B H. A Study on Speaker Adaptation of the Parameters of Continuous Density Hidden Markov Models[J]. IEEE Trans on Acoustis and Speech Signal Processing, 1991, 39(4): 806-814

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700