Continuous phone recognition in the Sigma cognitive architecture

详细信息查看全文

作者：Himanshu Joshi^a ; ^b ; ^{himanshu@ict.usc.edu} ; Paul S. Rosenbloom^a ; ^b ; ^{rosenbloom@ict.usc.edu} ; Volkan Ustun^a ; ^{ustun@ict.usc.edu}
关键词：Cognitive architecture ; Graphical models ; Sigma ; Speech recognition ; Factor graphs ; Dynamic Bayesian networks ; HMM
刊名：Biologically Inspired Cognitive Architectures
出版年：2016
出版时间：October 2016
年：2016
卷：18
期：Complete
页码：23-32
全文大小：740 K
卷排序：18

文摘

Spoken language processing is an important capability of human intelligence that has hitherto been unexplored by cognitive architectures. This reflects on both the symbolic and sub-symbolic nature of the speech problem, and the capabilities provided by cognitive architectures to model the latter and its rich interplay with the former. Sigma has been designed to leverage the state-of-the-art hybrid (discrete + continuous) mixed (symbolic + probabilistic) capability of graphical models to provide in a uniform non-modular fashion effective forms of, and integration across, both cognitive and sub-cognitive behavior. In this article, previous work on speaker dependent isolated word recognition has been extended to demonstrate Sigma’s feasibility to process a stream of fluent audio and recognize phones, in an online and incremental manner with speaker independence. Phone recognition is an important step in integrating spoken language processing into Sigma. This work also extends the acoustic front-end used in the previous work in service of speaker independence. All of the knowledge used in phone recognition was added supraarchitecturally – i.e. on top of the architecture – without requiring the addition of new mechanisms to the architecture.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700