用户名: 密码: 验证码:
17种分类算法在牛肝菌种类鉴别研究中的应用
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Application of 17 Classification Algorithms for Authentication Research of Various Boletus
  • 作者:张钰 ; 李杰庆 ; 李涛 ; 刘鸿高 ; 王元忠
  • 英文作者:ZHANG Yu;LI Jie-qing;LI Tao;LIU Hong-gao;WANG Yuan-zhong;College of Agronomy and Biotechnology,Yunnan Agricultural University;Institute of Medicinal Plants,Yunnan Academy of Agricultural Sciences;College of Resources and Environment,Yuxi Normal University;
  • 关键词:牛肝菌 ; FTIR ; 种类鉴别 ; 不同部位 ; 数据融合
  • 英文关键词:Boletaceae;;FTIR;;Species identification;;Different parts;;data fusion
  • 中文刊名:GUAN
  • 英文刊名:Spectroscopy and Spectral Analysis
  • 机构:云南农业大学农学与生物技术学院;云南省农业科学院药用植物研究所;玉溪师范学院资源环境学院;
  • 出版日期:2019-02-15
  • 出版单位:光谱学与光谱分析
  • 年:2019
  • 期:v.39
  • 基金:国家自然科学基金项目(31660591,21667031);; 云南省教育厅科学研究基金项目(2016ZZX106);; 云南省高校食用菌资源开发与利用重点实验室建设项目资助
  • 语种:中文;
  • 页:GUAN201902022
  • 页数:6
  • CN:02
  • ISSN:11-2200/O4
  • 分类号:122-127
摘要
由于部分毒菌与野生食用菌形态和生物学特征相似,农民仅凭经验采集,难免将两者混淆,从而导致严重的食品安全事故。云南省作为国内野生食用菌产量最高、出口量最大的省份,野生食用菌产业发展为云南农村经济发展做出了突出贡献,对不同种类野生食用菌进行快速鉴别,有利于野生食用菌产业的健康发展;分析食用菌亲缘关系,对食用菌育种工作具有积极作用。七种牛肝菌样品,采自云南及周边七个产地,利用FTIR光谱仪分别采集菌柄和菌盖红外指纹图谱,基于低级与中级数据融合策略,将预处理后的菌柄和菌盖FTIR光谱数据进行融合,结合Decision Trees,Discriminant Analysis,Logistic Regression Classifiers,Support Vector Machines,Nearest Neighbor Classifiers和Ensemble Classifiers中的17种算法,分别建立菌柄、菌盖、低级数据融合和中级数据融合模型,每个分类模型连续进行10次运算,通过比较训练集分类正确率平均值,确定牛肝菌种类鉴别最佳分类算法。中级数据融合数据集进行系统聚类分析(HCA),对推测不同种类牛肝菌样品的亲缘关系进行鉴定。结果显示:(1)菌柄、菌盖和低级数据融合模型最佳分类算法均为Linear Discriminant,训练集分类正确率分别为92.8%,96.4%和97.6%。中级数据融合模型最佳分类算法为Subspace Discriminant,训练集分类正确率为100%;(2)菌柄、菌盖、低级数据融合和中级数据融合最佳分类模型,全部样品分类正确率平均值分别为93.61%,95.54%,96.99%和99.88%,中级数据融合模型优于其他三种模型,表明中级数据模型可以将相似度较高的样品区分开,且减少了产地对种类鉴别的影响;(3)中级数据融合模型数据集进行HCA,华丽牛肝菌和美味牛肝菌聚类距离最小,表明这两种牛肝菌化学信息较相似,亲缘关系较近;(4)华丽牛肝菌与皱盖疣柄牛肝菌聚类临界值距离最大,表明样品化学信息差异较大,亲缘关系较远。综上表明,基于中级融合策略将不同部位FTIR光谱数据融合,结合Subspace Discriminant与HCA,可以准确鉴别不同种类牛肝菌和快速推测样品亲缘关系,可作为野生食用菌种类鉴别与亲缘关系推测的一种新方法。
        Many wild nocuous fungi are similar to the edible in morphology and biological characteristic,which easily leads to serious food safety incident because it is difficult for farmers to distinguish them just by experience.The progress of wild edible production makes a great contribution to rural economy of Yunnan province where the yield and export volume are highest in China.Rapid authentication of wild edible fungi variety is beneficial for wild edible industry towards healthy development.Meanwhile,the authentication also contributes to the analysis of the genetic relationship between edible mushroom and their breeding.Seven kinds of fungi were collected from Yunnan and other seven origins around Yunnan.Fingerprint of caps and stipe were obtained with Fourier transforms infrared(FTIR)spectrometer,respectively.Cap model,stipe model,low-level data fusion model and mid-level data fusion were established using prepressed spectra according to low-and mid-level fusion strategy combined with decision trees,discriminant analysis,logistic regression classifiers,support vector machines,nearest neighbor classifiers and ensemble classifiers that every model was computed 10 times.The optimal classification algorithm was selected based on the accuracy of training set.Hierarchical cluster analysis(HCA)was executed using the mid-level fusion dataset to judge genetic relationship between seven fungi.The results indicated:(1)The best algorithm of caps,stipe and low-level fusion is linear discrimination that accuracy is 92.8%,96.4%,and 97.6%,respectively.Subspace discriminant is the most optimal in mid-level fusion that accuracy is 100%.(2)The average accuracy of all samples is 93.61%,95.54%,96.99% and 99.88% based on the best model of stipe,cap,low-level data fusion and mid-level data fusion.The performance of mid-level fusion is better than other three models,which indicated that the model could distinguish the highly-similar samples by reducing the influence caused by their origins.(3)The result of HCA based on mid-level fusion dataset displayed that the distance between Boletus magnificus and B.edulis was very close,which showed their chemical information were similar and genetic relationship was close.(4)The result of HCA based on mid-level fusion dataset displayed that the distance between Boletus magnificus and Leccinum duriusculum was very long,which showed their chemical information were different and genetic relationship was inferior.In a word,mid-level data fusion strategy combining FTIR spectra of different parts,subspace discriminant and HCA could effectively distinguish different kinds of edible fungi and judge the genetic relationship,which is a novel method used for variety authentication and genetic relationship judgment of wild edible fungi.
引文
[1] MAO Xiao-lan(卯晓岚).Mycosystema(菌物学报),2006,25(3):345.
    [2] BAU Tolgor,BAO Hai-ying,LI Yu(图力古尔,包海鹰,李玉).Mycosystema(菌物学报),2014,33(3):517.
    [3] MAO Xiao-lan(卯晓岚).The Macrofungi in China(中国大型真菌).Zhengzhou:Henan Science and Technology Press(郑州:河南科学技术出版社),2000.
    [4] WEN Hua-an,YANG Zhu-liang,LI Tai-hui,et al(文华安,杨祝良,李泰辉,等).Science World(科学世界),2013,(10):56.
    [5] ZHOU Zu-fa,LU Na,SONG Ji-ling,et al(周祖法,陆娜,宋吉玲,等).Journal of Fungal Research(菌物研究),2017,(3):188.
    [6] Juma I,Mshandete A,Tibuhwa D,et al.Tanzania Journal of Science,2016,42(1):109.
    [7] Zhao R L,Li G J,Sánchez-Ramírez S,et al.Fungal Diversity,2017,84(1):43.
    [8] Avin F A,Bhassu S,Shin T Y,et al.Journal of Animal&Plant Sciences,2014,24(1):89.
    [9] Fan X Z,Zhou Y,Xiao Y,et al.Microbiological Research,2014,169(5):453.
    [10] Lu T,Bau T.Biotechnology&Biotechnological Equipment,2017,31(7):1.
    [11] Yadav M K,Chandra R,Singh H B,et al.International Journal of Current Microbiology and Applied Sciences,2017,6(5):1260.
    [12] YANG Tian-wei,ZHANG Ji,LI Tao,et al(杨天伟,张霁,李涛,等).Spectroscopy and Spectral Analysis(光谱学与光谱分析),2016,36(11):3510.
    [13] Qi L M,Zhang J,Zhao Y L,et al.Analytical Letters,2017,50(9):1497.
    [14] Ouyang Q,Zhao J W,Chen Q S.Analytica Chimica Acta,2014,841(23):68.
    [15] Márquez C,López M I,Ruisánchez I,et al.Talanta,2016,161:80.
    [16] Reis N,Botelho B G,Franca A S,et al.Food Analytical Methods,2017,10(8):2700.
    [17] SUN Su-qin(孙素琴).Analysis of Traditional Chinese Medicine by Infrared Spectroscopy(中药红外光谱分析与鉴定).Beijing:Chemical Industry Press(北京:化学工业出版社),2010.
    [18] He X S,Xi B D,Wei Z M,et al.Chemosphere,2011,82(4):541.
    [19] Silva S D,Feliciano R P,Boas L V,et al.Food Chemistry,2014,150:489.
    [20] Sergios Theodoridis.Pattern Recognition(模式识别).Translated by LI Jing-jiao(李晶皎,译).Beijing:Publishing House of Electronics Industry(北京:电子工业出版社),2006.
    [21] Zhang L,Li L D,Yang A Q,et al.Pattern Recognition,2017,69:199.
    [22] Zhang Y,Zhou G X,Jin J,et al.Neurocomputing,2017,225:103.
    [23] Fang J W,Wang L P,Wang Y,et al.Molecular Bio Systems,2017,13(8):1575.
    [24] Lu W,Dong X,Qiu L L,et al.Journal of Hazardous Materials,2017,326:130.
    [25] Li Y,Zhang J,Li T,et al.Spectrochimica Acta Part A:Molecular and Biomolecular Spectroscopy,2017,177:20.
    [26] Vapnik V N.The Nature of Statistical Learning Theory Springer,1995.
    [27] Balcázar J,Dai Y,Osamu Watanabe.Algorithmic Learning Theory Washington,DC,2001.119.
    [28] Lee Y J,Mangasarian O L.RSVM:Reduced Support Vector Machines,Proc of the First SLAM International on Data Mining,Chicago,2001.
    [29] SrbuC,Na爧cu-Briciu R D,Kot-Wasik A,et al.Food Chemistry,2012,130(4):994.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700