摘要
对肿瘤信息基因的提取及基因表达谱数据的处理,是基因表达谱研究中至关重要的一步。基于图谱理论提出一种新的DLBCL信息基因提取方法。首先,对每一个基因在不同条件下的表达情况构图,使其便于利用图论知识挖掘规律;进而进行奇异值分解(SVD)获取图的谱信息,刻画出该基因的表达规律,根据图谱与理想模板的余弦夹角及距离的运算选取信息基因子集;最后,实验了两组公开数据集,实验结果验证了方法的可行性。
The extraction of tumor information genes and the processing of gene expression profile data is an important step in the study of gene expression profile. Based on map theory,a new DLBCL information gene extraction method is proposed. Firstly,the composition of the expression of each gene under different conditions makes it easy to exploit the law of graph theory. Then singular value decomposition( SVD) is carried out to obtain the spectral information of the graph,and the expression rule of this gene is described. A subset of information genes is selected according to the calculation of cosine angle and distance between the graph and the ideal template. Finally,two sets of open data sets were tested and the feasibility of the method was verified.
引文
[1]谭青青,王年,苏亮亮,等.基于双正交非负矩阵三因式的肿瘤识别[J].安徽大学学报(自然科学版),2015,39(06):29-36.
[2]王年,庄振华,唐俊,等.基于Fiedler向量的基因表达谱数据分类方法[J].中国生物工程杂志,2010,30(12):82-86.
[3]尹婷婷.基因表达谱识别算法研究[D].南京林业大学,2015.
[4]郭志鹏.肿瘤基因表达谱的数据挖掘与识别分类[D].北京理工大学,2015.
[5]甘斌.基于稀疏性理论的肿瘤基因表达谱分类[D].曲阜师范大学,2015.
[6] Priyank Shah,etc. Inhibits Proliferation,Induces Apoptosis and Alters Gene Expression Profiles in Breast Cancer Cells[J].Pharmacological Reports,2016,68(3).
[7] Tobias Heckmann,etc. Graph theory—Recent Developments of its Application in Geomorphology[J]. Geomorphology,2015,243.