用户名: 密码: 验证码:
Spark平台下的海表面温度并行预测算法
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Surface temperature parallel prediction algorithm under Spark platform
  • 作者:贺琪 ; 查铖 ; 孙苗 ; 姜晓轶 ; 戚福明 ; 黄冬梅 ; 宋巍
  • 英文作者:HE Qi;ZHA Cheng;SUN Miao;JIANG Xiao-yi;QI Fu-ming;HUANG Dong-mei;SONG Wei;Department of Information Technology, Shanghai Ocean University;National Marine Data and Information Service;Key Laboratory of Digital Ocean,State Oceanic Administration;Shanghai Bitu Information Technology CompanyLimited;Shanghai University of Electric Power;
  • 关键词:时间序列 ; 相似性度量 ; 海表面温度预测 ; Spark
  • 英文关键词:time series;;similarity measurement;;sea surface temperature prediction;;Spark
  • 中文刊名:海洋通报
  • 英文刊名:Marine Science Bulletin
  • 机构:上海海洋大学信息学院;国家海洋信息中心;国家海洋局数字海洋科学技术重点实验室;上海必途信息技术有限公司;上海电力大学;
  • 出版日期:2019-06-15
  • 出版单位:海洋通报
  • 年:2019
  • 期:03
  • 基金:海洋大数据分析预报技术研发基金(2016YFC1401902);; 国家海洋局数字海洋科学技术重点实验室开放基金(B201801029);; 上海市高校特聘教授(东方学者)项目(TP2016038);; 上海市科委部分地方院校能力建设项目(17050501900)
  • 语种:中文;
  • 页:43-52
  • 页数:10
  • CN:12-1076/P
  • ISSN:1001-6392
  • 分类号:P731.11
摘要
面对海量的海表面温度数据,如何使用大数据处理平台和新的处理技术来实时处理、分析并预测海表面温度数据,是一个亟待解决的问题。本文基于现阶段的时间序列方法和专家意见,首先,将类比合成方法引入到海表面温度预测应用中;其次,基于Spark平台提出了一种改进的快速DTW算法SparkDTW;最后,为了充分利用通过时间序列挖掘得到的信息,将SparkDTW与SVM相结合,提出了SparkDTW+SVM混合模型,为海表面温度预测的应用研究提供了较好的理论基础和技术支持。实验结果表明,SparkDTW算法预测精度优于SVM,提高了海表面温度预测效率,验证了将类比合成方法应用在海表面温度预测的可行性;SparkDTW+SVM在精度方面要优于SparkDTW和SVM,表明SVM模型能充分利用时间序列挖掘的信息,验证了SparkDTW+SVM在海表面温度预测的有效性。
        Faced with massive sea surface temperature data, how to use Spark platform and new processing technology to analyze, predict and process sea surface temperature data in real time has become a hot topic. Based on the current time series method and expert opinion, this paper first introduced the analog complexing method into the sea surface temperature prediction application.Then, based on the Spark platform, an improved fast DTW algorithm SparkDTW was proposed. Finally,in order to make full use of the information obtained through time series mining, SparkDTW and SVM were combined, and the SparkDTW +SVM hybrid model was proposed, which provides a good theoretical basis and technical support for the application research of global sea surface temperature prediction. The experimental results show that the prediction accuracy of SparkDTW algorithm is better than SVM, which improves the sea surface temperature prediction efficiency and verifies the feasibility of applying analogy complexing method to sea surface temperature prediction; Spark DTW +SVM is superior to SparkDTW and SVM in accuracy, which indicates that SVM model can make full use of time series mining information to verify the effectiveness of SparkDTW+SVM in sea surface temperature prediction.
引文
Agrawal R,Faloutsos C,Swami A,1993.Efficient similarity search in sequence databases.ProcFodo,730:69-84.
    Berndt D J,1994.Using dynamic time warping to find patterns in time series.Kdd Workshop.359-370.
    Casaroli G,Villa T,Bassani T,et al,2017.Numerical Prediction of the Mechanical Failure of the Intervertebral Disc under Complex Loading Conditions..Materials,10(1):31
    Jaramillo J,Velasquez J D,Franco C J,2017.Research in Financial Time Series Forecasting with SVM:Contributions from Literature.IEEELatin America Transactions,15(1):145-153.
    Li M,Liu X,Ding F,2017.The maximum likelihood least squares based iterative estimation algorithm for bilinear systems with autoregressive moving average noise.Journal of the Franklin Institute,354(12).
    Qiu X,Ren Y,Suganthan P N,et al,2017.Empirical Mode Decomposition based Ensemble Deep Learning for Load Demand Time Series Forecasting.Applied Soft Computing,54(C):246-255.
    Vapnik V N,1995.The Nature of Statistical Learning Theory.The nature of statistical learning theory.Springer:988-999.
    Wu J,Deng L,Jeon G,2017.Image Autoregressive Interpolation Model using GPU-Parallel Optimization.IEEE Transactions on Industrials Informatics,PP(99):1-1
    Yan H,Moradkhani H,Zarekarizi M,2017.A Probabilistic Drought Forecasting Framework:A Combined Dynamical and Statistical Approach.Journal of Hydrology,548:291-304.
    Zaharia M,Chowdhury M,Das T,et al,2012.Resilient distributed datasets:a fault-tolerant abstraction for in-memory cluster computing.Usenix Conference on Networked Systems Design and Implementation.USENIX Association:2-2.
    陈亮,王震,王刚,2017.深度学习框架下LSTM网络在短期电力负荷预测中的应用.电力信息与通信技术,15(5):8-11
    刘娜,王辉,凌铁军,等,2018.一个基于MOM的全球海洋数值同化预报系统.海洋通报,37(2):21-30.
    李其杰,李嘉康,赵颖,等,2017.EEMD、CEEMD算法与SVM在SST时间序列研究中的应用.数学的实践与认识,(7):221-228.
    李琰,范文静,骆敬新,等,2018.2017年中国近海海温和气温气候特征分析.海洋通报,37(3):296-302.
    孙建乐,2014.基于时间序列相似性的股价趋势预测研究.重庆:重庆交通大学.
    张建华,2003.海温预报知识讲座第一讲海水温度预报概况.海洋预报,(4):81-85.
    张美英,何杰,2011.时间序列预测模型研究综述.数学的实践与认识,41(18):000189-195.
    赵祥鸿,暴景阳,黄辰虎,等,2017.基于经验模态分解削弱多波束残余误差的方法.海洋通报,36(6):662-667.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700