人工智能时代国家战略行为的模式变迁—

人工智能时代国家战略行为的模式变迁——走向数据与算法的竞争

英文篇名：Changing Models of Strategic Behavior in the Age of Artificial Intelligence
作者：封帅 ; 周亦奇
英文作者：FENG Shuai;ZHOU Yiqi;
关键词：人工智能 ; 深度学习 ; 国家战略与决策 ; 模式变迁 ; 数据 ; 算法
英文关键词：Artificial Intelligence;;deep learning;;strategic behavior;;data;;algorithm
中文刊名：GJZW
英文刊名：Global Review
机构：上海国际问题研究院;复旦大学国际关系与公共事务学院;
出版日期：2018-07-15
出版单位：国际展望
年：2018
期：v.10;No.55
基金：国家社会科学基金青年项目“人工智能时代的国际关系研究”(18CGJ013)的阶段性成果
语种：中文;
页：GJZW201804003
页数：28
CN：04
ISSN：31-1041/D
分类号：38-63+157-158

摘要

深度学习算法的突破使人工智能技术进入高速发展阶段,技术进步带来的破壁效应推动了国际战略领域的深刻变革。依托持续完善的算法、不断提升的计算能力和海量的数据资源,人工智能技术能够为国际行为体提供更系统的战略评估和精准的战略决策,保证更高效的战略动员与执行。随着新技术逐渐渗入战略行为的各个环节,战略领域将先后出现"先发优势"和"数据—算法竞争"两种新的战略行为模式,最终推动人类的战略行为形态从人与人的竞争逐步走向数据与算法的竞争。当然,在战略行为模式重构过程中,将不可避免地遭遇黑箱状态、鸿沟效应等潜在风险。因此我们需要努力降低其中的可能威胁,推动人类社会走向一个更加稳定而繁荣的新时代。
The breakthrough of deep learning algorithm has ushered in a new era of rapid development for artificial intelligence, and technological progress has facilitated the profound changes in the international strategic field. The function of in-depth learning is highly consistent with the inherent characteristics of the strategic field, which makes artificial intelligence technology obtain an effective way to fully intervene in strategic behavior. Relying on the continuous improvement of algorithms, the continuous improvement of computing power, and the continuous accumulation of data resources, artificial intelligence technology has been able to provide a more systematic strategic assessment and more accurate strategic decision-making for the government to ensure that the strategy implementation process is more efficient. The pattern of human strategic behavior will be changed gradually, and two new patterns of strategic behavior will appear successively in the strategic field, namely, "first-mover advantage" and "data-algorithmic competition." Finally, it precipitate the transition from human competition to competition between data and algorithm. In the process of strategic behavior pattern reconstruction, it will inevitably encounter new risks and challenges. International relations researchers need to explore all kinds of unknown situations that may appear in the process of pattern change, try to reduce the possible threats in the process of change, and promote human society to a more stable and prosperous new era.

引文

(1)关于本轮以深度学习为代表的人工智能高速发展周期的基本技术背景可参考:Geoffrey E.Hinton,Simon Osindero,and Yee-Whye Teh,“A Fast Learning Algorithm for Deep Belief Nets,”Neural Computation,Vol.18,No.7,July 2006,pp.1527-1554;Pedro Domingos,The Master Algorithm:How the Quest for the Ultimate Learning Machine will Remake Our World,New York:Basic Books,2015;Yoshua Bengio,Pascal Lamblin,Dan Popovici,Hugo Larochelle,and Greedy Layer Wise,“Training of Deep Networks,”in J.Platt,et al.eds.,Advances in Neural Information Processing Systems(NIPS 2006),Cambridge:MIT Press,2007,pp.153-160;Marc’Aurelio Ranzato,Christopher Poultney,Sumit Chopra,and Yann Le Cun,“Efficient Learning of Sparse Representations with an Energy-Based Model,”in J.Platt,et al.eds.,Advances in Neural Information Processing Systems(NIPS 2006),pp.1137-1144;[美]伊恩·古德费洛、[加]约书亚·本吉奥、[加]亚伦·库维尔:《深度学习》,赵申剑等译,人民邮电出版社2017年版。
    (2)参见Stephen J.Cimbala,ed.,Artificial Intelligence and National Security,Lexington,MA:Lexington Books,1987;Allan M.Din,ed.,Arms and Artificial Intelligence:Weapon and Arms Control Applications of Advanced Computing,New York:Oxford University Press,1988;Valerie M.Hudson,ed.,Artificial Intelligence and International Politics,San Francisco:Westview Press,1991;Ennals Richard,Artificial Intelligence and Human Institutions,London:Springer-Verlag,1991;George F.Luger,Artificial Intelligence,Structures and Strategies for Complex Problem Solving,Harlow,England:Addison-Wesley,2005;等等。
    (1)参见Erik Brynjolfsson,and,Andrew Mcfee,The Second Machine Age:Work,Progress,and Prosperity in a Time of Brilliant Technologies,New York:Norton,2014;Martin Ford,Rise of the Robots:Technology and the Threat of a Jobless Future,New York:Basic Books,2015;Jerry Kaplan,Humans Need not Apply:A Guide to Wealth and Work in the Age of Artificial Intelligence,New Haven and London:Yale University Press,2015;Kareem Ayoub,and Kenneth Payne,“Strategy in the Age of Artificial Intelligence,”The Journal of Strategic Studies,Vol.39,No.5-6,2016,pp.793-819;Vincent Boulanin,Maaike Verbruggen,Mapping the Development of Autonomy in Weapon Systems,Stockholm International Peace Research Institute,2017;等等。
    (2)国内相关问题的主要研究成果包括:封帅:《人工智能时代的国际关系:走向变革且不平等的世界》,《外交评论》2018年第1期,第128-156页;董青岭:《机器学习与冲突预测——国际关系研究的一个跨学科视角》,《世界经济与政治》2017年第7期,第100-117页;庞珣:《定量预测的风险来源与处理方法——以“高烈度政治动荡”预测研究项目的再分析为例》,《国际政治科学》2017年第1期,第1-32页。
    (3)目前,在中国国际关系领域,以人工智能或机器学习为主要研究内容的研究团队数量仍比较有限,主要包括清华大学的庞珣团队、对外经贸大学的董青岭团队、华东政法大学的高奇琦团队、暨南大学的陈定定团队、中国国际问题研究院团队以及由笔者和部分同事组成的上海国际问题研究院团队。总体来说,中国学者对该问题的研究仍处于起步阶段,学术共同体尚未完全形成。
    (4)唐世平、王凯主编:《历史中的战略行为:一个战略思维教程》,北京大学出版社2015年版,第3页。需要说明的是,本文为了更准确说明战略领域的整体变化情况,在对国内外各种战略分析模式进行对比之后,选择了唐世平老师提出的“战略行为”概念作为基本分析框架。我们认为这种分析框架能够更好地概括整个战略活动的各个环节,也能够更加全面地展示战略领域的基本状态。
    (1)事实上,不同时代的学者对“战略”这一概念均有多重界定,各种概念虽存在外延上的差别,但其核心逻辑基本一致,对本文的研究结论影响较小。对于战略概念的不同界定和应用,可参考李际均:《论战略》,解放军出版社2002年版;叶自成:《中国大战略》,中国社会科学出版社2003年版;钮先钟:《战略研究》,广西师范大学出版社2003年版;杨洁勉:《大合作:变化中的世界和中国国际战略》,天津人民出版社2005年版;时殷弘:《战略问题三十篇——中国对外战略思考》,中国人民大学出版社2008年版;周丕启:《大战略分析》,上海人民出版社2009年版;门洪华:《中国国际战略导论》,清华大学出版社2009年版。还包括Henry Kissinger,Diplomacy,New York:Simon and Schuster,1994;Zbignew Brezinski,The Grand Chessboard:American Primacy and Its Geostrategic Imperatives,New York:Basic Books,1997;John Lewis Gaddis,Strategies of Containment:A Critical Appraisal of American National Security Policy During the Cold War,New York:Oxford University Press,2005;等等。
    (2)唐世平、王凯主编:《历史中的战略行为:一个战略思维教程》,第11-13页。
    (1)参见[德]马克斯·韦伯:《经济与社会》(上卷),林荣远译,商务印书馆1997年版,第278-286页。
    (1)封帅:《人工智能时代的国际关系:走向变革且不平等的世界》,第129页。
    (2)关于人工智能技术进步的进程,可参考Ray Kurzweil,The Singularity is Near:When Humans Transcend Biology,New York:Viking Press,2005;Ian Goodfellow,Yoshua Bengio,Aaron Courville,Deep Learning,Cambridge:MIT Press,2016;尼克:《人工智能简史》,人民邮电出版社2017年版。
    (3)[美]伊恩·古德费洛、[加]约书亚·本吉奥、[加]亚伦·库维尔:《深度学习》,第2页。
    (1)需要强调的是,由深度学习算法所引领的人工智能技术突破推动了领域性人工智能技术(Artificial Narrow Intelligence)的进步,相关成果只能在特定的垂直领域或应用场景中发挥作用,因此,也被称之为“弱人工智能”技术。目前,该技术距离能够完全模拟人类认知活动,并且实现跨领域、跨场景发挥作用的通用人工智能(General AI)(即“强人工智能”)仍有很大的距离。现阶段我们关于人工智能在国际关系领域的所有讨论都是基于“弱人工智能”时代的技术水平而进行的,这也是科学研究和科学幻想的界限。二者的区别可参见Executive Office of the President,National Science and Technology Council,“Committee of Technology,Preparing for the Future of Artificial Intelligence,”October 2016,p.7,https://obamawhitehouse.archives.gov/sites/default/files/whitehouse_files/microsites/ostp/NSTC/pr eparing_for_the_future_of_ai.pdf。
    (2)针对“弱人工智能”时代技术的主要功能,很多研究者都进行了非常有价值的分析与概括,笔者认为麦肯锡公司(Mc Kinsey&Company)的“四功能说”是其中最准确、最全面的概括。相关研究报告参见Mc Kinsey Global Institute,“Artificial Intelligence:the Next Digital Frontier?”Discussion Paper,June 2017,https://www.mckinsey.com/~/media/Mc Kinsey/Industries/Advanced%20Electronics/Our%20Insights/How%20artificial%20intelligence%20can%20deliver%20real%20value%20to%20companies/MGI-Artificial-Intelligence-Discussion-paper.ashx.以及中文版研究报告:《中国人工智能的未来之路》,2017年3月,http://www.mckinsey.com.cn/wp-content/uploads/2017/03/CDF_Mc Kinsey_AI_CN_final.pdf。
    (3)例如,科大讯飞的语音识别技术、谷歌的智能识图技术都是这一功能的典型成果。
    (1)例如,淘宝等电商平台针对使用者的消费偏好进行的广告投放行为,以及在脸书平台上出现的针对不同用户的政治类广告投放,都是这一功能的体现。
    (2)这一功能的应用范围很广,从优步、滴滴的出行路线规划,电影票的动态定价系统,到Wealthfront的资产管理计划推荐,都是人工智能技术的决策功能的体现。而由谷歌旗下的Deepmind公司开发的Alpha Go系统则是人工智能决策功能最优秀的代表之一,它实际上将围棋这一人类最为深奥和复杂的智力活动分解为多轮决策过程,并最终提供了出色的决策结果。
    (3)例如,Waymo、百度等企业开发自动驾驶技术就是集成解决方案最为典型的案例,其建立基于人工智能的综合解决平台,集成多项领域性技术,满足更为复杂的社会需求。
    (1)See Robert Jervis,System Effects:Complexity in Political and Social Life,Princeton:Princeton University Press,1997.
    (2)前者类似于围棋、国际象棋等博弈游戏,双方在信息完全开放的情况下进行博弈。随着Alpha Go等人工智能系统在围棋上占据压倒性优势后,人类在所有完全信息博弈游戏中已经处于下风。后者则是以德州扑克等游戏为代表,博弈的任何参与者不能完全掌握其他行动者的信息。在德州扑克竞赛中,Libratus等人工智能系统同样通过“强化学习”的方式在比赛中轻易战胜顶级扑克玩家。现实中的战略博弈更接近非完全信息博弈活动。相关技术性成果可参考NIPS2017最佳论文:Noam Brown and Tuomas Sandholm,“Safe and Nested Subgame Solving for Imperfect-Information Games,”https://arxiv.org/pdf/1705.02955.pdf。
    (1)目前以人工智能技术获取信息资料的渠道已经非常丰富,形成了自然语言处理数据集、计算机视觉、图像识别数据集等数十个数据集系统。相关链接汇总参见https://www.jianshu.com/p/8c7c1af7d593。
    (2)相关研究案例可参见Tamar Mitts,“From Isolation to Radicalization:Anti-Muslim Hostility and Support for ISIS in the West,”Working Paper,March 31,2017,https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2795660。
    (1)Yanchuan Sim,Brice.D.LAcree,Justin H.Gross,and Noah A.Smith,“Measuring Ideological Proportions in Political Speeches,”Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing,Seattle Washington,October 18-21,2013,pp.91-101.
    (2)Robert Aexlrod,“The Dissemination of Culture:A Model with Local Convergence and Global Polarization,”Journal of Conflict Resolution,Vol.41,No.2,1997,pp.203-226.
    (3)Lars-Erik Cederman,“Emergent Polarity:Analyzing State-Formation and Power Politics,”International Studies Quarterly,Vol.38,No.5,1994,pp.501–533.
    (4)例如,美国Five Thirty Eight网站对竞选和体育比赛的预测活动。参见http://fivethirtyeight.com/。
    (5)例如,著名学者罗伯特·阿克塞尔罗德(Robert Axelord)所提出的合作进化战略,就是以ABM模型进行的模拟为基础。参见[美]罗伯特·阿克塞尔罗德:《合作的进化》,吴坚忠译,上海人民出版社2007年版。
    (6)该领域的研究可参考董青岭:《机器学习与冲突预测——国际关系研究的一个跨学科视角》,第100-117页;Michael D.Ward,et al.,“Learning from the Past and Stepping into the Future:Toward a New Generation of Conflict Prediction,”International Studies Review,Vol.15,No.4,2013,pp.473-490;Skyler J.Cranmer and Bruce A.Desmarais,“What Can We Learn from Predictive Modeling?”Political Analysis,Vol.25,No.2,2017,pp.145-166;H?vard Hegre,Joakim Karlsen,H?vard Mokleiv Nyg?rd,H?vard Strand,and Henrik Urdal,“Predicting Armed Conflict,2010-2050,”International Studies Quarterly,Vol.57,No.2,2012,pp.250-270。
    (1)Neil Johnson,Spencer Carran,Joel Botner,Kyle Fontaine,Nathan Laxague,Philip Nuetzel,Jessica Turnley,and Brian Tivnan,“Pattern in Escalations in Insurgent and Terrorist Activity,”Science,Vol.333,No.6038,2011,pp.81-84;and H?vard Hegre,H?vard Mokleiv Nyg?rd,and Ranveig Flaten R?der,“Evaluating the Scope and Intensity of the Conflict Trap:A Dynamic Simulation Approach,”Journal of Peace Research,Vol.54,No.2,2017,pp.243-261.
    (2)参见Tang Shiping,“Fear in International Politics:Two Positions,”International Studies Review,Vol.10,No.3,2008,pp.451-471。
    (1)Jorge Galindo and Pablo Tamayo,“Credit Risk Assessment Using Statistical and Machine Learning:Basic Methodology and Risk Modeling Applications,”Computational Economic,Vol.15,No.1-2,April 2000,pp.107-143.
    (2)客观而言,众筹式决策与精英式决策孰优孰劣,目前尚无定论。有研究比较了这两种方式在预测冲突发生时的准确度,发现两者都不能令人完全满意,而其中长期在新闻媒体上撰写评论文章的所谓专家给出的判断最为糟糕。Naomi Bosler and Gerald Schneider,“The Oracle or the Crowd?Experts Versus the Stock Market in Forecasting Ceasefire Success in the Levant,”Journal of Peace Research,Vol.54,No.4,2011,pp.231-242。
    (3)Michael Colaresi and Zuhaib Mahmood,“Do the Robot:Lessons from Machine Learning to Improve Conflict Forecasting,”Journal of Peace Research,Vol.54,No.2,2017,pp.193-214.
    (4)Kareem Ayoub and Kenneth Payne,“Strategy in the Age of Artificial Intelligence,”The Journal of Strategic Studies,Vol.39,No.5-6,2016,p.803.
    (1)关于Alpha Go的“进化”,可以比较Deepmind团队在两次围棋人机大战之后发表的论文,2016年,Alpha Go主要采用MCTS蒙特卡洛搜索树进行对弈演算,从人类棋谱中学习。但2017年,Alpha Go Zero通过算法调整,采用上限置信区间算法(UCT),完全抛开了人类棋谱,进行自对弈强化学习,利用训练好的神经网络,运用博弈树形搜索将胜率在原来基础上提高了近一倍。这些使系统提升的方法很可能被运用到战略博弈中。参见David Silver et al.,“Mastering the Game of Go with Deep Neural Networks and Tree Search,”Nature,Vol.529,No.7587,2016,pp.484-489;David Silver et al.,“Mastering the Game of Go without Human Knowledge,”Nature,Vol.550,No.7676,2017,pp.354-359。
    (2)此外,人类在战略决策过程中还会受到很多无法预知的情绪因素的影响,例如,一些决策者对面子、荣誉等因素的渴望会直接影响其决策方向;而另一些决策者因性格上的谨慎、怯懦而倾向于夸大风险,使整个决策趋于保守。这些现象都是人类心理的正常状态,但也是非理性战略决策的重要根源。可参考Richard Ned Lebow,Between Peace and War:The Nature of International Crisis,Baltimore:Johns Hopkins University Press,1984。
    (3)关于身体因素对战略决策的影响,可参见[法]皮埃尔·阿考斯、[瑞]皮埃尔·朗契尼克:《病夫治国》,郭宏安译,江苏人民出版社2005年版。
    (1)参见David Silver et al.,“Mastering the Game of Go without Human Knowledge,”Nature,pp.354-359。
    (2)当然,具有优势并不等于机器智能在决策过程中不存在缺陷,机器决策系统也存在一些明显的缺陷,例如,机器决策无法理解模糊性目标,无法虑及政治体系内各种部门利益妥协的问题,纯粹的概率分析容易忽视人类追求小概率冒险行动的可能,等等。
    (1)Vincent Boulanin and Maaike Verbruggen,Mapping the Development of Autonomy in Weapon Systems,Stockholm International Peace Research Institute,2017,pp.27-29.
    (2)战略动员本身就是一项集体行动,在任何传统战略模式下,都会出现非常典型的“搭便车”行为和“代理人”行为。而在智能化武器系统逐渐成熟的条件下,国内战略动员在很大程度上将转变为对机器工具资源的调配使用,集体行动问题将会减少。关于这一问题可参见[美]曼瑟尔·奥尔森:《集体行动的逻辑》,陈郁等译,上海人民出版社2014年版。
    (1)关于国际体系进化的路径问题,唐世平提出的以“变异—选择—遗传”为基本模式的社会演化理论是一种非常具有说服力的研究思路。参见Tang Shiping,The Social Evolution of International Politics,Oxford:Oxford University Press,2013。
    (2)参见:[美]伊恩·古德费洛、[加]约书亚·本吉奥、[加]亚伦·库维尔,《深度学习》,第3页。
    (3)对于不同算法的内生性特点,可参考Pedro Domingos,The Master Algorithm:How the Quest for the Ultimate Learning Machine will Remake Our World,New York:Basic Books,2015。
    (1)虽然数据来源不稳定、小概率事件的发生等使人类博弈者在个别场景能够获得优势,但在与人工智能系统的长期博弈过程中,人类取得具有决定性意义的战略优势几乎已经不可能。这也应该成为我们思考未来国际体系变迁的一个背景条件。
    (2)关于人工智能技术可能导致国际社会不平等加剧的问题,可参见封帅:《人工智能时代的国际关系:走向变革且不平等的世界》,第141-142页。

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700