大卫·席尔瓦 (计算机科学家)

大卫·席尔瓦
David Silver
大卫·席尔瓦; David Silver
国籍	英国
母校	剑桥大学（BSc）; 阿尔伯塔大学（PhD）
知名于	AlphaGo; AlphaZero; AlphaStar（英语：AlphaStar (software)）
奖项	ACM计算机奖（英语：ACM Prize in Computing）（2019）; 皇家学会院士（2021）
	科学生涯
研究领域	计算机科学
机构	DeepMind

大卫·席尔瓦 FRS （英语：David Silver，1976年—）是一名英国计算机科学家和商人。他领导DeepMind的强化学习研究小组，是AlphaGo、AlphaZero的首席研究员和AlphaStar（英语：AlphaStar (software)）的共同负责人。

教育

席尔瓦于1997年毕业于剑桥大学，获得阿迪生-韦斯利奖，并在那里与杰米斯·哈萨比斯结识^[1]。席尔瓦于2004年回到学术界，在阿尔伯塔大学攻读强化学习的博士学位，在那里他共同提出了用于第一个硕士级9×9围棋项目的算法，并于2009年毕业^[2]^[3]。他版本的程序MoGo是截至2009年的最强围棋程式之一^[4]。

职业生涯

大学毕业后，席尔瓦共同创立了电子游戏公司Elixir Studios（英语：Elixir Studios），并担任其首席技术官和首席程序员，获得多个技术和创新奖项^[1]^[5]。

席尔瓦在2011年被授予皇家学会大学研究奖学金，随后成为伦敦大学学院的讲师，现在是教授^[6]。他关于强化学习的讲座可以在YouTube上找到^[7]。席尔瓦从DeepMind成立之初就为其提供咨询，于2013年全职加入。

席尔瓦近期的研究重点是将强化学习与深度学习互相结合，包括一个直接从像素学习玩雅达利游戏的程式^[8]。席尔瓦领导了AlphaGo项目，最终使其成为第一个在全尺寸围棋游戏中击败顶级职业棋手的程式^[9]。随后AlphaGo获得荣誉的9段职业认证，并获得了康城狮子奖的创新奖^[10]。之后他领导了AlphaZero的开发工作，利用同样的人工智能从头开始学习下围棋（只通过自己下棋而不是从人类游戏中学习），然后以同样的方式学习下国际象棋和日本将棋，达到比其他任何计算机程序更高的等级。

席尔瓦是DeepMind发表文章最多的员工之一，引用次数超过130,000次，h指数为78^[11]。

他因在电脑游戏方面取得的突破性进展而被授予2019年ACM计算机奖（英语：ACM Prize in Computing）^[12]

2021年，席尔瓦因其对深度Q-学习和AlphaGo的贡献而被选为英国皇家学会院士^[13]。

参考资料

^ ^1.0 ^1.1 Shead, Sam. David Silver: The unsung hero and intellectual powerhouse at Google DeepMind. Business Insider. [2020-09-26]. （原始内容存档于2022-11-16）.
^ David, Silver. Reinforcement Learning and Simulation-Based Search in Computer Go. ERA. 2009. doi:10.7939/R39D8T （英语）.
^ Sylvain Gelly, David Silver. Achieving Master Level Play in 9 × 9 Computer Go (PDF). Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence. 2008 [2023-02-23]. （原始内容存档 (PDF)于2022-04-03）.
^ Stuart J. Russell, Peter Norvig. Artificial Intelligence: A Modern Approach 3rd. Prentice Hall. 2009.
^ What the AI Behind AlphaGo Can Teach Us About Being Human. Wired.com. [17 May 2016]. （原始内容存档于2016-05-29）.
^ CSML | David Silver. www.csml.ucl.ac.uk. [2017-05-27]. （原始内容存档于2021-04-24）（美国英语）.
^ RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning. May 13, 2015 [2023-02-23]. （原始内容存档于2023-02-25） –通过YouTube.
^ Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K. Human-level control through deep reinforcement learning. Nature. 2015-02-26, 518 (7540): 529–533. Bibcode:2015Natur.518..529M. ISSN 0028-0836. PMID 25719670. S2CID 205242740. doi:10.1038/nature14236 （英语）.
^ Silver, David; Huang, Aja; Maddison, Chris J.; Guez, Arthur; Sifre, Laurent; Driessche, George van den; Schrittwieser, Julian; Antonoglou, Ioannis; Panneershelvam, Veda; Lanctot, Marc; Dieleman, Sander; Grewe, Dominik; Nham, John; Kalchbrenner, Nal; Sutskever, Ilya; Lillicrap, Timothy; Leach, Madeleine; Kavukcuoglu, Koray; Graepel, Thore; Hassabis, Demis. Mastering the game of Go with deep neural networks and tree search. Nature. 28 January 2016, 529 (7587): 484–489. Bibcode:2016Natur.529..484S. ISSN 0028-0836. PMID 26819042. S2CID 515925. doi:10.1038/nature16961.
^ Google DeepMind AlphaGo in U.K. Wins Innovation Grand Prix. [2017-05-27]. （原始内容存档于2016-07-31）（英语）.
^ David Silver – Google Scholar Citations. [2022-02-01]. （原始内容存档于2023-03-25）.
^ Ormond, Jim. ACM Prize in Computing Awarded to AlphaGo Developer: David Silver Recognized for Breakthrough Advances in Computer Game-Playing. acm.org. [2020-04-02]. （原始内容存档于2023-03-07）.
^ Royal Society elects outstanding new Fellows and Foreign Members. royalsociety.org. [2021-06-08]. （原始内容存档于2021-05-06）.

[Unsung_Hero-1] 1.0 ^1.1 Shead, Sam. David Silver: The unsung hero and intellectual powerhouse at Google DeepMind. Business Insider. [2020-09-26]. （原始内容存档于2022-11-16）.

[2] David, Silver. Reinforcement Learning and Simulation-Based Search in Computer Go. ERA. 2009. doi:10.7939/R39D8T （英语）.

[3] Sylvain Gelly, David Silver. Achieving Master Level Play in 9 × 9 Computer Go (PDF). Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence. 2008 [2023-02-23]. （原始内容存档 (PDF)于2022-04-03）.

[4] Stuart J. Russell, Peter Norvig. Artificial Intelligence: A Modern Approach 3rd. Prentice Hall. 2009.

[MyUser_Wired.com_May_17_2016c-5] What the AI Behind AlphaGo Can Teach Us About Being Human. Wired.com. [17 May 2016]. （原始内容存档于2016-05-29）.

[6] CSML | David Silver. www.csml.ucl.ac.uk. [2017-05-27]. （原始内容存档于2021-04-24）（美国英语）.

[7] RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning. May 13, 2015 [2023-02-23]. （原始内容存档于2023-02-25） –通过YouTube.

[8] Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K. Human-level control through deep reinforcement learning. Nature. 2015-02-26, 518 (7540): 529–533. Bibcode:2015Natur.518..529M. ISSN 0028-0836. PMID 25719670. S2CID 205242740. doi:10.1038/nature14236 （英语）.

[9] Silver, David; Huang, Aja; Maddison, Chris J.; Guez, Arthur; Sifre, Laurent; Driessche, George van den; Schrittwieser, Julian; Antonoglou, Ioannis; Panneershelvam, Veda; Lanctot, Marc; Dieleman, Sander; Grewe, Dominik; Nham, John; Kalchbrenner, Nal; Sutskever, Ilya; Lillicrap, Timothy; Leach, Madeleine; Kavukcuoglu, Koray; Graepel, Thore; Hassabis, Demis. Mastering the game of Go with deep neural networks and tree search. Nature. 28 January 2016, 529 (7587): 484–489. Bibcode:2016Natur.529..484S. ISSN 0028-0836. PMID 26819042. S2CID 515925. doi:10.1038/nature16961.

[10] Google DeepMind AlphaGo in U.K. Wins Innovation Grand Prix. [2017-05-27]. （原始内容存档于2016-07-31）（英语）.

[MyUser_Https:_May_17_2016c-11] David Silver – Google Scholar Citations. [2022-02-01]. （原始内容存档于2023-03-25）.

[12] Ormond, Jim. ACM Prize in Computing Awarded to AlphaGo Developer: David Silver Recognized for Breakthrough Advances in Computer Game-Playing. acm.org. [2020-04-02]. （原始内容存档于2023-03-07）.

[13] Royal Society elects outstanding new Fellows and Foreign Members. royalsociety.org. [2021-06-08]. （原始内容存档于2021-05-06）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]