OpenAI Product Presentation Deck slide image

OpenAI Product Presentation Deck

Before RL can't actually solve hard tasks The result AlphaGo (2016) 72-million parameter feedforward neural network, coupled with MCTS, to defeat top humans at Go After RL + MCTS can solve hard problems, given: discrete actions, modest action space, simulator at test time ICDEFGHIJKLMNOPO LEE SEDOL 00:01:00 ALPHAGO 00:00:54
View entire presentation