Hacker News

Playing hard exploration games by watching YouTube

by indescions_2018on 5/30/2018, 12:17:25 PM with 6 comments

by sleepychuon 5/30/2018, 12:54:21 PM
Neat, I don't understand what they mean by having embedded a reward video into the set. Is that a video where copying the behaviour will deliver victory?
by eric_hon 5/30/2018, 4:39:28 PM
here's video of the agent actually playing (linked in the paper): https://www.youtube.com/watch?v=Msy82sIfprI
by jexahon 5/30/2018, 1:14:08 PM
This is really cool. A step in the right direction towards general learning through observation.
by erikbon 5/30/2018, 4:30:55 PM
This is actually quite human. I also watch Let's plays if I struggle with a quest (or game in general).
Also interesting assumption to say "harder = fewer rewards". Probably doesn't always apply but is a good generalization.
by jonbaeron 5/30/2018, 1:35:14 PM
Are audio cues also analyzed here? ie: "We observe that use of the audio signal in CMC results in more emphasis being placed on key items and their location in the inventory"
by navaation 5/30/2018, 12:55:48 PM
This should probably say "ML" or "AI" or whatever, I was slightly disappointed to realize it was not a funny paper about… I don't know to be fair.