Connect with us

Technology

What AlphaGo Can Teach Us About How People Learn – WIRED

David Silver of DeepMind, who helped create the program that defeated a Go champion, thinks rewards are central to how machines—and humans—acquire knowledge.

Published

on

post featured image

We are, of course, looking at ways to apply MuZero to real world problems, and there are some encouraging initial results. To give a concrete example, traffic on the internet is dominated by video, and a big open problem is how to compress those videos as efficiently as possible. You can think of this as a reinforcement learning problem because there are these very complicated programs that compress the video, but what you see next is unknown. But when you plug something like MuZero into it, our…

Click here to view the original article.

Continue Reading
Advertisement
Advertisement

Trending