Real-world RL: DeepMind controls a fusion reactor:…The era of the centaur scientist cometh…DeepMind researchers have trained a reinforcement learning agent to shape the distribution of plasma in a Tokamak fusion reactor. This requires training an agent that “can manipulate the magnetic field through a precise control of several coils that are magnetically coupled to the […]

If you (or anyone) still read this you’re probably aware I’ve been banging on about Centaurs for a little while. I started idly sketching something that could become a shorthand for a ‘centaur’ actor in a system. The kind of visual shorthand that you might often use on whiteboards or in sketches of flows in […]

A couple of weeks ago when AlphaGo beat a human opponent at Go, Jason Kottke noted “Generally speaking, until recently machines were predictable and more or less easily understood. That’s central to the definition of a machine, you might say. You build them to do X, Y, & Z and that’s what they do. A car […]