PodcastsÉducationTic-Tac-Toe the Hard Way

Tic-Tac-Toe the Hard Way

People + AI Research
Tic-Tac-Toe the Hard Way
Dernier épisode

10 épisodes

  • Tic-Tac-Toe the Hard Way

    Lessons learned

    22/07/2020 | 33 min
    What have we learned about machine learning and the human decisions that shape it? And is machine learning perhaps changing our minds about how the world outside of machine learning — also known as the world — works?
    For more information about the show, check out pair.withgoogle.com/thehardway/.

    You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.
  • Tic-Tac-Toe the Hard Way

    Head to Head: The Even Bigger ML Smackdown!

    22/07/2020 | 24 min
    Yannick and David’s systems play against each other in 500 games. Who’s going to win? And what can we learn about how the ML may be working by thinking about the results?
    See the agents play each other in Tic-Tac-Two!

    For more information about the show, check out pair.withgoogle.com/thehardway/.

    You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.
  • Tic-Tac-Toe the Hard Way

    Enter tic-tac-two

    22/07/2020 | 21 min
    David’s variant of tic-tac-toe that we’re calling tic-tac-two is only slightly different but turns out to be far more complex. This requires rethinking what the ML system will need in order to learn how to play, and  how to represent that data.
    For more information about the show, check out pair.withgoogle.com/thehardway/.

    You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.
  • Tic-Tac-Toe the Hard Way

    Head to Head: the Big ML Smackdown!

    22/07/2020 | 25 min
    David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe!
    See the agents play each other!

    For more information about the show, check out pair.withgoogle.com/thehardway/.

    You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.
  • Tic-Tac-Toe the Hard Way

    Give that model a treat! : Reinforcement learning explained

    22/07/2020 | 26 min
    Switching gears, we focus on how Yannick’s been training his model using reinforcement learning.  He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.
    Resources: 
    Deep Learning for JavaScript book
    Playing Atari with Deep Reinforcement Learning
    Two Minute Papers episode on Atari DQN
    For more information about the show, check out pair.withgoogle.com/thehardway/.

    You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

Plus de podcasts Éducation

À propos de Tic-Tac-Toe the Hard Way

A writer and a software engineer from Google's People + AI Research team explore the human choices that shape machine learning systems by building competing tic-tac-toe agents.
Site web du podcast

Écoutez Tic-Tac-Toe the Hard Way, Vivons heureux avant la fin du monde : des idées pour repenser nos modèles de société ou d'autres podcasts du monde entier - avec l'app de radio.fr

Obtenez l’app radio.fr
 gratuite

  • Ajout de radios et podcasts en favoris
  • Diffusion via Wi-Fi ou Bluetooth
  • Carplay & Android Auto compatibles
  • Et encore plus de fonctionnalités
Applications
Réseaux sociaux
v8.7.2 | © 2007-2026 radio.de GmbH
Generated: 3/14/2026 - 11:25:23 AM