Daily Archives: November 2, 2024


AlphaZero Paper review

Paper: Mastering the game of Go without human knowledge. https://www.nature.com/articles/nature24270 Key takeaways: The loss function sums the evaluation loss and policy loss, together with a regulation parameter.