AlphaZero Paper review
Paper: Mastering the game of Go without human knowledge. https://www.nature.com/articles/nature24270 Key takeaways: The loss function sums the evaluation loss and policy loss, together with a regulation parameter.
Paper: Mastering the game of Go without human knowledge. https://www.nature.com/articles/nature24270 Key takeaways: The loss function sums the evaluation loss and policy loss, together with a regulation parameter.