Neurogammon

Neurogammon: Neurogammon is a computer backgammon program written by Gerald Tesauro at IBM's Thomas J. Watson Research Center. It was the first viable computer backgammon program implemented as a neural net, and set a new standard in computer backgammon play. It won the 1st Computer Olympiad in London in 1989, handily defeating all opponents.^[1] Its level of play was that of an intermediate-level human player.^[2]

Neurogammon contains seven separate neural networks, each with a single hidden layer. One network makes doubling-cube decisions; the other six choose moves at different stages of the game. The networks were trained by backpropagation from transcripts of 400 games in which the author played himself. The author's move was taught as the best move in each position.

In 1992, Tesauro completed TD-Gammon, which combined a form of unsupervised learning with the human-designed input features of Neurogammon, and played at the level of a world-class human tournament player.

References

^ Tesauro, Gerald (1989). "Neurogammon Wins Computer Olympiad" (PDF). Neural Computation 1: 321–323. doi:10.1162/neco.1989.1.3.321. http://www.mitpressjournals.org/doi/pdf/10.1162/neco.1989.1.3.321. Retrieved 2010-02-20.

^ Tesauro, Gerald (March 1995). "Temporal Difference Learning and TD-Gammon". Communications of the ACM 38 (3). http://www.research.ibm.com/massive/tdl.html. Retrieved 2010-02-08.

Categories:
Backgammon

Игры ⚽ Нужно сделать НИР?

1st Computer Olympiad — The 1st Computer Olympiad took place at the Park Lane Hotel in London, UK from 9 August, 1989 to 15 August, 1989. In this Computer Olympiad, computer programs competed against each other at a variety of games, including Awari, Backgammon, Bridge … Wikipedia

Academic Dictionaries and Encyclopedias