Self-taught computer program finds super poker strategy

ASSOCIATED PRESS
This undated photo provided by the University of Alberta shows University of Alberta researcher Michael Bowling, left, and team member Michael Johanson, right. A computer program that taught itself to play poker has created nearly the best possible strategy for one version of the game, showing the value of techniques that may prove useful for real-world challenges according to a paper released Thursday, Jan. 8, 2015, by the journal Science. (AP Photo/University of Alberta, John Ulan)

NEW YORK >> A computer program that taught itself to play poker has created nearly the best possible strategy for one version of the game, showing the value of techniques that may prove useful to help decision-making in medicine and other areas.

The program considered 24 trillion simulated poker hands per second for two months, probably playing more poker than all humanity has ever experienced, says Michael Bowling, who led the project.

The resulting strategy still won’t win every game because of bad luck in the cards. But over the long run — thousands of games — it won’t lose money; “We can go against the best (players) in the world and the humans are going to be the ones that lose money,” said Bowling, of the University of Alberta in Edmonton, Canada.

In any case, Bowling doubts the poker strategy will let anybody make a fortune on the game. It applies only to heads-up limit Texas hold ’em, a game which has waned in popularity over the past seven years or so, he said. Even online, the stakes tend to be small and “you’d be winning a few dollars, not raking in millions.”

He and colleagues report their work in a paper released Thursday by the journal Science.

Poker has long been a proving ground for the mathematical approach to decision-making called game theory, and Bowling’s paper introduces some techniques that could become useful in other situations. He is investigating the possibility of helping doctors determine proper insulin doses for diabetic patients, for example. Game theory has also been used to schedule security patrols, and it has implications for other areas like developing strategies for negotiations, auctions, cybersecurity, designing drugs and fighting disease pandemics.

Don't miss out on what's happening!

Stay in touch with top news, as it happens, conveniently in your email inbox. It's FREE!

By clicking to sign up, you agree to Star-Advertiser's and Google's Terms of Service and Privacy Policy. This form is protected by reCAPTCHA.

In the two-player game, each contestant creates a poker hand from two cards he is dealt face-down plus five other cards placed on the table face-up. Players place bets before the face-up cards are laid out, and then again as each card is revealed. The size of the wagers is fixed.

Bowling said the computer’s strategy is far too complicated for anybody to memorize, with about 1,000 times the amount of information in the English-language Wikipedia. But his university has created a website where people can ask it for advice and even play against it.

While scientists have created poker-playing programs for years, Bowling’s result stands out because it comes so close to “solving” its version of the game, which essentially means creating the optimal strategy. Poker is hard to solve because it involves imperfect information, where a player doesn’t know everything that has happened in the game he is playing — specifically, what cards the opponent has been dealt.

Tuomas Sandholm of Carnegie Mellon University in Pittsburgh, who didn’t participate in the new work, called Bowling’s results a landmark. He said it’s the first time that an imperfect-information game that is competitively played by people has been essentially solved.

___

Online:

Science: http://www.sciencemag.org

Poker program website: http://poker.srv.ualberta.ca

Comments are closed.