Monday, June 30, 2008

A tournament of social computers

It is where they don their armor and charge, in the name of 10,000 euros.
"They" are pseudo-code strategies for a strange game that the organizers claim will shed some light on the computational properties of social behavior.

The rules of Project Cultaptation's Social Learning Strategies Tournament are complex, but the game is simple. Your strategy has to invade a population of 100 agents over thousands of simulation runs, based on its accumulated score. You get points by pulling levers of an n-armed bandit machine. But you can only pull a lever that you know about. And that knowledge can come from two sources: either you try your random luck, or you look at someone pulling levers next to you. Seems like a no-brainer, but the devil is in the details of this clever setup. Observing someone may be error-prone. The scoring may change over time, maybe faster than the expected lifetime of the agents.

I'm sorry to say that the deadline for entries to the tournament is over; it was today. Playing with the simulation has been truly fun. Of course, I really doubt that my entry will win -- I hope it gets past the round-robin selection phase! But I'd like to make a prediction about the winner.

I think the winning strategy will be very simple -- 10 lines of elementary pseudo-code maximum -- and stochastic -- with in-built randomness. I might have my reasons for that, but the chief ones come from history (the strategy that won the Prisoner's Dilemma tournament back in the 1980s was a very simple tit-for-tat) and perhaps misplaced reasoning-by-analogy (there's randomness in the environment, therefore a good strategy should include some of that too).

And I think the results won't tell us much about the fundamental nature of social learning, that is, gleaming truths about the way the world works from observing the behavior of others.

Why not? For instance, because in the tournament, my actions don't influence anything about the world beyond my own ability to score and therefore reproduce (ha ha). Or because there is no competition for resources -- all agents can choose the same action and they will all receive the same scores, same as it would be if only one agent chose it. Or even because there is no timeline, no sequences in this game, no planning a few steps ahead even.

Still, I can't wait to see the results, and they will be fascinating for all the simplifications of the game. And if I'm wrong, if the winning strategy is complex, or deterministic, or biologically plausible, it'll be pretty exciting to reason why that should be the case, and how the tournament actually captured some fundamental property of learning in groups.