Notebook

Solving the pay for prediction challenge¶

This document is a collaboratively developed IPython Notebook to solve the "Pay for prediction" challenge.

Hit Esc to see an overview of the pages¶

Outline¶

Overview of the challenge
The simulator
The solutions
Factoring the problem

Overview¶

[from the competition website]

Strategy competition¶

What is the best strategy to reduce the risk of disaster as a humanitarian worker? Red Cross Red Crescent workers must balance their time and resources between long-term programming such as hygiene promotion, preparing right before a disaster happens, and responding to disasters when they happen. Fortunately, there are many tools to help with this, including science-based forecasts that can help people anticipate a disaster. While forecasts can be confusing, making a strategy for how to act based on a forecast can help humanitarian workers best manage their time and resources.

The Red Cross Red Crescent Climate Centre is launching a competition to find the best strategy for how to manage disaster risk in the game Paying for Predictions. This game demonstrates the many responsibilities of a humanitarian worker, from long-term preparedness to short-term anticipation of a disaster. In this global competition, we would like you to submit a strategy which you think will make a player win this game more often than other strategies.

How does this work?¶

You submit your strategy by Thursday, February 28th, 2013.

A computer programme will use your strategy as if you were a person playing this game against other people over and over again. After many games are played, the programme will show which strategy wins most often.

We will announce the winners in March 2013.

Competition rules¶

Origin of the rules

Game License: http://creativecommons.org/licenses/by-nc/3.0/

The basic parameters of how the game will be simulated by computers for this competition:

10 teams of 3 people are playing this game (30 people total)
Players are unable to communicate with each other, but they are able to see what other players

on their team are doing. They are not able to see what players are doing who are not on their team.

There are 10 rounds in this game.

Game setup¶

Each player receives 10 beans (resources), and one six-sided die which represent the local rainfall of his/her area. Each team receives a cup and one six-sided die which represents the regional rainfall of their zone.

WINNING¶

The individual WINNER is the person with the most beans.
The team WINNER is the team with fewest total humanitarian crises. If there is a tie the team with most total beans combined is the team winner.

Simulator¶

The simulator is setup with some default strategies. These are for illustrative purposes.

In [1]:

import numpy as np

n_teams = 10
n_persons_per_team = 3
n_beans = 10
n_rounds = 10
n_die_change = 7
target_rain = 10
penalty = 4

def initialize():
    beans = n_beans * np.ones((n_teams, n_persons_per_team))
    forecast_teams = np.ones((n_teams)) # receive regional forecast
    drr_teams = np.ones((n_teams))  # have disaster risk reduction
    return beans, forecast_teams, drr_teams

In [2]:

def get_forecast_bids(beans):
    """ Defines how each person or team will bid for regional forecast

    Example
    -------

    return np.random.randint(0, np.max(beans) * .4, size=beans.shape)
    """
    return np.ones(beans.shape)

def get_drr_bids(beans):
    """ Defines how each person or team will bid for disaster risk reduction

    Example
    -------

    bids = np.zeros(beans.shape)
    for i in range(beans.shape[0]):
        for j in range(beans.shape[1]):
            bids[i, j] = np.random.randint(0, beans[i, j] * .2)
    """
    bids = np.ones(beans.shape)
    return bids

Gameplay: Stage 1 perform bids¶

In [3]:

beans, forecast_teams, drr_teams = initialize()

# perform forecast bids
forecast_bids = get_forecast_bids(beans)
forecast_team_bids = np.sum(forecast_bids, axis=1)
sort_index = np.argsort(forecast_team_bids)
forecast_teams[sort_index[:n_teams/2]] = 0

# Winning teams pay their beans
beans = beans - (forecast_teams[:, None] * forecast_bids)
print beans.T
print forecast_teams

[[ 10.  10.  10.  10.  10.   9.   9.   9.   9.   9.]
 [ 10.  10.  10.  10.  10.   9.   9.   9.   9.   9.]
 [ 10.  10.  10.  10.  10.   9.   9.   9.   9.   9.]]
[ 0.  0.  0.  0.  0.  1.  1.  1.  1.  1.]

In [4]:

# perform drr bids
drr_bids = get_drr_bids(beans)
drr_team_bids = np.sum(drr_bids, axis=1)
sort_index = np.argsort(drr_team_bids)
drr_teams[sort_index[:-2]] = 0

# Winning teams pay their beans
beans = beans - (drr_teams[:, None] * drr_bids)
print beans.T
print forecast_teams
print drr_teams

[[ 10.  10.  10.  10.  10.   9.   9.   9.   8.   8.]
 [ 10.  10.  10.  10.  10.   9.   9.   9.   8.   8.]
 [ 10.  10.  10.  10.  10.   9.   9.   9.   8.   8.]]
[ 0.  0.  0.  0.  0.  1.  1.  1.  1.  1.]
[ 0.  0.  0.  0.  0.  0.  0.  0.  1.  1.]

Round¶

In [5]:

def get_insurance_payments(regional_predictions, drr_teams, beans, round_idx):
    # determine the likelihood of a flood
    likelihood = (7 - (target_rain - regional_predictions)) / 6
    # if likelihood > .2 pay, or if you didn't have a prediction pay one bean
    payments = ((likelihood > .2) + (regional_predictions < 1))[:, None] * np.ones(beans.shape)
    return (payments * (beans > 0)).astype(int)

In [6]:

def generate_rainfall(n_sides):
    regional_rainfall = np.random.randint(1, n_sides, size=(n_teams))
    local_rainfall = np.random.randint(1, 7, size=(n_teams, n_persons_per_team))
    total_rainfall = local_rainfall + regional_rainfall[:, None]
    flooded = (total_rainfall >= target_rain).astype(np.int)
    return regional_rainfall, flooded

def adjust_beans(beans, payments, flooded, round_idx, drr_teams):
    if round_idx < 3:
        drr_penalty = 4
    else:
        drr_penalty = 2
    penalized = np.maximum(flooded - payments, 0)
    beans_to_remove = drr_penalty * penalized * (drr_teams[:, None] == 1) + penalty * penalized * (drr_teams[:, None] == 0)
    already_in_crisis = beans * (beans < 0)
    beans[already_in_crisis < 0] = 0
    beans = beans - payments - beans_to_remove
    beans_joining_crisis = beans < 0 
    beans = beans * (beans > 0)
    in_crisis = already_in_crisis - beans_joining_crisis
    return beans + in_crisis

Simulate¶

In [7]:

print beans.T
for turn in range(n_rounds):
    n_sides = 6
    if turn == 6: # 7th round
        n_sides = 8
    regional_rainfall, flooded = generate_rainfall(n_sides=n_sides)
    payments = get_insurance_payments(regional_rainfall * forecast_teams, drr_teams, beans, turn + 1)
    beans = adjust_beans(beans.copy(), payments, flooded, turn + 1, drr_teams)
    if turn % 2 == 0:
        print turn + 1, flooded.T - payments.T
print beans.T

[[ 10.  10.  10.  10.  10.   9.   9.   9.   8.   8.]
 [ 10.  10.  10.  10.  10.   9.   9.   9.   8.   8.]
 [ 10.  10.  10.  10.  10.   9.   9.   9.   8.   8.]]
1 [[-1 -1 -1  0  0  0  0  0  0  0]
 [-1 -1 -1 -1 -1  0 -1 -1  0  0]
 [-1 -1 -1 -1 -1  0 -1 -1  0  0]]
3 [[-1 -1 -1  0 -1  0  0  0  0  0]
 [ 0  0 -1  0 -1 -1  0  0 -1  0]
 [ 0 -1 -1 -1  0 -1  0  0 -1  0]]
5 [[-1 -1 -1 -1 -1 -1  0  0  0  0]
 [ 0 -1 -1  0 -1 -1  0  0  0  0]
 [-1 -1 -1 -1 -1 -1  0  0  0  0]]
7 [[ 0 -1 -1 -1 -1  0 -1  0  0  0]
 [-1 -1 -1 -1 -1  0  0  0  1  0]
 [ 0 -1 -1 -1 -1  0 -1  0  0  0]]
9 [[-1 -1 -1  0 -1 -1  0  0  0  0]
 [-1 -1 -1  0 -1 -1  0  0  0 -1]
 [-1 -1 -1 -1 -1 -1  0  0  0 -1]]
[[ 0.  0.  0.  0.  0.  5.  6.  7.  3.  4.]
 [ 0.  0.  0.  0.  0.  5.  6.  7.  3.  4.]
 [ 0.  0.  0.  0.  0.  5.  6.  7.  5.  4.]]

Solutions¶

Example from challenge guidelines

# This is a complete example submission for the "Paying for Predictions" game.
# Blank lines or lines beginning with "#" are ignored.
# Bids:
Bid 1 for the forecast.
Bid 3 beans for DRR.
# Conditions:
If I won the forecast and the dice is less than 5 and the beans remaining are more than 7, then take 
early action.
If I have forecast and the dice rolls more than the number of rounds remaining, then take early 
action.
If I don't have DRR and the dice rolls more than 5 and the rounds played are more than 6, then take 
early action.
Else, take no early action.

Solution from the strategies chosen for the simulation¶

# This is a complete example submission for the "Paying for Predictions" game.
# Blank lines or lines beginning with "#" are ignored.
# Bids:
Bid 1 for the forecast.
Bid 1 beans for DRR.
# Conditions:
If I have forecast and the rolled dice is more than 4 then take early action.
If I don't have forecast then take early action.
By default take no action.

Factoring the problem¶

The problem can be factored into five independent strategies: one for each of the four possible bidding outcomes and one for placing the initial bid.

Strategy 1: winning no bids¶

In each round, the only choice is whether to buy insurance for one bean.

For rounds 1-6, the expected cost of no insurance is $4\left(\frac{7}{36}\right) = \frac{7}{9} < 1$, so we should never buy insurance.

For rounds 7-10, the expected cost of no insurance is $4\left(\frac{15}{48}\right) = \frac{5}{4} > 1$, so we should always buy insurance.

The expected final bean count (neglecting running out of beans) is $10-6\left(\frac{7}{9}\right)-4 = \frac{4}{3}$.

Strategy 2: DRR only¶

For rounds 1-2, DRR is not available, so strategy 1 applies.

For rounds 3-6, the expected cost of no insurance is $2\left(\frac{7}{36}\right) = \frac{7}{18} < 1$, so we should never buy insurance.

For rounds 7-10, the expected cost of no insurance is $2\left(\frac{15}{48}\right) = \frac{5}{8} < 1$, so we should never buy insurance.

I.e., the net strategy is to never buy insurance.

The expected final bean count (neglecting running out of beans) is $10-2\left(\frac{7}{9}\right)-4\left(\frac{7}{18}\right)-4\left(\frac{5}{8}\right) \approx 4.39$.

This suggests that DDR is worth slightly more than three beans.

Strategy 3: Forecast only¶

Given a regional forecast $\leq 4$, the expected cost of no insurance is $\leq \frac{2}{3}$, so we should never buy insurance for these cases.

Given a regional forecast $\geq 5$, the expected cost of no insurance is $\geq \frac{4}{3}$, so we should always buy insurance for these cases.

The expected final bean count is: $10-6\left(\frac{1}{6}\frac{2}{3}+\frac{1}{3}\right)-4\left(\frac{1}{8}\frac{2}{3}+\frac{1}{2}\right) = 5$.

This suggests that the forecast is worth slightly less than 4 beans.

Strategy 4: DRR and Forecast¶

Again, rounds 1-2 match strategy 3.

For the DRR rounds:

Given a regional forecast $\leq 5$, the expected cost of no insurance is $\leq \frac{2}{3}$, so we should never buy insurance.
Given a regional forecast $\geq 7$, the expected cost of no insurance is $\geq \frac{4}{3}$, so we should always buy insurance.
Given a regional forecast of 6, the expected cost of no insurance is 1, so it shouldn't matter if we buy insurance. Since there is no way to aquire beans, it is probably best to not buy insurance for this case. This also implies that we should never buy insurance for rounds 3-6, independent of forecast.

The expected final bean count is: $10 - 2\left(\frac{1}{6}\frac{2}{3}+\frac{1}{3}\right)-4\left(\frac{1}{6}\frac{1}{3}+\frac{1}{6}\frac{2}{3}+\frac{1}{6}\right)-4\left(\frac{1}{8}\frac{1}{3}+\frac{1}{8}\frac{2}{3}+\frac{3}{8}\right) \approx 5.77$

This suggests that, given the forecast, DRR is worth less than one bean.

Strategy 5: Initial bid¶

Because there is no tracking of opponents across games, there is no reason to change the opening bid between games (as there would be in, e.g., the tit-for-tat strategy in the Prisoner's Dilemma). Therefore, the initial bid strategy is a choice among the $11^2 = 121$ possible opening bids. Although this is the only stage of the game with any interaction with the other players, I think that it is still not an interactive choice. That is, independently for each of the two bids, it should always be optimal to bid the "fair market value" of the resource (i.e., the difference in expected final beans with or without the resource). Winning the resource by overbidding will tend to put our final score below an optimal play without the resource. Underbidding may be a reasonable strategy as losing the bidding is free. I think that the influence of bids from teammates can be disregarded.

From the expectation values above, neither resource should ever be worth more than three beans; so, the interesting space to explore has $4^2 = 16$ choices.

Solution¶

# This is a complete example submission for the "Paying for Predictions" game.
# Blank lines or lines beginning with "#" are ignored.
# Bids:
Bid 2 for the forecast.
Bid 2 beans for DRR.
# Conditions:
If I have neither DRR nor forecast and round greater than 6 then take early action.
If I have forecast and the rolled dice is more than 4 then take early action.
If I have DRR and forecast and round greater than 6 and dice greater than 6 then take early action.
By default take no action.

In [8]:

!/software/challenges/nbconvert/nbconvert.py -f reveal /software/challenges/payforpredictions/payforpredictions.ipynb