
Rabie R. answered 06/27/20
PhD in Mathematics with 5+ years of Teaching Experience
Hi James,
Calculate the expected reward in all scenarios:
A) you get 1.1 with 5/6 probability and 0 with 1/6 probability. Expected reward is 1.1*5/6 + 0*1/6 = 0.916666
B) you get 9 with 1/6 probability and 0 with 5/6 probability. Expected reward is 9*1/6 + 0*5/6 = 1.5
C) you get 2.6 with 1/2 probability and 0 with 1/2 probability. Expected reward is 2.6*1/2 + 0*1/2 = 1.3
D) you get 3.7 with 1/3 probability and 0 with 2/3 probability. Expected reward is 3.7*1/3 + 0*2/3 = 1.233333
To maximize expected reward, I would go for B. Second option would be C.
This strategy is based on maximizing expected reward, and doesn't take into account the variance of the reward.
James H.
Thank you so much! Very helpful06/27/20