How about this:
Two people give the odds for the result of a coin flip of non-weighted coins.
Person A: Heads = 50%, Tails = 50%
Person B: Heads = 75%, Tails = 25%
The result of the coin flip ends up being Heads. Which person had the more accurate model? Did Person A get something wrong?
But in this hypothetical scenario of explicitly unweighted coins, Person A was entirely correct in the odds they gave. There's nothing to improve.