OPTIMISTIC SAMPLING STRATEGY FOR DATA-EFFICIENT REINFORCEMENT LEARNING