Reinforcement learning with blackjack