Borrowing From the Future: Addressing Double Sampling in Model-free Control