Research Review Deep learning as a mixed convex-combinatorial optimization problem Off-policy evaluation for slate recommendation ...