Multi-Task Policy Learning With Minimal Human Supervision