Learning with and without human feedback