How do we model learning at scale?