... Z. Li et al., 2021;Linzen & Leonard, 2018;Marvin & Linzen, 2018;Prasad et al., 2019;Tenney et al., 2018Tenney et al., , 2019). While this by no means is a complete representation of phrase meaning (Bender & Koller, 2020), using an LM as a linearizing transform has been shown to effectively predict natural language responses in both the cortex and cerebellum, with different neuroimaging techniques and stimulus presentation modalities (Abnar et al., 2019;Anderson et al., 2021;Goldstein et al., 2021;Jain et al., 2020;Jain & Huth, 2018;LeBel et al., 2021;Schrimpf et al., 2021;Toneva et al., 2020;Toneva & Wehbe, 2019;Wehbe, Murphy, et al., 2014;Wehbe, Vaswani, et al., 2014). Moreover, these models easily outperform earlier "word embedding" encoding models that use one static feature vector for each word in the stimulus and thus ignore the effects of context (Antonello et al., 2021;Jain & Huth, 2018). ...