One Way or Another: Cortical Language Areas Flexibly Adapt Processing Strategies to Perceptual and Contextual Properties of Speech

Anastasia Klimovich-Gray; Ander Barrena; Eneko Agirre; Nicola Molinaro

doi:10.1093/cercor/bhab071

One Way or Another: Cortical Language Areas Flexibly Adapt Processing Strategies to Perceptual and Contextual Properties of Speech

Anastasia Klimovich-Gray^*, Ander Barrena, Eneko Agirre, Nicola Molinaro

^*Corresponding author for this work

Psychology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Cortical circuits rely on the temporal regularities of speech to optimize signal parsing for sound-to-meaning mapping. Bottom-up speech analysis is accelerated by top-down predictions about upcoming words. In everyday communications, however, listeners are regularly presented with challenging input - fluctuations of speech rate or semantic content. In this study, we asked how reducing speech temporal regularity affects its processing - parsing, phonological analysis, and ability to generate context-based predictions. To ensure that spoken sentences were natural and approximated semantic constraints of spontaneous speech we built a neural network to select stimuli from large corpora. We analyzed brain activity recorded with magnetoencephalography during sentence listening using evoked responses, speech-to-brain synchronization and representational similarity analysis. For normal speech theta band (6.5-8 Hz) speech-to-brain synchronization was increased and the left fronto-temporal areas generated stronger contextual predictions. The reverse was true for temporally irregular speech - weaker theta synchronization and reduced top-down effects. Interestingly, delta-band (0.5 Hz) speech tracking was greater when contextual/semantic predictions were lower or if speech was temporally jittered. We conclude that speech temporal regularity is relevant for (theta) syllabic tracking and robust semantic predictions while the joint support of temporal and contextual predictability reduces word and phrase-level cortical tracking (delta).

Original language	English
Pages (from-to)	4092-4103
Number of pages	12
Journal	Cerebral Cortex
Volume	31
Issue number	9
Early online date	7 Apr 2021
DOIs	https://doi.org/10.1093/cercor/bhab071
Publication status	Published - 1 Sept 2021

Bibliographical note

Funding Information:
The European Union's Horizon 2020 research and innovation programme (under the Marie Sklodowska-Curie grant agreement No 798971 awarded to A.K.G.); the Spanish Ministry of Science, Innovation and Universities (grant RTI2018-096311-BI00 to N.M.); the Agencia Estatal de Investigaci?n (AEI), the Fondo Europeo de Desarrollo Regional (FEDER); the Basque Government (through the BERC 2018-2021 program), the Spanish State Research Agency through BCBL Severo Ochoa excellence accreditation (SEV-2015-0490), DeepText project (KK-2020/00088) and Ixa excellence research group (IT1343-19). the UPV/EHU (a postdoctoral grant ESPDOC18/101 to A.B.); the NVIDIA Corporation (to A.B.with the donation of a Titan V GPU used for this research).

Publisher Copyright:
© 2021 The Author(s). Published by Oxford University Press.

Keywords

coherence
MEG
neural network
phonological processing
representational similarity analysis
semantic predictions

Access to Document

10.1093/cercor/bhab071Licence: Unspecified

Cite this

@article{0e1c15c2061b44c19ea038cdc3f0c4c5,

title = "One Way or Another: Cortical Language Areas Flexibly Adapt Processing Strategies to Perceptual and Contextual Properties of Speech",

abstract = "Cortical circuits rely on the temporal regularities of speech to optimize signal parsing for sound-to-meaning mapping. Bottom-up speech analysis is accelerated by top-down predictions about upcoming words. In everyday communications, however, listeners are regularly presented with challenging input - fluctuations of speech rate or semantic content. In this study, we asked how reducing speech temporal regularity affects its processing - parsing, phonological analysis, and ability to generate context-based predictions. To ensure that spoken sentences were natural and approximated semantic constraints of spontaneous speech we built a neural network to select stimuli from large corpora. We analyzed brain activity recorded with magnetoencephalography during sentence listening using evoked responses, speech-to-brain synchronization and representational similarity analysis. For normal speech theta band (6.5-8 Hz) speech-to-brain synchronization was increased and the left fronto-temporal areas generated stronger contextual predictions. The reverse was true for temporally irregular speech - weaker theta synchronization and reduced top-down effects. Interestingly, delta-band (0.5 Hz) speech tracking was greater when contextual/semantic predictions were lower or if speech was temporally jittered. We conclude that speech temporal regularity is relevant for (theta) syllabic tracking and robust semantic predictions while the joint support of temporal and contextual predictability reduces word and phrase-level cortical tracking (delta). ",

keywords = "coherence, MEG, neural network, phonological processing, representational similarity analysis, semantic predictions",

author = "Anastasia Klimovich-Gray and Ander Barrena and Eneko Agirre and Nicola Molinaro",

note = "Funding Information: The European Union's Horizon 2020 research and innovation programme (under the Marie Sklodowska-Curie grant agreement No 798971 awarded to A.K.G.); the Spanish Ministry of Science, Innovation and Universities (grant RTI2018-096311-BI00 to N.M.); the Agencia Estatal de Investigaci?n (AEI), the Fondo Europeo de Desarrollo Regional (FEDER); the Basque Government (through the BERC 2018-2021 program), the Spanish State Research Agency through BCBL Severo Ochoa excellence accreditation (SEV-2015-0490), DeepText project (KK-2020/00088) and Ixa excellence research group (IT1343-19). the UPV/EHU (a postdoctoral grant ESPDOC18/101 to A.B.); the NVIDIA Corporation (to A.B.with the donation of a Titan V GPU used for this research). Publisher Copyright: {\textcopyright} 2021 The Author(s). Published by Oxford University Press.",

year = "2021",

month = sep,

day = "1",

doi = "10.1093/cercor/bhab071",

language = "English",

volume = "31",

pages = "4092--4103",

journal = "Cerebral Cortex",

issn = "1047-3211",

publisher = "Oxford University Press",

number = "9",

}

TY - JOUR

T1 - One Way or Another

T2 - Cortical Language Areas Flexibly Adapt Processing Strategies to Perceptual and Contextual Properties of Speech

AU - Klimovich-Gray, Anastasia

AU - Barrena, Ander

AU - Agirre, Eneko

AU - Molinaro, Nicola

N1 - Funding Information: The European Union's Horizon 2020 research and innovation programme (under the Marie Sklodowska-Curie grant agreement No 798971 awarded to A.K.G.); the Spanish Ministry of Science, Innovation and Universities (grant RTI2018-096311-BI00 to N.M.); the Agencia Estatal de Investigaci?n (AEI), the Fondo Europeo de Desarrollo Regional (FEDER); the Basque Government (through the BERC 2018-2021 program), the Spanish State Research Agency through BCBL Severo Ochoa excellence accreditation (SEV-2015-0490), DeepText project (KK-2020/00088) and Ixa excellence research group (IT1343-19). the UPV/EHU (a postdoctoral grant ESPDOC18/101 to A.B.); the NVIDIA Corporation (to A.B.with the donation of a Titan V GPU used for this research). Publisher Copyright: © 2021 The Author(s). Published by Oxford University Press.

PY - 2021/9/1

Y1 - 2021/9/1

N2 - Cortical circuits rely on the temporal regularities of speech to optimize signal parsing for sound-to-meaning mapping. Bottom-up speech analysis is accelerated by top-down predictions about upcoming words. In everyday communications, however, listeners are regularly presented with challenging input - fluctuations of speech rate or semantic content. In this study, we asked how reducing speech temporal regularity affects its processing - parsing, phonological analysis, and ability to generate context-based predictions. To ensure that spoken sentences were natural and approximated semantic constraints of spontaneous speech we built a neural network to select stimuli from large corpora. We analyzed brain activity recorded with magnetoencephalography during sentence listening using evoked responses, speech-to-brain synchronization and representational similarity analysis. For normal speech theta band (6.5-8 Hz) speech-to-brain synchronization was increased and the left fronto-temporal areas generated stronger contextual predictions. The reverse was true for temporally irregular speech - weaker theta synchronization and reduced top-down effects. Interestingly, delta-band (0.5 Hz) speech tracking was greater when contextual/semantic predictions were lower or if speech was temporally jittered. We conclude that speech temporal regularity is relevant for (theta) syllabic tracking and robust semantic predictions while the joint support of temporal and contextual predictability reduces word and phrase-level cortical tracking (delta).

AB - Cortical circuits rely on the temporal regularities of speech to optimize signal parsing for sound-to-meaning mapping. Bottom-up speech analysis is accelerated by top-down predictions about upcoming words. In everyday communications, however, listeners are regularly presented with challenging input - fluctuations of speech rate or semantic content. In this study, we asked how reducing speech temporal regularity affects its processing - parsing, phonological analysis, and ability to generate context-based predictions. To ensure that spoken sentences were natural and approximated semantic constraints of spontaneous speech we built a neural network to select stimuli from large corpora. We analyzed brain activity recorded with magnetoencephalography during sentence listening using evoked responses, speech-to-brain synchronization and representational similarity analysis. For normal speech theta band (6.5-8 Hz) speech-to-brain synchronization was increased and the left fronto-temporal areas generated stronger contextual predictions. The reverse was true for temporally irregular speech - weaker theta synchronization and reduced top-down effects. Interestingly, delta-band (0.5 Hz) speech tracking was greater when contextual/semantic predictions were lower or if speech was temporally jittered. We conclude that speech temporal regularity is relevant for (theta) syllabic tracking and robust semantic predictions while the joint support of temporal and contextual predictability reduces word and phrase-level cortical tracking (delta).

KW - coherence

KW - MEG

KW - neural network

KW - phonological processing

KW - representational similarity analysis

KW - semantic predictions

UR - http://www.scopus.com/inward/record.url?scp=85113272236&partnerID=8YFLogxK

U2 - 10.1093/cercor/bhab071

DO - 10.1093/cercor/bhab071

M3 - Article

C2 - 33825884

AN - SCOPUS:85113272236

SN - 1047-3211

VL - 31

SP - 4092

EP - 4103

JO - Cerebral Cortex

JF - Cerebral Cortex

IS - 9

ER -

One Way or Another: Cortical Language Areas Flexibly Adapt Processing Strategies to Perceptual and Contextual Properties of Speech

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this