-
[show abstract]
[hide abstract]
ABSTRACT: Cortical activity underlying speech perception has been studied mostly by using isolated vowels with constant formant frequencies. Speech, however, is characterized by formant transitions whereby formant frequencies change as a function of time. We used magnetoencephalography (MEG) to investigate cortical activity elicited by isolated vowels and diphthongs containing formant transitions. Ten subjects were presented with two isolated vowels /a/ and /u/ and diphthongs /au/ and /ua/. Stimulus duration was 200 ms, and the diphthongs started and ended with a 50-ms constant-formant period and included a 100-ms linear transition period. Apart from studying the auditory N100m response, we examined subsequent brain activity in a 500-ms poststimulus time window, as the transitions were expected to elicit activity also in later stages of cognitive processing. All the stimuli elicited prominent N100m responses. Thereafter, both the isolated vowels and diphthongs elicited sustained brain activity lasting up to 500 ms. The present observations indicate that identification of the speech sounds as well as changes in their identity are reflected in the auditory N100m. Notably, the stimuli appeared to elicit left-hemispheric activity resembling the N400, typically obtained by using more complicated speech stimuli such as words and sentences.
Neurology & Clinical Neurophysiology 02/2004; 2004:91.
-
[show abstract]
[hide abstract]
ABSTRACT: We studied the cortical processing of spatial stimuli by magnetoencephalographic (MEG) measurements using broadband noise bursts presented from eight sound source directions in the horizontal plane. The stimuli were individually created for each subject by using three-dimensional (3D) sound techniques. The subjects carried out a behavioral task where their accuracy for localizing the 3D stimuli was established. We found that the auditory N100m response was sensitive to the sound source direction, exhibiting contralaterally more preponderant responses in both the left and the right hemisphere. Generally, responses were more prominent in the right hemisphere. The behavioral performance of the subjects correlated positively with N100m amplitude organization, showing that the dynamics of auditory cortex predict behavioral sound detection.
Neurology & Clinical Neurophysiology 02/2004; 2004:50.
-
[show abstract]
[hide abstract]
ABSTRACT: We argue that the mismatch negativity (MMN), elicited by rare auditory events, is generated by the same neural mechanisms as the N100, elicited by any audible stimulus. To date, the MMN has been considered to be a unique index of auditory sensory memory and a component which is functionally and spatially separate from the N100 response because: (i) MMN and N100 appear to have different generator locations; (ii) the MMN occurs too late to be an N100; (iii) the MMN, as opposed to the N100, is elicited by stimulus omissions. By utilizing neural modeling and EEG/MEG results, we show that the above reasoning relies on unwarranted assumptions and propose that the MMN is, essentially, an amplitude- and latency-modulated N100 response. This study offers a physiologically constrained and theoretically plausible framework whereby brain dynamics in terms of stimulus feature maps and their reorganization may be used to describe various memory- and learning-related effects of human auditory cognition.
Neurology & Clinical Neurophysiology 02/2004; 2004:20.
-
[show abstract]
[hide abstract]
ABSTRACT: We studied the cortical activation underlying perception of variations in speech fundamental frequency (F0) as indexed by the amplitude, latency and source location of the auditory N100m response registered with magnetoencephalography (MEG). Ten subjects were presented with Finnish vowels with either a constant or an ascending/descending F0. We found that the human auditory cortex is sensitive to these time-varying changes in the F0 of speech: vowels with a constant F0 elicited more prominent N100m responses than did vowels with ascending or descending changes in F0. These results suggest that the speech-related behavior of the N100m arises out of cortical sensitivity to variations in the F0 and its harmonics which underlie the perception of pitch and intonation. The present observations are interpreted in terms of the interrelatedness of speech production and perception.
Neurology & Clinical Neurophysiology 02/2004; 2004:49.
-
[show abstract]
[hide abstract]
ABSTRACT: The generation mechanism of stimulus-evoked electro- and magnetoencephalographic (EEG & MEG) responses has remained controversial. One view holds that evoked responses are independent components, additive to ongoing brain activity. The other view holds that evoked responses are generated via stimulus-induced phase reorganization of ongoing brain activity. This issue has been commonly addressed with signal processing techniques that assume a high level of stationarity (i.e., unchanging properties over time) of the measured signal. Here we used signal analysis methods suitable for analyzing non-stationary signals. We found that auditory stimulation leads to a large power increase of the poststimulus signal compared to prestimulus level. Linear superposition of the (time-domain) averaged response and the unaveraged prestimulus signal accounted for 90% of the power increase. Further, we found that auditory stimulation does not lead to a phase-coherent state of ongoing oscillations. Taken together our results show that auditory evoked responses are directly additive to ongoing oscillations and only 10% of the observed power increases are explained by non-phase-locked brain activity. When examining evoked brain activity with methods providing simultaneous frequency and time information, emphasizing temporal accuracy is likely to provide more accurate descriptions of non-stationary processes of the human brain.
Neurology & Clinical Neurophysiology 02/2004; 2004:45.
-
[show abstract]
[hide abstract]
ABSTRACT: Brain processes phase-locked to stimuli can be readily observed with electro- and magnetoencephalography (EEG & MEG, respectively) using stimulus-triggered averaging of the measured signal. The detection of non-phase-locked brain processes depends on the method used for analyzing the unaveraged data. Here we introduce a technique, partition-referenced moment (PRM) power spectrum, which uses established spectral estimation algorithms but yields a power spectrum with sharp, easily distinguishable peaks in an otherwise level spectrum even when the signal (such as EEG & MEG) is of the one-over-frequency-slope type. Employing this method and wavelet transforms, we show that transient auditory brain responses are followed by dispersed small-magnitude power reductions. Power reductions occurred around 400-600 ms and were specific to ongoing 10 Hz-oscillations. The PRM-method also indicated ongoing oscillations in the 15-30 Hz frequency range where power reductions occurred at around 200-400 ms. Thus, the presented methods enable the straightforward detection of ongoing brain oscillations and their association with event-related power changes.
Neurology & Clinical Neurophysiology 02/2004; 2004:104.
-
[show abstract]
[hide abstract]
ABSTRACT: Voiced speech is created by the fluctuating vocal folds generating the glottal pulseform. This excitation signal is the source of the speech fundamental frequency and its harmonic integer multiples. Periodic glottal excitation is required for the elicitation of speech-specific cortical processes indexed by the auditory N100m response. Here, we studied the cortical processing underlying the perception of the vowels /a/ and /u/ produced using normal and aperiodic phonation. The behavior of the N100m, registered with magnetoencephalography (MEG), was studied in 10 subjects. The amplitude and latency of the N100m as well as the center of gravity of the activated cortical areas varied as a function of stimulus periodicity. Further, the presence of glottal excitation had differential effects on the latency of the N100m elicited by the vowels /a/ and /u/. Thus, changes affecting the perceptual quality of speech signals without changing their phonetic content modify the dynamics of human auditory cortex.
Neurology & Clinical Neurophysiology 02/2004; 2004:103.
-
[show abstract]
[hide abstract]
ABSTRACT: We consider the neural dynamics underlying auditory streaming, the perceptual grouping of transient auditory events, by using neural modeling and magnetoencephalographic (MEG) measurements in humans. We demonstrate that spatial variations in the strength of feedback inhibition leads to differential amplitude modulation (AM) tuning resembling that found in animal models. In our model, neurons respond selectively to stimuli presented at different onset-to-onset interstimulus intervals (ISIs), and their summed activity (corresponding to the MEG signal) exhibits both transient and sustained responses (SRs) at fast ISIs. In MEG measurements utilizing 2-s trains of 50-ms stimuli presented at 0-1950 ms ISIs, we observed the transient N100m and SRs predicted by the model, with a prominent SR emerging for discrete stimuli at ISIs below 200 ms. Our results explain why, at fast stimulus rates, the amplitude of the auditory N100m appears to be strongly attenuated even though auditory cortex continues to respond vigorously to the stimuli. The results suggest that the longer and shorter forms of auditory sensory memory may be reflected in the N100m and the SR, respectively. As the emergence of the SR coincides with the stimuli being perceived as auditory streams, our study suggests that auditory sensory memory as indexed by transient and sustained cortical activity might underlie auditory scene analysis.
Neurology & Clinical Neurophysiology 02/2004; 2004:19.
-
[show abstract]
[hide abstract]
ABSTRACT: Introduction In their natural auditory environment, humans are faced with a highly complex array of stimulus sources of varying location, frequency, and intensity, usually overlapping in time. This threedimensional auditory space is to be analysed and represented by the human brain for the organism to function adequately. Slightly simplifying, the human auditory cortex can be described as a twodimensional array consisting of topographic representations for stimulus frequency and intensity. The question then arises how the human brain achieves the compression of the threedimensional auditory space onto a two-dimensional plane. At present, very little is known about this feature of the human brain. Non-invasive measurements utilizing electro- (EEG) and magnetoencephalography (MEG) have recently been used to study how stimulus features are represented in the human brain. Previous research has revealed how tone frequency and intensity are encoded in tonotopic [1-3] and amplitopic maps [4
12/2001;
-
[show abstract]
[hide abstract]
ABSTRACT: Introduction Selective attention refers to the ability to pick relevant stimuli embedded in a vast amount of information. Previous clinical and basic studies suggest that the dopamine system may be associated with this fundamental function [1,2], however, the exact neurochemical basis remains to be elucidated. The cerebral bases of selective attention, and other higher cognitive functions, have been associated with neural activity synchronized at around 40 Hz [3, 3]. For instance, the transient 40-Hz auditory response, detected with both electroencephalography (EEG) and magnetoencephalography (MEG) [5], is enhanced by selective attention [4] and gradually attenuated after long-term stimulation due to lessened vigilance [6]. Previous neuropharmacological studies, further, suggest that the transient 40-Hz response is attenuated by GABAA-agonist temazepine [7], and modulated by cholinergic muscarine receptors [8]. The dopaminergic modulation of transient 40-Hz auditory responses has not
12/2001;
-
[show abstract]
[hide abstract]
ABSTRACT: An essential feature of the human brain is the ability to extract information from temporally distributed events in the auditory environment. We addressed this temporal encoding ability by modelling how stimulus rate is represented in the auditory cortex. We propose that a cortical representation of stimulus rate can be achieved through the oscillatory properties of nerve cells. Using non-invasive brain measures, we tested the prediction of the model that multiple rebound responses (RRs) occur after the ending of stimulation. MEG recordings revealed successive RRs which originated in the same areas as the N1s elicited by the auditory stimuli at latencies predicted by the model. Our modelling and experimental results therefore provide evidence that the temporal structure of the auditory environment is decomposed in human auditory cortex by cells performing bandpass filtering on periodic input.
Neuroreport 04/2001; 12(3):573-7. · 1.66 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: The effects of stimulus duration on the elicitation and equivalent current dipole (ECD) localization of the auditory N400(m) were studied in two subject groups, either familiar or unfamiliar with Finnish language, using a sentence-processing paradigm with incongruent ending words of either short or long duration. Long-duration words elicited a broad response at around 400 ms, the generator location(s) of which could not be reliably determined using ECD estimation. In contrast, short-duration words elicited a sharp, strong-amplitude response at about 400 ms latency and it's source location could be reliably determined as being in the vicinity of auditory cortex. Subjects unfamiliar with the Finnish language elicited no response at the 400 ms range. Thus, the use of short-duration words appears to be an important prerequisite for the elicitation and localization of N400m. The differential amplitude behaviour of the N400m between the two subject groups further suggests that comprehension of the semantic content of the speech message is also required.
Neuroreport 03/2001; 12(2):249-53. · 1.66 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: Sound location processing in the human auditory cortex was studied with magnetoencephalography (MEG) by producing spatial stimuli using a modern stimulus generation methodology utilizing head-related transfer functions (HRTFs). The stimulus set comprised wideband noise bursts filtered through HRTFs in order to produce natural spatial sounds. Neuromagnetic responses for stimuli representing eight equally spaced sound source directions in the azimuthal plane were measured from 10 subjects. The most prominent response, the cortically generated N1m, was investigated above the left and right hemisphere. We found, firstly, that the HRTF-based stimuli presented from different directions elicited contralaterally prominent N1m responses. Secondly, we found that cortical activity reflecting the processing of spatial sound stimuli was more pronounced in the right than in the left hemisphere.
Neuroreport 06/2000; 11(7):1535-8. · 1.66 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: Learning to speak a new language requires the formation of recognition patterns for the speech sounds specific to the newly acquired language. The present study demonstrates the dynamic nature of cortical memory representations for phonemes in adults by using the mismatch negativity (MMN) event-related potential. We studied Hungarian and Finnish subjects, dividing the Hungarians into a naive (no knowledge of Finnish) and a fluent (in Finnish) group. We found that the MMN for a contrast between two Finnish phonemes was elicited in the fluent Hungarians but not in the naive Hungarians. This result indicates that the fluent Hungarians developed cortical memory representations for the Finnish phoneme system that enabled them to preattentively categorize phonemes specific to this language.
Psychophysiology 10/1999; 36(5):638-42. · 3.29 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: In response to the rapidly increasing interest in using human voice in cognitive brain research, a new method, semisynthetic speech generation (SSG), is presented for generation of speech stimuli.
The method synthesizes speech stimuli as a combination of purely artificial processes and processes that originate from the natural human speech production mechanism. SSG first estimates the source of speech, the glottal flow, from a natural utterance using an inverse filtering technique. The glottal flow obtained is then used as an excitation to an artificial digital filter that models the formant structure of speech.
SSG is superior to commercial voice synthesizers because it yields speech stimuli of a highly natural quality due to the contribution of the man-originating glottal excitation.
The artificial modelling of the vocal tract enables one to adjust the formant frequencies of the stimuli as desired, thus making SSG suitable for cognitive experiments using speech sounds as stimuli.
Clinical Neurophysiology 09/1999; 110(8):1329-33. · 3.41 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: To discern the role of the GABA(A) receptors in the generation and attentive modulation of the transient auditory 40-Hz response, the effects of the benzodiazepine temazepam (10 mg) were studied in 10 healthy social drinkers, using a double-blind placebo-controlled design. Three hundred Hertz standard and 330 Hz rare deviant tones were presented to the left, and 1000 Hz standards and 1100 Hz deviants to the right ear of the subjects. Subjects attended to a designated ear and were to detect deviants therein while ignoring tones to the other. Temazepam significantly suppressed the amplitude of the 40-Hz response, the effect being equal for attended and non-attended tone responses. This suggests involvement of GABA(A) receptors in transient auditory 40-Hz response generation, however, not in the attentive modulation of the 40-Hz response.
Neuroscience Letters 07/1999; 268(2):105-7. · 2.11 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: Transcranial magnetic stimulation (TMS) is accompanied with loud clicks that evoke auditory responses in the brain, confounding several types of TMS studies. We investigated the effects of these clicks with high-resolution EEG by applying TMS pulses at 3 magnitudes, with the coil placed either at 10 or 50 mm over the subjects' vertex and recording event-related potentials (ERPs). The clicks were found to elicit a positively displaced response at 150-250 ms post-TMS. Furthermore, clicks were found to interact with simultaneously presented auditory sinewave stimuli, resulting in an amplitude decrease in the auditory N1 response.
Clinical Neurophysiology 06/1999; 110(5):982-5. · 3.41 Impact Factor
-
Duodecim; lääketieteellinen aikakauskirja 02/1999; 115(5):560-4.
-
[show abstract]
[hide abstract]
ABSTRACT: 1. Recent experimental studies on the neurophysiological basis of auditory selective attention and sensory memory forming the sensory-data basis for tuning the selective-attention system in humans are reviewed. 2. The results demonstrate that the transient 40-Hz response is enhanced by selective attention, attenuated in the course of long-term stimulation, but is not affected by changes in auditory stimuli. 3. Therefore, the 40-Hz response seems to be closely related to selective and sustained attention, whereas it does not seem to be associated with passive attention, as it does not reflect the detection of changes in auditory stimuli. 4. Changes in auditory stimulation are registered by pre-attentive sensory memory, indexed by the mismatch negativity (MMN), a change-specific component of the event-related potentials (ERPs). By this time, the transient 40-Hz response has already terminated. The magnitude of stimulus change is reflected in MMN latency. These latency changes predict changes in attentive reaction time (RT). 5. Thus, the pre-attentive memory mechanism seems to govern attentive detection of changes in the auditory environment. 6. It is concluded that the transient 40-Hz response is related to active attention and MMN is related to passive attention.
Progress in Neuro-Psychopharmacology and Biological Psychiatry 08/1997; 21(5):751-71. · 3.25 Impact Factor
-
[show abstract]
[hide abstract]
ABSTRACT: Invasive microelectrode measurements have demonstrated binaural interaction effects of summation, occlusion, and suppression. Here we demonstrate these phenomena in humans using non-invasive long-latency cortical event-related potential (ERP) components N1 and mismatch negativity (MMN). Subjects were presented with monaural and binaural stimulus trains consisting of frequent standard stimuli and deviant stimuli deviating from the standard either in frequency, intensity, or duration. The binaural N1 was smaller than the monaural N1, MMN for the intensity change was larger with binaural than monaural stimulation, whereas for the frequency and duration change, the MMN amplitude remained unchanged. Thus, cortical binaural interactions reflected suppression in the N1, summation for the intensity MMN, and occlusion for the frequency and duration MMNs.
Neuroscience Letters 02/1997; 222(1):37-40. · 2.11 Impact Factor