Cách người Mỹ tiếp nhận nguyên âm tiếng Việt phân biệt bởi độ tròn môi
PERCEPTION OF ENGLISH LISTENERS ON
VIETNAMESE VOWELS CONTRASTING IN ROUNDING
Tran Thi Hai Yen*
Thai Nguyen University
Received: 28/07/2020; Revised: 20/09/2020; Accepted: 26/04/2021
Abstract: This study investigated into how English listeners from Midland U.S. perceived Vietnamese
back vowels contrasting in rounding. The words were produced by Northern and Central Vietnamese
speakers. The results showed that Vietnamese vowels [o] and [u] was assimilated to English [oʊ] (77%)
and [u] (69%) respectively. [ɤ] and [ɯ] were not assimilated to any English vowels. This suggested that
Midwest U.S. speakers tended to use vowel height to map between native vs. non-native vowels while
Southern U.S. speakers in the only previous study (Shport, 2019) tended to use vowel rounding. Dialect
effects were found for Northern Vietnamese [ɯ] was assimilated to [u] (61%) while Central Vietnamese
[ɯ] was perceived the most as [ʊ] (39%). Thus, the perception of non-native sounds can be affected by
the dialects of both speakers and listeners. The study suggests that English teachers should be aware of
this issue to accommodate appropriate teaching strategies in classroom.
Key words: Perception assimilation model, second language acquisition, second language perception,
Vietnamese vowels
1. Introduction
Although naïve listeners’ perception on non-native sounds has been well-studied (Levy, 2005;
Simon, Debaene&Herreweghe, 2015; Shport, 2019), how listeners perceive back vowels contrasting in
rounding is limited. Shport (2019) was the only first to study how English speakers perceived Vietnamese
back vowels contrasting in rounding. Shport (2019) recruited English speakers with Southern U.S. dialect
(SUSE) and a Vietnamese speaker with Central Vietnamese. But both Vietnamese and U.S. English have
many regional dialects with different vowel characteristics. Thus, more research into how listeners’ and
speakers’ dialects may affect this perception is needed. This study is a replication of Shport (2019) with
two contributions regarding this issue. First, the speakers in this study were from both Central and Northern
Vietnam. Kirby (2011) claimed that Northern Vietnamese [ɯ] was more mid-centralized than Central
Vietnamese [ɯ]. Hence, how Northern Vietnamese [ɯ] was perceived compared with Central Vietnamese
[ɯ] was a potential issue needing more in-depth research. Second, the listeners in this study were from the
Midland U.S. (Kansas). English speakers from the Southern U.S. exhibited Southern Vowel Shift: the
fronting of the back vowels /u/ and /oʊ/. Speakers from Midland U.S., on the other hand, did not necessarily
exhibit the same process (Clopper, Pisoni& Jong, 2005). Regarding this, how these two vowels might yield
different assimilation patternsis further explored in the current study.
2. Literature review
2.1. Perceptual Assimilation Model (PAM):
PAM (Best, 1995) predicts that naïve listeners perceive non-native contrasts based on
articulatory/acoustic similarity with the native phones. If the non-native phone is perceived as ‘similar’ to
L1 phones, the non-native phone will be ‘assimilated’ to the first language (L1) phone. As a result, listeners
are unable to perceive the similar non-native phone correctly. PAM predicts fiveassimilation patterns:
(1) Two-Categories (TC): each non-native phone is assimilated to a different L1 phone. For example,
Spanish listeners should be able to discriminate English /ɪ/ and /ʊ/ accurately because these vowels
are assimilated to Spanish /e/ and /o/ respectively. This assimilation pattern predicts excellent
discrimination.
(2) Category-Goodness (CG): two non-native phones are assimilated to the same L1 phone but how
similar (goodness-of-fit ratings) they are to the L1 phone are different. For example, Spanish
listeners should show moderate discrimination of English /ɪ/ and /eɪ/ because although both are
assimilated to Spanish /e/, English /eɪ/ is more similar to Spanish /e/ than English /ɪ/. This
assimilation pattern predicts moderate discrimination.
(3) Single-Category (SC): two non-native phones are assimilated to the same L1 phone and they are
equally different (goodness-of-fit ratings) from the L1 phone. For example, Spanish listeners
should have difficulty discriminating English /ʌ/ and /ɑ/ because both are assimilated to Spanish
/a/ and both are equally bad exemplars of Spanish /a/. This assimilation pattern predicts poor
discrimination.
(4) Uncategorized-Uncategorized (U-U): both non-native phones fall in the phonetic space but none of
them are assimilated to any L1 phone. For example, Thai vowels [ɯ]-[ɤ] are categorized as English
[ʌ] by only less than 70% of listeners so they are not assimilated to any English vowel. This
assimilation pattern predicts from poor to good discrimination.
(5) Uncategorized-Categorized (U-C): one non-native phoneis not assimilated to any L1 phone while
the other is. For example, Spanish listeners should not categorize English /u/ as any Spanish vowel,
and should be able to discriminate between English /u/ and any other English vowel. This
assimilation pattern predicts good discrimination.
Given the assimilation patterns above, the following order of accuracy discrimination is posited from
the easiest to the most difficult: TC = UC >CG>UU> SC. To test the predictions of PAM, Levy (2005),
Simon, Debaene & Herreweghe (2015) and Shport (2019) use two experiments: the first categorization
experiment serves as predicted assimilation patterns and the second discrimination experiment tests naïve
listeners’ discrimination of non-native phones.
Shport (2019) was the first study to examine how Southern U.S. English speakers (SUSE dialect)
perceived the rounding contrast in Vietnamese back vowel pairs [u]-[o], [ɯ]-[u], [ɯ]-[ɤ], and [o]-[ɤ]
according to PAM framework (Figure 1). The Vietnamese vowel pairs [o]-[ɤ], [u]-[ɯ] contrast in rounding;
[u]-[o], [ɯ]-[ɤ] contrast in height.
Figure 1. Vietnamese monothong centroids in a schematic F1-F2 space (Kirby, 2011)
In the first vowel categorization experiment, 49 English speakers listened to the four Vietnamese
vowels and chose one from seven English vowels /u oʊ ʊ ɔ ʌ ɝ ɑ/ that was the most similar to the
Vietnamese vowels they just heard. By calculating the percentage of response from participants,
assimilation patterns could be established. When the 50% threshold was used as categorization criteria, the
results of the first experiment basically suggested that Vietnamese [ɤ], and [u o] were perceived the most
as English [ʌ] (70%) and [oʊ] (58% for [u] and 59% for [o]) respectively. [ɯ] did not yield any dominant
categorization (22% for [u], 32% for [ʊ], 26% for [ʌ]). The results from the first experiment predicted the
following discrimination ability from the easiest to the most difficult: [o-ɤ] = [ɯ-ɤ] = [ɯ-u] > [u-o] as the
assimilation patterns were UC = TC > SC accordingly. These predictions were tested in the second vowel
discrimination experiment. In the discrimination experiment, English speakers made odd-man-out
judgements for words with the Vietnamese vowels [u], [o], [ɯ], [ɤ] presented in triads. In the experiment,
participants heard three Vietnamese words including two words with the same vowel and one with a
different vowel. The words with the same vowels were not the same productions: e.g. [tu1]-[tɯ1]-[tu2].
Then, participants selected the words with the different vowel compared with the other two. The second
experiment basically supported the predictions with the observed ranking of accuracy: [o-ɤ] (90%) = [ɯ-ɤ]
(89%) > [ɯ-u] (83%) = [u-o] (84%). The authors claimed that the unpredicted accuracy in [ɯ-u] contrast
actually could be predicted when the above-chance level (14.3%) categorization criterion was used. Overall,
PAM could predict the speech perception in naïve listeners quite accurately.
1.2.Research questions and hypotheses
This experiment aimed to find out how English speakers from the Midland (Midwest) U.S.
categorized Vietnamese vowels [u o ɯ ɤ] to English vowels [u oʊ ʊ ɔ ʌ ɝ ɑ]. The following research
questions and hypotheses were tested:
- How did English speakers from the Midwest U.S. categorize Vietnamese vowels into English vowels
regarding both dialects? With the Southern Vowel Shift mentioned above, there might be a difference
between this study and Shport (2019).
- How did English speakers from the Midwest U.S. categorize Vietnamese vowels into English vowels for
Central Vietnamese dialect? If the Midwest U.S. speakers used the same acoustic cues (with Southern U.S.
speakers) to map between native and non-native vowels, the assimilation patterns should be similar to
Shport (2019).
- How did English speakers from the Midwest U.S. categorize Vietnamese vowels into English vowels for
Northern Vietnamese dialect? Regarding the claim from Kirby (2011) that [ɯ] was more mid-centralized
for Northern Vietnamese dialect, this vowel may be perceived differently from Central Vietnamese dialect.
3. Methodology
3.1.Participants
For the production of Vietnamese stimuli, there were two female speakers who were native speakers
of Vietnamese. One speaker was 26 years old and the other was 27 years old. These speakers reported
Central and Northern Vietnamese as their dominant dialect.
For the vowel categorization experiment, participants were 21 (5 males, 16 females, mean age of
19.5 years old) native English speakers. They were all naïve to Vietnamese and were students at the
University of Kansas, U.S. Their dialect was defined as Midwest (Kansas) American English. The
difference between the number of speakers and listeners (2 vs. 21) may not cause any potential issue as
discussed in Shport (2019).
3.2. Stimuli
The stimuli in this experiment were similar with the experiment of Shport (2019). For each of the
Vietnamese vowel [u o ɯ ɤ], there were two words (Table 1 and 2). Among thetwo words, one of them
had level tone (e.g. tu) and the other had falling tone (e.g. tù). The words had CV structure in which the
first consonants were always the voiceless dental stop[t] and the vowel was one of the above. Each word
was produced five times by the speakers. The total number of the words for each dialect was forty: four
vowels x twotones x fiverepetitions. The total number of tokens for both dialects was eighty. For the
Vietnamese vowel production, the Northern Vietnamese speaker was recorded in Anechoic chamber using
a Marantz PMD 671 solid state recorder and an ElectroVoice N/D 767a microphone (16-bit resolution).
The Central Vietnamese speaker was recorded in a quiet room using a mobile phone. The speakers produced
the words with normal speaking rate.
3.3. Data analysis
To examine the Vietnamese vowel characteristics, duration, the fundamental (F0), first (F1), second
(F2) and third formant frequencies (F3) of the Vietnamese vowels were measured (Table 1 and 2) and
averaged across five repetitions. The frequencies were measured in the midpoints of the vowels. Although
both English and Vietnamese did not distinguish long vs. short vowels, vowel duration was also measured
because this could bias listeners in perceiving long vowels as tense and short vowels as lax. All the
measurements were done in Praat software (http://www.fon.hum.uva.nl/praat/). Particularly, the wave form
of each vowel was illustrated on Praat. Then, the researcher could measure its duration, F0, F1, F2 and F3
in the midpoint of the vowel. An example can be found in Figure 2.
Midpoint of the
vowel
F3
F0
F2
F1
Figure 2. The waveform of the word token ‘tô’ with marked F1, F2, F3 and F0
To compare the characteristics of English vowels vs. Vietnamese vowel, a vowel space is plotted in
section 4.2. (Figure 3). The data of English vowels were from Hillenbrand, Getty, Clark, and Wheeler
(1995).
To establish the categorization patterns, the response proportion of each English vowel was
calculated. This experiment used 50% threshold to determine assimilation patterns as in Shport (2019).
3.4. Data collection procedures
In the experiment, participants first completed a practice block with 8 Vietnamese words. The words
of the practice block were the same with the test items: CV structure with a voiceless dental stop [t]
preceding one of the vowels [u o ɯ ɤ]. For each of the CV pair, there was one word with the level tone and
one with the falling tone. The practice items were different productions from the test items and were
produced by the speaker from Northern Vietnam. After the practice trials, participants completed the test
block (80 items). In the experiment, participants first listened to a Vietnamese word, then they saw a list of
seven English words on the screen. The words were GOOSE, GOAT, HAWK, PUT, BUS, POT and
NURSE (the same with Shport, 2019). These words corresponded with the seven English vowel /u/, /oʊ/,
/ɔ/, /ʊ/, /ʌ/, /ɑ/ and /ɝ/ respectively. Participants were instructed to click the mouse to the one English word
that had the vowel the most similar with the vowel of the Vietnamese word they just heard. Finally,
participants rated the token for goodness-of-fit of the English word to see how good the Vietnamese vowel
as an example of the English vowel. The rate was from 1 (very bad), 2 (moderately bad), 3 (slightly bad),
4 (average), 5 (slightly good), 6 (moderately good),to 7(very good). After that, participants pressed ‘ok’
button to move on to the next trial. The inter-stimulus interval was one second. Participants had unlimited
time to provide their responses. All the stimuli were presented aurally via headphone on Paradigm. The
stimuli were randomized across participants to avoid order effect. Participants were also instructed to make
the best guess if they could not make a decision.
4. Findings and discussion
4.1. Vowel characteristics of Central and Northern Vietnamese vowels
The specific characteristics of the vowels produced by the two Vietnamese speakers are summarized
in Table 1 and 2 below. The characteristics included mean vowel duration, F0, F1, F2 and F3 measured in
the midpoints of the vowels. Some important generalizations of the characteristics are as follow. First, the
duration of the words with falling tone (451ms) was always longer than the words with level tone (370ms).
This effect was particularly strong in the Central speaker (456ms for words with falling tone and 334ms for
words with level tone) compared with the Northern speaker (445ms for words with falling tone and 406ms
for words with level tone). The mean durations of the words were shaded in the tables. This suggested that
participants may perceive the vowels with falling tone as English tense vowels and the vowels with level
tone as English lax vowels. Tone effect was not discussed in this paper because of its limited scope. Second,
mean F1, F2, F3 in the vowels produced by the Central speaker most of the time were higher than these in
the vowels produced by the Northern speaker. The plotting of F1-F2 in Figure 2 suggested that the
difference in F1 and F2 in the production of two dialects was not very remarkable with the exception of
[ɯ]. Central Vietnamese [ɯ] had the biggest difference in F1(82Hz) and F2(303Hz) compared with
Northern Vietnamese [ɯ].
Table 1. Vowel characteristics in eight Vietnamese words produced by Central Vietnamese speaker
Vowel
Word
Tone
F0 (Hz) Duration (ms) F1 (Hz) F2 (Hz) F3 (Hz)
Tu ‘knock up’ Level
293
218
255
296
220
258
275
226
251
276
218
247
289
444
366
322
470
396
331
436
384
396
483
439
367
459
413
406
491
449
633
453
543
639
527
583
909
975
942
1740
1705
1723
929
946
938
1271
1345
1308
3601
3399
3500
3682
3258
3470
3545
3224
3384
3470
3235
3353
[u]
Tù ‘prison’
Mean
Falling
Tư ‘four’
từ ‘word’
Mean
Level
Falling
[ɯ]
[o]
[ɤ]
Tô ‘bowl’
tồ non word
Mean
Level
Falling
Tơ ‘silk’
tờ ‘sheet’
Mean
Level
Falling
Table 2. Vowel characteristics in eight Vietnamese vowels produced by Northern Vietnamese speaker
Vowel
Word
Tu ‘knock up’
Tù ‘prison’
Mean
Tư ‘four’
từ ‘word’
Mean
Tô ‘bowl’
tồ non word
Mean
Tơ ‘silk’
Tờ ‘sheet’
Mean
Tone
Level
Falling
F0 (Hz)
297
219
258
297
223
260
300
214
257
290
207
249
Duration (ms)
400
F1 (Hz)
342
348
345
348
387
367
582
393
488
567
555
561
F2 (Hz)
916
839
F3 (Hz)
3041
2698
2870
3060
2892
2976
3054
2879
2967
3254
3176
3215
[u]
476
438
414
455
435
408
458
408
877
Level
Falling
1472
1368
1420
992
884
938
1400
1310
1355
[ɯ]
[o]
[ɤ]
Level
Falling
Level
Falling
413
452
432
The mean F1 and F2 of Vietnamese Northern and Central vowels are plotted in Figure 3. The vowels
[u], [ɤ] and [o] produced by two speakers were quite similar in the vowel space. The vowel that was the
most different from the two speakers was [ɯ]. However, this vowel was different in an unexpected way:
Kirby (2011) claimed that Northern [ɯ] was mid-centralized. Yet in this study, the Central speaker
produced more mid-centralized [ɯ] than the Northern speaker. Shport (2019) found that Central
Vietnamese [ɯ] was partly assimilated to English [ʊ] (32%), [ʌ] (26%) and [u] (22%). Thus, if acoustic
similarity between the native vs. non-native vowels could predict the assimilation patterns as PAM claims,
Central Vietnamese [ɯ] in this study may be perceived more as English lower [ʊ] or mid central [ʌ] than
high back [u] because it was more mid-centralized.
Figure 3. Mean F1 and F2 values of Northern and Central Vietnamese speakers
4.2. Assimilation patterns elicited by the speakers in both dialects
A comparison between Vietnamese and English vowels can be found in Figure 4 below. English
vowel characteristics were from Hillenbrand, Getty, Clark, and Wheeler (1995).The vowels within the same
circles were close in the vowel space.
Figure 4. Mean F1 and F2 values of English and Vietnamese vowels
Figure 5 shows the assimilation patterns of the listeners for both Vietnamese dialects.
38%
*77%
45%
*69%
21%
29%
/ɤ/
/u/
/o/
/ɯ/
Figure 5. Assimilation patterns of English speakers for both dialects
The results suggested that under the 50% threshold, listeners robustly assimilated Vietnamese [o] to
[oʊ] (77%) and [u] to [u] (69%); [ɤ] and [ɯ] were not categorized as any English vowel. Accordingly, the
assimilation patterns for [o], [u] was Categorized and [ɤ], [ɯ] was Uncategorized. The predicted
discrimination accuracy for the Vietnamese vowels from the easiest to the most difficult was: [o-ɤ] = [u-o]
= [ɯ-u] > [ɯ-ɤ] (UC = TC > UU). This was different from Shport (2019): [o-ɤ] = [ɯ-ɤ] = [ɯ-u] > [u-o].
One notable thing is that even though [ɤ] and [ɯ] were uncategorized, they were perceived considerably as
[ʌ] (38%) and [u] (45%). Under the above-chance level (14.3%) categorization criterion, these vowels were
still assimilated. Thisindicated a strong tendency of participants’ perception towards these vowels. This
data will be accounted for by vowel characteristics in 4.3. and 4.4.
4.3. Assimilation patterns by Central and Northern Vietnamese dialects
The assimilation patterns for both dialects were also found for Central Vietnamese dialect. The
specific proportion for this dialect can be found in Figure 6.
34 %
27 %
*61 %
*83%
42 %
29 %
39 %
*77 %
*55 %
*77
/o/
/o/
/ɤ/
/ɯ/
/u/
/ɤ/ /u/ /ɯ/
Figure 6. Assimilation patterns of English speakers by dialect
The results suggested that listeners had different assimilation patterns for Northern Vietnamese
dialect compared with overall and Central Vietnamese dialect. The first difference was in the vowel [ɯ].
Central Vietnamese [ɯ] was more mid-centralized than Northern Vietnamese [ɯ] and thus Central
Vietnamese [ɯ] was expected to be perceived more as [ʊ] or [ʌ] than [u]. This effect was indeed observed
clearly: Central Vietnamese [ɯ] was perceived the most as [ʊ] (39%). This proportion for [u] was lower:
29%. For Northern Vietnamese [ɯ] as less mid-centralized, this vowel was assimilated to English [u]: 61%
and the proportion for [ʊ] and [ʌ] was very low: 19% and 4% respectively. This suggested that listeners
tended to perceive Central Vietnamese [ɯ] as the lower [ʊ] and mid central [ʌ] than [u] and the opposite
was found for Northern Vietnamese [ɯ]. The second notable effect was the difference in the Vietnamese
[u]. Northern Vietnamese [u] was higher than Central Vietnamese [u] and this gap was quite big compared
with other vowels. This resulted differences in listeners’ perception: although Vietnamese [u] was
categorized as English [u] in both dialects, the percentage for Central Vietnamese was only 55%, very low
compared to Northern Vietnamese: 83%. This indicated that listeners robustly mapped Northern
Vietnamese [u] to English [u] using vowel height: the higher the vowel, the more it was perceived as English
[u]. Since English [u] has been more fronted, vowel backness may not serve as a reliable cue for the listeners
in this case.
5. Implications and conclusion:
This study aimed to find out how English speakers from the Midwest U.S. perceived Vietnamese
vowels contrasting in rounding [o ɤ u ɯ] based on PAM framework. The experiment was a replication of
Shport (2019) vowel categorization experiment with the English speakers from a different dialect and
stimuli from Central and Vietnamese dialect. Overall, the results suggested the followings:
First, the assimilation patterns of participants in general was [o] to [oʊ] (77%), [u] to [u] (69%); [ɤ]
and [ɯ] were not categorized as any English vowel. Accordingly, the assimilation patterns for [o], [u] was
Categorized and [ɤ], [ɯ] was Uncategorized. The predicted discrimination accuracy for the Vietnamese
vowels from the easiest to the most difficult is: [o-ɤ] = [u-o] = [ɯ-u] > [ɯ-ɤ] (UC = TC > UU). This was
different from Shport (2019): [o-ɤ] = [ɯ-ɤ] = [ɯ-u] > [u-o]. This suggested a difference between English
speakers from Southern U.S. in Shport (2019) and Midwest U.S. in the current experiment in which cue
they used to map between the native and non-native vowels. Specifically, Southern U.S. speakers prioritized
vowel rounding to map between Vietnamese [u] to English [oʊ]. Midwest U.S. speakers prioritized vowel
height to map between Vietnamese [u] to English [u]. Southern Vowel Shift may play a role in this
difference. Future studies are needed to examine further into this issue. This difference is summarized in
Table 3.
Table 3. Assimilation patterns between Southern U.S. and Midwest U.S. speakers
Southern U.S. speakers
Midwest U.S. speakers
(Current study)
[oʊ]
(Shport, 2019)
[oʊ]
[ʌ]
[oʊ]
[o]
[ɤ]
[u]
[ɯ]
Uncategorized
[u]
Uncategorized
Uncategorized
Second, Central Vietnamese [ɯ] was more mid-centralized than Northern Vietnamese [ɯ] and this
was unpredicted based on Kirby (2011). This resulted in a strong dialect effect in participants’ response
(Central Vietnamese [ɯ] was perceived the most as [ʊ] and Northern Vietnamese [ɯ] was assimilated to
[u]) and suggested that acoustic similarity between native and non-native vowels could predict the
assimilation patterns very accurately in this case. Another dialect effect to be found was the difference in
[u]. Although both Central and Northern Vietnamese [u] was categorized as English [u], the proportion for
Northern Vietnamese [u] was very high compared with Central Vietnamese. This was another indication
that English listeners prioritized vowel height to map between non-native and native vowels because
Northern Vietnamese [u] was higher than Central Vietnamese [u].
Third, the study demonstrated that languagestudents may perceive ‘similar’ L2 phones as their L1
phones (e.g. the short English [ɪ] may be assimilated to Vietnamese long [i:]). Given the aforementioned
findings, the study suggests the following implications for language teachers:
First, teachers should be aware of this outcome and help their students distinguish the very subtle
differences between these ‘similar’ sounds. They can use different teaching techniques to help them
perceive them correctly (e.g. listen to minimal pairs, listen and guess the sounds, listen and check the
pronunciation in dictionary, etc.). Teachers need to emphasize the differences and ensure that they can
perceive the sounds correctly through practical techniques, not only through theoretical instructions. Indeed,
it is the first step for correct production.
Second, to help students produce correctly similar sounds, teachers can ask students to do different
speaking tasks such as reading out loud, recording their own voice, practicing minimal pairs, reading after
the recording, etc. Teachers need to keep in mind that correct perception comes before correct production
and so their teaching techniques should follow this sequence. Lastly, teachers need to be patient if learners
keep substituting L2 sounds with ‘similar’ L1 sounds because it is a natural phenomenon. To
perceive/produce these sounds correctly, students need to devote lots of efforts into the process. Also,
teachers should not insist that students need to produce L2 sounds perfectly at the beginning. Teachers
should instruct them to realize their mistakes over time by comparing their production with model native
speakers.
References
Best, C.T. (1995). A direct realist review of cross-language speech perception. In W. Strange (Ed). Speech
perception and linguistic experience. Issues in cross-language research (pp.171-204). Timonium, MD:
York Press.
Clopper, Cynthia G., Pisoni, D.B., & Kenneth, D.J. (2005). Acoustic Characteristics of the vowel systems
of six regional varieties of American English. The Journal of the Acoustical Society of America,118(3),
1661-1176.
Erika, S.L. (2009). On the assimilation-discrimination relationship in American English adults’ French
vowel learning. The Journal of the Acoustical Society of America, 126(5), 2670-2682.
Hillenbrand, J., Getty, L.A., Clark M.J., and Wheeler, K. (1995). Acoustic characteristics of American
English vowels. Journal of the Acoustical Society of America,97(5), 3099-3111.
Jacewicz, E., Fox., R.A., & Salmons, J. (2011). Cross-generational vowel change in American.
Kirby, J.P. (2011). Illustration of the IPA: Vietnamese (Hanoi Vietnamese). Journal of the International
Phonetic Association, 11(3), 381-392.
Labov, W., Ash, S., & Boberg, C. (2006). Atlas of North American English: Phonetics, phonology, and
sound change. New York: Mouton de Gruyter.
Pham, H.T. (2002). Vietnamese tone: Tone is not pitch. (Doctoral dissertation). Retrieved from
accountid=14556.
Shport, I.A. (2019). Perception of Vietnamese back vowels contrasting in rounding by English listeners.
Journal of Phonetics,73, 8-23.
Simon, E., Debaene, M., & Van Herreweghe, M. (2015). The effect of L1 regional variation on the
perception and production of standard L1 and L2 vowels. Folia Linguistica, 49(2), 521–553.
CÁCH NGƯỜI MỸ TIẾP NHẬN NGUYÊN ÂM TIẾNG VIỆT PHÂN
BIỆT BỞI ĐỘ TRÒN MÔI
Tóm tắt: Nghiên cứu sau đây tìm hiểu cách người nghe đến từ Trung Mỹ tiếp nhận các nguyên âm được
phân biệt bởi độ tròn môi trong tiếng Việt. Trong nghiên cứu, người phát âm nói tiếng Trung và Bắc
Việt Nam. Kết quả cho thấy các âm [o] [u] tiếng Việt đã được người nghe đồng hóa với các âm [oʊ]
(77%) [u] (69%). Các âm [ɤ] [ɯ] không bị đồng hóa với nguyên âm tiếng Anh nào. Kết quả gợi ý khi
nghe các âm ngoại lai, người nghe từ Trung Mỹ có xu hướng dùng độ cao để so sánh với âm trong tiếng
mẹ đẻ. Tuy nhiên người nghe từ Nam Mỹ trong nghiên cứu của Shport (2019) lại dùng độ tròn môi.
Ảnh hưởng của phương ngữ cũng được ghi nhận khi âm [ɯ] trong tiếng Bắc được đồng hóa sang [u]
(61%) nhưng âm [ɯ] trong tiếng Nam lại được tiếp nhận nhiều nhất là [ʊ] (39%). Vì vậy, sự tiếp nhận
âm ngoại lai có thể được ảnh hưởng bởi cả phương ngữ của người nghe và người nói. Nghiên cứu cho
thấy giáo viên tiếng Anh nên nhận thức được điều này để có thể dùng những chiến lược thích hợp cho
lớp học.
Từkhóa: Mô hình tiếp nhận đồng hóa, đắc thụ ngôn ngữ hai, tiếp nhận ngôn ngữ hai, nguyên âm trong
tiếng Việt
Bạn đang xem tài liệu "Cách người Mỹ tiếp nhận nguyên âm tiếng Việt phân biệt bởi độ tròn môi", để tải tài liệu gốc về máy hãy click vào nút Download ở trên
File đính kèm:
- cach_nguoi_my_tiep_nhan_nguyen_am_tieng_viet_phan_biet_boi_d.pdf