Comments
Description
Transcript
Document 1930241
UNIVERSITY OF STOCKHOLM INSTITUTE OF LINGUISTICS PERILUS VIII PERIL US mainly contains reports on current experimental work carried out in the Phonetics Laboratory at the Unhfersity of Stockholm. Copies are available from the Institute of Linguistics, University of Stockholm, S-106 9 1 Stockholm, Sweden. This issue of PERILUS was edited by Olle Engstrand, Mats Dufberg and Johan Stark. 11 Institute of Linguistics University of Stockholm S-106 91 Stockholm © 1988 F6rfattarna ISSN 0282-6690 ill THE PHONETICS LABORATORY GROUP Ann-Marie Alme Ulf Andersson Leslie Bailey 1 Robert Bannert Aina Bigestans Peter Branderud Una Cunningham-Andersson Hassan Djamshidpey Mats Dufberg aile Engstrand 2 Garda Ericsson Anders Eriksson Ake Floren 3 Eva Holmberg Diana Krull Francisco Lacerda Ingrid Landber � Bjorn Lindblom Rolf Lindgren James Lubker 6 Bertil Lyberg 5 Robert McAllister Lennart Nord 7 Lennart Nordstrand 8 Liselotte Roug Richard Schulman Johan Stark Hartmut TraunmOlier Eva Oberg 1 Visiting from Department of Linguistics, University of Delaware, Newark, Delaware, USA 2 Also Department of Phoniatrics, University Hospital, Link6ping 3 Also Research Laboratory of Electronics, MIT, Cambridge, MA, USA 4 Also Department of Linguistics, University of Texas at Austin, Austin, Texas, USA 5 Also Department of Communication Science and Disorders, University of Vermont, Burlington, Vermont, USA 6 Also Technology Department, Swedish Telecom, Stockholm 7 Also Department of Speech Communication and Music Acoustics, Royal l nstitute of Technology (KTH), Stockholm 8 Also AB Consonant, Uppsala IV ACKNOWLEDGMENTS The current phonetic experimental research at the Institute of Linguistics is sponsored in part by the following sources: The Swedish Board for Planning and Coordination of Research (FRN) The Swedish Council for Research in the Humanities and Social Sciences (HSFR) The Swedish Natural Sciences Reseach Council (NFR) The Bank of Sweden Tercentenary Foundation (RJ) The Swedish National Board for Technical Development (STU) The Swedish Cancer Society, The Cancer Foundation The Swedish Institute AI/manna arvsfonden v PREVIOUS ISSUES OF PERILUS PERILUS 11978-1979 1. INTRODUCTION Bjorn Lindblom and James Lubker 2. SOME ISSUES IN RESEARCH ON THE PERCEPTION OF STEADY-STATE VOWELS Vowel identification and spectral slope Eva Agelfors and Mary Graslund Why does [Q] change to [0] when Fo is increased? Interplay between harmonic structure and formant frequency in the perception of vowel guality Ake Floren Analysis and prediction of difference limen data for formant frequencies Lennart Nord and Eva Sventelius Vowel identification as a function of increasing fundamental frequency Elisabeth Tenenholtz Essentials of a psychoacoustic model of spectral matching Hartmut TraunmOlier 3. ON THE PERCEPTUAL ROLE OF DYNAMIC FEATURES IN THE SPEECH SIGNAL Interaction between spectral and durational cues in Swedish vowel contrasts Anette Bishop and Gunilla Edlund On the distribution of [h] in the languages of the world: is the rarity of syllable final [h] due to an asymmetry of backward and forward masking? Eva Holmberg and Alan Gibson On the function of formant transitions I. Formant frequency target vs. rate of change in vowel identification II. Perception of steady vs. dynamic vowel sounds in noise Karin Holmgren vi Artificially clipped syllables and the role of formant transitions in consonant perception Hartmut TraunmOlier 4. PROSODY AND TOP DOWN PROCESSING The importance of timing and fundamental frequency contour information in the perception of prosodic categories Bertil Lyberg Speech perception in noise and the evaluation of language proficiency Alan C. Sheats 5. BLOD - A BLOCK DIAGRAM SIMULATOR Peter Branderud PERILUS 111979-1980 Introduction James Lubker A study of anticipatory labial coarticulation in the speech of children Berlin, Ingrid Landberg and Lilian Persson Asa Rapid reproduction of vowel-vowel sequences by children Ake Floren Production of bite-block vowels by children Alan Gibson and Lorrane McPhearson Laryngeal airway resistance as a function of phonation type Eva Holmberg The declination effect in Swedish Diana Krull and Siv Wandeback Compensatory articulation by deaf speakers Richard Schulman Neural and mechanical response time in the speech of cerebral palsied subjects Elisabeth Tenenholtz An acoustic investigation of production of plosives by cleft palate speakers Garda Ericsson vii PERILUS III 1982-1983 Introduction Bjorn Lindblom Elicitation and perceptual judgement of disfluency and stuttering Ann-Marie Alms Intelligibility vs. redundancy - conditions of dependency Sheri Hunnicut The role of vowel context on the perception of place of articulation for stops Diana Krull Vowel categorization by the bilingual listener Richard Schulman Comprehension of foreign accents. (A Cryptic investigation.) Richard Schulman and Maria Wingstedt Syntetiskt tal som hjalpmedel vid korrektion av dovas tal Anne-Marie Oster PERILUS IV 1984-1985 Introduction Bjorn Lindblom Labial coarticulation in stutterers and normal speakers Ann-Marie Alms Movetrack Peter Branderud Some evidence on rhythmic patterns of spoken French Danielle Duez and Yukihoro Nishinuma On the relation between the acoustic properties of Swedish voiced stops and their perceptual processing Diana Krull Descriptive acoustic studies for the synthesis of spoken Swedish Francisco Lacerda Frequency discrimination as a function of stimulus onset characteristics Francisco Lacerda V111 Speaker-listener interaction and phonetic variation Bjorn Lindblom and Rolf Lindgren Articulatory targeting and perceptual consistency of loud speech Richard Schulman The role of the fundamental and the higher formants in the perception of speaker size, vocal effort, and vowel openness Hartmut TraunmOlier PERILUS V 1986-1987 About the computer-lab Peter Branderud Adaptive variability and absolute constancy in speech signals: two themes in the quest for phonetiC invariance Bjorn Lindblom Articulatory dynamics of loud and normal speech Richard Schulman An experiment on the cues to the identification of fricatives Hartmut TraunmOlier and Diana Krull Second formant locus patterns as a measure of consonant-vowel coarticulation Diana Krull Exploring discourse intonation in Swedish Madeleine Wulffson Why two labialization strategies in Setswana? Mats Dufberg Phonetic development in early infancy - a study of four Swedish children during the first 18 months of life Liselotte Roug, Ingrid Landberg and Lars Johan Lundberg A simple computerized response collection system Johan Stark and Mats Dufberg Experiments with technical aids in pronunciation teaching Robert McAllister, Mats Dufberg and Maria Wallius ix PERILUS VI FALL 1987 Effects of peripheral auditory adaptation on the discrimination of speech sounds (Ph.D. thesis) Francisco Lacerda PERILUS VII MAY 1988 Acoustic properties as predictors of perceptual responses: a study of Swedish voiced stops (Ph.D. thesis) Diana Krull CONTENTS OF PERILUS VIII Some remarks on the origin of the "phonetic code" Bjorn Lindblom ...................... . . .. . .... . ... . . ........ ................... . ...... . . . . . . . . . . . . . . . . . . . . Formant undershoot in clear and citation form speech Bjorn Lindblom and Seung-Jae Moon ........... . ............ . .. . . . . . . . . . . .. . . . . . . . . . . . . . . . . On the systematicity of phonetic variation in spontaneous speech aile Engstrand and Diana Krull . . . . . . . . ... . . . ................... . ... . . . .. . . . . . . . . . . . . . . . . . . . . . . . 1 20 34 Discontinuous variation in spontaneous speech aile Engstrand and Diana Krull .48 Paralinguistic variation and invariance in the characteristic frequencies of vowels Hartmut TraunmOlier 54 Analytical expressions for the tonotopic sensory scale Hartmut TraunmOlier 93 .. . . . ... . ........... . ..... . ....... . . . . . . . . . . . . ... . . . . . . . . . . . . . . . . . . . ..... . . . . . . . . . . . ... . . . . .. . . . . . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ... . ... . ...... . . . . . . . . . . . .. . . . . . .. . . . . . .. . . . .... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Attitudes to immigrant Swedish A literature review and preparatory experiments Una Cunningham-Andersson and aile Engstrand 1 03 Representing pitch accent in Swedish Leslie M. Bailey 153 - . . .. . ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . .... . .. . . . . . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Some remarks on the origin of the "phonetic code"* Bjorn Lindblom Departments of Linguistics University of Texas at Austin and Stockholm University *Paper presented Dyslexia: Aspects at on the Symposium Memory on Functions, Interactions, sponsored by Rodinensis Pro Remediatione, in June Wenner-Gren Center, Stockholm Sweden. Hemispheric Developmental Sequencing and the Academia 1988 at the INTRODUCTION: THE ELUSIVE PHONEME Human languages exhibit duality (Hockett 1958) means that they make combinatorial use of which discrete units at two levels of structure: Elements carrying meaning (words, morphemes) are combined to f orm phrases and sentences according to syntactical rules. Phonemes are combined to f orm words and morphemes according to phonological- rules. Apparently no other species codes its communicative signals in this combinatorial way. In all languages the building blocks of spoken words are vowel and consonant phonemes. In animal communication systems, on the other hand, meaningf ul elements cannot be f ormed by systematic use of discrete units since they lack such units. The signals are Gestalts. Inventories are limited and typically consist of no more than 10- 40 holistic patterns (Wilson 1975) . By comparison, human vocabularies can become extremely large owing to the combinatorial power of the phonemic principle. Once acquired this "phonetic code" enables the normal child to begin, by its eighteenth month or so, to learn half a dozen new words a day, so that by six years of age it und erstands seven to eleven thousand phonetic f orms, or about ten percent of an adult's vocabulary (Studdert-Kennedy 1983, 1987) We are thus led to conclude that the property of dual structure is unique to hum an langauges. It 1S linguistically universal. And it is the key to their unique expressiveness. 'Slips of the tongue' (spoonerisms and other speech errors) - e g "our g!leer old gean" f or "our gear old g!leen" provide strong evidence that the adult language user's production of speech is organized in terms of phonemic segments (Fromkin 1980, MacNeilage, - 2 - Studdert- Kenned y and Lindblom 1985). Interestingly, 'slips of the hand' are reported to occur in sign language (Klima and Bellugi 1979: 126-146). Such f acts along with other ind ependent observations confirm the assumption that also sig n langua ge uses the method of combining abstract building blocks, i e hand shapes like phonemes by themselves totally devoid of meaning to f orm the complex signs of their vocabularies. Accordingly, d uality and the principle of phonemic coding cannot be said to be unique to the vocal auditory med ium (Bellugi and Studd ert-Kenned y 1980) Demonstrably the phonetic cod e of f ers an extremely powerf ul method of coding semantic inf ormation. Clearly it would not be possible f or a linguist to d escribe langua ge structure, whether spoken or signed , without the recognition of the phonemic organization of the lexicon. The phoneme represents a major d iscovery of twentieth-century linguistics (Fischer- J¢rgensen 1975). Yet the phoneme remains elusive to those who stud y the physical and behavioral aspects of language use. Neither in articulatory movements nor in the speech signal d o phonemes appear as bead s on a necklace. Their correlates in the signal d o not f orm segments that are sharply d elimited along the time axis. And those correlates cannot be unambiguously identif ied irrespective of contex t. These dif f iculties are known as the segmentation ' and invariance problems (Perkell and Klatt 1986, Fant 1988, Liberman 1988). Large-scale recognition of human speech by computer still awaits the successf ul resolution of these classical theoretical issues. At this point the student of reading would want to intersperse that the alphab et which ref lects the phonemic segmentation of speech, was d eveloped late in the evolutionary history of language, perhaps no earlier than 3500--4 000 years ago. He would also point out that readers tend to vary with respect to their a bility to a nalyze written ma terials phonolog ica lly. Poor readers lack phonemic awareness (Liberman 1987, Lundberg, Olof sson and Wall 1980, Lundberg 1987) They have greater d if f iculties segmenting word s into phonemes than good read ers. According ly we f ind that the research interests of those studying speech and those stUdying reading converge on the baf f ling but admittedly powerf ul notion of the phoneme. How could such a complex structure ha ve evolved? The goal of the present pa per is to shed some lig ht on that question. I shall make my presentation in two steps. We beg in by f irst considering how the phonetic values o f phonemes might ha ve developed. In other w ord s, how do phonetic systems evolve? We then use our tentative . - 3 - answer to that questio n to elucidate the o rigin of the Where did the pho nemic principle co me units themselves? fro m? ARTICULATORY AND PERCEPTUAL PHONE TIC SYSTEMS EVOLVE? CONSTRAINTS I: HOW DO The majo r dimensio ns that linguists traditio nally use to describe vo wels are: (i) the degree of rounding of the lips and the po sitio n o f the to ngue alo ng (ii) a front-back and (iii) a high-low dimensio n (Ladef o ged 1982 ) . The typo lo gical data used in the present paper FIGURE 1 I % 100 50 o u 'U" o O ·:> include vo w el systems fro m o ver 200 languages (Cro thers 1978) w ho se vo w el qualities were specified in relatio n to a maximal universal set with fro nt, central o r back, ro unded o r unro unded and seven po sitio ns o n the high lo w continuum. The mo st f avo red invento ries are listed in Table 1. Figure 1 sho ws supplementary data fro m an independent investigatio n o f 317 languages (Maddieso n - 4 - TABLE l. Mo st favo red vo w el systems o bserved co rpus o f o ver 200 languages (Cro thers 1 978) INVENTORY SIZE 3 4 4 5 5 6 6 7 7 9 VOWE L 1 a 1 a 1 a 1 a 1 a 1 a i a 1 a 1 a 1 a QUALITIE S u u t + u u €. " u E t u € :> :t:: u E ,;) e u e 0 :i: � u � ;) e 0 u E. ::> e 0 3: � in a NO OF LG' S 23 13 9 55 5 29 7 14 11 7 1984) . The o ccurrence o f the mo st frequent symbo ls have been plo tted o n a tw o - dimensio nal pro je ctio n o f the universal set. Bo th so urces o f data co nverge in demo nstrating that only a small subset of the available qualities are bro ught into play. We f urther no te that there is a clear preference fo r 'peripheral vo wels' such as Ii e t a � 0 ul and a relative disfavo ring o f Iy 0 .a y 1M. I. Also high- lo w co ntrasts are mo re co mmo n than f ro nt-back and ro unded-unro unded o ppo sitio ns. These systematic trends represent rather drastic departures fro m the systems w e wo uld generate simply by drawing invento ries at rando m fro m the max imal set o f universal vo wel types. Ho w do we explain such patterns? Here is a brief summary o f a theo ry develo ped to acco unt fo r the o bserve d regularities but w ho se co mpo nents o riginally co me f ro m several independently mo tivated research themes. (Fo r an ex haustive descriptio n o f the research repo rted in the present and the f o llo w ing sectio ns see Lindblo m, MacNeilage and Studdert- Kennedy fo rthco ming). The theo ry can be presented in three parts. It pro vides quantitative definitio ns of the space of "po ssible vo w els", a co nstraint on "pho netic discriminability'" and a criterio n fo r selecting the "o ptimal system". The po int of de parture is a physio lo gically mo tivated, numerical mo del (Lindblo m and Sundberg 1971) w hich takes specificatio ns o f the po sitio n o f the jaw , to ngue, larynx and the lips as its input and w ho se o utput is the shape (area functio n) o f the vo cal tract f o r an arbitrary, but physio lo gically po ssible vo w e l articulatio n. The aco ustic pro perties o f such vo cal tract shape s can be ascertained by me ans o f establishe d metho ds o f aco ustic the o ry (Fant 1960). The audito ry pro pe rtie s are derived by transf o rming the aco ustic descriptio n o f a vo w el w hich is given in terms o f its harmo nic spectrum into an audito ry representatio n. This - 5 - last ste p employs computational models that capture e sse ntial characteristics of the auditory periphe ry as re veale d by psycho acoustic re se arch (Schro eder, Atal the class of vo w els, or and Hall 1979). Acco rdingly, the vowel space, generate d by this model can be de scribe d in articulatory, acoustic or auditory dime nsions. Since the above -me ntioned de f initio ns quantif y ge ne ral aspects of oral physiology, aco ustics and he aring that are in no w ay spe cial to spe e ch w e can view the vow e l space as a te ntative hypo thesis about the a prio ri range of physical sounds unive rsally available for the linguistic selection of vowel co ntrasts. The the ory analyzes phonetic discriminability into an auditory and a sensori-moto r aspect. It can be sho w n that it is po ssible to predict the auditory diffe rence or distance that a listene r assigns to an arbitrary pair o f vo w els (Blado n and Lindblom 1981) from �"S- AUDij=c (f I o Ei (z)-Ej (z)12 dz)1/2 (1) where c is a co nstant and Ei (z) and E" (z) represent "excitatio n patterns" calibrated in psyc1 oaco ustically motivated dimensions. The interval z=O-24. 5, in Bark units, correspo nds to the frequency range o f human hearing (Schroeder, Atal and Hall 1979) . There is also data fro m experiments using the technique of Direct Magnitude Estimation (Stevens 1975) . These experiments compared subjects' judgements o f movement along the dimensions of jaw o pening and front-back positioning o f the tongue. The DME results indicated that subjectively jaw movements appeared mo re exte nsive than tongue movements altho ugh displacements were equal in terms o f physical measures (Lindblo m and Lubker 1985) . On the basis of these findings an articulatory distance metric, ARTi" ' was derived f or the vow el space (Lindblo m 198 � ) Taking the product of the articulatory and the audito ry . matrice s we e xpress phone tic discriminability as Dij = (2) ARTij*AUDij Given the definitio ns of the space and the discriminability measure we are in a po sition to ask: If vow e l systems were se e n as evolutionary adaptatio ns to the idio syncratic shape of the vo w el space and to selectio n pressures favo ring maximally discriminable vow e l co ntrasts what wo uld they be like? This que stio n w as addre sse d in a series of computatio nal expe rime nts 1n w hich optimal system w as de rive d by computing: i- I k (l/Dij)2 .... minimize d (3) L. L �:' '2.. j=1 --- - 6 - TABLE 2. ------ - OBS ER V E D i . . . . --------- ------------- INVENTORY SIZE: 3 u COMPUTED i . . ------ -- ---- . . u . . . u • • a a (23) INVENTORY SIZE: i . . . . e . . . 4 u i . E a • a (13 ) i INVENTORY SIZE: . €. . . • • . 5 u i . E. , . . . • • , . � . • • :> u a a (55) i INVENTORY SIZE: . � . . t . , . 6 u i . £ a a ------ � u (29) --------------------------------------------------- INVENTORY SIZE: i . i � . . . . � . 7 u i . . . ) . . £. tt . . • • . a. • a U y . a (14) i i . e . t . . INVENTORY SIZE: . . . . � a u a 9 i . e ) . tt.. o\:r � . • • • a U • . a • 0- (7) f or all possible combinations generated by k=3 through 9 (inventory size) and n=19 (size of universal set) . - 7 - The results are presented in Table 2. The lef t co lumn restates the inf o rmatio n of Table 1 and is co mpared w ith the results o f the simulatio ns (right co lumn) . In no case do es the pro bability o f selecting a co rrect system by pure chance exceed 10-3. If w e make a gro ss co mpariso n in terms o f the number o f high- lo w (vertical) and f ro nt- back (horizo ntal) co ntrasts there is perfect agreement between the predictio ns and the data. Lo o king at the individual qualities w e f ind that certain discrepancies o ccur in systems with mo re than six vo w els. Ho wever, in mo st cases they are o f f by no mo re than a single step o n the nineteen-po int grid o f the universal set. It appears justif ied to co nclude that the simulatio ns achieve a rather a clo se agreement w ith the typo lo gical data. Such a result suppo rts the idea that vo wel systems can be understo o d as f unctio nal adaptatio ns to articulato ry and perceptual co nstraints. FIGURE 2 30 VOWEL SYSTEMS CONTAINING: BASIC SEGMENTS w (J Z w a:: a:: :::> (J (J o I- Z w (J BASIC . �ND ELABORATED SEGMENTS 20 • BASIC, ELABORATED, COMPLEX AND SEGMENTS 10 a:: w a.. o o 5 10 15 20 25 INVENTORY SIZE Acco rding to the theo ry presented here the preference f o r 'peripheral' vo wels and the disf avo ring of ' interio r' vo w els o riginates in an interactio n between a "demand" f o r discriminability o n the o ne hand - w hich pro duces a dispersio n ef f ect displacing vo wels to w ards the periphery and the idio syncratic pro perties o f the vo wel space o n the o ther w hich - 8 - leaves more room f or high-low contrasts than f or f ront back and rounding gestures. As we broaden our f ield of observation the above conclusions tend to be reinforced. The vowel d ata just examined are limited to systems of so-called 'plain' vowels. Many languages use vowel series that have ad d itional attributes such as apicalization, nasalization, breathy or creak y voice, etc: Ii%., 'e, a, 01 as well as combinations of such ad ditional f eat� re;: lair �:/. We classif ied the vow els of the UPSID database (Maddieson 1984) into those three groups: Basic or plain, Elaborated (with ad d itional features) and Complex (with combinations of elaborated mechanisms) . Plotting the distribution of these types as a f unction of inventory size we obtained the d iagram shown in Figure 2. It shows that small sy stems use Basic segments, medium-sized invoke Basic and Elaborated articulations. Large systems bring all three types into play. FIGURE 3 60 60 • BASIC ARTICULATIONS � 50 50 � 40 40 30 30 20 20 10 10 o o z w � • ELABORATED • COMPLEX } ARTICULATIONS en III o u. o 0: w a:l ::::!: ;:) z o 10 20" 30 40 50 60 o 10 20 30 40 50 60 TOTAL INVENTORY SIZE We classif ied the consonants of UPSID in a similar manner. Segments such as Ip t b d m . . 1 are treated as nd IpJ t' m . 1 represent Elaborated Basic; � gestures; Articulation; that combine Elaborated I. Examining mechanisms are Complex: Itl:h qW �C cl " the occurrence of these consonant categories as a f unction of inventory size we find patterns closely paralleling those f or vow els. Figure 3 presents a representative subset of the UPSID corpus showing data f rom 47 languages of the Af ro-Asiatic and the Ind o Pacif ic language groups. Each d ata point ref ers to a . . . . - 9 - . . given language. The top diagram show s the number of Basic obstruents as a f unction of total system size. The bottom panel indicates the number of Elaborated and Complex obstruents as a f unction of total sy stem size. Once more we see that small sy stems have Basic articulations, medium sy stems have Basic and Elaborated gestures and large sy stems have all three ty pes. Also note the saturation of Basic elements bey ond a certain system size and the law f ul linear grow th of the Elaborated and Complex data. Our analysis indicates that know ing the size of a vow el or consonant inventory w e can make some f airly good predictions about its phonetic contents. The motorically most elaborated and complex phonetic gestures (e g clicks, vowels simultaneously diphthongized and pharyngealized etc) are likely to occur in the largest inventories (as in ! Xu of the Kalahari desert with its 148 segments) whereas small systems (n<10, e g Maori and Haw aiian) would not be expected to contain such sounds but to f avor elementary articulations (p, t, m, e, a etc) . This Size Principle makes sense if we assume that in small inventories Basic articulations achieve suf f icient contrast whereas larger sy stems place greater demands f or intrasy stemic distinctiveness and theref ore cause additional dimensions to be recruited and to be combined to f orm more complex segments. ARTI CULATORY AND PERCEPTUAL CONSTRAI NTS I I : WHERE DOES THE PHONEMI C PRI NCI PLE COME FROM? I n the preceding sections we have argued that the values that vow els and consonants ex hibit have evolved in response to universal, non-linguistic articulatory and perceptual constraints. Let us now see w hether these constraints could have play ed a role also in the emergence of the discrete units themselves. Our discussion w ill be based on a computational ex periment in which we simulate the phonetic grow th of a small vocabulary , a minilexicon. The design of the experiment is closely analogous to the vowel sy stem simulations. The point of departure is again the articulatory model (Lindblom and Sundberg 1971) . We use it to generate a phonetic space consisting in this case of a set of "possible syllables". A possible sy llable is of f ixed duration and is represented as a continuous trajectory in phonetic space moving f rom a complete closure of the vocal tract (w hose location ranges f rom labial through dental, alveolar, retrof lex to palatal, velar and uvular points of articulation) to an open conf iguration. The open conf igurations are those of the phonetic - 10 - previously described cardinal vow el set. Figure 4 gives an example of such a transition w ith a stylized f requency-time f ormant pattern at the top and its representation in a three-dimensional f ormant space below . The phonetic discriminability of an arbitrary pair of trajectories was obtained by generalizing the procedures applied to vowels to the time domain , that FIGURE 4 TIME ( kHz) THIRD FO R M A N T (kHz) 3.0 3.0 2.5 r--t---l 2.5 2.0 2.0 3.0 -- a .2 ., .8 .6 FIRST FORMANT 1.0 SECOND FORMANT (kHz) 1.0 (kHz) is by representing them as a series of discrete spectra in time, calculating Eqs 1-3 f or each time sample and then deriving the discriminability measure as the square root of the sum of the individual samples squared (cf Eq 3). since the reduction phenomena and articulatory simplif ications of on-line speech can in most cases be explained satisf actorily in elementary biomechanical terms by representing articulators by damped spring mass systems (Lindblom 1983) a rank ordering of every possible syllable based on articulatory criteria was also attempted. Such a biomechanical analysis makes us - 11 - expect that ex treme positions (ex treme displacements f rom habitual rest) and ex treme movement rates tend, if possible, to be avoided. That is a f act richly supported by phonetic observations (Lindblom 1983) . There is room f or only a f ew ex amples: Syllables with labial and dental occlusions receive high ranks. They have near-neutral points of closure (cf their high f requency in babbling) whereas a transition with a retracted tongue tip, a retrof lex closure, represents a more ex treme departure f rom neutral. The penalty on extreme movement rates leads to a favoring of homorganic, assimilated sequences. Thus a trajectory consisting of a uvular closure followed by a palatal (high-f ront) open conf iguration gets a lower score than say a palatal (velar) closure f ollowed by a palatal (velar) open conf iguration. Pursuing the analogy with the vowel system simulations f urther we investigated "optimal systems" of syllables by computing k � - I � L. · ·/d· .)2 � minimized (4) (a1 J 1J 1 = j where dij is the discriminability of an arbitrary transition pa1r and aij the articulatory cost of that pair. In words: Find that set of k syllables that simultaneously satisf y the goal of being as easy as possible to say (minimal articblatory �ost) and as easy as possible to hear (max imal discriminability). In the present case k=15 and the total inventory was 133. A procedure of cumulative selection was ad opted. "" L·l:2. --- TABLE 3 ba bu de da du gc ga bi di . gl + + . gt gu Once an initial syllable had been selected Eq 4 was applied repeatedly until a minilex icon of 15 elements had been obtained. In all there were 133 runs (=initial syllables) . The results were pooled which yielded a total of 1995 syllables. The "optimal system" was def ined as the 15 f orms w ith the highest f requency in this pooled set. The results are presented in Table 3. - 12 - The most signif icant aspect of this table emerges when, examining it row by row and column by column, we observe that trajectory onsets and end -points are shared . Row s and columns appear to contain what linguists w ould call "minimal pairs". Why not a more diverse set of closures and open conf igurations? FIGURE 5 Perhaps the easiest way of obtaining an intuitive grasp of the causes of the combinatorial structure of the derived inventory is to invoke a simple geometrical metaphor. Suppose we consid er tw o vertical line segments and the task of d rawing k lines (trajectories) f rom anywhere on the lef t segment to anywhere on the right in such aw ay that the area, A, between any pair of lines w ill be as large as possible. Mathematically, i.-I k (5 ) (l! Aij)2 ..... minimized L L. 1 i-=2 j= Cf Eqs 3 and 4. Figure 5 show s the result f or k=9. We see that trajectory onsets and end-points are shared. The phonetic space of the simulations is clearly much more complex but our point is that the convergence of trajectories in the geometrical exercise is analogous to the convergence of the optimized phonetic transitions. The combinatorial pattern appears to be a consequence of achieving an ef f icient packing (read: optimal d iscrimination) w ithin a bound ed space. It can be shown that the aij matrix d oes not inf luence the d egree combinatorial coding in any major w ay but it -- - 13 - does play an important role determining the phonetic value s of the derive d syllables. Suppose w e presented Table 3 to a linguist as a sample f rom the vocabulary of an unknown language implying that the f orms have dif f e rnet meaning but being very caref ul so as not ot reveal that they had be e n produce d as unanalyze d w holes. Undoubtedl y he would note that the table contains numerous minimal pairs. Assuming that the le xical items are semantically distinct and f ollowing standard linguistic methodology he w ould hypothesize that the language in question uses three consonant phone mes and f ive vowel phonemes the minimally contrastive segments being / b d g/ and / i a :> u/ . t F IGURE 6 T ONGUE TIP JAW :\ BODY : \i ... m t- -- ALV r m - - f-�. t- TONGUE I ALVf t- f- ... � �: � � - ... � - -- -- -- --- f- --------- . � A P _______ TIP H ______ : \i y AL t- fm - BODY - OPEN - TARGET - NEUTRAL - - ---- - -- t- ---- TARGET NEUTRAL NEUTRAL - - JAW TONGUE OPEN TARGET - ---- 0 . - --- - - HABITU AL REST DENOMINATOR - - NEUTR AL _ _ COMMON TONGUE P - TARGET - - ----- -- ---- HABITUAL REST How do w e resolve this paradox? For remember: The syllables are by def inition specified as continuous transitions, as phonetic Ge stalts. Their production is no more segmentally organize d than the early vocalizations of the babbling child. Cf top panels of Figure 6. Neve rtheless, we transcribe such utterances using se gments. Accordingly our use of segments l.n - 14 - Table 3 should be seen as a mere convenience analogous to the conventional way of describing the phonetic behavior of the y oung child. How ever, a "phonemic principle" is never theless implicitly present in the derived lexicon since the existence of minimal pairs implies gestural overlap among motor scores. This is the point w e try to make in Figure 6 which shows the motor scores of tw o syllables, call them Idil and I da/ . The jaw and tongue body time f unctions dif f er whereas the tongue tip curves are identical. This overlap identifies a common denominator component. N ote that its contents is in a one-to-one relation w ith all words linguistically analy zed as beginning w ith the phoneme Id/. Were we to examine the rest of the motor scores of the derived syllables in the same manner we would obtain analogous common denominators f or the remaining "phonemic segments". We conclude that the simulated lexicon exhibits implicit phonemic coding. CONCLUSIONS: THE ELUSIVE PHONEME REVISITED We began by describing the phoneme as a pow erf ul but elusive unit of linguistic structure and by asking how such a complex structure could have evolved. Although our simulations of lexical growth no dOubt drastically underestimate the complexities of real-lif e vocabulary ac q uisition let us nevertheless brief ly examine what we might have learned f rom these preliminary considerations. The main f inding appears to be the demonstration of the beginnings of combinatorial structure in speech like signals. The computational experiments tell us that such combinatorial patterns can arise f rom phonetic constraints that f avor the selection of optimally discriminable stimuli. Recall that we f irst inferred the presence of such constraints f rom our analyses of vow el and consonant systems. In the present context w e have emphasized the role of perceptual aspects but other f actors should also be conside;t:'ed, f or instance learning. Conceivably , articulatory gestures that a child has alrea dy mastered might make new syllbles also containing those gestures easier to acquire than totally novel materials. Work now in progress indicates that such a mechanism w ould reinf orce the trend towards combinatorial coding even f urther and extend it to much larger lexica than the ones considered here. Examining the motor scores of the derived syllables w e f ound gestural components that can be said to be in a one-to-one rel ation w ith phonemic segments. - 15 - Note that these subcomponents are not explicit control units in the production process. They are hidden or only implicitly present in the holistic motor scores of the individual w ords of the lexicon. Making them explicit in the model, that is turning them into explicit control units is possible but requires additional processing. I would like to consider brief ly this notion of implicit phonemic coding in the context of reading. Poor readers are reported to lack phonemic aw areness. They have greater dif f iculties processing w ords phonologically than good readers. Similarly, it has been shown that illiterates score badly on tests involving phonemic segmentation tasks (Morais, Cluyens, Alegria 1984) . Phonemic aw areness does not seem to develop spontaneously during their language acquisition (Morais, Cary, Alegria and Bertelson 1979) . A novel view of the phoneme seems to be emerging out of the present work, namely that the elusive nature of the phoneme - manif est in reading behavior as w ell as in many other aspects of language use - is related to the f act that the phoneme has implicit rather than explicit status in the mental lexica of language users. Finally a f ew remarks on speech development. Psycholinguists assume that unless a child, as it were, "discovers the phoneme" as the building block of the lexicon it w ill not be ablef6 'deve'lop a vocabulary of normal large size. In the computational experiments I have reported the "phoneme" is not discovered. Rather minimal pairs identif y nodes of "gestural overlap". The notion of gestural overlap may have several interesting consequences w orth exploring in f uture research. It may be an important f actor in f urther learning. Moreover, since by def inition overlapping motor scores ref er to shared subcomponents of the motor scores they ought to take up less memory space than holistic patterns w hich require separate individual storage. The simulated phoneme is accordingly not the cause of a large vocabulary. It is the result of the vocabulary growth. It is an emergent consequence of lexical development. The process is automatic. It occurs in a completely self -organizing w ay as it seems to do in the normal child. On this view then the reasons why other species lack phonemic organization may have less to do w ith their ability to produce and receive signals ef f iciently* than w ith absence of an urge to communicate and f orm large vocabularies. * For (1985) a diff erent view cf the - 16 - scenario of Lieberman REFERENCES U and Studdert-Kennedy, M (1980) . Signed and Bioloqical Constraints on Spoken Language: Park, Deerf ield Form, Linguistic Florida:Verlag Chemie. Bellugi, Bladon, R Crothers, Fant, J (1978) . "Typology and Universals of Vow el Systems", In: Greenberg, J H, Ferguson, C A and Moravcsik, E A (eds): Universals of Human Language, Vol 2, 99-152, Stanf ord:Stanf ord University Press. G Fant, G A W and Lindblom , B (1981) . "Modeling the Judgement of Vow el Quality Dif f erences", J Acoust Soc Am �:1414-1422. (1960) . The Acoustic Theorv Production, The Hague:Mouton. (1988). "The Speech Code", of Speech this volume. Phonolooical Fischer-J0rgensen E (1975) . Trends in Theory, Copenhagen:Akademisk Forlag. A (1980). Errors of Linguistic Perf ormance, New York:Academic Press. Fromkin, V Hockett, C (1958). A Course in Modern Linguistics, York:MacMillan. Jakobson, R and Waugh, L (1979) Bloomington Language, University Press. Klima, S and Bellugi, U Language, Cambridge, Press. E The and New Sound Shape 0 f London:Indiana (1979) . =T=h=e��S�i�g�n�s�-=o=f MA:Harvard University Ladef oged, P (1982) . A Course in Phonetics, 2nd ed, York:Harcourt, Brace and Jovanovich. Liberman I Y (1987) . "Language and Literacy: The Obligation of the Schools of Education", 1-9 in Intimacy w ith Language, Proceedings of the Orton Dyslexia Society Symposium, Baltimore: Maryland: The Orton Dyslexia Society. Liberman A M (1988) . "Reading is Hard Just Listening is Easy", this volume. - 17 - New Because Lieberman, and Evolution of P (1985). The Biology L anguage, Cambridge, MA:Harvard University Press. L indblom, B (1983) "Economy of Speech Gestures", 217245 in MacNeilage, P F (ed) The Production of Speech, New York:Springer Verlag. L indblom, B (1986) "Phonetic Universals in Vowel Systems", 13-44 in Ohala, J J and Jaeger, J J (eds): Orlando, Experimental Phonology, FI:Academic Press. Lindblom, B (in press). "A Model of Phonetic variation and Selection and the Evolution of Vowel Systems", to appear in Wang, S-Y (ed): New Change, Language Transmission and York:Blackwell. Lindblom B, and Sundberg, J (1971) . "Acoustical Consequences of Lip, Tongue, Jaw and Larynx Movement", J Acoust Soc Am 50(4) :1166-1179. Lindblom B and Lubker J (1985). "The Speech Homunculus and a Problem of Phonetic Linguistics", 169192 in V A Fromkin (ed): Phonetic Linguistics, Orlando, FI:Academic Press. Lindblom B, MacNeilage P and Studdert-Kennedy M (forthcoming). Evolution of Spoken L anguage, Orlando, FL:Academic Press. Lundberg I, Olofsson, A and Wall, S (1980) . "Reading and Spelling Skills in the First School Years Predicted from Phonemic Awareness Skills in Journal of Kindergarten", Scandinavian Psychology 21:159-173. Lundberg I (1987). "Phonologcal Awareness Facilitates Reading and Spelling Acquisition", 56-63 in Proceedings of the Intimacy with Language, Orton Dyslexia Society Symposium, Baltimore: Maryland:The Orton Dyslexia Society. MacNeilage, P F, Studdert-Kennedy, M and L indblom, B (1985). "Planning and Production of Speech: An Overview", in L auter, J (ed): Planning and Production of Speech by Normally Hearing and Deaf People, ASHA reports. Maddieson, I (1984) Patterns of Cambridge:Cambridge University Press. . - 18 - Sound, Morais, J, Morais, Cary, L , Alegria, J and Bertelson, P (1979) . "Does Awareness of Speech as a Sequence of Phonemes Arise Spontaneously?", Cognition 1:323-331. Cluyens, M and Alegria, J (1984) . "Segmentation Abilities of Dyslex ics and Normal Readers", Perceptual and Motor Skills 58: 2 2 1-2 2 2 . J, Perkell, J and Klatt, D (1986) . Invariance and Variability in Speech Processes, Hillsdale, N J:L EA. J L (1979) . M R, Atal,B S and Hall, "Objective Measure of Certain Speech Signal Degradations Based on Masking Properties of 2 17-22 9 in Perception", Human Auditory Frontiers of L indblom, B and Ohman, S (eds) Research, Speech Communication L ondon:Academic Press. Schroeder, Stevens, S S (1975) . Psychophysics, New York:Wiley. Studdert-Kennedy, M (1983) . "On Learning . Human Neurobiology �:191-195. to Speak", Studdert-Kennedy, M (1987) "The Phoneme as a Perceptuomotor Structure", 67-83 in Allport, A, MacKay, D, Prinz, W and Scheerer, E (eds): Language Production, Perception and London:Academic Press. . Wilson, E 0 (1975). Sociobiology: The New Cambridge, MA: Belknap Press. - 19 - Synthesis, Formant Undershoot in Cl ear and citation Form Speech* Bjorn Lindbl om Departments of Linguistics university of Texas at Austin and stockholm University and Seung-Jae Moon Department of Linguistics University of Texas at Austin *Paper on Distinctive presented at a conference Features, stockholm Univerisity, May 27, 1988. FORMANT UNDERSHOOT IN CLEAR AND CITATION-FORM SPEECH Bjorn Lindblom and Seung-Jae Moon TOPIC This is a progress report on a project aiming at characterizing the acoustic properties of "clear speech". Is clear speech (henceforth CS) identical to more neutral forms of speech except for an improved signal-to-noise ratio? Or does CS also involve reorganization of phonetic gestures and their associated acoustic patterns? EXPERIMENTAL PHILOSOPHY Our motivation for studying CS is similar to the reasoning underlying our previous work on compensatory articulation and bite-block speech. The approach taken in that research was to observe the speech system under rather extreme conditions (abnormal mandible positions), assume that the principles governing its operation under normal and compensatory circumstances are the same, and try to infer from the compensatory behavior what those principles are. It appears fair to say that these bite-block experiments have provided evidence that vowel targets (Lindblom, Lubker and Gay 1979) and the durational correlates of stress patterns (Lindblom, Lubker, Lyberg, Branderud, and Holmgren 1987) are specified as sensori-motor, and ultimately . auditory, goals (cf Perkell 1980). In the case of CS the question arises whether, in order to become more intelligible, talkers tend to make perceptually important attributes of the signal more salient? And whether, by using a CS style, they invoke more extreme acoustic patterns that - in a more direct way than those of citation-form speech (henceforth CF) will provide clues to the true nature of the distinctive features? ELICITING CLEAR SPEECH Several experimental methods can be chosen to address such issues. The ideal type of approach appears to be an indirect one according to which CS speech samples are elicited from a subject by the experimental circumstances rather than by an explicit instruction to speak more clearly. An example would be speaking under noisy conditions (Clark, Lubker and Hunnicutt 1987, Chen, Zue, Picheny, Durlach and Braida 1983, Summers, Pisoni, Bernacki, Pedlow and Stokes 1988) or speaking - 21 - to a listener with reduced comprehension due to a hearing loss or a limited command of the language spoken. The data to be described in the present report were obtained by means of a direct approach, that is by explicitly asking subjects to "overarticulate and to speak as clearly as possible as when communicating with a non-native listener". IS CLEAR SPEECH INDEED MORE INTELLIGIBLE? Presumably when people speak more clearly they do so in an effort to become more intelligible. There are a few previous studies of clear speech showing that people tend indeed to become more intelligible (Dreher and O 'Neill 1957, Picheny, Durlach and Braida 1985, for a recent summary see Summers et al 1988). In the present work we are interested in correlating the acoustic properties of the speech samples with estimates of their intelligibility. We are currently running experiments designed to measure the intelligibility of the CF and CS recorded for our subjects as a function of a systematically varied signal-to-noise-ratio. Are the tokens that are intended to be CS samples indeed more intelligible? What are their acoustic properties? In the present report we shall focus on some preliminary findings from these acoustic analyses. The intelligibility tests will be the subject of a future report. SPEECH SAMPLES Lindblom (1963) examples of so-called In that article it was demonstrated that vowel formant frequencies can be considerably displaced from hypothetical target values depending on the duration of the vowel and the identity of the surrounding consonants. Our present aim is to compare CF and CS speech in terms of that phenomenon. Research reported since Lindblom (1963) has demonstrated that the undershoot model is too simple. It has been shown that speakers are free to vary the degree of undershoot somewhat independently of vowel duration. In fast speech articulatory and acoustic goals can be attained despite short vowel durations (Engstrand 1988, Gay 1978, Kuehn and Moll 1976). In Nord (1986) we find evidence indicating that reductions can occur in spite of sufficient duration. In designing a speech sample we set up the following conditions: duration-dependent contains undershoot. - 22 - (i) The vowels and their cons onantal environments s hould be chos en s o as to maximize large 'locus to-target' d is tances ; (ii) The tes t vowels s hould carry lexical main s tres s ; (iii) Vowel d uration s hould be s y s tematically varied over a s ufficiently large range. The firs t condition was achieved by choos ing Englis h word s in which (a) front vowels occur in a labial-velar environment, (b) back vowels occur in a palatal context and (c) open vowels occur between cons onants with s trict d emand s on a high jaw pos ition and critical cons triction d imens ions , s uch as ls i (Keating, Lindblom, Lubker and Kreiman in preparation) . The s econd and third requirements were met by making us e of the s o-called 'word-length effect' manifested in e.g. s peed, s peed y, s peedily in which the d uration of the firs t, main-s tres sed vowel s hortens as more unstres s ed s y llables are ad d ed to the word . Examples of tes t words containing front vowels in a Iw II frame are given in the table below. wheel will well wail wheeling willing welling wailing Wheelingham Willingham Wellingby Wailingby Word s of this ty pe were read by five male speakers of American Englis h with normal s peech and hearing and with dialects approaching General American English. Each s ubject was first as ked to produce each word in is olation reading from randomized lists at comfortable vocal efforts and speaking tempos. No further ins tructions were given for these s amples that repres ent citation forms or CF-utterances. The lists contained five repetitions of each item. A s econd s et was then recorded in which similar lis ts were us ed but in which the s ubjects were asked to overarticulate and to s peak as clearly as possible as when communicating with a non-native listener. The lis ts contained at leas t five repetitions of each item. Finally the s ubjects read the same test words embedded in carrier phrases: "It's X not y" and "Not x but y" . They were asked to overarticulate the items occurring in the pos itions of the upper-case letters. This list contains at least three repetitions of each paired items. We refer to these two sets as the non contrastive and contrastive CS lists respectively. For each s ubject we also obtained five repetitions of each vowel in a Ih-dl frame. The vowels in this context will be labeled null-context tokens (NC). - 23 - RESULTS A more complete presentation of the results of the present experiments is in preparation. We will here concentrate on the formant patterns of the front vowels in labio-velar environments. The measurements are all from the same speaker. Figure 1 shows the second formant frequency plotted against vowel duration for the vowels [i, t. , E , et ] in a /w 1/ environment. CF measurements are indicated by dots, CS results by open squares. There are fifteen data points for each condition and speaking style (5 repetitions times three word lengths). The arrows on the left side of each rectangle represent the F2-value of the vowel in the /h-d/ null context and the F2-llocus" at the point of maximum constriction for /w/. Vowel duration in the /w 1/ environment was determined by choosing an arbitrary value of F2, 12 00 Hz, and taking the points in time when F2 crosses this value as the vowel segment boundaries. The diagrams show a clear duration-dependence for all four vowels. As the vowels get shorter F2 is shifted more and more away from its nUll-context value towards its position in /w/. This undershoot effect is clearly present in CF as well as CS speech. However, the CF and CS data points seem to form separate clusters. In Lindblom (1963) undershoot was described in terms of exponential curves that made the degree of formant undershoot dependent on the 'locus'-pattern of the adjacent consonant and the duration of the vowel. The significance of this curve-fitting procedure was that since the asymptotes of the F1 and F2 curves derived for a given vowel were independent of consonant context they could therefore be regarded as constant attributes of that vowel. It was argued that, in spite of the all the variability associated with formant undershoot, a given vowel could be described in terms of a unique and invariant attribute: a context independent underlying articulatory and acoustic target that the talker aimed at but failed to attain under certain conditions. Figure 2 shows the analogous data on the third formant frequency as a function of vowel duration. Again the CF measurements are indicated by dots, and CS results by open squares. These diagrams show the same trends as Figure 1: There are clear undershoot effects whose magnitude is related to vowel duration, the locus-target distance and to the speaking style. We tentatively conclude that the data of Figures 1 and 2 are not compatible with the simple undershoot model since, for both F2 and F3 of each of the four vowels, we would need to fit two separate exponentials, one for the CF's and one for the CS. We are forced to - 24 - observe that (a) it is not the case that there is a single asymptote associated with the formant pattern of each vowel and that (b) it is therfore not possible to infer a unique invariant underlying target for each vowel. In Figure 3 formant frequency information on the same subset of data as shown in Figure 1 is presented in the form of F1/F2 charts. The average of at least five repetitions of the vowel in the /h-d/ environment is indicated by a large solid dot in each panel. The averages of a minimum of five CS renderings of the mono-, bi- and trisyllabic words are plotted as open squares and small dots represent the corresponding CF samples. Note that the CS measurements are closer to the null-context data points in all cases. This observation is also brought out clearly by comparing the results for /h-d/ environments with those for the trisyllabic words, that is with the situation in which the shortest vowel durations, and thus the greatest amount of undershoot, would be expected. Figure 4 presents that comparison for average F1 and F2 from the stressed vowels of Wheelingham, Willingham, Wellingby. Null context and CS samples are unquestionably more peripheral than the CF measurements. In the terminology of Koopmans-van Beinum (1980) intervocalic acoustic contrast seems to be reduced in CF vowels as compared with NC and CS tokens. Note the tendency for the F1 values for CS to be somewhat higher than in the other conditions. This might conceivably be due to the loudness increase that was in general associated with the CS tokens. The more peripheral locations of the CS match well the results of Chen (1980) and of Lindblom and Lindgren (1985). SUMMARY AND DISCUSSION OF FINDINGS These preliminary findings from a single speaker suggest that when lexical stress is held constant but vowel duration is varied by means of the word-length effect formant undershoot is observed in American English front vowels embedded in /w 1/ contexts. In the present case undershoot cannot be attributed to variations in stress but is duration-dependent. Also the degree of undershoot varies with speaking style. As mentioned above the variables determining undershoot have been examined by several investigators (Kuehn and Moll 1976, Gay 1978, Nord 1986, Engstrand 1988, Flege 1988). These studies clearly show that a simple duration-dependence does not give a satisfactory account of the facts gathered so far. The present findings complicate the situation further in demonstrating a style-dependence. - 27 - • . o NULL CONTEXT: /h_d/ CITATION FORM CLEAR SPEECH 2 .5 ,------, - N J: .:. 2.0 ............... ............................. . . . . . ... .................................. . . . . . . . . ............... ..... . . . .. . . . . c z o u W 1 0 ....................................................................................... .................................... CI) ' 0 .5 . L----'_---L_--1.._--'-_---L._---L-_---.J o 0. 1 0. 3 0. 2 0.4 FIRST FORMANT 0.5 0.6 0.7 (kHz) FIG 4 - 29 - One factor which phoneticians have so far not really been able to control in their laboratory studies of speech production is the dimension of over- and underarticulation. While instances of "hyperspeech" and "hypospeech" are readily observed informally in everyday situations this aspect is very difficult to manipulate experimentally. When we record speech samples for our experiments we ignore the degree of reduction or elaboration that a given speaker may use perhaps because we tacitly hope that there is some "default mode" of pronunciation which every subject will adopt and which will make data from different talkers comparable. It is tempting to speculate that it may be this uncontrolled hyper-hypo dimension that might account some of the variability observed in vowel reduction studies. Is there a continuum of over- and underarticulation? If so, how do we determine where along this continuum a piece of recorded speech happens to be located? These are questions for further research. Our hope is that the present research on clear speech might shed some light on these issues. ** Although the results shown here are limited to a single speaker it appears evident that clear speech is not merely equal to more neutral speech with a better signal-to-noise-ratio. It does involve reorganization of phonetic gestures. That such a conclusion may be correct is supported by evidence in the literature. For instance, Picheny, Durlach and Braida (1986) and Uchanski, Durlach and Braida (1987) present data indicating that CS does indeed involve reorganization which can in fact be rather extensive. Assuming that speaking imposes sUbstantial modifications of citation-form speech patterns we are led to ask: Why should there be such a thing as clear speech? Why do talkers bother to make CS adjustments? Is it - as advocates of the Motor Theory of Speech Perception would no doubt argue - because, in so doing, they facilitate the listener 's access to the " distal objects of speech perception" , that is the phonetic gestures? Or is it because they would thereby make acoustically stable and salient properties of the signal easier to identify (cf the Quantal Theory of Speech as presented in Stevens (1989» . A third possibility is in line with TAD, the Theory of Adaptive Dispersion (Lindblom and Engstrand 1989) and is based on the assumption that the existence and very nature of clear speech can be understood as a speaker adaptation to the needs of the communicative situation. According ** The present project is closely related to the research on " Uttalstransformer" (speech transforms) now in progress at the Phonetics Laboratory of Stockholm University and conducted by Engstrand, Krull and Lindgren. - 30 - to this theory the purpose of clear speech is to enhance the acoustic contrast among phonemes (cf Koopmans-van Beinum 1980). It is further assumed that (i) speakers are capable of controlling the degree of acoustic contrast among phonemes and that (ii) the very existence and nature of clear speech tells an indirect story about the organization of perceptual processing: The argument is that clear speech makes sense only if we suppose that lexical access is based on - neither recovering phonetic gestures nor invariant acoustic properties but signal information that is sufficiently contrastive. On this view the recognition of segments involves the computation of (dis)similarities between stored patterns and signal derived features. TAD suggests that speaking more clearly is intended to make lexical access converge more unambiguously on unique candidates. Much more data is needed before such questions can be definitively answered but it already seems evident that clear speech offers opportunities to develop new paradigms and vantage points from which current theories can be tested. REFERENCES F Chen, Chen, R (1980): " Acoustic Characteristics and Intelligibility of Clear and Conversational Speech at the Segmental Level" , Master's Thesis, M I T, Cambridge, Mass. F R, Zue, V W, Picheny, M A, Durlach, N I and Braida L D (1983): " Speaking Clearly: Acoustic Characteristics and Intelligibility of stop Consonants" , 1-8 in Working Papers II, Speech Communication Group, M I T. Clark J E, Lubker, J F and Hunnicutt, S (1987): " Some Preliminary Evidence for Phonetic Adjustment Strategies in communication Difficulty" , in Language Topics, vol 2, John Benjamins. Engstrand, O (1988): " Articulatory Correlates of Stress and Speaking Rate in Swedish VCV utterances" , J Acoust Soc Am 83 (5): 1863-1875 . Flege, Gay, J E (1988): " Effects of Speaking Rate on Tongue position and Velocity of Movement" , J Acoust Soc Am 84 (3) :901-9 16. T " Effect of Speaking Rate (1978): Movements" , Formant J Acoust 63 (1) :223-230. - 31 - on Vowel Soc Am Keating, P, Lindblom, B, Lubker, J and Kreiman J (in prep): "Jaw position in English and Swedish VCVs", ms. , Department of Linguistics, UCLA. Koopmans-van Beinum, Reduction, Kuehn, F J (1980): Vowel contrast Amsterdam: Academische Pers B V. D P and Moll, K L (1976): "A Cineradiographic Study of VC and CV Articulatory Velocities", J of Phon 4 :303-320. Liberman A M and Mattingly I Theory Speech of cognition 21:1-36. Lindblom, B (1963): Reduction", On Vowel Department Stockholm. G (1985): "The Motor Perception Revised", "Spectrographic Study of Vowel J Acoust Soc Am 35:1773-1781 and Reduction, technical report, of Speech Communication, RIT, Lindblom B, Lubker J and Gay T (1979): " Formant Frequencies of Some Fixed-Mandible Vowels and a Model of Speech Motor Programming by Predictive Simulation" , J of Phonetics 7, 147-161. Lindblom B and Lindgren R (1985): "Speaker-Listener Interaction and Phonetic Variation" , 77-85 in Experiments in Speech Processes, Perilus IV, Department of Linguistics, stockholm University. Lindblom B, Lubker J, Lyberg B, Branderud P and Holmgren K (1987): "The Concept of Target and Speech Timing" , 161-182 in: Channon, R and Shockey, L (eds): In Honor of Ilse Lehiste, Foris: Dordrecht, Holland. Lindblom B and Engstrand 0 (1989): "In What Sense is Speech Quantal? " , commentary on focus paper by K N Stevens to appear in theme issue J of Phonetics. Nord, L Perkell, (1986): " Acoustic studies of Vowel Reduction in Swedish" , STL-QPSR 4/1986, 19-36 (Dept of Speech Communication, RIT, Stockholm). J (1980): " Phonetic Features and the Physiology of Speech Production" , 337-372 in Butterworth (ed) : Language Production (Vol 1), London :Academic Press. - 32 - Picheny, M A, Durlach, N I and Braida, L D (1985): "Speaking Clearly for the Hard of Hearing I: Intelligibility Differences Between Clear and Conversational Speech", J Speech and Hearing Res, 28, 96-103. Picheny, M A, Durlach, N I and Braida, L D (1986): "Speaking Clearly for the Hard of Hearing II: Acoustic Characteristics of Clear and Conversational Speech", J Speech and Hearing Res, 29 (4), 434-4 4 6. Schulman, Stevens, R (forthcoming): Loud and Normal Acoust Soc Am. "Articulatory Dynamics of Speech", submitted to J K N (1968): "Acoustic Correlates of Place of Articulation for stop and Fricative Consonants", QPR 89, RLE, MIT, 199-2 05. Stevens, K N (1989): "On the Quantal Nature of Speech", to appear in theme issue of J of Phonetics. Summers, Uchanski, W V, Pisoni, D B, Bernacki, R H, Pedlow, R I and Stokes, M A (1988): "Effects of Noise on Speech Production: Acoustic and Perceptual Analyses", J Acoust Soc Am 84 (3): 917-92 8. R M, Durlach, N I and Braida, L D (1987) "Clear Speech" paper presented as part of a seminar on "Hearing-Aid Processed Speech" at the American Speech-Language-Hearing Association Convention in New Orleans, November 1987. - 33 - ON THE SYSTEMATICITY OF PHONETIC VARIATION IN SPONTANEOUS SPEECH Ol le Engstrand & Diana Krul l 1 Introduction Even though the phonetic correlates of a given linguistic unit can vary drastical l y in different speech situations, a l ot of phonetic variation seems to be fairly regular. For example, Lindbl om (1963) was abl e to predict spectral variation in Swedish vowel s given information about their durations and adjacent consonant l oci; vowel durations, in turn, are partly predictabl e in terms of phonol ogical l ength, stress, and position in the utterance (Lindbl om, Lyberg & Holmgren, 1981; Engstrand, 1988); and these factors, final l y, partl y depend on the morphological and syntactic specification of the utter ance. Observations of this kind are important in view of the reason abl e assumption that speech perception depends on both direct impl icit knowledge of on an information and signal property various systematic interactions between acoustic-phonetic and grammatical dimensions (Lindbl om, 1987) . Mainly, however, phonetic systematicity has been observed in speech elicited and produced under highl y constrained laboratory conditions. There are, of course, several good reasons for using a strictly controlled experimental design, permitting systematic manipul ation of a smal l set of variabl es. Also, there is no £ to assume that phonetic variation in elicited priori reason speech is principall y different from phonetic variation in non-elicited speech. There is reason to assume, however, that the systematicity of the variation is l ess transparent in spontaneous speech than in speech produced under the cl assical experimental f ormat. The extent to which the phonetic varia tion in spontaneous speech can be disentangled in terms of its physical and l inguistic determinants should therefore be carefully examined. In this paper, we report some data bearing on the foll owing hypotheses relating to the systematicity of phonetic variation in spontaneous speech. These hypotheses are: 1) There are systematic correl ations between duration and spectral features of speech segments. In particul ar, a) durational changes will affect the formant pattern in the front non-low vowel s Iii and lei such that the formant frequency deflection from the onset or offset values related to a certain consonantal environment decreases as a function of the duration of the vocalic segment; b) the duration of a sequence IVrVI wil l affect the third and fourth formants associated with Irl such that the formant frequency deflection from values related to the vowel onset decreases as a function of the duration of the IVrVI segment. 2) Changes in the grammatical function of a certain phoneme sequence are accompanied by phonetic changes. More specifical l y, when a certain sequence makes up either a preposition or a - 34 - prefix, the alternation between these functions associated w ith spectral or durational effects. will be For the remainde r of this report, we first describe what experimental measures are currently being taken to test these and related hypotheses (section 2). We then report the relevant data that have been collected to date (section 3) . Finally, we discuss the measurement results and draw some tentative conclusions (section 4) . 2 Method We used for this experiment approximately one hour of recorded speech produced by one male native speaker of the Stockholm dialect of Swedish (subj. JS). The greater part of the recording is a lively monologue supported by brief questions and comments. There are three main topics in this monologue: a) about the subject's profession and daily work; b) about travelling by train in Europe; and c) about the political situation in a European country that the subject knows relatively well. In addition, the subject related the contents of a short story that he had been asked to read a few weeks earlier. The recording was carried out in a sound-proof recording room with a Revox PR/99 tape -recorder at a tape speed of 19 cm/s with the microphone (Sennheiser 211/U) at approximate ly 25 cm from the subject's mouth. After the recording, a manuscript of the whole speech material was pre pared. The frequency of occurrence of words and phrases was determined1, and frequently occurring words and phrases of interest were noted. Preliminary analysis of the resulting material then provided the basis from which the above hypo theses were generated. The preliminary analysis procedure involved digitizing the utterance candidates using an Eclipse S/200 computer system. The re sulting sample data file was edited such that all examples of a certain word or phrase could be listened to in a sequence. The material was tran and computer spectrograms were printed scribed phonetically, out for inspection. The single words selected for the final analysis were. for hypothesis 1a: tid /tid/ 'time', vis /vis/ 'manner', 'way '; also in visum 'visa' and visa 'show'; vet /vet/ 'know (s)', and se n /sEn/, which is a variant of sedan 'then '; for hypothesis lb: bara /bara/ 'only'; and for hypothesis 2: for /f0r/, which means 'for' when use d as a preposition. When pre fixed to verb and noun stems, it has a function roughly similar to its English equivalent, e. g. in forbjuda 'forbid'. In these words, the Ce ntral Swedish vowels /i/ and /e/ are phonologically long (or tense ) and display a conside rable amount of diphthongiza tion in stressed position; /E/ is short; /a! is long and slightly rounded in many speakers ' pronunciation. The vowe l /0/ has an open major allphone whe n preceding /r/ and supra de ntals, the latter being derive d from /r/ plus dental. It is long in the pre position for as pronounced in isolation; in prepositional phrases, however. it is normally unstressed and phonetically short. In derivative s such as forbjuda, the - 35 - neutralized. These two situations is distinction long/short feature' vs. 'for the can be typified by the pair for draget fordraget ' the treaty'. A related use occurs in compounds such has primary sequence the where as foredraget ' the speech', main question here is whether the former two uses stress. The different in spontaneous of the sequence for are phonetically speech. For the final measurements, broadband spectrograms were made (Kay Elemetrics Digital Sona-Graph/7800, 300 Hz filter bandwidth, frequency range O-S kHz; Sona-Graph Printer/7900) . The spectrograms were segmented a t major acoustic discontinu ities and measured for segment duration. Onset and offset formant frequencies relating to CV and VC boundaries were measured at the respective first and last glottal pulse clearly within the vocalic segment. In addition, formant frequencies were traced and measured at turning-points, if any, otherwise half way through the duration of the vocalic segment. Measurements of durations were performed to the nearest S ms relative to the marked segment boundaries, and formant frequencies were measured to the nearest 10 Hz relative to the tracing marks. 3 Observations 3. 1 Data and correlations for the front non-low vowels Mean durations and frequency measures relating to the second and third formants are listed in the Appendix, tables A 1-A4. Our primary concern is now to analyze these da ta with respect to the relationships between durations and formant frequen cies. Table 1 below shows correlations between vowel durations and various frequency measures relating to the second and difference between maximum third formants: maximum frequency, frequency and onset frequency, and difference between maximum frequency and offset frequency. In order to avoid the possible influence of a final lengthening effect (Lyberg, 1981; Nord, 1986) , the correlations are based only on test items occurring in non-final position, whereas the data tables in the Appendix include also the finals. Slightly lower correlations result in our data when the calculations include the finals. Effects of final lengthening in spontaneous speech will be studied at a later s tage in our project. Table 1. Correlation between duration and various measures of the frequency of the second formant. Subj. JS. Fzmax F�'? (max-onset) F�? (max-offset) F;,�max F;�< (max-onset) F:� (max-offset) D U R A T I 0 N vis vet 0. 90 0 73 0. S1 0.7 4 0.61 0.96 0.64 0. 39 0. 38 0.74 0 61 0. 29 tid 0.41 0.39 0.39 0.47 0. S2 0. 61 . . - 36 - sen -0.22 -0.49 0.36 -0. 18 -0. 61 0.07 way. The trajec the following The data can be summarized in a formant from onset to offset of a' slowly pronounced tory of at a can lie (or minimum) vowel forms a curve whose maximum axis from its onset and frequency the on distance varying in tid is F:;� flat curve a relatively offset. An example of is no appreciable change in case, there such a (Fig.la). In F2 in duration. vowel changing the formant trajectory with vet, on the other hand, forms a large excursion to its maximum (Fig. ib). In this case, the excursion decreases with vowel duration and there is a good correlation between the duration of the vowel and the difference in frequency between onset/offset of the forman t and its maximum. Thus, the correlation coefficients in Table 1 are predictable to a certain extent given the range of the formant excursions (r=0.66). The data showing the difference between maximum and offset second formant frequency as a function of duration for /e: / in the word vet 'know (s) ' are shown in Fig. 2. It should be noted that a linear f unction approximates these data well, whereas other data of this kind are generally better described by an asymptotic approximation (Lindblom, 1963; Lindblom & Moon, 1988). The reason for this difference is probably that the vowel durations measured here, coming from spontaneous speech, have much smaller durations than typically encountered in citation forms and overarticulated speech (Lindblom & Moon, 1988) . Thus, our phonologically long vowels are on the average just above 100 ms. The corresponding relationships in Lindblom & Moon also appear to be roughly linear around this value. 3 .2 Data and correlations for the /VrV/ sequence Durations and formant frequencies pertaining to the word bara are shown in the Appendix, table A5. Table 2 below shows correlations between durations of the /ara/ portion in the word bara and various measures of the frequency of the third and fourth formants relating to the segment /r/: minimum F::�; and F4, difference between F::�!; minimum and onset, and between F.<} minimum and onset. Table 2. Correlation between duration and various measures of the frequency of the third and fourth formants in bara. Subj. JS. Duration -0.74 -0.71 0. 79 0 85 F::!� min F.<} min F;" (min-onset) F,,:, (min-onset) . - 37 - 800 �-�----� 700 /"'\ 600 I N '-/ IW (f) LL LL 0 x <I L: N LL 500 400 300 200 0 100 0 0 4 60 80 120 100 140 VOWEL LENGTH 160 180 200 (ms) Fig. 2. Distance between F2 maximum and of f set (Hz) plotted as a f unction of vowel duration (ms) f or the word vet 'know(s)' as pronounced in spontaneous speech. The correlation between the variables is r-O.96. Subj . JS. - 39 - We find significant negative correlations for F3min and F4min as we 1 1 as for F;:l' (min-onset) and F"" (mean-onset). The 1 atter relationship is illustrated graphically in Fig. 3. The correlations for F3 and F4 onset are weak. suggesting that the onset values are relatively fixed. The correlations between F·;:� and F4 are as follows: for F3min/F4min. r=0. 71. and for F3(min-onset)/F4 (min-onset). r=0. 79. 3.3 The sequence /f0r/ as preposition vs. prefix The following data are relevant to Hypothesis 2 above. assuming phonetic correlates of a shift in the grammatical function of a sound sequence. Table 3 shows mean durations (ms) and formant frequencies (Hz) for the vowel /0/ in for in various functions. We have included data pertaining to stressed versions of the same sequence for reference. Table 3. Mean durations (ms) and formant frequencies (Hz) for the vowel /0/ in the morpheme for in various functions. Subject: JS. The stars (*) indicate that differences between adjacent mean values in the table are statistically signifi cant at the 1% level of significance. Further explanation in text. P R E F I X Dur Fl F:;.r F:::, F "1 x s 25 39 4 08 31 1414 132 2 2 26 136 2998 191 --- PREPOSITION x n ----------- 21 19 19 19 15 - - ---- * 1 ---- 45 479 1297 2 205 3201 s n - - 34 18 14 36 132 15 134 15 204 15 - - ----- 1 * * S T R E S S x s - --------- 126 530 1040 216 2 318 2 n - 43 11 11 33 60 11 57 11 70 10 -- - - The di fference in F:!.. F:.� and F"'I between Pref and Prep are statistically significant (p<O.Ol). The difference in dura tion. F1 and F2 between Prep and Stress are also statistically significant (p<O. Ol). The formant data suggest a systematic fronting and heightening in the direction "stress">"preposi tion">"prefix". These data can be compared to similar data obtained from earlier measurements. Those measurements. which were made from a radio interview with a m ale Swedish politician. show roughly the same tendency although the durational difference between the "prefix" and "preposition" conditions are more marked. and the difference between Fl means is not statistically signifi cant at p<O. Ol (see table 4). - 40 - GOO 500 0 r-.. I N '\../ Z :L 400 0 - IW (f) z 0 -.:t LL 300 0 200 0 10 0 0 o 8 o -100 L-4--L� ---L-�L�---L-�-L�---���---� 300 0 l OO VOWEL LENGTH (ms) Fig. 3. Distance between F4 onset an d mlnlmum (Hz) plotted as a function of duration (ms) of the /VrV/ sequence in the word bara 'only' as pronounc ed in spontaneous speech. The correlation between the variables is r=O.85. Subj. JS. - 41 - Table 4. Mean durations (ms) and formant frequencies (Hz) for the vowel 101 in for functioning as prefix vs. preposition in spontaneous speech. The stars (*) indicate that differences between adjacent mean values in the table are statistically significant at the 1% level of significance. Male Central Swedish subject (KOF). 1 PREF IX PREPOSITION ---�---:---�---- ----�----:----�---25 70 19 31 247 38 1 296 576 1509 4 89 20 1 29 2 153 16 Comments It was assumed in the introductory section of this paper that the perception and understanding of speech partly depends on an implicit knowledge of systematic interactions between various acoustic-phonetic and grammatical dimensions. Such a knowledge would permit extraction of linguistic information out of the physically variable waveform in spontaneous speech. Obviously, however, this implies that there really exists a fair amount of phonetic systematicity in the first place. The purpose of this study was therefore to gain some preliminary insight into the nature and extent of such a systematicity. As a first step, we formulated two limited hypotheses. The first hypothesis was that durational changes would be related to formant pattern changes in the front non-low vowels Iii and lei, and in the sequence IVrV/. It can be concluded now that some of our results do show fairly strong relation ships which can be interpreted in support of this hypothesis. Since, on the other hand, these data are among the very first of its kind, we must be particularly careful in stressing the limitations and preliminary nature of the results. Thus, we have observed a very small number of utterances produced by one single speaker. It should also be emphasized that what we have found here are merely statistical relationships which themselves in do not say much about the phonetic control strategy in terms of causally dependent and independent variables. There are various reasonable assumptions that can be made concerning this. Everyday experience suggests that spontaneous speech is frequently fast and reduced. A good question to ask, then, is to what extent one of these properties is governed by the other; i. e. , does rate cause reduction or vice versa? Some current terminology such as "allegro" vs. "lento", "fast speech", and the like (Dressler, 1972; Dalby, 1984) would - 42 - appear to imply the primacy of the rate parameter. Phonetic reduction would then, perhaps for physiological and mechanical reasons, result fr om small durations due to increas ed speaking rate. A rapid sequence of incompatible m otor commands to an inert articulator w ould act to reduce the amplitude of its movement. This , of course, is the philosophy underlying Lind blom's early vowel reduction model (Lindblom, 1963) . On the other hand, if the reduction parameter were primary, articula tory movements would be reduced in the first place and, given an unaltered movement velocity, overall rate variation and s egment shortening would result from variations in reduction (Barry, 1984). As a third poss ibility, it is conceivable that duration and reduction are jointly affected by some third factor, e. g. degree of stress. Whereas all these mechanisms appear reasonable, everyday experience also suggests that rate and reduction may be, at least partially, independent. We note in particular that s peech can be slow and reduced as well as fas t and elaborated (e.g. Zwicky, 1972; Shockey, 1973; Lindblom, 198 3 ; Engstrand, 1988; Lindblom & Moon, 1988). For example, s h ort vowel durations elicited by a high speech rate need n ot lead to formant undershoot (Engstrand, 1988), and long durations due to final lengthening are compatible with considerable reduc tion (Nord, 1986). The recent data of Lindblom & Moon (1988) further corroborate these observations. They find that durations and formant frequencies in vowels vary independently when different speech styles are experimentally controlled. Our present data have not yet given evidence for such style effects in spontaneous speech. But as we pointed out above, this study is a preliminary one. In order to approach some more general c onclusions, we are currently building up and analysing a more complete data base. Acknowledgments This work was supported in part by the Swedish National Board The Bank of Sweden for Technical Development (STUF) and Tercentenary Foundation (RJ) . - 43 - Footnote l·Program written by Rolf Lindgren. References Barry, M.C. 1984. Connected speech: processes, models. Cambridge Papers in Phonetics and Linguistics, 3, 1-16. Dalby, J. M. 1984. Phonetic American English. Bloomington, Linguistics Club. structure Indiana: motivations, Experimental of fast speech in Indiana University Dressler, W. 1972. Allegroregeln rechtfertigen Lentoregeln. Sekundare Phoneme des Bretonischen. Innsbrucker Beitrage zur Sprachwissenschaft, 9. Engstrand, O. 1988. Articulatory correlates of stress and speaking rate in Swedish VCV utterances. JASA 83, 1863-1875. Lindblom, B. 1963. JASA 35, 1773-1781. Spectrographic study Lindblom, B. 1983. Economy of speech MacNeilage (ed.): The Production of Speech. Verlag, pp. 217-245. of vowel reduction. gestures. New York: In: P. Springer Lindblom, B. 1987. Adaptive variability and absolute constancy in speech signals: two themes in the quest for phonetic invariance. Proc. XIth ICPhS, Tallinn, Estonia, Aug. 1987, Vol. 3, pp. 9-18. Lindblom, B., B. Lyberg & K. patterns of Swedish phonology: memory processes? Bloomington, Linguistics Club. Holmgren. 1981. Durational Do they reflect short-term Indiana: Indiana University Lindblom, B. & S.-J. Moon. 1988. Formant undershoot in clear and citation form speech. PERILUS 8 (this issue). Lyberg, B. 1981. Temporal properties of spoken Swedish. Mono graphs from the Institute of Linguistics, University of Stockholm (MILUS), 6. Nord, L. 1986. Acoustic studies of vowel reduction in Swedish. STL-QPSR 4/1986, 19-36. Department of Speech Communication and Music Acoustics, Royal Institute of Technology, Stockholm. Shockey, L. 1973. Phonetic and phonological properties of connected speech. Ohio State Working Papers in Linguistics, 17. Columbus, Ohio: Ohio State University, Department of Linguistic. Zwicky, A. 1972. On casual speech. Papers from the 8th Regional Meeting of the Chicago Linguistic Society. Chicago: CLS. - 44 - APPENDIX: Measurement data, tables A1-A5 Table A1. Data pertaining to li:1 in the word tid 'time': Mean values f or F2 and durations (ms); onset, maximum and offset maximum and onset, and F::!�; differences between F�;,� and F:" between F:;;? and F�:� maximum and offset (Hz); n=17. Subj. JS. Std.dev. Mean Duration F::;,� max F:;;.? onset F:;? offset F:;;.? (max-onset) F:;.? (max-offset) F;" max F·:,� onset F�:� of f set F3 (max-onset) F�:� (max-offset) 31 70 59 91 73 94 99 77 105 73 81 108 1772 1662 1627 109 145 2759 2615 2578 146 186 Table A2. Data pertaining to li:1 in the word vis 'way', 'manner': Mean durations (ms); onset, maximum and offset values for F2 and F3; differences between F2 and F3 maximum and onset, and between F:;? and F::�; maximum and offset (Hz); n=20. Subj. JS. Mean Duration F:;.� max Fz onset Fz offset F�:r (max-onset) F::;.� (max-offset) F·", max F::.� onset F::� offset F�� (max-onset) F3 (max-offset) Std.dev. 117 1786 1492 1753 291 37 2642 2270 2553 473 89 - 26 64 95 50 109 50 114 59 94 89 70 45 - Table A3. Data pertaining to /e:/ in the word vet 'know (s)': Mean durations (ms); onset, maximum and offset values for F::.2 and Fo:;,; differences between F:;;,� and F:::r. maximum and onset, and between F:;? and F:::.t maximum and offset (Hz); n=19. Subj. JS. Duration F:;;,� max F::;:, onset F:;;.� offset F:;;.� (max-onset) F:? (max-offset) F max F onset F'.'!:< offset F�· (max-onset) F·.::.\ (max-offset) .. ,· -.' .. � -' -' Mean Std. dev. 116 1692 1494 1365 198 316 2420 2211 2524 218 -10 8 40 220 124 68 136 213 79 43 54 83 67 Table A4. Data pertaining to /e/ in the word sen (=sedan) 'then': Mean durations (ms); onset, maximum and off set values for F�2 and Fo:;r.; differences between F:;;,� and F:�� maximum and onset, and between F:.? and F,'!!: maximum and offset (Hz); n=11. Subj. JS. Mean Duration Fz max F:;-t onset Fz offset F:;;.? (max-onset) F:;;.� (max-offse:t) F';!:< max F::!.t onset F::!.t offset F3 (max-onset) F� (max-offset) -' Std.dev. 69 1417 1446 1315 -29 93 2470 2491 2466 -45 4 - 46 22 75 67 75 70 96 67 40 95 45 53 - Table A5. Data pertaining to various aspects of the word bara /ba: ra/ 'only': Mean durations of the /VrV/ sequence (ms); onset and minimum F::�; and F��. pertaining to /r/, and differences between F::� and F4. minimum and onset, respectively (Hz). Subj. JS. Mean Duration F::;:t. min F4 min F:!.� onset F,,:, onset F·.::< (min-onset) F ... (min-onset) Std. dev. 122 2154 3091 2150 3209 36 139 - 47 57 163 217 79 108 188 159 - n 21 20 19 17 16 17 16 DISCONTINUOUS VAR IATION IN SPONTANEOUS SPEECH Olle Engstrand & Diana Krull 1 Introduction In spontaneous speech, words and phrases are frequently heard to be strongly reduced in relation to their alleged phonologi cal base forms. In certain cases, segments, sy llables and other units seem to lack acoustic traces in the speech signal altogether. It might be assumed that such reduced word forms are frequently phonologized in the sense that the implicit segments do not form part of the speaker's utterance plan, and that they are consequently unrepresented at all levels of the motor execution of speech. Alternation between phonologically elaborated and reduced forms entails phonetic discontinuity; i. e., for a given phonetic property , there is no continuum along which the property is reduced or elaborated. For exam ple, in the alternative word forms n�gonting and n�nting (both meaning 'something'), the first form contains a velar gesture (corresponding to the segment /g/), whereas the second form is assumed to bear no trace of the segment /g/. A criterion for determining whether two phonetically distinct word forms are also phonologically distinct in this sense is that a weak form can be phonetically elaborated without becom ing identical with or even approaching the strong form. For example, even an emphatic pronunciation of n�nting is quite common and acceptable even if it completely lacks a gesture for /g/. Does phonetic discontinuity entail phonological alternation? If the answer to this question is affirmative, all phonetic variants of a common phonological base must be continuous; i. e. , the amplitude of a certain phonetic gesture correspond ing to an underly ing segment display s a continuous transition between levels of reduction or elaboration. If the answer is negative, i.e. . if phonetic discontinuity is compatible with phonological identity . then there appear to be two distinct sources of discontinuous variation, one which is phonological and one which is something else. For the remainder of this paper, we attempt to exemplify and discuss an instance of this second source of discontinuous variation in spontaneous speech. Continuous and discontinuous phonetic variation tend to be more or less common in different parts of the word and phrase vocabulary . An apparent discontinuous variation is frequently observed in the subset of the Swedish vocabulary containing grammatically functional morphemes. words and phrases. In such cases, moreover, it frequently seems as if speakers tend to adhere to a relatively small set of discontinuous realiza tions. Fig. 1 provides an graphical example. The figure display s spectrogram tracings of the first and second formants pertain ing to all occurrences of the grammatical conjunction s� att - 48 - Iso attl (so that, such that) as pronounced in a spontaneous speech situation by a male Stockholmer; see Engstrand and Krull (1988) for details. There are two basic, quite distinct sets of curves pertaining to one reduced (left three columns) and one elaborated (rightmost column) sty le of pronunciation. The reduced version sounds much like the word satt Isattl ( sat I ) I • kHz 2 - --... - -- - - ..... ....-=- 'C7 -----.. 0 2 - � - - - - -..... - ",,-- - - - --... � � - - 0 ""\. c= � - 2 - 0 ::::. -� -- 2 '" _"--Z- .. ,.,,-... 0 Fig. 1. Tracings of the first and second formants from broad band spectrograms illustrating two basic varieties of the pronunciation of the grammatical conjunction sa att. The left three columns represent a reduced level of pronunciation and the rightmost column represents an elaborated level. Subj. JS. Do strongly reduced forms such as these instances of sa att as a rule contain any kind of acoustic traces of each of the segments constituting the corresponding phonological base forms? Mere listening and transcribing is probably insuffi cient to answer this question; when a clear segment realiza tion is heard in the natural speech situation, the effect frequently seems illusory when listened for in an excized portion of the speech flow. Careful measurements are therefore - 49 - necessary . The remainder of this paper will focus on this particular example in terms of a set of measurements and their subsequent perceptual evaluation. 2 Experiments 2.1 Recording and measurements We used for this experiment approximately half an hour of recorded speech produced by a male native speaker of the Stockholm dialect of Swedish (subj. JS). The greater part of the recording is a lively monologue supported by brief ques tions and comments from one of the experimenters. The subject and the recording procedures are described in more detail in Engstrand and Krull (1988). Out of the material, we selected and spectrographed (1) all instances of the g rammatical conjunction sA att (so that) , where £ stands for the phonologically long (tense) back round ed vowel 10/, and £ stands for the phonologically short (lax) open fronted vowel la/; (2) all sequences saC, where ICI is any non-nasal consonant; and (3) all preconsonantal instances of the word sA (so). Onset and nuclear (mid vowel) frequencies of the first four formants (F.t-F4) were identified and measured for all F::, and F4 were practica11 y occurrences of these utterances. identical and are not shown here. Measurement data for F1 and F2 as well as durations are given in Table 1. Onset and Table 1. nuclear frequencies (Hz) and durations for the vocalic segments in sA att, saC and sA. Subj. JS. NO. 1 2 3 4 5 6 7 8 9 10 11 12 PHRASE sA att sA att sA att sA att ONINUC on on nuc nuc FORMANT F1 F2 F1 F2 N 12 12 12 12 MEAN 470 1332 553 1319 S. D. 49 57 42 62 saC saC saC saC on on nuc nuc F1 F2 F1 F2 16 16 13 13 5 16 1348 596 132 5 78 70 85 83 sA sA sA sA on on nuc nuc F1 F2 F1 F2 10 10 9 9 42 0 1395 408 137 3 32 102 40 101 sA att x s 60 16 DURATIONS saC x s 67 15 - 50 - sA x 38 s 19 (ms) The table shows slight differences between the three cases. In at the statistical s ignificance particular, a t-test reveals F1 onset values both for sa vs. sa att and for mean 1% level significant durational Statistically saC. vs. att sa for differences were found between sa vs. sa att and saC. We next turned to the question whether these small differences are at all able to influence listeners' perception in the direction of the respective utterances sa att and satt with these portions of the speech flow excized out of their con texts. 2. 2 A listening test A series of stimuli containing 12 instances of sa att and 12 instances of satt was presented to nine native Swedish spea kers of roughly the same dialect as subj. JS. Since the sequence satt occurred only four times in our recorded mate rial, copies of those tokens were introduced randomly to give an equal number of both stimulus types. The resulting set of stimuli was subsequently duplicated and a new random order was created for the duplicate. This procedures gave a total of 43 2 responses from the nine listeners whose task it was to decide for each stimulus whether it represented sa att or satt. The results are shown in Table 2 in terms of the outcome of a multiple regression analysis using 1-5 predictor variables. The data refer to correctly identified instances of sa att. The values of R for the correlation between number of correct answers and each individual variable are given in the middle column, and t he cumulative effect is shown in the right co lumn. Calculated across all predictor variables, this effect is R=0.89. Table 2. Result of multiple regression analysis: prediction of number of correctly identified instances of the phrase sa att using 1-5 predictor variables. 9 subjects. PREDICTOR F.lon F:;.�on F.l nuc F:;.�nuc Duration MULTIPLE R Cumulative Individual variables -0.60 0.77 0. 77 0. 85 0. 89 -0.60 -0.60 -0. 3 5 0. 11 0. 51 We thus obtain a high cumulative ons from several variables. - 51 correlation with contributi - 3 Corrunents The above relationships, of course, are purely statistical, and do not tell us much about the perceptual contribution of each variable. However, F1 onset is the acoustically strongest effect and may therefore play a decisive role. The contribu tion of a possible spectral difference in the preceding /s/ should also be evaluated. On the other hand, it seems rather unlikely that these small differences would play more than a very subordinate role as a perceptual criterion in a natural speech context. This can be convincingly demonstrated by manipulating for example the utterance " ... sa att dom Uit ju det dar ga da ... " (so that they let it go on, then). listen to In an informal test we have asked our colleagues to a) the authentic version of this utterance and to b) a version where the conjunction sa att Iso att/ had been replaced with the verb form satt /satt/. The latter satt does not make sense in this context. In consequence, both versions are unanimously interpreted as containing sa att. In surrunary then, small effects such as the one demonstrated here are clearly not perceptually decisive in a natural speech context. Such effects may, however, be of interest in certain other respects. First, the example suggests that this speaker does in fact preserve traces of underlying segment structure even in a very strongly reduced form. Apparently then, this form is not phonologized in the sense that certain segments entering the phonological base form are completely absent from the utterance plan. We note, however, that the phonetic varia tion between the reduced and elaborated forms of sa att were discontinuous. Thus, we have a case where phonetic disconti nuity preserves phonological identity. A second point of interest is the following: Since the speaker is very close to completely eliminating a segment, for what reason does he nevertheless preserve it to some extent; i.e., why doesn't he phonologize the short form? This question may be discussed in terms of various constraints on phonetic speaker-listener interaction. Thus, on the one hand it can be assumed that speakers, in accordance with Lindblom's view of adaptive variability (Lindblom 1987), tend to shape the speech wave so as to achieve a good balance between articulatory effort and perceptual clarity. If this balance were optimal, however, the vowel corresponding to /0/ in sa att would be reduced completely since it is highly predictable in context and therefore semantically insignificant to the listener. Reasonably, however, an optimal adaptation would in itself re quire a fair amount of data processing. Thus, rather than fine-tuning the level of reduction/elaboration of segments according to their predictability, it would seem to be a more convenient speaker strategy to manipUlate the basic motor scheme by means of a somewhat more coarse-grained adaptation - 52 - speaker would sim particular, the of gestural amplitude. In (hypo) or a strong weak aiming at either a task by plify his even of traces phonetic with possibly variety, (hyper) underlying segments preserved at both perceptually negligible levels. As a third possibility, it may be hypothesized that complete segment reduction is marked stylistically, so that traces of underlying segments will remain and be perceived even though their strictly semantic value is insignificant. The above experiment demonstrates a case where listeners are able to hear a rather subtle variation. The difficulty of dealing with it from a semantic-perceptual point of view suggests the need for adding the phonostylistic dimension to the issue of seg mental invariance in spontaneous speech. Acknowledgement This work was supported in part by the Swedish National Board for Technical Development (STUF) and The Bank of Sweden Ter centenary Foundation (RJ). Reference Engstrand, O. and D. Krull. 1988. On the systematicity of phonetic variation in spontaneous speech. PERILUS VI II (this volume) . Lindblom, B. 1987. Adaptive variability and absolute constancy in speech signals: two themes in the quest for phonetic in variance. Proc. XIth ICPhS, Tallinn, Estonia, Aug. 1987, Vol. 3 , pp. 9-18. Also in PERILUS V, pp. 2-20. - 53 - VARIATION AND INVARIANCE IN THE Cl1ARACI'ERISTIC PARALINGUISTIC �ES OF VCWELS. Hartmut TraunmUller ABSTRACT . It is affect shown that within speaker variations in vocal effort and F0 phonation and the formant frequencies of vowels in the sense of a compression/expansion of the distances between them, scaling of pitch (tonotopic or logaritlmic). given an linear adequate Between speaker variations in size correspond to a translation of the spectral peaks shaped by FO and the formants if pitch is scaled tonotopically (in Bark) . adult waren tend, however, Vowels produced to be phonetically more explicite and, by hence, more peripheral in "vowel space" than those of men and children. rnmOOOCTION A main line of research within experimental phonetics is concerned with the variation for of the acoustic properties of speech sounds and with the generally listeners. search valid acoustic correlates to phonetiC distinctions made There are several sources of variation in the acoustic ties of speech signals. by proper Sane kinds of variation involve audible variations in phonetiC quality between utterances with the same meaning, such as would be reflected in a phonetiC transcription. (dialectal, signal femt signal, sociolectal, These can be called "linguistic" idiolectal) variations. The conditions of speech transmission between speaker and listener are the source of a kind of variation which affects the acoustic properties of a dif speech but which within wide limits does not affect its phonetiC quality. Another dichotomy is that between, text (coarticulation, on the one hand, variation due to con syntax) or context effects on the perceived phonetic quality of segnents and, on the other hand, variation due to factors other than the linguistic context. In this paper we are only concerned with the latter type of variations - 54 - in the spectral properties of vowels, sex, enotion, and audibility intention of the speaker . These or attitude, personal factors such as due to variations in size, age, affect the production of the speech signal in various ways, and the resulting variation carries biologically important tion, either characteristic or "symptanatic" of the speaker . As listeners, we are man, When we hear j ust a sound we hear imnediately whether it has been produced by a highly aware of paralinguistic information . single vowel informa a wanan, or a child . We perceive the degree of vocal effort used to produce it, and if the speaker is enotionally affected, we can hear whether he is cheerful, grieved, angry, or afraid . For this, it is not necessary to have perceived paralinguistic the phonetic quality of the vowel . camunication, which The systen shares essential features equivalent systens of several other species, of human with the appears to have priority over the facility of linguistic communication specific to humans . While the the acoustic consequences of the age- and sex-related variations shape and structures, overall size of the vocal are communication . quite substantial, The expression expression of phonetic quality, analyse the frequencies of their versus tract, including the of enotions does not either except in extrene cases . restrict We are going of vowels without interfering essentially with the whispered speech) . glottal this does not appear to hinder speech effects of those types of variation which affect the phonetiC quality: in Vocal effort and mode of the to formant expression phonation (voiced Variation of this kind is used by speakers in order to adapt to various denands set by environmental circums tances: We shout in order to be heard by an addressee and we whisper in order not to be heard by others . Moreover, this kind of variation is also essentially involved in the expression of enotions and attitudes . Peterson are and Barney [195 2 ] observed that the formant frequencies of vowels reproduced with high precision by a speaker repeating the same utter ance, while there is much more variation between speakers, in particular if they differ in age or sex . Their set of data includes, considerable linguistic variation in phonetic quality . "normalize" phonetiC however, Also in attempts to formant frequency data on vowels a strict distinction and non-phonetic variation was not always also made [e . g. between Disner, 1980] . For sane restricted applications, e . g . in autanatic speech recogni tion, such a distinction may not be as essential as it is if we aim at an - 55 - understanding Barney's human observation fonnant were of speech perception . most SUbsequent studies a�g at an to Peterson understanding frequency variation from a viewpoint of production of or and vowel perception concerned with inter-speaker variation [Ladefoged and Broadbent 1957, Fant 1966, E\lj isaki and Kawashima, 1968, Pols et al 1969, Nordstran 1975, Kent and Forner 1979, Goldstein 1980, Ananthapadmanabha 1984, Bladon 1977, et ale 1984, TraunmUller 1984, Syrdal 1985, Syrdal et al 1986] . Phoneticians fonnant that appear frequencies individual to have overlooked the intra-speaker variations that should be expected as a consequence of the speakers are capable of producing the same vowels in fact with quite different articulations: It is generally known that we open our rrouth more when shouting than when speaking softly . The articulatory, acoustic and perceptual consequences of variations in vocal effort have only recent ly been subjected to thorough experimental investigation [Rostolland, 1982; 1988] . Schulman, considered such as speech in The effects of variations in vocal effort have not treatises on theories of speech production "action theory" [Fowler et al, and 1980] and "the motor perception" [Libe nnan and Mattingly 1985], been perception theory to which these of effects appear to be crucially relevant . WITHIN-SPEAKER VARIATION The spectral variations we find within speakers concern mainly mode of phonation, intonation, and vocal effort . Studies concerned with the expression of erotions [Williams and Stevens, 1972 ] focused on the effects of real or imitated frequency, erotions on fundamental signal intensity, and on segment durations . Average FO' range of FO-variation, signal intensity, and segment durations have all been shown to vary as a function of the erotio nal state of the speaker . As for variations due to the attidude of the speaker, there has been sane research concerned with "baby-talk", the speech of adults directed to young children . Garnica [1977] reported, i. a., an increased mean FO' an in creased Fo-range (expressed in senitones), and frequent appearance of parts of sentences being whispered . - 56 - We are going to include Fa arrong the as motivated below, vowels, "characteristic rut concerning their phonetic quality, not informative unless the formant frequencies, of Fa is in particular F l' also are Williams and stevens [1972] observed an effect on Fl in addition to known. Fa: frequencies" F1 is higher in stressed vowels in utterances produced under anger, as canpared with the same utterances produced withaut erotion . They p.1blished, however, no quantitative data on this effect . As for attirudinal varia tions, I am not aware of any formant frequency measurements . The formant frequencies of vowels produced with high vocal effort (shouted vowels) are difficult to measure because the shouted voice is characterized very high F0 (see Table 1) and, hence, by large gaps between the harmonics sampling the spectral envelope . Published data are therefore by a scanty in the literarure . Fr!ZSkjesr-Jensen [1966] canpared Fl to F4 in shou ted Danish vowels with those in the corresponding vowels spoken by 20 male and fenale measured . to F4) only F1 and F2 appear to have been acrually Rostolland and Parant [1974] give median formant frequencies (F1 speakers, of rut vowels in disyllabic words produced by three male and three fenale speakers (see Table 1) . The authors consider these values as "debat able" and give no details on the method of formant frequency These data show, however, measurement . quite dramatic increases in Fa (Further details on this in Rostolland [1982]) and in Fl with an increase in vocal effort . Table spoken 1: Characteristic frequencies of vowels in French and shouted by three male and three fenale speakers, Rostolland and Parant [1974] . /CVCV/-words, according Reliability of data on Fl to F4 in is in question . Characteristic frequency mean speaking Shouting Shouting/Speaking FO 200 437 2.2 median Fl 400 750 1.9 median F2 median F 3 1 500 1 600 1.1 2 250 2 3 00 1.0 median F4 3 750 3 100 0.8 - 57 - to shouting [1985] Schulman vowels measurerl formant frequencies (F1 to F 3 ) 10 of spoken and shouterl by three male speakers (unp..1blisherl, Swerlish except for F1 and F2 of one speaker, userl in a figure) . The differences found between. the two speech conditions are similar to those reporterl by Rostolland and Parant [1974], rut the increase in F1 is not quite as large . Zhang [1987] measurerl F1 and F2 of [i], [a], and [u] in Chinese rronosyllables uttererl by six male and six fenale speakers at three levels of vocal effort (to obtain sound levels of 55, 65, and 75 dB) . Both formants and FO showerl a consis tent increase in frequency with an increase in level . [1988] Schulman utterances strictly spoken by describes mandib..la l r and labial four linear Swedes, relationship and in shouterl vowels . speaking and shouting . of He reports between. vowel specific j aw ones . [ibVb] a Jaw depression was increaserl by a in speaker with the Three of the four subj ects producerl the shouterl close vowels [i Y 1:i u] with the same degree of j aw depression as the spoken open [ffi quite depression factor of 1 . 6 to 3 . 1 in the shouterl vowels as canparerl specific spoken articulation vowels a 1:)] . If the tounge shape of a given vowel is unchangerl in relation to jaw, this would imply a correspondingly increaserl value of F1 in the vowels [cf . Lindblan and Sundberg, 1971] . Although no data on the shouterl tongue shape shouterl vowels have been. p..1blisherl, the acoustic shape as seen. in relation to the j aw . which As known subjects producerl a vowel with fran experiments in frequencies of a spoken [i] with the jaw being lockerl in a adequate data 1985] are canpatible with the hypothesis of a grossly invariant [Schulman, tongue in the for the production of an open vowel [Gay et al, bite-block characteristic position 1977], more speakers are capable of canpensating for a low j aw position by adopting a different tongue shape . The acoustic data on shouterl vowels do not show any canpensa tion like this . Since j aw depression is highly correlaterl with F1 we should expect an increase in F1 with an increase in vocal effort, for any vowel . This agrees with the p..1blisherl data, as can be seen. in Figure 1 . Only the Danish [a] of male speakers appears increases more in This showerl Hz to be an exception . close that rut less in % for open vowels than for closerl does not hold strictly for each speaker . a Figure 1 also shows to constant increase in Hz, constant percentage increase in F1 . - 58 - Two of Schulman's while one showerl F1 vowels. speakers a roughly 1200 Figure 1: shouted 1000 N :::c BOO <II o+J ;j 0 .t: GOO "- '-' -0 '" l.L � F1 in the same vowels ( vertically) and pro duced with rroderate vocal effort (horizontally) . Mean vowels by five male 400 produced and Danish, 200 / o / / / / MalE' (D) FE'malE' (0) MalE' (5) ten [J 0 and vowels spokE'n adult speakers of produced by three Swedish. Fre quency data fran B. FnzSkjrer Jensen [1966] and R. Schulman [1985] . for a given speaker there is a close correlation between F1 and While depression, vocal i. tract, openness: e., this Vowels of individual data of male speakers of DB ................................... - .............. . Fl female data jaw between F1 and the physical degree of openness of the does not provide us with a valid measure heard as identical in phonetic quality of have phonetic different degrees of jaw depression and, consequently, different F1s depending on the vocal effort with which they are produced . Listeners as well as lip readers appear to encounter the problem of segregating effects of vocal effort fran cues to phonetiC openness. even We know that listeners find a solution being aware of any problem, phonetiC openness rut if F1 in a vowel depends on both its and on vocal effort, phonetiC openness in vowels? without on which basis do they perceive This is particularly interesting since beside F1 there appears to be no other major acoustic cue to phonetiC openness. Sane insight into the perceptual process of segregating effects effort fran phonetiC vowel openness has been gained fran a few experiments [Traunrrill l. er, 1981: were later repeated vocal perceptual Exp. 2-4] intended to test the generality of findings obtained with synthetic one-formant vowels . which of with closer to These experiments, natural synthetic vowels [TraunrrillJ.er, 1985] demonstrated very convincingly that ( a) F1 is the major cue to the perceived degree of openness in vowels; (b) within wide margins, an F1 increase can be offset by a concanit tant Fa increase in order to keep invariant the phonetiC quality, in parti- - 59 - cular the perceived phonetic opermess , of a vowel ; (c) while invariant , this kind of manip..llation is apt to vowel quality it does affect the perceived degree of vocal effort : The latter increases with increasing F0 According keep & F1 if the upper formants are not touched. to ( b ) , knowledge of F0 is required in order to decide upon the degree of opermess of a phonated vowel . Consequently , we have to include F0 among the characteristic frequencies of vowels . Another motive for this can be seen in the intrinsic pitch differences between vowels , nega tively pitch correlated with Fl . FO being At least in stressed syllables , these There are a differences are clearly above the threshold of discrimination . variety of proposals to explain intrinsic pitch by sane kind of acoustical , aerodynamical , or physiological [Ohala and Eukel , 1987 ; interaction between tongue to larynx Rossi and Autesserre , 1981 ] . Whatever the primary conditioning factor for these pitch differences may be, appear and be maintained intentionally . in sane cases they The existence of intrinsic pitch differences in oesophageal speech [ Gandour and Weinberg , 1 9 80 ] may possibly require such an explanation. The clearly increased F0 in the close German as canpared with the vowels of Eastern Central vowels of German Viennese may also be interpreted in this way [ Iivonen , overall higher F0 in Viennese German tells us , also admit other explanations . '" N ::I: 0 0 ..... <II > 0 � (/) <II s: 0 +' ..... E <II (f) 'V o u.. Bf- I I I I : IJ &1- I I IJ I ......... IJ : 2- : m : : - : - . . .. . .... ... : - IJ ID however , Figure 2: Intrinsic vowels in standard Halle : 2. The that the data may pitch ( FO ) German as ( circles ) and in Vienna of the spoken in ( squares ) , scaled logarithnically , shown as a function .. o see Figure - - m o 0- I : rmDD 4 - -2 - I 1 9 87 ] , 1 2 3 4 Zl 5 <Bark) b 7 B - of Fl ' Open scaled symbols tonotopically short , ( abscissa ) . filled symbols vowels. Data from A. Iivonen [1987 ] . - 60 - long While it had been observed previously that FO affects the perceived tic quality 1953 ; of synthetic vowels with given formant Slawson limits 1968 ] , frequencies TraunmUlle r [1981 ] suggested that , phone [Miller , within certain and with certain minor rrro ifications , perceived openness in vowels is given by the tonotopic distance between F1 and F0 ' measured in Zwicker ' s [1960 ] critical band units ( Bark ) . If this is so , differences in openness might be cued by FO as efficiently as by Fl . not easily vary FO as much as Fl ( in Bark ) , However , a given speaker can and FO is needed to express other linguistic and paralinguistic features . As for variations in vocal effort , the data by Rostolland and Parant [1974 ] appear to be compatible with the hypothesis of an invariant tonotopic between F1 and F0 . The values fran Table 1 result in Dl 2. 3 Bark ( speaking ) and Dl = 2 . 8 Bark ( shouting ) . While Dl appears to be a distance , "Dl" = little too high in shouting , the discrepancy is actually smaller than 0. 5 Bark ( The calculation should , appropriately , be based on critical band rate values 75 instead of frequencies ) . dB ) Zhang syllables ) For a 20 dB increase in level ( fran 55 to an increase in Fl of 70-110 Hz in closed vowels and and about 200 Hz in [ a ] . other syllables ) agree with our hypothesis while that for [ a ] is s anewhat without For the Hz [1987 ] reports an increase in FO of 50 - 110 The values for ( in sane ( in sane closed too large , access to the complete data no reliable conclusions can be conversion of frequency f to critical band rate z, [fran Traunrrill l er , 1 9 8 3 ] will be used in this paper . vowels rut drawn. Equation (1) Within the frequency range fran 0 . 2 to 6 . 8 kHz , the values calculated with this equation deviate less than ± 0 . 05 Bark fran the values listed by Zwicker [1960 ] z = + 26 . 81 / ( 1 1960 / f ) - 0 . 5 3 (1) with f in Hz and z in Bark . The distance Dl in the ten shouted Swedish smaller by 0 . 6 Bark , between vowels is , vowel should , the equivalent in the mean, vowels [Schulman , 1 9 85 ] is than in the spoken versions . The spread however , quite large . The phonetic quality of a shouted of course , not be expected to be exactly the same as that of spoken one . In the shouted vowels we would rather expect sane additional stochastic deviation fran the ideal quality intended by the speaker. - 61 - In SLn1lT1a ry , the data shCM D1 in the shouted vowels to be a little too high and Parant , 1 9 74 ] , or a little too lCM [ Schulman , 1985 ] . [Rostolland Fr!DkjCEr-Jensen ' s [ 1 9 6 6 ] rreasurements can not be evaluated in this respect , because data on Fa were not included. on the whole , the rreager and not very reliable data which we have on shouted vCMelS agree , in approximation , with the findings fran the perceptual experiments and with the hypothesis of an invariant distance between F1 and Fa. The observation of an increased Fa and F1 in angry stressed vCMelS [Williams and Stevens , 1 9 7 2 ] agrees qualitatively . The data [ Fr!DkjCEr-Jensen , 1 9 7 4 ; Schulman , 1 9 85 ; Zhang , tonotopic 1987 ] also indicate a marginally significant positive correlation between AD 1 and D1 . Figure 3 shCMS Az as a function of z for Fa ' F1 , F2 , and F 3 in Schulman ' s data. The figure shCMS an overall negative correlation between 4Z and z for all these characteristic frequencies considered without discrimination . It can be seen that F2 is increased in the shouted versions , in particular in the back rounded vCMelS , in which F2 is lCM . F 3 appears not to be affected consistently by a change in vocal effort . the same slope. kind Those The data of Fr0kj CEr-Jensen shCM of correlation ( for F1 and F2 ) , with a s anewhat steeper of Rostolland and Parant ( only rrean or median values ) also agree qualitatively . . ..... � -0 o E Figure 3 o lID 2 o § 0 0 00 00 o o ............... ...... a. ...., o f co u FO o 0 0 o 00 o .. . ... .. 0 · llJ riB o .. ·0·0 to moderate vocal effort ; data each on three Olo adult Swedish : plotted o ............. .......... -0 ............. . o Critical band rates rate male Differences produced speakers against values fran data fran by of ( vertically ) version ( horizontally ) . quency F3 CB-rotE', vCMel individual R. mode Fre Schulman mociE'Y'atE' E'ffort While the perceptual experiments with synthetic vCMels [Traunml1lle r , were of Fa ' F1 , F2 , and F 3 of the same ten vCMels shouted and produced with o -1 3: not designed to investigate variations in vocal effort , - 62 - they 1 9 81] dernon- that the tonotopic distance D2 ( between F1 and F2 ) or that between F1 and the group of higher formants , is largely irrelevant to the percep tion of those vowels in which D2 is relatively large . This was also ob strated served by zhukov and Chistovich [ 19 85 ] in identifications of noise two-formant vowels . It is at variance with Chiba and Kaj iyarna ' s [ 1 941 ] space-pattern theory . the irrelevancy of the tonotopic distance between F2 and F1 in is no longer unexpected . This distance will decrease with phonetic front vowels After consideration of variations in vocal excited effort , F1 increases roo re than F2 ) . There cannot constitute an invariant cue to any aspect of the phonetic increasing vocal effort ( tonotopically , fore , it quality of these vowels . TraunmUller and Lacerda [ 1 987 ] investigated swedes and Turks , cally . In the perception , mainly by of synthetic two-formant vowels in which F1 was kept closely above FO ' while both together as well as F2 ' were varied systenati accordance with the hypothesis of D1 being the cue to phonetic roos t subj ects heard close vowels , no matter how high F1 was . The openness , other investigated aspect was the distinction between the four close vowels whether this distinction was based on F2 ' alone or on F2 ' in relation to F0 and Fl . The results showed that the phoneme bounda ries between the close vowels could be described quite accurately by the in these languages , equation I2 ' = (2) 15. 3 - (Z O + 3.2) where 1 2 , is a constant specific to a particular phonetic boundary while Z o and Z 2 ' are the critical band rates ( in Bark ) of F0 and F2 ' . Equation ( 2 ) rerroves the effects of uniform spectral canpressions with respect to the distance between an upper reference point at 15. 3 Bark ( = 2 . 8 kHz) and a if F2 ' was low , it was considered in relation to F1 and FO ' but at F2 ' around 2. 8 kHz the phonetic boundaries were given by F2 ' alone . At about the same critical band rate lower one ( 15 . 3 Bark ) the regression line in Figure 3 can be seen to cross the zero line . This suggests that Equation ( 2 ) might apply also to these data , that we located at 3 . 2 Bark above Z O . might Thus , extend its application to any spectral quality such as F1 and the higher formants . - 63 - cues to and phonetic An 1 1 might then be calculated by simply substituting Zl for Z2 ' in Equation ( 2 ) . The assumption that Dl is the maj or cue to openness in vowels agrees closely , but not precisely , with the generalized Equation ( 2 ) . dered open If the lower reference point is consi to be located at 3 . 2 Bark above Z o the agreement is exact for half vowels with Dl = 3 . 2 Bark ( II = 0 ) . If applied to the formants above 2. 8 kHz , Equation ( 2 ) would predict slightly lowered formant frequencies in shouted vowels . This agrees qualitatively with the lowered F4 reported by Rostolland and Parant [ 1 974 ] , but the reported value is too low and probab ly not reliable. . � o .c P- Figure 3 Men 'IIomE?Y\ I -0 a. Sa. PU> .c � v . .... '+- -0 a. "'"' o r en u 2 [J . 1 _ [J D--().. o 0 _ -- . - - . . • . . . EI . . 8 male ( squares ) and 15 adult fenale ( circles ) speakers of Ameri can 0- '1,% . · · - - . - - - - - - - - - - - English Differences 0. ( group mean data ) : ( vertically ) plotted against phonated version -1 ( horizon tally ) . Formant frequency data fran o Fl i.o Kallail 5 10 F3 CB-ro.i.e, and whispered K. 'Phonai.e>d Einanuel [ 1 9 84a , J. Kallail and F. 1984b ] measured the formant Emanuel frequencies They observed increased Fl and F2 in the whispered vowels . At least in part , formant frequencies are due to the open glottis in 197 2 ; these whispering in order to make then sufficiently audible. Kallail and Emanuel do not inform us about the the formant frequencies in the two modes of between in the [cf. Fuj inura and Lindquist , 1 9 7 1 ] . In addition , the whispered vowels , canpared values increased may have been articulated with more vocal effort than the vowels in vowels produced by men and by waren and canpared then with those of Flanagan , W. [ 19 84a ] and [ 19 84b ] . measured in the same vowels when phonated . about of F2 , and F 3 of the same five vowels whispered and phonated by 15 adult -'EJ Critical band rates F1 , 0 o : � 4: Figure 4 , for both male and fenale phonated The investigations by articulations , but production . These speakers . The formant frequencies in whispered and phonated vowels strikingly similar to that between shouted and spoken vowels , only are relation appears i. e . , whis pered vowels appear to have the same formant frequencies as shouted vowels. - 64 - As canpared with phonated vowels , are the auditory spectra of whispered vowels canpressed just like those of shouted vowels . Whispered and shouted vowels are also Sat'leNhat similar in their deemphasis of the lower frequency range . In vowels roth cases Fl is relatively weak . While shouted and whispered are related to moderately phonated vowels by similar transfonna tions , there is no invariant relationship between any two of the three aspects of vowels : articulation , fonnant frequencies , and phonetic quality. In order to result in similar fonnant frequencies , the articulation of whispered vowels must be different fran that of shouted ones , while shouted vowels differ fran moderately phonated ones as well in articulation as in fonnant frequencies . The absence of a fund amental in the whispered vowels poses a problem : If the opermess of phonated vowels is judged on the basis of Fl in relation to F0 ' this is obviously excluded in whispered vowels . An equivalent F0 might perhaps be vowels or , sic estimated on the basis of other properties of the whispered rrore confidently , on the basis of a voiced context . The intrin perceptual cues for opermess appear to be deficient in whispered vowels. As for changes in FO due to factors other than vocal effort , our knowledge about their articulatory consequences reflected in the fonnant frequencies of vowels is quite rudimentary . spoken with Bladon [ 1982 ] measured Fl in English words changing F0 by men and waren . He obseved that F1 was not affected as long as F1 remained arove roughly 3 . 5 F0 . otherwise , F1 in creased with F0 approximately in such a way as to retain an invariant ratio F1 /FO ' which implies an invariant distance between Fl and FO ' expressed not in Bark rut in a logarithmic measure of frequency , such as semitones . Sundberg [ 1975 ] investigated the effects of variations in FO on the fonnant frequencies of vowels in one soprano singer . He found that Fl is kept as long as it is clearly arove FO ' rut Fl is not allowed to fall below FO . This result may , however , not be typical of speech . In singing , as distinct fran the linguistic use of the organ of speech , the phonetiC constant quality of speech sounds may be allowed to be affected by changes in those variables as pitch and volume with which the singer is primarily concerned . The observations and interpretations of the relation between FO and Fl - 65 - by Sundberg [1975] , (S) , TraunmOller [ 19 81 ] , (T) , and Bladon [ 1982 ] , (B) , agree in sane aspects and disagree in others . For close vowels , S and T agree at any Fo. For the lowest F0 ' s S and T agree on that phonetiC quality is essentially independent of Fo. In roth these cases there is a disagree ment with B I S close vowels at low F0 ' s . For other vowels T and B agree quite well , although their data evoke different explanations . For half-open vowels at high FO ' S S disagrees sharply with B and only slightly less with T : S observed no increase in F1 untill F0 was close to it - while B ob served F1 to increase proportionally with FO . The discrepancy may be due to a conflict between the requirements of singing ( S ) and those of speaking ( B , T ) . In back rounded vowels , Sundberg reported F2 to increase proportio nally with F0 . Only if F1 also would increase with F0 ' this would agree with the hypothesis of invariant tonotopical distances between close spec tral peaks ( T ) . BETWEEN-SPEAKER VARIATION Contrary to the paralinguistic within-speaker variation , related between-speaker variation the age- and sex in the characteristic vowels attracted the attention of investigators long ago . was probably first vowels produced by The speaker vowels are , to actually measure higher resonance waren category [1915 ] frequencies in characteristic frequencies of a consequence of the size differences between organs of speech of different speakers . the Crandall of than in those produced by men . differences in the obviously , frequencies To the a first order of approximation , age- and sex-dependency of vowel formant frequencies can be understood as a consequence of a proportional up- or down-scaling of all three sions of the vocal tract . dimen A uniform rescaling of a vocal tract would leave the formant frequency ratios invariant . It has been observed , however , that the female/male formant frequency ratios deviate quite consistently from uniformity [Fant , 1975 ] . Such deviations lead us to the conclusion that the vocal tract proportional . ferences on [NOrdstrem, shapes in men and waren producing the same vowels are not Attanpts to calculate the female-male formant frequency dif the basis of anatomical data were not entirely successful 1977; Goldstein , 1980; TraunmOlle r , 1 9 84 ] . While the differen ces in F3 and F2 could be predicted satisfactorily , there remained problems - 66 - Despite the large between-speaker differences :in vowel formant frequencies , the idea that perceived vowel quali ty essentially might be given absolute present formant day capability frequencies still sometimes appears to speech research. to be by the �licit :in This contrasts with our knowledge that make accurate absolute j udganents of pitch is only the found exceptionally , :in some people said to possess "absolute hear:ing" , while all normal hear:ing people are kncmn to be highly proficient :in j udg:ing relative pitch. This aspect is easily overseen , s:ince most acoustic data on speech sounds orig:inate fran adult speakers , :in particular fran men The measure ment of formant frequencies :in children ' s vowels is j ust as difficult as :in shouted vowels. be acceptable . Consider:ing only adult speakers , it may for some plrposes to say that the same vowels have roughly the same formant frequencies. Figure 5 , however , absolute demonstrates rather clearly that this can no longer be said if we canpare the vowels of adult men with those of k:indergarten children. mants of The vowel spaces circumscribed by the mean for the five Japanese vowels of the two groups of speakers not only fail to co:incide but they do not even overlap. 20 , . . . .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. . Figure Tonotopically scaled F1 ( horizontally ) and F2 ( vertically ) .. .. . , :in 15 the 5: five Japanese vowels duced by children aged 4 to 5 years and by adult male N l.I.. area 10 speakers. shcmn by l:ine connect:ing them. Mean :in : . .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. . . Fl The 1970 ] Vowel circumscribed by these vowels values obta:ined fran five 5 pro data :in ( Bo.r k ) 15 each group. data [ 1970 ] . fran H. Ebrmant FUj isaki et ale 19 87. ] [ FUj isaki the vowels fran six physiologically different speakers were analysed. frequency [ Adapted fran TraunrnUller , used for Figure 5 are fran an :investigation which speakers There were five speakers :in each of the - 67 - et groups al. , of follow:ing male adults , fanale adults , girls (12-14 a ) , boys after maturation groups : of voice ( 12-14 a ) , garten boys before maturation of voice ( 1 2-14 a ) , and kinder The relations between the formant frequencies of children ( 4-5 a ) . the same vowels produced by the different groups of speakers can be means ciated by appre of the Figures 6a-c . Each figure shows , for sane particular groups of speakers , the deviation of the mean log ( f ) of each characteristic peak of each vowel fran an arbitrary standard value . The standard value was defined three as the mean value fran all speaker groups with the first mentioned groups given dubble weight . Figure 6a displays the vowels of men , wanen , and kindergarten children . The developnent in each sex can be appre ciated in Figures 6b and 6c . between adult described quite regression female men lines speakers uniformities and Figure 6b shows that the overall boys just after maturation of the well by a uniform scale factor ( 1 . 14 ) : difference voice The can respective The difference between adult male run in parallel . is less well described by a uniform scale be factor . and Non similar to those observed by Fant [ 19 7 5 ] are present also in these data . More importantly , however , Figures 6b and 6c make it clear that the diffe rences between adults ( including boys after maturation of voice ) and dren can not be described satisfactorily by a uniform scale respective regression lines do not run in parallel . chil factor : The The data are , however , described quite well by linear regression lines whose slope changes cons is tently scribes with maturation . the The power function expressed in Equation ( 3 ) relationship between the formant frequencies Fa different speaker groups according to those regression lines . and Fb de of (3) constants k1 and p are specific to each pair of speaker groups . The factor k 1 represents the scale factor for Fa = 1 frequency unit . For sane The the values of k1 and p are listed in Table 2 . Scale factors for FO optimal uniform scale factors for the formants are also included for pairs , and canparison . In order to give an idea of the consequences of the particular canbinations of k1 and p within the formant frequency range , scale factors at Fa = 300 and 3000 HZ are also listed . - 68 - the resulting \0 (J\ a, If) -9 -4 '....,e a, f;: o 0 4 f- 'r '" � a, > 0 '" ....,a 8 "C "C 12 CD m $I m DI IS J�--- uJDl DB • IS . 2 .5 6 10 (kHz) 5 . 2 .5 Cha.racteristic .1 0 $I frequency CD 0 b CD it . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . freoqueoncy . . . . . . . . . . . . . . . . . IS CharactE'ristic . . . ., I :II II $I Ch i Idr!?n Prop-pubP'!' ty ll. Pos t -pub!?!" ty [J Adul ts lID Logaritlrnically scaled Fa , F1 , F2 , and F 3 of the same five vowels of produced by physiologically different groups of speakers ( group mean data ) : Devia ( a to c ) : frequE'ncy 2 , • :II :It Ch i 1 d'!' t>n Ado 1 t> S C E'n ts Adul t.s waren, and kindergarten children ( age 4-5 years ) . kindergarten children , boys before maturation of voice ( age 12-14 maturation of voice ( age 12-14 years ) , and adult men . ( c) (b) kindergarten children , girls ( age 12-14 years ) , and adult waren . ( a) adult men , adult years ) , Regression line fitted to the formant data shown for each group . data fran H. Fuj isaki et al . [ 19 70 ] . defined in text . boys after Frequency tion f ran a standard value ( vertically ) plotted against actual value ( horizontally ) . Standard Japanese Figu re Cha.racteris tic .1 : ...,..................................................� ..: II !!I-' -1mJ! __ f • :II �IS Child'l"l?n Wom!?n Mpn " o (/) ....,8 a "C "C Cl.. f;: a, o s a, 100 ..., 1 50 200 Table 2 : Scale factors for FO ( kFO ) and for undiscriminated formants ( Fl to F 3 ) corresponding to regression lines in Figure 6 . Best uniform scale ( k ) and factors at 300 Hz, factor with F in Hz ). Last column : 3000 Hz , and 1 Hz ( kl in Equation ( 3 ) Exponent p in Equation ( 3 ) . Between speaker canparison based on Japanese data [ E\..lj isaki et al , six European languages [ Fant , Swedish [ Scirulman , 1975 ] . 1 9 70 ] , and on data fran Within speaker canparisons based on 1985 ] ( shouting ) , and on American English data [ Kallail and Emanuel , 1 9 84a , b ] ( whispering ) . p men boys , mat. , 12-14 a 1 . 29 1 . 14 1 . 14 1 . 13 1 . 14 0 . 999 men boys , :imnat. , 12-14 a 1 . 93 1 . 26 1 . 41 1 . 16 2 . 29 0 . 9 15 men children , 4-5 a 2 . 34 1 . 64 1 . 99 1 . 42 4 . 55 0 . 855 women girls , 12-14 a 1 . 27 1 . 05 1 . 15 1 . 01 1 . 62 0 . 940 women children , 4-5 a 1 . 32 1 . 34 1 . 75 1 . 18 4 . 72 0 . 827 men women 1 . 77 1 . 22 1 . 18 1 . 25 1 . 02 1 . 026 men women 1 . 14 1 . 08 1 . 19 0 . 84 1 . 044 1 . 14 1 . 36 0 . 99 2 . 99 0 . 86 2 voiced whispered ( men ) 1 . 14 1 . 43 0 . 97 3 . 73 0 . 8 31 voiced whispered (women) 1 . 08 1 . 30 0 . 99 2 . 55 0 . 882 normal shouted [Fant , 1975 ] * 2 . 12 ( men ) Figure 7 shows that within-speaker variation in the formant frequencies of vowels can also be described quite well with Equation ( 3 ) . The values of kl and p for the data on shouted and whispered vowels which have been used above are also listed in Table 2 . While a pcMer function ( Equation 3 ) describes the relationships between the formant and/or frequencies of the same vowels produced with different kinds of phonation quite satisfactorily , ( normal , by different shouting , the calculated values would not , 70 - whispering ) in general , with the observed ones if Equation ( 3 ) was also applied to F0 . - speakers agree '-l (fl E '" II) '" c: o ..., � Q, > o II) .3 "1:S c: B "1:S 1 • 2 8 5 Fl SpokE"n o 0- to F3 a F0 ' -4. 0 4. 8. 12. .2 frE'qUE'ncy .5 Fl 2 to F3 5 10 Regression lines also shown ( dashed for wo Differences ( vertically ) plotted against values sarre b 100 150 200 vowels (kHz) 0 c Standard: Phonatlod vE'rsioY"l Formant .1 �- - -- ------ -- -- -'0 0 M�n 'r/omt'n F1 , F2 , Formant frequency data fran K . J. Kallail and F. W . Emanuel [ 19 84a ] and [ 1984b ] . male (b ) Whispering : and F 3 of the sarre five vowels whispered and phonated by 15 adult (squares ) and 15 adult fenale ( circles ) speakers of American English ( group mean data) . Frequency data fran R . Schulman [ 1 985 ] . ( a) Shouting : Fl' F2 ' and F 3 of the sarre ten vowels shouted and produced with m:.:x]erate vocal effort ; individual data on each vowel produced by three adult male speakers of Swedish: men) . (fl E CII ..., g II) '" � Q, > o II) .3 c: � "1:S Logaritimically scaled characteristic frequencies of the 10 (kHz) VE"rs ion 5 c: Q, 0 " Q, 0.. +I .... o +I '" a "1:S § "1:S with different types of phonation : ( a and b ) : 2 100 150 200 fran ordinary lab-speech version ( horizontally ) . produced 7 0 frE'quE'ncy • Standard: Fo r mant • 0 0 00 0 o o� u-� 00 rJDf:b § o - - - -- - - - - - - - - - - - - - - - - - - - - - Figure -4. 0 4. 8. 12. a § c: Q, 0 " <11 0.. .... .... o +I <h "1:S "1:S If we data want to utilize a power function to "normalize" on vowels produced by a single speaker , formant or in order frequency to recognize vowels autanatically , we encounter a problen : We have no a priori knowledge of the appropriate values of k 1 and p . Here , these constants have been derived on the basis of data on several sets of vowels whose phonetic quality was assumed to be identical , applicable if the phonetiC quality of the vowels is in question . that vowel quality is substantially influenced by Fo. further , k1 rut this procedure is not irrmed iately and/or the exponent p mISt, therefore , We know , The factor be regularly dependent on FO . Equation ( 3 ) does not describe this regularity . Since equivalence in phonetic quality is defined by the perceiver , be preferable to adopt a genuinely perceptual point of view in the relation between phonetiC quality and characteristic it may describing frequencies of vowels . We should account for the fact that listeners are able to decide on the phonetiC quality of a vowel sound even without prior expousure to other the vowels produced by the same speaker , and we would like to any describe phonetiC quality of vowels in terms of such parameters which have same value Peterson , whenever the phonetic quality of sane vowels is the same 1952 ] . Syrdal [ 19 85 ] , this goal . the The tonotopic approach followed by TraunmUlle r Syrdal and Gopal [ 1 9 86 ] , DiBenedetto [ 19 87 ] , [ cf . [ 19 81 ] , may lead to E\.lnd amental to this approach is the tonotopic representation of speech sound spectra , hypothesized that characteristic e. the g . , along the basilar membrane . Initially it was tonotopic distances between of a given vowel , might be invariant . the spectral peaks , In gross approxima this was found to hold between speakers . For small and medium inter tion , peak distances than that between F1 and F2 in vowelS ) the hypothesis was confirmed [ TraunmUller , 1981 ] . Syrdal ( smaller [ 19 85 ] distances and subsets tures . A In front particular , critical 3 . 5 Bark to dispartiate the set of American of inter-peak English vowels waren , and children ( data fran Peterson and Barney , [ 1 952 ] ) sPJken by men , into Syrdal and Gopal [ 1 986 ] observed non-open traditionally considered to share certain similar observation was made for the distinctive distinction between fea the vowel-like consonants [ 1 ] and [ r ] [ Syrdal , 1 9 82 ] . The tonotopic speakers , and with approach suggests the same vowels the same type of phonation , produced by to share an invariant pattern it suggests a s�le relation to hold between the different - 72 - different speakers . Accord:ing to this approach, ces the consequences of vocal tract size differen J:etween speakers can J:e descriJ:ed very simply by a uniform tonotopic translation of the spectral envelope of the auditory pattern of excitation . In spirit , and this agrees with the " space pattern theory " proposed by Kaj iyama [ 1941 ] , Chiba rut ncm the description of this pattern :includes not only the formant peaks rut , :in addition , also FO . Figure 8 ( a to c ) displays the same data as those used :in Figure 6 ( a to c ) rut ncm scaled :in Bark . Figure 8 can :in abscissa , Unlike the regression l:ines :in Figure 9 , J:e seen to follcm a course grossly :in parallel those :in with the agreement with the hypothesis of an :invariant tonotopic pat tern . Most of the regression l:ines shown :in Figure 8 fit FO as well as the formants although there are sane minor deviations : For k:indergarten chil dren , FO is J:elcm the regression l:ine and for adult waren it is above . The regression l:ines shown :in Figure 8 run , hcmever , not quite strictly :in parallel with the abscissa , and Equation ( 4 ) holds for their exact descrip tion , J:etween any speaker groups a and b . (4) In this equation , d O is the :intercept at Z a the regresSion l:ine :in a plot of Zb vs Z a . = 0 Bark , and c is the slope of Values for d O and c are listed :in Table 3 . Equation (4) phonated and shouted or whispered vcmels . and can also J:e used to descriJ:e the relation c are also listed :in Table 3 . t:inctly J:elcm 1 . 0 . fran In these cases , the slopes c are In J:etween-group canparisons among consideration , normally The correspond:ing values of this slope does not deviate rrore than ± 0 . 0 3 fran 1 . excluded J:etween dis speakers hcmever , FO were as is the case :in Fant I s set of data , the deviation would J:e larger :in sane instances . - 73 - If, Japanese dO .j>. -....j > aD +' 0 0 � J: S 0 <h � +' -0 B -0 CO \../ B :>l. -2 -1 0 2 3 4 D • " • CD • m • .. CP • a "'i a . . (BaY'k> m m ... 0 ...... . • o a ......, _ . . . .. ... .. ...l.CP a • u: II : o 00 CD FO :. ...� : • It a .� • .. CD i a CD • o o :I • a (BaY' k > . CD m .. �fo.. It a 'i a Ch! ld'l'pn AdolE>sc(>nts Adul ts to F3 CB-Y' Q teo i' • CD 0 It b: ......, I FO IDIIJ C • a a :I C SI • . . . C .. .. tJ. a • It ce m m • II uP m F3 CB-Y' 0 tE' D c in m milb • kindergarten children, boys before maturation of voice ( age 12-14 years ) , maturation of voice ( age 12-14 years ) , and adult men . ( c) wanen. and kindergarten children ( age 4-5 years ) . (b ) kindergarten children, girls ( age 12-14 years ) , and adult wanen, defined boys after E\1j isaki et al. Standard • • .... c: *' ChildrE>n PrE'-pubE'r ty tJ. Pos t-pubE>r ty c Adul ts m �. tJ. . . � .. �8 C Formant frequency data fran H . ( vertically ) plotted against actual value ( horizontally ) . Regression line shown for each group. value ( a) adult men , adult [1970] text. dard 8 ( a to C ) : to F3 CB-Y' 0 teo 5 II l1li 0° 0 • . .. . . Critical band rates of Fa , F1 , F2 , and F 3 of the same five vONels of Japanese produced by different groups of speakers ( group rrean data ) : Deviation fran a stan FO Figure o ;... ...g . G) CD IDOl • • a *' :I *' .. . . . Ch ildr�n 'Wom(>n M�n .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . - _ Table 3 : Tonotopic translation dO of FO and of undiscriminated formants ( Fl to F 3 ) . Mean translation of FO and translation corresponding to regression lines at 300 Hz, in Figure 8 at 0 Bark ( intercept in �ation ( 4 ) ) , mean uniform translation , and translation at 3000 Hz in Bark . Last colurm : Slope c in �ation ( 4 ) . Same data as in Table 2 . c men boys , mat . , 12-14 a 0 . 45 0 . 43 0 . 52 0 . 66 0 . 90 1 . 0 30 boys , �at . , 12-14 a 1 . 44 1 . 50 1 . 42 1 . 31 1 . 11 0 . 976 men children , 4-5 a 2 . 02 2 . 52 2 . 55 2 . 60 2 . 69 1 . 0 11 women girls , 12-14 a 0 . 08 0 . 36 0 . 31 0 . 22 0 . 11 0 . 9 84 women children , 4-5 a 0 . 82 1 . 66 1 . 58 1 . 44 1 . 27 0 . 97 6 men women 1 . 20 0 . 94 1 . 02 1 . 16 1 . 39 1 . 0 29 men women 0 . 12 0 . 33 0 . 79 1 . 20 1 . 069 1 . 56 1 . 26 0 . 62 0 . 02 0 . 902 voiced whispered (men) 1 . 78 1 . 42 0 . 62 -0 . 07 0 . 882 voiced whispered ( women ) 1 . 40 1 . 13 0 . 40 0 . 01 0 . 9 12 men [ Fant , 1975 ] normal shouted ( men ) * 1 . 72 INVARIANT OJES According to the power-function approach, scribed by �ation ( 3 ) , the same kind of appears to hold for both relation , between- and de within speaker variation . According to the tonotopic approach, these relations are fundamentally different in character : between speaker variation is cribed by a tonotopic translation while the kinds of within speaker tion des varia discussed above are described by a tonotopic compression or expansion of the auditory pattern of excitation . on the basis of the latter approach , we can see several analogies between auditory perception of phonetic quali ty and visual perception of shape , of distance e. g . , of graphic symbols . The effects and orientation with respect to the line - 75 - of sight , on the Figure 9 : Tonotopic distance between F3 and F0 - a negative measure of " elevation" , 11.0 plotted against critical band rate of F 3 a ( negative ) correlate to speaker size . o N 13 . 0 Mean speakers of Japanese : I ('f) 1 4 . 0 Speaker groups , sym ools , and data source as aoove . Data points N 1 5. 0 connected in sense of ontogenetic 19. 0 17.0 1&. 0 have ment 1 5. 0 ( Bar k ) Z3 We values plotted for each group of five seen in each sex . develop Data on Swedish spoken ( N ) and shouted ( S ) vowels also shown . that Equation ( 2 ) neutralizes spectral canpressions with respect to the distance between a reference point located at 15 . 3 Bark and another one at 3 . 2 Bark aoove Z O ' If we interpret the reference point at Bark ( Zh ) not as fixed in absolute terms , rut allow it to follow the 15 . 3 higher formants in natural vowels , which , without tions . Equation ( 5 ) describes the phonetiC pararreters involving and F2 we can write a more general further rrodification , proper . also neutralizes spectral The two reference points may be interpreted as equation , transla F2 ' , F1 ' expected average positions of Z I and of Z 3 ' respectively . Zn - ( Z O In - 3.2) (5) Zh - ( Z O With ZI ' aoove . the + + 3.2) we obtain I I ' a measure of vowel openness , already mentioned Within the two reference points , Equation ( 5 ) agrees exactly with assumption that paralinguistic spectral rrodifications be linear trans lations and/or canpressions of the auditory spectrum , as described by Equation ( 3 ) . The problem of between-speaker variation in the description of vowels often been treated fuj isaki and Nakamura , which can be with various "normalization" 1969 ; Wakita , 1977 ; pardigms Disner , [ Fant , has 1959 ; 1 9 80 ; Suani , 1984 ] , used to " correct " formant frequency data to conform to a standard vocal tract . Such a normalization can be imagined to take place in - 76 - visual pattern of excitation appear to be neutralized by application of an proj ective geanetry . inverse Similar processes appear to be effective in auditory perception of phonetic quality : The effects of variations in vocal effort are analogous to those of variations in tilt with respect to a plane orthogonal to the line of sight , both resulting in a canpression or expan sion of a spatial dimension of the proximal stimulus . The effects of variaTranslations , or rotations tions in vocal tract size have other analogies : within a plane orthogonal to the line of sight , all leaving the relative locations of any characteristic details unaffected . Since variations in vocal effort , intonation , and phonation do not appre ciably affect F 3 and the higher formant frequencies , these provide close to invariant correlates to speaker s ize . The tonotopic position of these formants vocal is , by tract . protrusion . dimension and large , only to a minor extent does this length vary also This constitutes there speakers quality , quality distinctive and involving the possible canpression of spectra . lip physical dis The present analysis of data has further is another variable of personal the with the marginal phonetiC use of this which mainly functions as a variable of personal tinctive between speakers . that given by the speaker specific length of shown within This variable can be expressed by the tonotopic distance between the higher formants and FO . I have tentatively used the term " elevation" [ TraunrnUller , 1987 ] designate this dimension of variation in order to cover the elevation to of the low frequency part of the spectrum irrespective of its appearance as a consequence of increased vocal effort or of whispering , or due to any other reason . 9 shows the mean Z 3 ( indicating tonotopic translation ) , correlated speaker size , and the mean distance Z 3 - Zo ( indicating tonotopic Figure with canpression ) , correlated with vocal effort or " elevation" in general , in the whole set of Japanese vowels as produced by different speaker groups . The figure shouted also shows the corresponding data for the normal and Swedish vowels . Although Figure 9 shows that " elevation" is also , marginal ly , involved reflect after in between-speaker variations , a psychological reality . maturation this dichotany is likely The speech of boys j ust before and of their voice may , e. g. , - 77 - just be perceived to differ " elevation" , or " shoutiness " , as suggested by Figure 9 . to in human speech perception . approach is , however , The solution proposed by the conceptually different . theory of direct perception [ Gibson , of present tonotopic It is more in line with a 1 9 6 6 ] and it involves the calculation psychoacoustic parameters In . Any phonetic II templates II of stored in our brains are asS\.ID1Erl to be free of transmittal and invariant vowels personal , paralinguistic quality , as is the case with the In . Subsequently , a nonnalization can only take place in the perception of phonetic quality if there are differences between the phonetic systems of speaker and liste ner . When own , rut which we are not very familiar with , we probably use serre kind of we listen to serrebody speaking a dialect which is close to our nonnalization or correspondence rules in order to understand . While we have seen that the fonnant frequency differences between speaker groups can be described by the power-function expressed in Equation ( 3 ) by or a tonotopic translation slightly modified by a tonotopic compression or expansion , none of these transfonnations captures the fanale-male differen ces , observed by Fant [ 1 975 ] and further analysed by Ananthabadmanabha [ 1984 ] and TraunmUller [ 19 84 ] , a discrepancy in detail , Nordstran [ 19 7 7 ] , completely . There remains which is of the same kind in the tonotopic ap proach as in the power-function approach . The female-male difference in log ( Fn ) and in Zn in Fant ' s [ 1975 ] set of data is plotted against the male values in Figures l O a and l O b , respective ly . Separate regression lines are shown for each fonnant in addition to an overall regression line . It can be seen that the fenale-male difference in Fl is highly correlated with Fl ( Significant at p < 0 . 01 ) . The same kind of interrelation holds for F2 . As for F 3 ' there is a barely significant ( p < 0 . 1 ) negative correlation . The slope of the regression lines for Fl as well for F2 is much steeper than that of the overall regression line . This makes female vowels stand out as clearly more peripheral than male vowels as in any of our approaches . There is only one potential vowel , in the central region scribed of the vowel space , exactly for which the fenale-male difference is by the overall regression lines in accordance with de Equa tions ( 3 ) or ( 4 ) . Table 4 lists the constants descriptive of the regression lines for Fl and F2 in the Figures l O a and lO b . - 78 - I.D -...J a. D.. S- � a. (.) "0 +I ;; � 0 8 -.:S -.:S 3. 0 � 1 20 1 00 I- .1 Men ( sp o k en ) fr E' q u E' n c )I S t an d ar d : Fo r mant .1 a � E o +I � '" a. � a. > o lL. a. 0 E a. ..... 0 E a. '-/ CD 5 ,... .::I- -1 . 0 1. 2• 0 U Ma l E' 5 10 f o r m an t C B -r a b ( B ar k > 15 20 b these languages . Separate regression lines fitted to the data for each formant ( whole ( b ) Formant frequencies expressed in a tonotopic measure ( Bark ) . ( a) Formant frequencies expressed in a logaritlmic measure such as sanitones . drawn lines ) . Overall regression line fitted to the three formants also shown ( dashed ) . of Figure 10 ( a and b ) : Fanale-male differences ( vertically ) plotted against male values of FI , and F 3 in vowels ( horizontaly ) . Data represent mean formant frequencies fran a sample of F2 , six languages [ Fant , 1 9 75 ] . The twelve vowels chosen are those represented in at least three -3. £. . 140 " . -.:S ..... '" Table 4: Frequency scale factors and exponents ( Equation 3 ) and band rate translation , of intercepts and slopes ( Equation 4 ) for female formant frequency values fran male data . each formant . critical calculation Separate analysis for Listed values based on formant frequencies observed in six European languages [ Fant , 1975 ] . F2 Fl uniform scale factor 1 . 11 6 1 . 140 scale factor kl at 1 HZ ( Equation 3 ) exponent p in Equation ( 3 ) 0 . 362 0 . 37 3 1 . 18 6 1 . 155 + uniform translation in Bark intercept dFO at 0 Bark ( Equation 4 ) slope c in Equation ( 4 ) + 0 . 50 7 F3 1 . 16 8 + 0 . 855 - 0 . 744 - 0 . 936 1 . 280 1 . 170 1 . 018 Using the constants listed in Tables 2 and 3 , the transformations described by Equations ( 3 ) and ( 4 ) can be implemented without identification of formants , the e . g . , by an autanatic analysis and a following resynthesis with transformed parameters , this whereby only F0 has to be rescaled separately . In case the spectra are treated holistically . This would be canpatible with non-formant approaches to the description of vowel spectra , cated by Bladon [ 19 82 ] . as advo Figures l O a and l O b show , however , that a separate treabnent of the formants F1 and F2 ' using the Equations ( 3 ) or ( 4 ) with the constants listed in Table 4 leads to a considerably more exact descrip tion of the female/male differences . In this case , distinct role and has to be identified by number , reliably with an autanatic procedure . and which is difficult to do Since the separate treatement of F2 allows a sufficiently exact calculation of female formant cies if male formant frequencies are known , with each formant plays its advantage , tics , to frequen this can , however , be utilized together with a description of voice source generate F1 characteris female speech by formant synthesis on the basis of a description of male speech . Should we interprete male vowels as pale versions of female vowels as exagerated versions of male vowels , - 80 - female vowels , or is our or analysis insufficient so far? In order to answe r this question , we would have check whether the discrepancy is conditioned by sane factor which is to reco verable fran the acoustic signal and not exclusively linked to speaker sex . perceived degree of opermess in vowels has been shown to be a function The of F1 and FO . If other factors are negligible , the female-male discrepancy in the acoustic data is likely to reflect a discrepancy in phonetic quality unless it can be shown to be conditioned by F0 ' regardless of speaker sex . As for Fl ' there are sane data which show that than peripheral wanen ' s vowels are more men ' s even if they are produced with the same FO [ Bladon 1982 : Figure 4 ] . At FO approx . 160 Hz, wanen ' s vowels have distinctly more extrene F1 s , as canpared with the sarne intended vowels produced by men . This discrepancy i s as large a s that in Fant ' s set of data . Thus , the wide female energes range in F1 can not be explained as a function of FO . This also clearly fran the Japanese data . In Figure 1 1 , the speaker group specific dispersion in Z 1 ( standard deviation between mean values for each vowel ) is plotted against Z o for each speaker group . This figure shows that speakers with an FO similar to that of wanen ( girls , and boys j ust before maturation of voice ) do not share the wide F1 range in that group , confir ming that it can not be explained as a function of F0 ' although the F1 range is similarly expanded in kindergarten children . '" 2 . 0 � e CD v 1.9 .... 1.8 1= o tI> s- , . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. "" Po tI> Figure 1 1 : Dispersion in F1 between the mean Japanese vowels of six physiologically dif .- 1 . 7 -0 § E s- o .... -oJ tI> l- .... L1.. 1 . e. ferent of speakers , expressed as critical band rate standard deviation , ( ver 1.5 1.4 groups tically ) . . . . . . .. .. . . . . . _ _ . .. . . . . . . . . _ _ . . . . . . . . .. Symbols points F un d a ml? n t a l p i t c n ( Bar k ) plotted against FO ( horizontally ) . and source of data as above . Data connected in sense development in each sex . - 81 - of ontogenetic In Figure 12 , the dispersions of Z l ' Z2 ' and Z 3 ' are plotted against the mean Z 3 ' for each group . This figure shows a correlation between. Z 3 and the dispersion for each formant , rut the data on waren ' s vowels clearly stick. out fran the general trend , reasons tions to of not only in Z l . There are , thus , believe that the phonetic quality of waren ' s and men ' s the same vowel phonenes is not precisely the same . difference similar to that between. more and less explicit Such a difference in behavior might , or sociological factors . difference likely There is a pronounciations . in principle , be due to physiological to represent a "universal " of linguistic behavior , On to have a physiological cause . the other hand , speaker by speaker , it appears a more detailed revealed an especially discrepancy in the case of lui which the data suggest pronounced rendi Since the present study suggests this sex-related analysis of the Japanese data , been convincing to have produced with lip rounding by two of the five adult waren rut by none of the other 25 speakers . These cases are supportive of the argument that a sociological factor is also involved . � t (!) '" s: o If) S<II s:L If) . .... 3.0 2.5 2. 0 Figure between. -0 ! . 5 . ... <II .oJ d sI (!) U .5 � 0 8 E LL kers , different F3 of six groups of spea expressed as critical band rate stan critical Symbols 14 points 15 f o r man t C B -r a t � ( Bar k ) band rate of F3 ( horizontally ) . and source of data as connected in sense above . of Data ontogenetic development in each sex . difference similar to that between. waren and men has also been shown exist between. more and less stressed vowels [Koopnans - van Christov , the e. and dard deviation , ( vertically ) plotted against Th ir d A Dispersion in F1 , F2 , the mean Japanese vowels physiologically ! .0 .oJ 12 : 1987 ] . In this case there may , Beinum, to 1980 ; in addition , be a difference of kind we found between. vowels produced with more and less vocal effort , g. , in the angrily stressed vowels of Williams and Stevens Degree of peripheralness has also been shown to be a sufficient - 82 - [ 19 72 ] . perceptual cue to stress [Rietveld et al , 1 9 87 ] . Not considering the data for waren , Figure 1 2 shows a sizable increase in the range of Z 3 variation with increasing age . The between-vowel variation in Z 3 aroong kindergarten children is barely a third of that in adults . still younger children , below 2 years of age , have a relatively small range of variation in F2 [ Buhr , 1980 ; Bickley , 1 9 8 3 ] . The ability to use varia tions in F2 , reflecting a front-back rrovanent of the tongue appears to be aquired later than the ability to vary Fl ' reflecting j aw-rrovanent . As for variation in F3 , the present analysis appears to show that speakers do not attain full control of it before adolescence . It has been observed previously that the simple tonotopic approach is slightly in conflict with observational data insofar as a change in FO has alrros t negligible effects on the perceived openness of vowels as long as FO < Hz. 150 This has been interpreted as an " end of scale effect" It has been argued that an FO decrease below 150 Hz might not affect the tonotopic representation of spectra because of audito [ TraunmUller , ry 1981 ] . integration Lublinskaya question , over roughly 3 . 5 Bark , [ 197 9 ] . first reported by Chistovich This idea has subsequently been supported , and interpreted differently [ TraunmUller , 1981 , p1t 1982 ; and into Bladon , 1983 ; Schwarz and Escudier , 1 9 87 ] . Syrdal and Gopal [ 19 8 6 ] found the end of scale Barney ' s effect to be negligible for their analysis of Peterson and [ 1952 ] data . increased Now it is necessary to reexamine this hypothesis , peripheralness simply be done by , of waren ' s vowels also into actually , taking the account . This can leaving overall variations in peripheralness out of the account , i . e . , by considering the vowel space centroids of each speaker instead of the individual vowels . each To this end , the mean values of characteristic log ( f ) or z have been calculated for each of our spea kers of Japanese and they are plotted in Figures 13a and 1 3b as a function of log ( FO ) and Z O ' respectively . Figure 1 3 shows that in none of our approaches do the formant increase frequencies strictly linearly with FO . In the power-function approach, we would not even expect this , rut in the tonotopic approach , the discrepancy , quite clearly visible in the case of Fl , needs an explanation . In part , this discrepancy is in line with the previous observation that vowel quali ty is not greatly affected by F0 if it is within ( or below ) - 83 - the typical adult male Fo -range . Figure 1 3b shows , however , also that there is a dis crepancy in the opposite direction above the typical adult female Fo -range . This part of the discrepancy is not accounted for by spectral integration as suggested previously . In the frequency range 250 < < FO Hz, 350 appears to influence phonetic quality to a larger extent than predicted the hypothesis that perceived phonetic openness is given by the FO by tonotopic distance between Fl and FO . In this respect , the Japanese data agree with the American English data of Peterson and Barney [ cf . TraunmUller , 1984 ] , and also 1981 ] . to with the result of a p..1rely perceptual experiment [ TraunmUl.ler , In this particular range of FO ' perceived phonetic openness appears given by FI /FO ( or log ( Fl ) - log ( FO ) ' as suggested by Bladon be [ 19 82 ] . 4.0 . . 3.5 J: : 3 .-.. ..., b> 0 2 3.0 :1 2.5 . . S . . . . . . . . . . . . . . . . . . 20 . ., �s 1 �2 2 2J�/ 'Z 1 1 1 1 (1 1 11 11 Figure 13 log & ;1 1 2. 4 (FO) ( a and b ) : . . . . ..., N 2. 5 : : . iI 2 �22 1 1 � 111 I 11 :: Z : : ' ' , , ' ' ' ' , , ' 0 a . � 2221 10 5 . . ,� � ssSs '" 0 2. 2 . . 2 2. 0 2. 0 . _ _ : 15 a3 sF § � 2 2 11 1 . . . . . . . . . . . . z ' . . . . . . , . . . . 2 3 (FO) 4 vowel-space centroids for 30 speakers of b Japanese . Three formants plotted against fundamental . Source of data as above . ( a ) : Characteristic frequencies expressed in a logari thnic measure . (b) : Characteristic frequencies expressed in a tonotopic measure . culated low-frequency end of spectral pattern ( Equation 6 ) shown by Cal curved line . This phenanenon may perhaps be understood as follows : ( below 150 Hz) , characterizing If F0 is very low the ear considers the envelope of the spectral pattern vowels to begin right at the beginning of - 84 - the tonotopic If F0 is very high ( above 300 Hz) , the ear is not capable to bridge the large spectral gap below FO and , therefore , considers scale ( at 0 Bark ) . envelope to begin at FO . In the range of 150 Hz < FO < 350 Hz, then , quite a drastic transition between these two interpretations will result . the A similar limitation to spectral envelope detection , concerning the maximal spacing of partials acceptable for the auditory extraction of F1 has been observed previously [ Traunnilller , 1981 ] . The present suggestion differs fran the previous one in that the problem is now seen as one envelope detection instead of spectral integration . of spectral This explanation does not preclude the calculation of the tonotopic distance between F 3 and F0 ' suggested to be invariant between speakers , given equality in phonation and vocal effort . accordance A low-freqyency endpoint , Z OO ' of the spectral envelope , in with the present explanation , can be calculated with Equation (6) : Z oo = The Zo l C l + (6) exp [ 2 . 5 ( 2 . 5 - Z O ) ] } constants in equation ( 6 ) were arrived at by attenpting to keep con stant the spectreal distance between Z OO and the mean Z I of our speakers of Japanese . This was done by hand-craft , i . e . , better approximations may exist . The calculated Z OO is shown in Figure 1 3b . The success of the present attenpt to remove between-speaker variation fran the description of vowels can be appreciated by canparing a plotting of the 1970 ] in Z I vs Z 2 space ( Fi gure 14 ) with a plotting of the same data in terms of the two parameters I I individual Japanese vowels [ E\.lj isaki et al . , and size 12 , suggested and " backness " to be invariant with respect to variations in "elevation" ( Figure 15 ) . I I reflects as well as "lip protrusion" , oral "depth" in TraunmUller [ 1 9 87 ] . substituted for Z o in Equation ( 5 ) . vowel openne ss , speaker and 12 with a cover term referred to as In calculating I I and 1 2 , Z OO has been The upper reference value needed in Equation ( 5 ) can be estimated either by using for each speaker his mean F 3 - not inherent in the particular segment - or to use the F3 of each particular vowel . In the latter case , we discard sane phonetiC information . perception , it If there is a similar procedure in human speech is likely to be based on the formants above very little phonetiC information in vowels . - 85 - F3 , carrying Such data are , however , not at our disposal and for Figure 15 , F 3 has been chosen as the upper reference point . The vCMels can be seen to aggregate in five distinct clusters . A few tokens still appear to be wrongly placed . Probably these represent errors carrnitted s anewhere in the chain of data aquisition . 20 - 15 N u.. - I . .. .. .. .. .. .. .. .. : I .. .. .. . . . . . . . . . . .. I . . . . . . .. I . . .. .. . . . . .. .. .� : : - - Figure 14 : 10 - Tonotopically scaled F1 ( horizontally ) and F2 ( vertically ) : - in the five Japanese duced 5 1- :. . . .. . . .. .. .. .. .. .. . I o I 5 Fl ( B ar k ) . . . . . . I 10 : I 15 - vCMels pro by physiologically different speakers . Formant frequency as above , fran H . data , E\.lj isaki et al . [ 19 70 ] . 1.2 1.0 N if .8 i i , \. Figure 15 : i "Jl.'f� : i iii i Phonetic , paralinguis . it tically : invariant parameters I1 ( related with vCMel openness , hori ..... . E. zontally ) .4 culated " oral and depth" , I2 ( related vertically ) , with cal- according to Equations ( 5 ) and ( 6 ) in the five Japanese vCMels .2 produced by physiologically -.4 -.2 o I1 .2 .4 . f, rent speakers . Formant data same as in Figure 14 . - 86 - diffe frequency SUMMARY We have seen that there are two types of spectral envelope transformations which whithin wide lllnits do not affect the phonetic quality of vowels : Translations and linear canpressions/expansions . variations in speaker size and in vocal effort , whispered these vowels two without can be described or elllninated fran a types of transformation applied to discrllnination essentially contains apply . as well as the spectral difference between voiced and to between vowel quality . the description spectra the several spectral holistically , peaks We have also seen that the with contributing speech signal information concerning the appropriate type of transformation to In voiced speech segments , this information is carried by F0 and by the formants above F2 . The higher formant frequencies correlate ( negative ly ) with speaker size , and the tonotopic distance between the higher for mants and F0 correlates ( negatively ) with vocal effort . The difference between vowels produced by adult waren , those produced by other speakers , can , however , as canpared not be described with holis tically , but it requires the formants to be treated separately . It has been shown that vowels produced by adult waren tend to be phonetically more explicite than those of other speakers . The investigation of the perceptual role of spectral transformations of the abovenentioned kind presupposes an adequate scaling of pitch . While sane arguments in favour of a tonotopic approach on the basis of a critical band rate scale variation have been p.lt forward , we have seen that the paralinguistic in vowel formant frequencies can also be described on the basis of a logarithnic scale of frequency . This research has been supported by a grant fran HSFR , Council for Research in the Humanities and Social Sciences . - 87 - the Swedish REFERENCES Ananthapadmanabha , T. Pitch as a normalization factor in V. : vowel perception . Acoustics Letters � : 1-4 ( 19 84 ) . Bickley , C . : Acoustic evidence for phonological developnent of vowels MIT RLE Speech Group Working Papers , in young children . !: 111-124 vol . ( 1983 ) . Bladon , of speech ; A . : Arguments against formants in the auditory representation in Carlson , Granstran , The Representation of Speech in the peripheral Auditory Systan , pp . 95-10 2 , Fig . 4 ( Elsevier Bianed . , Amsterdam 1982 ) . Bladon , A. : 'IWo-formant Irodels of vowel perception : shortcanings and enhancements . Speech Cbmm . �: 305-31 3 ( 19 8 3 ) . Bladon , R . A . W . ; Henton , C . G . ; Pickering , J . B . : Towards an auditory theory of speaker-sex normalization . Language and Ccmnunication !: 59-69 ( 1 984 ) . Buhr , The anergence of vowels in an infant . J . Speech Hear . Res . R. : 23 : 7 3-94 ( 1 980 ) . Chiba , Ts . ; Kaj iyama , M. : The Vowel : Its Nature and Structure , pp . 174-226 , ( Tokyo-Kaiseikan Publ . Oomp . , Tokyo , 1941 ) . Chistovich , L . A . ; Lublinskaya , V . V . : The " centre of gravity" effect in vowel spectra and critical distance between the formants : psychoacousti cal study of the perception of vowel-like stimuli . Hearing Res . 1 : 1 85-1 95 ( 1981 ) . Christov , Bulgarian ph . : a:rnputer aided analysis of stressed and vowels fran 30 male and 30 fanale speakers . Proc . unstressed XIth ICPhS , Vol . 3 , pp . 121-124 ( 19 87 ) . Crandall , I . B . : The sounds of speech . Bell Syst . Techn . J . 4 : 586-626 ( 1 915 ) . Di Benedetto , M . -G . : On vowel height : acoustic and perceptual repre sentation by the fund amental and the first formant frequency . Proc . XIth ICPhS , Vol . 5 , pp . 198-201 ( 1 9 87 ) . Disner , S. F. : Evaluation of vowel normalization procedures . J. Acoust . Soc . Am . 67 : 253-261 ( 1 9 80 ) . Fant , G . : Acoustic analysis and synthesis of speech with applications to Swedish, Ericsson Tecimics 1 ( 1959 ) . - 88 - Fant , G . : A note on vocal tract size factors and non-unifonn F-pattern scalings . Q. stockh . , No . Prog . 4, status Rep . , Speech Transm . Lab . , R . Inst . Technol . , 22- 30 ( 19 6 6 ) . (Also in Fant , Speech Sounds and Fea- pp . tureS , MIT Press 1 9 7 3 ) . Fant , G. : Non-unifonn vowel nonnalization . Q. Prog . Status Rep . , Speech Transm . Lab . , R . Inst . Technol . , Stockh . , No . 2/3 pp . 1-19 ( 1 975 ) . Flanagan , J . L . : Speech Analysis , Synthesis and Perception , pp . 6 3-65 ( 2nd ed . , Springer , Berlin , 197 2 ) . Fowler , C. A. ; Rubin , P . ; Renez , R . E . ; Turvey , M . T . : Implications for speech production of a general theory of action ; in Butterworth , Lan guage Production , vol . 1 , pp . 37 3-420 ( ACademic , London 1 9 80 ) . Frc6kjaar-Jensen, B . : Changes in fonnant frequencies and fonnant levels at high voice effort . ARIPUC No . 1, Inst . Phonet . , Univ . Copenhagen , pp . 47-55 ( 19 6 6 ) . 0. ; EUj imura , Lindqvist , J . : Sweep-tone measurements of vocal-tract characteristics . J . Acoust . Soc . Am . 49 : 541-558 ( 19 7 1 ) . EUj isaki , Japanese H. et . al . : Ebnnant frequencies of sustained vowels obtained by analysis-by-synthesis of spectral envelopes . in ( Unp.1b lished data , Univ . of Tokio , 1970 ) . EUj isaki , H . ; Kawashima , T . : The roles of pitch and higher fonnants in the perception of vowels . IEEE Trans . Audio Electroacoust . , Vol . AU-16 , No . 1 , pp . 7 3-77 ( 196 8 ) . EUj isaki , Ann . H. ; Nakamura , N . : Nonnalization and recognition of vowels . Rep . , Eng . Res . Inst . , Tokyo Univ . , Vol . 28 : 61-66 ( 1 9 6 9 ) . Gandour , J . ; Weinberg , B . : on the relationship between vowel hight and fund amental frequency : Evidence fran esophageal speech . Phonetica E: 344- 354 ( 1980 ) . 0. : Garnica , young children , Sane prosodic and paralinguistic features of speech to in Snow , Ferguson , Talking to Children : Language Inp.1t and Aquisition , pp . 6 3-83 ( cambridge Univ . Press , cambridge , 1 9 7 7 ) . Gay , T. ; Lindblan, B . ; Lubker , J . : Production of bite-block vowels : Acoustic equivalence by selective canpensation . J. Acoust . Soc . Am . 69 : 183-1 9 3 ( 197 7 ) . Gibson , J . J . : The Senses Considered as perceptual Systens ( Houghton Mifflin , Boston , MA , 1 9 6 6 ) . Goldstein , U. G . : Articulatory Model for the Vocal Tracts of Growing Children . Ph . D . thesis , Mass . Inst . Technol . ( 1 9 80 ) . - 89 - Iivonen , A . : Zur regionalen Variation der betonten vokale im gehobenen Deutsch , in Kahlas-Tarkka , Neophilologica Fennica ( Menoires de la Societe Neophilologique de Helsinki , TOne XLV ) , pp . 87-119 , Helsinki ( 19 87 ) . Kallail , isolated K . J . ; Elnanuel , F . W . : Formant-frequency differences between whispered and phonated vcmel samples produced by adult female subj ects . J . Speech Hear . Res . 27 , 245-251 ( 19 84a ) . Kallail , whispered K. J. ; Elnanuel , F . W . : An acoustic canparison of isolated and phonated vcmel samples produced by adult male subj ects . J. Phonet . 12 : 175-186 ( 19 84b ) . Kent , R. D. ; Forner , L. L. : Developnental study of vcmel formant frequencies in an imitation task . J . Acoust . Soc . Am . 65 : 208-217 ( 1 97 9 ) . Koopnans - van Beinum , F . J . : Vcmel a:mtrast Reduction . Dissertation , University of Amsterdam (Acadanische Pers B . V . , Amsterdam 1 9 80 ) . Ladefoged , P. ; Broadbent , D . E . : Information conveyed by vcmels . J . Acoust . Soc . Am . 29 : 9 8-104 ( 1 957 ) . L, A . M . ; Mattingly , I . G . : The rrotor theory of speech percep tion revised . Cognition 21 : 1-36 ( 19 85 ) . Lindblan B . ; Sundberg , J. : Acoustical consequences of lip , tongue , j aw , and larynx rrovanent . J . Acoust . Soc . Am . 50 : 1166-1179 ( 1 9 71 ) . Miller , Am . R. L . : Auditory tests with synthetic vcmels . J . Acoust . Soc . 25 : 114-121 ( 1 95 3 ) . Nordstrem , P . -E . : Fenale and infant vocal tracts simulated fran male area functions . J . Phonet . �: 81-92 ( 19 7 7 ) . Ohala , in Channon , J . J . ; EUkel , B . W . : Explaining the intrinsic pitch of vcmels ; Shockley , Festschrift for Ilse Lehiste , pp . 207-215 . ( Foris , Dordrecht , 1987 ) . Peterson , G. E. : Information bearing elanents of speech . J . Acoust . Soc . Am . 24 : 629-637 ( 1952 ) . Peterson , G . E . ; Barney , H . L . : Control methods used in a study of the vcmels . J . Acoust . Soc . Am . 24 : 175-184 ( 1952 ) . Pols , L . C . W . ; van der Kamp , L . J . ; Planp , R . : Perceptual and physi cal space of vcmel sounds . J . Acoust . Soc . Am . �: 458-467 ( 1 9 6 9 ) . Rietveld , A . C . M . ; Koopnans - van Beinum , F . J . : Vcmel reduction and stress . Speech Comm . �: 217-229 ( 1 9 87 ) . Rossi , M. ; Autesserre , D . : Movanents of the hyoid and the larynx and the intrinsic frequency of vcmels . Phonetica � : 2 3 3-249 ( 19 81 ) . Rostolland , D . : Acoustic features of shouted voice . Acustica 50 : 118- 125 ( 1982 ) . - 90 - Rostolland , D . ; Parant , X . : Physical analysis of shouted voice . Contr . to 8th int o congr . acoustics , London ( 19 74 ) . Schulman , R . : Articulatory targeting and perceptual constancy of loud speech . PERILUS No . 4 ( Inst . linguist . , Univ . Stockh . ) pp . 86-91 . ( 19 85 ) . Schulman , R . : Articulatory dynamics of laud and normal speech . Subnit ted to J . Acoust . Soc . Am . ( 19 88 ) . Schwarz , of J . L . ; Escudier , P . : A direct daronstration of the existence a large-scale integrated spectral representation in Subnitted to Speech Oomm Slawson , A. W. : . vowel perception . ( 1 9 88 ) . vowel quality and rrusical timbre as functions of spectrum envelope and fund amental frequency . J . Acoust . Soc . Am . 4 3 : 87-101 ( 19 6 8 ) . Sundberg , J . : Formant technique in a professional singer . Acustica 32 : 89-96 ( 1975 ) . Suani , K . : Whole spectrum vooel normalization . Speech Oomm l : 199-209 . ( 1984 ) . Syrdal , A. K. : Frequency analyses of syllable initial and final li quids spoken by American English talkers . ( Abstract ) J . Soc . Am . Acoust . 7 1 : S105 ( A ) ( 19 82 ) . Syrdal , A. K. : Aspects o f a model o f the auditory representation of American English vowels . Speech Oomm Syrdal , A. ! 121-135 ( 19 85 ) . . K . ; Gopal , H . S . : A perceptual model of vooel recognition based on the auditory representation of American English vooels . J . Acoust . Soc . Am . 7 9 : 1086-1100 ( 19 86 ) . TraunmUller , H. : Perceptual dimension of opermess in vooels . J. Acoust . Soc . Am . 6 9 : 65-1475 ( 19 81 ) . TraunmUller , H. : Die spek.trale Intergation bei der Wahmehmung der Klangfarbe von vokalen . Acustica 54 : 2 37-246 ( 19 82 ) . TraunmUller , scale . Related H. : sensory D . thesis , On Vooels : perception of Spectral Features , Part of Ph . Aspects Analytical expresSions for the tonotopical of Production , and Sociophonetic Dimensions . Univ . Stockholm ( 1 9 8 3 ) . TraunmUller , age- and H. : Articulatory and perceptual factors controlling the sex-conditioned variability in formant frequencies of vooels " , Speech Oomm l: 49-61 ( 19 84 ) . . TraunmUller , in the H. : perception The role of the fund amental and the higher formants of speaker size , vocal effort , and vooel opermess . PERILUS , No . 4 , ( Inst . Linguist . , Univ . Stockholm ) , pp . 9 2-102 ( 1 9 85 ) . - 9 1 - Traunrrill.ler , Schouten , H. : Sane psychophysics of aspects of the sound of Speech perception , pp . speech sounds ; 29 3 - 305 in ( Nij hoff , Dordrecht 1987 ) . Traunrrill.ler , H. ; Lacerda , F. : perceptual relativity in two-formant vowels . Speech Ccmn §.: 14 3-157 ( 19 87 ) . . Wakita , application H. : Normalization of vowels by vocal tract length to vowel identification . IEEE Trans . Acoust . and Speech its Signal Proc . , ASSP-25 , pp . 183-192 ( 19 77 ) . Williams , C . E . ; Stevens , K . N . : EInotions and speech : Sane acoustical correlates . J . Acoust . Soc . Am . 52 : 1 2 3 8-1250 ( 1 97 2 ) . Zhang , J. : The intrinsic fund amental frequency of vowels effect of speech rrodes on formants . Proc . XIth ICPhS , and the Vol 3 , pp . 390-393 ( 1 987 ) . Zhukov , S. Ya . ; Chistovich , L. A . : perceptual parameters of front vowels . Speech Ccmn !: 267-27 3 ( 19 85 ) . . Zwicker , E. : Zur unterteilung des horbaren Frequenzbereiches in Fre quenzgruppen . Acustica 10 : 185 ( 1 9 60 ) . - 92 - ANALYTICAL EXPRESSIONS EDR THE 'IONOIDPIC SENSORY SCALE Harbrut TraunmUlle r 1. Introduction Two processes are generally assumed to contriblte to auditory frequency resolution. First, the hearing systan is capable of performing an "oscillographic" analysis of the set of neural signals originating in the This cochlea. process is limited to frequencies which can be solved in the pattern of neural responses. While single neurons are not to fire rrore frequently than 500 times per second even likely stimulus intensites, re at high frequencies between 0.5 and 1.5 kHz can still be handled in the tanporal danain, albeit less efficiently, on the basis of the signals fran a large number of neurons. tions The capability and limita of a frequency analysis in the tanporal danain vividly by are deronstrated cochlear implant patients whose sole auditory inp..1t is an undifferentiated electrical stimulation of the auditory nerve. The sOillld second process covers the whole auditory frequency range. Any entering analysis, can be a normal functioning cochlea is subj ect to a spectral resulting in a frequency-to-place transformation. The cochlea regarded as a bank of filters whose outp..1ts are topically, ordered tono with the filters closest to the base responding maximally to the highest frequencies, and those closest to the apex responding maxi mally to the looest frequencies. The tonotopic order is further knCMn to be maintained in the structure of the neural network at higher levels in the hearing systan. Auditory "equivalent quency frequency selectivity can be described in terms rectangular bandwidth" (ERB) as a function of stimulation [1]. resolution of of the center-fre The two processes contriblting to frequency are not separated in the definition and measurement of the ERB. Consequently, the ERB, or ERB- rate should not be taken as a measure of the tonotopic scale. A quantity related to the ERB, though not identical with it, is the classical "critical bandwidth" (CB) [8]. Measurement of the CB typically Different summation rules have involves loudness summa tion exper�ts. been fOillld frequency to hold for auditory stimuli, depending on whether canponents are separated by rrore or less than the CB. - 93 - their The CB the and ERE have been found to be center-frequencies discrepancy, above shown 500 Hz. in Fig. 1. proportional and equivalent For lower frequencies, there for is a the ERE decreases with In this range, decreasing center-frequency, while the CB remains close to constant. The discrepancy can be explained by the reasonable assumption that SlID1Tlation analysis within the temporal danain is irrelevant to loudness down frequencies where loudness variations becane audible as to The discrepancy indicates, contritutes on the other hand, that temporal substantially to frequency resolution for low the Consequently, resolution, tut CB may be taken as a measure of such. analysis frequencies. should not be taken as a measure CB-rate the of frequency the tonotopic sensory scale. 2), the CB has been chosen In the familiar CB-rate scale (see Fig. to as a natural unit of the tonotopic serve values by Zwicker [7 ] in form of a table. applied tion. rrost of these applications, the instead of a table. been have speech percep it would be rrore convenient between z and f specified in form relation z Standard The CB-rate scale has been extensively in research on psychoacoustics and For have scale. the relation between frequency f and CB-rate for proposed sensory of an to equation Several equations which approx�ate the tatulated values have also been p..1blished [2, 4, 5, 9]. In the following, the error functions of these equations will be canpared. Recent studies of speech sounds suggest that the tonotopic ces distan (CB-rate differences) between praninent peaks in their spectra fund amental cally, it forman ts to the perception of their phonetic quality. has been suggested that the spectral peaks are More specifi shaped by the and the fundamen tal have the same relative tonotopic locations in linguistically identical vowels uttered by speakers of different ages and sex [3, 5, 6] . While differences in speaker size appear reflected in a tonotopic translation of the spectral peaks, to be differences in vocal effort appear to be reflected in a linear tonotopic canpression / expansion [6]. In order to test these hypotheses, both in theory and by speech synthesis, CB-rate differences must be accurately calculated. OUr requiremen ts include that the function have a s�le inverse and that it be accurate preferably to within ± 0. 05 Bark in the range of essential vowel formant frequencies of rigorous cla� men, wanen , and children. for accuracy prevents the introduction of any error in addition to that inherent in the table [7 ]. - 94 - This evitable \.0 lJ1 1.5 t 2.0 2.0 by = .... .. + + +++++ #/ .p-+ ....+ ++ + + ++ ++ .. . :4 EtIuivalent 2.5 3.5 4.0 [1] 50 100 and Zwicker 's (curve), according to Glasberg bandwidth (CB), Moore and -6 2 -2 6.23 10 f + 9.339 10 f + 28.52, ..c .,.J "d CO I 'V ,...., N -0 s:: o CO 200 "3 500 1000 2000 rectangular bandwidth (ERE), 1 g( f) 3.0 , 1 . .....' ; +� ... .. . . .. ... ......... 1 .......................... ...................................................... ........ .. +t-+ +++ . . ........ .. .... +... . +++ . .. 1 ______ 10 table [7 ] (marks), as a function of frequency. critical given B 1 +++ according to the fornula 1. 1 Frequency f (kHz) 0.5 1.0 2.0 5.0 � : i i i i 1 0.2 .··············· ..······ ····. · · ··..·················. . . . i; i 1 1 i l i ; : I , r 2.5 Fig. CO 'V b'I ,...., 3.0 3.5 1 0.1 ra I I 10 L� . . . 2.5 .. . " .... Critical-band . .. .. .. ... . . . rate 1 9 ( f) 3.0 . .... .. ... .. 4.0 " z as a function 3.5 .. . .... . ... .. ; i : ! / ____ // ,/' // / 2.0 2. I FrE>quE>ncy f (kHz) 0.5 1.0 2.0 5.0 r·····················································......................................................·········1 1 0.2 of [7 ]. The curve corresponds to eq. (6). frequency f. H+H-rnarks represent data fran Zwicker a G 12 Fig. � U -+oJ 0 u 1 "d s:: 0 .Q 0 � <II -+oJ N 'V '0 CO � ,...., 24 0.1 1 2. Analytical expressions 2.1 Expressions for critical-band rate the relation between f and z is linear for In rough approximation, f < 500 Hz (z .. f/l00) and logarithnic for higher frequencies. Fig. the error functions of two logarithnic approximations to the shows 3 CB scale. One of these, eq. (1), has been suggested by Zwicker and Terhardt [9]. It gives values that agree with the tahllated ones to within ± 0. 25 Bark in the range 0. 6 < f < 7 . 2 kHz . The other approximation, eq. (2), our stricter standards of no more than ± 0. 05 Bark satisfies deviation at the cost of a reduction in the range of validity, to within 1. 0 3. 6 kHz In < f < . z = 14. 2 19 (f/l000) + 8. 7 (1), z = 6. 57 8 In (f) - 36. 99 (2), these and in all the follONing equations, frequency f is to be expressed in Hz and CB-rate z in CB-units (Bark). A mathematical function that is linear at one extreme and loga the sinus-hyperbolicus function, has been ritlmic at the other extreme, used by Tj arov [4], eq. (3), and by Schroeder [2], eq. (4), to calculate CB-rate. The error functions of both equations are shawn in Fig. 4. f = z = f = z = 600 sinh (z/6. 7 ) + 20 (3), 1/2 2 6. 7 In{[ (f-20)1600]) + { [ (f-20)/600] + 1] )} (inverse). 650 sinh (z/ 7 ) (4), 1/2 2 7 In{ (fI650) + [ (f/650) + 1] } (inverse). As canpared with the tahllated values, Tj arov 's equation (3) is accurate to within +0. 03 -0. 28 Bark for f to within enough ± 0.13 Bark for f < < 4. 5 kHz and Schroeder's equation (4) 4.0 kHz These equations . are accurate for sane applications in which frequency canponents above 4 may be neglected, as they are in sane systans of telephonic tion. - 96 - kHz camun ica Approximations achieved in functions . a covering the whole auditory frequency range can various ways by appropriate combinations of be mathematical For the roost part, however, this yields equations which lack The roost aCOlrate of the equations given by simple inverse. zwicker and Terhardt [9], z is 2 13 atn (0. 0007 6 f) + 3. 5 atn (f/7 500) = of this kind. (5) , It agrees with the table to within +0. 20 -0. 25 over the whole range of auditory perception (see Fig. Bark 5). The waiviness of the error function tells us, however, that there is roam for improve ment. The equation also clearly falls short of our standards. If we want to compare the tonotopic distances between two pairs of spectral for example, we peaks, might obtain an error of up to 0. 9 Bark within these limits. An approximation that has a simple inverse and meets our is achieved by considering z to be related to log (f) by standards a logistic function, also known as "growth mrve". Such an approximation, eq. (6), has proposed by Traunmillle r [5]. been Its error function is shown in Fig. 6. z = [26. 81 f/ (1960 + f)] - 0. 53 f = 1960 (z + 0. 53)/ (26. 28 (6), (inverse). z) - The values obtained with eq. (6) deviate fran the tal::ulated ones by less than ± 0. 05 Bark for 0. 2 < f < 6. 7 kHz . At the low-frequency end of the scale, the deviation fran the table [7 ] sums up to -0. 53 Bark for f = 0 Hz (-0. 26 Bark for f = 20 Hz). At least in part, this deviation is due to biased rounding of the bandwidth values in Zwicker 's table [7 ] . For frequencies below 400 Hz, the stan dard width of the critical band was set uniformly equal to 100 Hz. appears to relation z = have been done in order to obtain the f/l00. said simple The original bandwidth data [8] indicate B .. 90 Hz for the lower frequencies in that range. for mnemonically This The values listed in the table f < 100 Hz are partimlarly questionable because they can hardly be to be based on any reliable experimental evidence. Eq. (6) may represent the tonotopic scale well enough down to the lowest frequencies for which it can be determined experimentally. The deviation at the high - 97 - CP 1.0 � 0 � � W '-/ N d u u ....... " N '-/ +-' ...Q d " a.. ....... .1 o 1 o 3. 18 functions of 24 with Olrve without marks: Eg. (2). marks: Eg. (1), given. by Zwicker and Terhardt [9]; Olrve logarithmic (table value) 12 of the CB-rate scale. Error CB-rate z b . . . .. . . . . . . . .. . . . . • . . . . . . .. . . ... . . . . . . .. •• .. . ... •• . . . . • . . . . . . . . . .... . . . . . . .... . .... .... ......................... r. . . . . . . !..................... . . . . . .. .... . ...... . . . . . . . . . . . . . . . . . . . . . . .. .. .. ............... , ......... ... . . . . . .... . approximations Fig. - .4 -.3 • - 2 - • .2 .3 .4 � 0 � � w N u '-/ d u ....... " N '-/ +-' ...Q d ....... a.. " 0 1 • 2 1.0 I _ __ n 2.0 I 4.0 I (kHz) 4. 18 Error functions of (3), given. by Tj arov [4]; 24 Upper curve: Eg. Lower curve: sinus-hyperbolicus CB-r ate z (tab 1 e va 1 ue ) 12 (4) , given. by SchroErler [2]. Eg. 8.0 I . ..... .. .. .... .. .... .. .. .. .. .. ................ .. .. .. .. .. .. .. ...................... . . . . . . ............ .. .. .. .. .. .. .. .. .. . 0.5 I f approximations of the CB-rate scale. Fig. -.4 -.3 - -. 1 • .2 .3 .4 0.0 I Frequency '!) '!) � 0 � � W N '-/ d u ...-i u /"'\ N '-/ +-' � ...-i a.. /"'\ 3 (", 12 18 24 CB-r 0. le z (lab 1 e value ) and Terhardt [9]. of the CB-rate scale, e.g:. ( 5) , given by Zwicker Fig. 5. Error function of an overall approximation -.4 -.3 2 � o � � w '-/ N • • - N '-/ +-' .Q d ...-i a.. /"'\ d u I 8.0 -. 1 I 4 .0 U ...... I 2.0 o I . 1.0 (kHz) /"'\ o I I f 1 .2 . .4 0.5 0.0 Frequency 0 1 I I I I o 6. Error 18 24 with error scale at the right: Eg. (6) Upper curve, shown vertically with corrections (7 ) and (8). displaced, by Traunrrile ll r [5]; curve with error scale at the left: Eg. (6), given Lower "growth scale. of a logistic (table value) 12 function CB-rate z (", !"" .. ,............................................................... " .................. ............. ............... � curve" approximation of the CB-rate Fig. - .4 -.3 -.2 -. 1 • .2 .3 .4 8.0 4.0 2.0 1.0 I _ 0.5 (kHz) I f 0.0 Frequency -. • 1 o 1 frequency end of the scale ranains unaccounted for. Calculating z with eq. achieved bending (6), close agreanen t with the table can be over the whole auditory frequency range by added corrections , the error function straight at both ends of the scale, in the following way: For calculated z < 2.0 Bark: and for calculated z Since > z' = 20.1 Bark: z z' (7 ), 0.15 (2 - z) + = this is an easily inverted procedure, z + 0.22 (z - 20.1) (8). the calculation of f for a given z is not a problan. The error function obtained with these correc tions is also shown in Fig. with the table for f > 6. The values calculated in this way agree 100 Hz to within ± 0.05 Bark. (7 ), Correction however, sirTulates also the abovanen tioned bias at low frequencies. 2.2 Expressions for critical bandwidth Zwicker and Terhardt [9] proposed the equation B to = 25 + 7 5 (1 + -6 2 0.69 1.4 10 f ) (9), calculate critical bandwidth B as a function of center-frequency While eq. (9) obtain CB-rate. is quite accurate, it can not easily be integrated f. to The authors ' equation for CB-rate (5) is not canpatible with eq. (9). Proceeding fran eq. (6), critical bandwidths B can be calculated as B for 2 52548/ (z - 52.56 z = + 690.39) (10) critical bands centered at z obtained by eq. tions. The values calculated by eq. within ± 6% without correc (10) agree with Zwicker 's table to for 0.27 < f < 5.8 kHz . Within function is similar to that obtained by eq. both equations are shown in Fig. 7 . - (6 ) 100 - that range, the error (9). The error functions of 8.0 I 2.0 I 1.0 I 0.5 (kH z ) f Frequency ...... .4 /'"'- r............., , . .", " .. . . . . ." . ..... .. . . .. . " , . . . . .... . " ... . ". . . ,, " . . . .... .. . . . .. " . .. . .. . . .. . ... .. .. � .2 '-/ • 1 /'"'- � 0 ....... d u -. 1 OJ -.2 '-/ .. � 0 � � W - . 3 -. 4 + - : ................... ............ . . . . . . .. " ................ . . . . . . .. .... . . . . . . . . . . . . . . . . . , . o Z (l ab 1 . ef' . . . . . . . . . . .... . . . . . . . . . . . . . . E' .: 24 18 12 I; f'Y.\ -- -- -- - : CB -I' a t €I (curve ". . .3 ..Q d ...... Fig. . . . .... valu E' ) 7 . Error functions for critical bandwidth calculated with eqs. (9) with marks) and (10) (curve without marks), Zwicker I s table values (se also Fig. 1). - 10 1 - as canpared with The HSER, preparation of this paper has been. supported by a grant fran the Swedish Council for Research in the Humanities and Social Sciences. [1] B. C.J , Moore, and • Glasberg, B. R., Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. J. Acoust. Soc. Amer. 7 4 [1983], 7 50-7 53. [2] Schroeder, Recognition of canplex acoustic signals. In T. H. M.R., Bullock. ( ed. ) , Life Sciences Research Report 5 (Dahlem Konferenzen) Abakon verlag, Berlin, 197 7 , pp. 323-328. [ 3] Syrdal, A. K. recognition and based H. Gopal, on the S., auditory A perceptual rrodel of representation of vowel American English vowels. J. Acoust. Soc. Amer. 7 9 [1986], 1086-1100. V. [4] Tj arov, L., A rrodel to describe the results of psychoacoustical experiments on steady-state stirruli. Analiz Rechevykh Signalov Chelovek.an, In G. Nauka, V. Gershuni (ed.), Leningrad, 197 1, pp. 36-49. [5] Traunrrle lLl r, H., en vowels: perception of Spectral Features, Related Aspects of Production and Sociophonetic Dimensions, thesis, Stockholms Universitet, 1983. [6] Traunrrile ll r, H. , paralinguistic variation and invariance in the characteristic frequencies of vowels, paper suhnitted to Phonetica. [7 ] Zwicker, E., Zur unterteilung des horbaren Frequenzbereiches in Frequenzgruppen.. Acustica 10 [1960], 185. [8] Zwicker, loudness [9] Zwicker, band E., Flottorp G., and Stevens S.S., Critical band width in SlIDTTlation. E., J. Acoust. Soc. Amer. 29 [1957 ], 548-557 . and Terhardt E., Analytical expressions for critical rate and critical bandwidth as a function Acoust. Soc. Amer. 68 [1980], 1523-1524. - 102 - of frequency. J. ATTITUDES TO IMMIGRANT SWEDISH A L I TERATURE REV I EW AND P REPARATORY EXPERIMENTS U n a C u n n i ngham-And e r s s o n a nd Olle Engs t r a nd Abs t r ac t Th i s pape r p r ov i de s a br i e f r e view o f prev i ou s work p ub l i s h e d i n the f i elds of l i ngu i s t i c a t t i t ud e s, n o n - na t i ve a c c e nt s a nd n a t i v e - s peak e r a t t i t udes t o n o n- na t i ve a c c e n t s. An e xpe r i me n t al i nve st i ga t i o n o f n a t i v e Swe d i sh s p e ak e r s' a b i l i ty t o ide n t i fy n o n - na t i ve a c c e nts i s report ed: lingu i s t i c all y n a i ve i n f o rma n t s c ould o nly id e nt i f y a f ew a cce nt s, wh i l e t e ache r s o f Swed i sh f or i mm i gr a n t s c o uld i d e n t i fy ma ny mor e a cce n t s . A method f or evalu a ting d e gr ee of a c ce n t edne s s i s developed , a nd i t i s f ou nd t h a t s i mple a c c e n t ed n e s s evalua t i o n s b y n a t i ve spe a k e r s c orrela t e wel l t o evalua t i o n s o bt a i n e d f rom li ngu i s t s u s i ng a c ompl i c a t ed gr ad i ng sy s t em. Me t hods for e l i c i ti ng n a t i ve s p e ak e r a t t i t ud e s t o n o n - n a t i ve a cc e n t s and to the m i n or i ty g r o ups who u s e t hese a ccen t s a re d i s c u s s e d . In a n o t h e r e xp e r i me n t, d e s i g n e d u s i ng t he r e s ul t s o f t h e a c c e n t n a t i ve s p e aker a t t i i de n t i f i c a t i o n expe r i me n t men t i o n e d a bove , t ude s to a s i ngl e n o n- na t i ve s p eaker we re el i cit e d w i th t h e i nf orma n t s alt e r n a t ely bel i ev i ng t h e s peak e r t o b e Kurd i sh a nd G e rma n ( a var i e ty o f "ma t ch e d-gu i s e " t e chn i qu e ) . He w a s judge d t o b e more f r i e ndly a nd l e s s s uc c e s s f ul wh e n belie ve d t o be a Kurd . The rela t i o n s h i p betw e e n accen t e d ness a nd n a t i ve s p e aker att i t ud e s w a s explo red i n the same exper i men t, and both in the c a s e o f two s peake r s bel i e ved t o be Kurd s a n d two s p e ak e r s b e l i eved t o be Germa n s , n a t i ve s p e akers re a c t ed more n e g a t i vely t o t h e s p e ak e r w i t h t h e s t r o n g e r ac c e n t . Ev i de n c e i s pr e s en t e d s ugges t i ng t h a t where n a t i ve s p e ak e rs f a i l t o re c og n i se n o n - na t i v e a c c e n t s , s t ro ng e r a c c e n t s a r e judged t o b e spok e n by i mm i gr a n t s f rom c ou n t r i e s f ur t her away f rom Swe d e n than weaker a c c ents . CONTENTS I n troduc tion 1. 2. Li t erature review 2.1 S t ere o type vie w s of l a ng u a g e 2 . 2 P h o ne t i c f e a ture s o f n o n - n a t i ve a c c e n t s 2 . 3 S t e r e o type f e a t u r e s o f n o n - na t i ve a c c e n t s 2.4 T oler a n c e o f f or e i g n a c c e n t s 2 . 5 A t t i t ude me a s ureme n t 2.6 Ide n t i fi c a t i o n o f a c c e n t o r i g i n 3 . Expe rime n t s 3 . 1 I d e n t i fi c a t i o n o f f or e i g n a c c e n t s 3 . 2 I de n ti f i c a t i o n o f f or eign la ngu a g e s 3 . 3 A c c e nt e dn e s s me a s ur e m e n t 3 . 4 A t t i t ude me a s u reme n t 3 . 5 Rela t i o n s h i p betwe e n a c c e n t ed n e s s a nd a c c e n t 4 . P l anned experime n t s 4 . 1 On t h e n a t ur e o f f or e i g n a c ce n t s 4 . 2 Ma n i pu l a t i o n o f a c c e n t edn e s s - 103 - ident i f i c at i on 1. Introduc t i on C ommu n i c a t i o n b e t we e n guage is i mm i g r a nts a nd o f t e n di f f i c u l t . problems t h a t the v o c a bu l a ry or pho n ol o gy . This c a n i mm i g r a nt t h e n a t i ve s p e ak e r ' s p e r f o rma n c e can be has be due t o purely w i th t he of a l a n l i ngu i s t i c l a nguage' s s y nta x , Th e r e is h owev e r re a s o n t o be l ieve t h a t t o l e ra n c e i n f l ue n c ed s p e ak e r s j udge d i al e c t s o t h e r "broad", "ple a s a n t" or n ative s p e ak e r s of the i mm i g r a n t ' s by o th e r tha n "ugly", f a c t ors. the i r s ome own l i n gu i s t i c Jus t a s n a tive to be " sl o p py" , f oreign a c c e nts a re he l d t o b e more a t t r a c t i ve t h a n o th e r s . It me n t s are bu t als o an e x pre s s i o n o f more or not o nly a e s t h e tic , s e ems l i k ely t h a t t h e s e j udge le s s c o n s c i ou s l y held a tt i t ude s t o the s p e ak e r s o f the a c c e nts o r dial e c t s . The p i c t ure i s mor e tha n f o r spe a k e r s o f c omp l i c a t e d f or spe ak e r s o f u n c e r t a i n ty in t h e g e n e r al. It i nvo l ve d a s well f ac e i s o nly of be genu i ne a s u n f am i l i ar i ty w i th a nd f or e i g n er s a nd immig r a n t s make o ne s el f u nd e r s t o od. ni c a ti o n be twe e n n a t i v e s p e a k e r s n e twork o f may foreign accents in i n t h e mo s t e x t r eme c a s e s t h a t c ommun i c at i o n be twe e n n a t i v e s p e a ke r s a nd i n a b i lity t o s i nce t h e r e o t h e r di a l e cts, c ompr e h e n s i o n probl ems f ore i g n a c cen t s evalu a t i o n s a nd a nd f ails due to a c omple t e It i s p r o b a ble that commu i mmig r a n t s, p r e jud i c e. Th i s i s b a s ed i s a nalo g o u s to on a the sit u a tion whe r e two n a tive spe ake r s e x c h a nging ple a s a ntr i es a b o u t t h e weath e r c a n e a ch , f o rm a o n the b a s i s o f a f a i rly d e t ail ed ge ogra ph i c al o r i g i n , evalua t i o n s that pi c t ure of e du c a t i o n e t c. n a tive s p e ak e r s - 1 04 - the f ew m i n ute s o t h e r' s c o nvers a t i o n , s o c i al s t a t u s , The d i f f er e n c e make r e g a rding i s that the o t h e r n a tive spe akers a r e more l i kely t o b e ba s ed o n t ho s e regard i ng Th e project low i ng i mm i gr a nt s . " At t i t ud e s f o ur f ac t a n d exp e r i e nce t h a n to I mm i gr a n t Swed i sh" f u ndame n t al qu e s t i o n s , addre s s e s the f ol u s i ng e x p er i me n t al pho ne t i c a nd s o c i ol i ngu i s t i c me t hods : * f l e c t ed * a t titudes How are nat i ve s p e akers ' i n the i r a t t i t udes t o What c ons equences do to i mmigrant s' fore i gn a c cents e t hn i c m i nor i t i e s re fore i gn a c cent s? have for the l i nguistic i nt e ract i on betwe e n Swe d e s a nd imm i grants? * Under wh i ch phone t i c c i rcumstan c e s c a n a nat i ve a non-na t i ve own * a reas o na b l e approx i mat i on to h i s l i nguis t i c norm? How can new phonet i c res e arch c o nt r i bute t o and i ncrease underst a nd i ng of the way In t h e f i r s t y e ar o f the proje c t , to t h o s e c overed by the proje c t expe r i me n t s f i rs t 2. pronunc i a t i on as s p e aker a c cept have t hree o f be e n i mm i gra n t s s pe ak Swed i sh? prev i ou s work has p e r f ormed an exp l a nat i on f or in bee n s t udied , f i e ld s relat ed a nd prel i m i n ary t o t e s t hyp o t he s e s a r o und t h e t h e a bove ba sic que s t i o n s . L i t e rature rev i ew 2 . 1 Stere o t ype v i ew s o f l a nguage A g o o d d e al o f res e ar c h i n t o l i ngu i s t i c a t t i t ud e s o n t h e o n e h a nd a nd f ore i gn a c c e n t s o n t h e o t h e r h a s b e e n per f ormed. ever k n own about a t t i t ude s t o Les s i s h ow fore i g n a c c e n t s . It has o f t e n been a s s um e d by s o c i al p s y c h ol o g i s t s t h a t an evalua t i on o f a var i e ty o f l a nguage by a group o f - 1 05 - i n f o rma n t s i s ne c e s - s a r i ly h i ghly c orrela t ed t o spe a k e r s of the language p e r c e p t i o n o f l a nguage a nd s o c i ally is that group's var i ety . L a b ov in fact a t t i t ud e (1966) t owa rd s t h e c l a i ms t h at mo s t p e r c ep t i o n o f s o c i al e x p e r i e n c e a cc e p t ed s t a t eme nt s a b o u t l a n g u a g e . Th i s le ads t o a s t e r e o ty p ed v i ew o f a group wh i ch i s bel i eved t o s p e ak way . L abov s ugge s t s tha t l a ng u a g e i s a symbol o f c i al i d e n t i ty. Mo s t n a t i ve a b o u t la ngua g e i n f orma n t s h a v e a nd c a n d e t e c t luate the s e in s o c i al v a r i a bl e s a r e h ow e v e r c o ns c i ous appe a r s t o f o c u s o nly o n s o c i al aware n e s s t e rms. La bov of certa i n The re a c t i o n s t o ma ny l i ngu i s t i c res p o n s e s b e l ow c omme n t s i t ems a nd have v e ry s tr o n g o p i n i o n s f e a t ur e s a nd c o n s i s t e n t ly ev a i na r t i c ul a t e awa r e ne s s . a p a r t i c ul a r s o t h e pres e n c e o r a bs e n c e s t i gma t i z ed o r s o c i a l ly d i a g n o s t i c i n a g i ve n that t h a t have be c ome p a r t the l evel o f c o n s c i ou s a t t e n t i o n r i s e n to t he level o f o f t h e a c c ep t ed f o l klore a b o u t l a ng u a g e . Rob i n s o n (1972) a t i ve s t ud i e s s ugge s t s t h a t are ba s ed c u e s a r e us ed t o pl a c e e t h n i c i ty , s o c i al i ntrodu c ed l i s t e ners ' ers on two s teps : t h e spe ak e r clas s ) ; i nd i v i du a l a r e de r i ved . (1960) i mp re s s i o n s o f the s e c o ndly, Lamb e r t , f i r s t ly, group (age, the G a rd n e r a nd F i lle n b a um t e ch n i qu e diale cts p e r s o n al i ty o f s ame or a c cents . p a s s ag e f o r comp a r i ng i n all t h e "gu i s e s ", the i r r at i ng s r e fle c t i o n o f are a l a ng u a g e var i e ty c o n c erned , - 1 06 u s i ng are it two not or more t old th a t i s a s sumed t h a t t h e i r s t e r e o type d re a c t i o n t o s i nce - a nd The l i ste n a r e c o rded s p e ak The l i s t e ne r s t h e s ame p e r s o n r e ad s sex , c o n c e r n i ng the r e a c t i o n s t o d i f f e r e n t l a nguage var i e t i e s. e r a ft e r h e a r i ng h i m r e ad th e the a nd spe e ch s t e r e o type s H odg s o n, i n evalu vo i c e a s o c i al " ma tched-gu i s e" a r e r equ i r ed to evalua t e langu a g e s , in spe ake r s f a c t or s s u ch a s a c t u al p e r s o na l i ty , es . qua l i ty, vo i c e ( 1 9 60 ) , Lamb e r t e t a l . f ound e rs , that t empo e t c u s i ng Fre nch- s p e ak i ng s tude n t s i n Mo n t re a l evalu a t ed t h e p re f e rr e d F r e nch gu i s e s f avourab l y t h a n t h e Eng l i s h - s p e ak i ng l i s t e ners d i d . e t al. i nt e r p r e t e d t h i s the Fr e nch-s p e a k i ng va l ue s of al . ( c. f. G i l es tha t s p e ak e r s o f react i ons aware n e s s . r e a c t e d more than to the s ame such as B ourh i s is et a l. f ound l ess r e ad i ng ( 1 9 73 ) a a p a s s age p o s i t i ve s el f - i mage , n o n- n a t i ve mak e a nd Re n ch e r spe ak e r s p e r s o n a l i ty 1 9 73 , i n We l sh i n E ng l i sh, o f Wels h 1 974 ) . as wh e r e i n f ere n c e s i nd i v i duals . o n the d o c ume n t ed ( 1 9 70 ) (c. f . f ound t h a t Fre nch a nd Nor t h Ame r i c a n acce n t s have h i gh e r prestige Br i t i sh reg i o nal acce n t s amo ng Br i t i s h - 1 07 - 1 2 a nd a nd i de n t i ty . judgeme n t s G i l es in l i s t e ne r s p a s s ag e spe a k e r s a r e r equ i r e d t o mak e to sug w i t h cha ng e s t h a t Welsh s t r e ng t h of h e a r i ng a v oi c e r e ad a t e xt i s we l l S t rong re a c t e d In i n L amb e r t e t al . may ch a ng e i s t ak e n t o b e a symb o l pos s i bl e u s e d t h e ma t Taj f el t ho s e o bs e rv e d s p e ak e r s r e a d i ng about t h e c h a r a c t e r i s t i c s o f Br own , the 1 9 75 f o r r ev i ew ) . G i le s a nd ( 1 969 ) spe ak e r s s tud i e s n a t i ve it a nd P owe s l a nd t o t he We l sh s p e ak e r s ' the Welsh l a ngu age Tha t o t h e r s tud i e s have Bourh i s , f avoura b l y t o a t t r i bu t e d th i s I n ma ny t hey p e r c e i ve t o be B l ack E ngl i sh t h a n s p e ak e r s u s i ng the a nd Tu c k e r a nd Lamb e r t e t hn i c wha t r e a c t i o n " where by f ound tha t B l a c k Ame r i c a n s "s t a nd a rd n e twork s ty l e" . ( 1 9 60 ) ado p t ( 1960 ) Tu cker a nd Lamb e r t f avoura b l y t o ge s t "m i nor i ty group Lamb e r t the Engl i sh-s p e ak i ng ma jor i ty . ched-gu i s e me thod 1969 , as a s tude n t s Fo l l ow i ng Lamb e r t e t R. P. , f or a l l gu i s b i l i ngual Fre n c h-Eng l i s h r e ad Engl i sh gu i s e s t o Fre n c h gu i s e s a nd l ess a r e t h e s ame 1 7 y e a r olds , than wh i le G e rma n a nd It a l i a n a c ce n t s were R . P. a nd Fre n c h a c c e n t s we re i n t e rmed i a t e . judg e d In t h e s ame s t udy, h i gh o n a e s t h e t i c wh i l e I t a l i a n a nd North Ame r i c a n were c o n t e nt, i n t ermed i a t e a nd G e rma n wa s low . Arthur , ( 1 983 ) Farrar as t ive l y a nd hav i ng a nd f o und s t e r e o typed s c al e s re l a t i ng Carranza r a t ed more f ou nd f avourably a nd be twe e n t h e amo u n t of soc i a l Mex i c a n- a c c e n t ed i n home a nd s ch o o l s t ud e n t s aware n e s s . on Rya n i n t h e h ome Mo f f i e ( 1977 ) Eng l i sh both f or c o n t e x t s by A nglo-Amer i adol e sce n t s , a l t h ough a Me x i c a n c o nt e x t . i nves t i ga t ed t h e re l a t i o n s h i p a c c e n t ed ne s s a nd i s t i c s o f Spa n i sh-Eng l i sh b i l i ng u a l 100 Eng l i sh wa s n e g a t h a t s t a nd a rd Eng l i sh s p e ak e r s were than a c c e nt wa s more a c cep t a bl e Carranza quo t e d by E i s e n s t e i n u n i vers i ty a b i l i ty a nd Mex i c a n- Ame r i c a n a nd B l a c k Rya n , are A ng l o -Ame r i c a n to s u c c e s s , ( 1 977 ) ( 1 974 ) tha t Spa n i s h - a c c e n ted by s t a tus a nd s o l i da r i ty can , Brad f ord t h e a t t r i bu t ed c h a r a c t e r s pe a k e r s . Th e y c ol l e g e s t ude n t s who l i s t e n ed t o t h e a c c e nt ed f ou nd tha t the s p e e ch d i s t i n gu i shed sma l l d i f f e re n c e s i n a c c e n t ed n e s s whe n r a t i ng a s p e ak e r ' s persona l s p e e ch. a c c e nt a t t r i bu t e s the a nd s t r o n g e r the n e g a t i ve po i nt s c a l e s t o me a s ur e of una c c e nt ed; p l e a s a n t -u n pl e a s a n t; rev i ew ) a Ry a n a nd C ors o s t ud i ed the f r i end; - 1 08 the eve n t u a l ( re p o r t e d a t t i t udes of in that Rya n et h e av i e r the al . us ed 7- f o l l ow i ng d i me n s i o ns: o c c up a t i o n; a c c e n t e d f l u e n t - u n flue n t . ( 1 978 ) u nd e rgradua t e s t o s p e akers f ou nd s t er e o type . a t t i t ud e s l i ke l i h ood S e ba s t i a n , be i ng Th ey of i n E i s e ns t e i n m i dd l e - c l a s s Ang l o-Am e r i c a n s t a ndard Eng l i s h - ( 1 983 ) ' s a nd Engl i sh w i t h t hr e e d e gr e e s t ed n e s s, of S p a nish- a c c e nt ed ne s s , i n t e l l i g i bi l i ty , t i on s hi p s w i th w i th r e s pe c t of the d e s i ra bi l i ty f i ve the a c c e n ted s p e akers a nd t he ext e n t s t a nd a rd spe ak e r s t hought the a c c e n t e d s p e akers t h e m on f i ve soc i al i ssues . speake r s we re t hought t o be i n beli e f s , a nd l owe r i n socia l 2 Spa ni s h - a c c e n t e d middle- c l a s s or m i dd l e a nd a lowe r Engl i s h The s t ud e n t s r a t ed t h e a nd speech men t s. a t ed d i f fere n c e b e t we e n m i ddle 2.2 It Phonet i c i s, h a nd , of more th i s is ( 1 970 ) the pos s i bl e wh ere con t e n t wa s a s p e ak e r . s t e reot y p e judge f avou r a b l y ev alu impo r t a n t to a c ce n t s d i s t i nguish betwe e n, o n t h e one wh i ch c a n ev i d e ntly be on cle a r i n f orma n t s f rom ma ny s t ud i e s , j ud g e d va rio u s con t e nt - to Br i t i s h ( re f l e c t i ng 1 09 the a c c e nt - Th a t for e x amp l e G i l e s s p e aki ng w i t h t h e a c c e n t i n qu e s tio n ) ( r e f l e c t i ng a t t i t ud e s e l i t he o t her h a nd , non- n a t i ve a c c e n t s s poke n by t h e s e grou p s . is The cla s s s p e ak e r s wa s mu c h s ma l l e r . a c c e n t s d i f f e r e nt l y for s t a t u s the group there for all me a s ur e s . cited u s i ng a c c e n t e d s p e ak e r s a s s timu l i a nd, to tha t s o l id a r i ty , a c ce n t ed s p e a k e r s a t ti t ud e s to e t h ni c m i nori t i e s , a t t i t ud e s s t a nd a rd a nd a l so made soc i al d i s t a n c e fea t ures o f no n - n a tive cours e , 2 i n t rodu ctio n for ea c h k i nd o f s p e akers on s t a t u s , c l as s The se to 8 0 u ndergr adu a tes w i t h Lower cla s s s t a nd a rd s p e ak e r s we r e l ower l e s s s i m i la r i n c re a s e d a c c e n ted n e s s . s p e ak e r s c h a ra c t e r i sti c s a n d tha n c l ass , i n t rodu c t i on s s u ch cl a s s to wh i ch t h e wou l d a gr e e wi t h p r e s e n ted r e cord i ng s o f l ow e r c l a s s re l a- i n a r a ng e o f r e l a t i on sh i p s . evalua t i o n s be c ame mor e n e g a t i ve w i th ( 1 980 ) soc i a l Res ul t s s howed t h a t S p a ni s h - a c c e n t e d l e s s des i ra b l e Ry a n a nd S e b a s t i a n to a c c e n a nd a nd fore i g n a t t i t ud e s to f or esthetic i t s el f ) . The ext e n t to wh i c h d i f f erent t o l e ra t e d w i l l I n o rd e r o f devi at i ons f r om n a t i ve s p e e c h a r e b e d i s c us s e d b e l ow . t o st udy n o n - na t i ve k i nds t h e a t t i t ud e s a c c e nt s we mu s t o f nat i ve Swe d i s h s p e a k e r s t o d e t e rm i n e wh a t t h e p h o ne t i c d i f f e r e n c es be twe e n n a t i v e a nd n o n - n a t i ve Swe d i s h a r e . Th e non-n a t i ve a c c e n t s c a n be fe a t u r e s o n the o n e h a nd , f e a t ures wh i ch of a a nd t h os e p a r t i c u l ar f e a t ure o f t h ought o f l a ng u a g e non-n a t i ve a o n the a s genera l are assoc i a t ed other . i s the i r me t h o d f o r me asur i ng n a t i ve deve l o p e d f l u e n cy i n t e rms o f r a t e a nd l a ck o f pause . He s i l e n c e o f gr e a t e r dura t i o n t h a n 0 . 2 s . age of paus e s i n r e a d i ng a nd free e n c es bus i n e s s had 5 1% n a t i ve s p e a k e rs w i t h 39% w i t h 56% p a us e . Th i s f u t ur e p l a ns . Joh a n s s o n wi l l ( 1 973 ) p e r f o rme d a n o n - n a t i ve a c c e n t s of the s p e a k e r s ' b e t ak e n up but n a t i ve phonemes we re a r r a n g e d a n d the s u b s t i t u t i o n or i g i n . f r e e s p e e ch , l es s e x p e r i p a u s e a nd t h e Br i t i sh A n o t h e r g e n e r a l f e a t ur e l i tt l e i s k n own n on i s var i a c omp a r i ng t h e use d a n a l ys i s o f ph o n o l og i c a l Swe d i sh . 1 10 - 9 s y s t ems Swe d i sh f r e q u e n cy o f m i s p r o n u n c i a t i o n , we r e l i st e d f or e a ch a c c ent c o n s i d e r e d we r e Ame r i c a n Eng l i sh , - of i n t h e d i s c uss i o n of t h e p r oj e c t' s i n o rd e r o f Th e n i n e a c c e n t s t h e r e wa s t o t he l a nguages w i t h t h a t o f types in sys t ema t i c s e gme nt a l i n Swe d i sh , as a c a l cu l at ed the percent c omp a r e d n a t i ve a c c e n t s a b o u t wh i c h re l a t i ve l y b i l i ty. "paus e " l earn e r s o f E ng l i sh w i th d i f f e p a u se c o l l e g e s t ud e n t s Leht o n e n a nd n on-na t i ve d e f i ned amo u n t s o f s t udy whe n t e x t s w e r e r e a d , un i ve rs ity s t u d e n t s f l u e n cy . s pe e ch a nd f o u nd t h a t l i t t l e d i f f e r e n c e b e t we e n F i n n i sh r e nt He w i t h s p e ak e r s Th e mos t o bv i o u s g e n e r a l accents ( 1981 ) f e a t ur e s o f Cze ch , D a n i sh , F i n n i sh , Cro a t i a n . Gre ek , Bannert (1 980 ) c u l t i es t h a t s p e a k e rs pronunc i a t i on o f ty p e s wh i ch of o c c u r f or to , f e atures o f l a nguage ) accent He a l s o p r e s e n t e d p r o n u n c i a t i o n e rr o r s ev e r a l n o n - n a t i ve in, o b s e rv e d a c c e nt s re c o g n i z e d . expe c t ed f e a t ur e s , s p e ak e r s It is w i t h o u t s t i mu l a t i o n ana l y s i s o f s i gna t ure of f r om a and prosod i c wh i ch t h e p a r t i c u l a r is wh a t L a bov feature s upon i n t h e a c c e nt part of in a wh i ch c o n c a s e tha t i n que s t i on , the g e nu i ne s i mu l a t e Eng l i sh i n Swe d i sh . showe d t ha t a n Eng l i sh a c c e nt a c c ent . I t wa s S u b s e qu e nt f e a t ur e s o r f ound a c c e nt s d i d n o t h a v e t h e Swe d i sh t o n a l r e t ro f l e x r e du c e d . - but n a t i ve s p eak e r' s p i l o t e xp e r i me n t , c ou l d a n Eng l i sh a c c e n t a ( 1966 ) a ny p a r t i c u l ar n o n - na t i ve a c c e nt. Swe d i sh / r / was p r o n o u n c e d as the i r i nd e e d a ny o t h e r t h e s i mula t e d a c c e n t s g ave t h e s t e re o ty p e s i mu l a t e d Eng l i sh l a ngua g e s , not ne c e s s ar i l y the a nd are ( 1 986 ) , of ( or " s i gn a t ur e " f o c u s e d. C u n n i ngh am-A ndersso n i n the i n o rd e r o f s e gme n t a l f e a t ur es by Th i s o f a nd o p i n i o ns a b o u t n a t i ve d i f f e re n t i n Swe d i sh f e a t ur e s a r e a l ways p r e s e nt they a r e mode l is have accents n o n-nat i ve re f erred t o a s the s o c i a l l y d i agnost i c these l a ngu a g e s i n t e l l i g i b i l i ty . i n c l ud e d s c i ou s a t t e n t i o n a nd p ro s od i c d i f f i d i f f erent are thos e st e r e otype is a nd S e rb o 25 s p e ake rs o f f e a t ures of a nd P o r t ugu e s e of p r o n u n c i a t i o n e r r o r ty p e s hyp o thes i s e d e f f e c t o n 2 . 3 Ste reotype P o l i sh , s t ud i e d s e gme nt a l Swed i sh . a nd a n a r r a ng eme n t P a ra l l e l Hu ng a r i a n , 1 1 1 - a nd uns t r e s s e d tha t the accent , vowe l s we r e S i m i l ar l y , F l e g e a nd Hammond Eng l i sh who we r e f am i l i a r wi th s i mu l a t e a S p a n i sh a c c e nt . a s s o c i ated wi th alt e r e d ( 1 9 82 ) / p , t,k / f ound f i na l t h at t h e sy l l a b l e th i s to be t e nta t i ve d i s t i n c t i ve pho n e t i c d i ff er e n c e s betwe e n l a ng u a g e t i on o f ( 1 966) , mo s t c a s e t h a t n at i ve on l y those s p e ak e rs about f e a t ure s wh i c h ( pr ov i d e d t h e y k n ow wh i ch a c c e n t n a t i ve s p e ak e r s h e a r a n a c c e n t t h ey may n o t be ab l e t o wh i c h f e a t ur e s of n o n - n a t i ve i n n a t i ve s p e a k e r s ? e s t a b l i sh (a) f e a t ur e they it fe a t ur e in p e r c e i ve Is in it is). (b) g e nu i n e n o n C o nv e r s e l y , whe n t o ask o r n e g a t i ve p o s i t i ve a c c e n t wh i l e t h ey p a r t i c u l a r n o n - na t i ve b u t p os i t i ve a t t i t ud e s it a t t i t udes wh e t h e r nat i ve s p e ak Th i s wou l d p r o v i de i s p o ss i b l e f e a t ur e to the same c o nv i n c i ng c orr o b o t o se para t e a t t i f r om a t t i t ud e s t o t h e e t hn i c m i nor i t i e s who s p e a k them . - I t may t h e n i t t h e n me a n i ng f u l a n o n - n a t i ve to a one a c cent n o n - n a t i ve a c c e n t s a c r e a t e p o s i t i ve t o a n o t h e r a nd i n another a c c ent . to i s p e r c e p Fur t h e r e x p e r i me n t s a r e r e q u i r e d t o r a t i o n f o r t h e hy p oth e s i s t h a t t ud e s t o it . a c c e nts n e g a t i ve a t t i t ude s o c c urs l a ng u a g e they a s s o c i a t e w i t h t h a t wh e t h e r n at i ve s p e ak e r s c a n have have a n e ga t i ve a t t i t ud e whe n a r e d e t e c t ab l e f or wh i c h they h a v e n o s t e r e o ty p e , i de n t i fy a t t i tude s e r s have e v i de n c e that n o n l a ng u a g e . l i s t e n i ng a c c e nt one l e n g t h e n i ng w e r e l a ng u a g e s p e r c e p t i on o f s o c i a l l y a c c e p t e d st ateme n t s n a t i ve h e a r to l o ng VOT va l u e s l e arners . A c c ord i ng t o La bov be t h e Eng l i sh c ou l d i n th e d i re c t i o n o f S p a n i sh a nd S p a n i s h - a c c e n t e d Eng l i sh . F l e g e a nd Hammo nd take by n a t i ve s p e ak e r s o f S p a n i sh-a c c e n t e d Th ey and f ou n d t h a t 1 1 2 - 2 . 4 To l e ran c e o f A t t i t udes t o n o n - n at i ve e t hn i c m i n o r i ty mu l i ) f or e i g n a c c e n t s g r av i ty o f l e ar n e r s i n te rms e r ro r. o f the He ( 1 978 ) f o und c i at i on , a nd t h a t e rr o r s mor e s e r i ous than th ose D j ord j ev i c ( 1971 ) in made Yugo s l av judges , q uo t e d by ( W i ng s t e d t c e n t or w i th have a i ncrease ( 1 9 7 8) a nd S chulma n 1 9 8 4 ) I t s e ems l i k e l y t h a t f or s p e akers o f s e n t e n c e s we re exper i e nced to l erant o f of te a c h e r s of D i m i t r i j e v i c a nd f ound t h a t nat i ve e r r o rs in for prosod i c errors . i n genera l p r o nu n c i a t i o n s . of Engl i sh c ou l d b e e rr o r s - t ha n 1 1 3 - non words l e ss l i ke l y J oh a n s s o n mor e s ev e r e l y fam i l i a r w i th n o n wa s h owe v e r t ak e n t o of is Eng l i sh t e nd t o judge t h e The o p p os i t e Th i s 1 9 84 ) c ompr e h e n s i o n o f Eng l i s h who we r e n o t j udges a r e more t o l e r a nt s e gme n t a l c o n s i de r e d t o be f am i l i a r i ty w i th a p a r t i c u l ar d ev i a nt E ng l i sh . judg e d . vowel p r o n u n ( G a s s a nd V a r o n i s o r w i th n o n - n a t i ve s p e e ch t h a t Swe d i sh n a t i ve va r i e t i e s in or a p a r t i c u l a r k i nd or ac i n g e n e r al Swe d i sh a c c e n t e d p r o n u n c i a t i o n than nat ive ma de by Swe d i sh s e gme nt a l o n n a t i ve s p e a k e r s t o l erance f ound of p e r c e i ve d f am i l i a r i ty w i t h a p a r t i c u l a r n o n-n a t i v e p o s i t i ve e f f e c t n o n - nat i ve a c c e n t to t o l erant t h e o p p o s i te wa s t ru e f or e i g n a c c e nt s n a t i ve s p e e c h. J o h a ns s o n ) the Yug os l av s t ud e n t s o f Eng l i sh t h a n we r e Rese a r c h has shown t h a t s p e ak e r by the of pronunc i at i ons o f t ha n e r r o r s i n se n t e n c e s a n d t e x t s . (as wh i l e e rr o r s i s o l a t e d words w e r e s p e a k e rs o f E ng l i s h we r e more pronunc i at i on e x am i n e d t h a t d ev i a n t c o n s o n a nts we r e d e eme d mo re s e r i o u s as st i p e r c e i ve d grav i ty ( a nd g r ammat i ca l ) Engl i sh . f r om a t t i t ud e s t o u s i ng a c c e n t e d s p e ak e r s J oha n s s o n p h o n o l og i cal of ( a s d i s t i n ct groups e l i c i t ed c a n be e l i c i t e d pronunc i a t i on accents t ru e whe n i nd i c a t e tha t mor e p r o s od i c e rr o r s and e x p e r i e n c e d judges, l e ss but s i nc e the exper i e nced c a n be expe c ted to s p e ak e r s do , j udge s w e r e non - n a t i ve ho l d d i f f e r e n t no c l e a r con c l us i on s Ta rd i f a nd d ' A ng l e j a n ( 1 981 ) , s p e ak e r s a nd a s s u c h l i ngu i s t i c va l u e s t h a n n a t i ve c a n b e d rawn. quoted in Ei senste i n ( 1983) found t h a t s e n t e n c e s s poke n by a s peak e r w i t h a s t rong e r Eng l i sh a c c e nt i n h i s pronun c i a t i o n o f Fre nch we r e s p e ak e r s t o b e l e s s a c c e p t a b l e, cons i d e r e d by l e s s g r amma t i c a l t i ng th a n the s ame s e nt e n c e s s p o k e n by a a n at i ve Fr e n c h a nd more i rr i t a non - n a t i ve s p e ak e r w i t h l e s s p r o nou n c e d E ng l i sh a c c e n t . Bre n n a n and Bre n n a n o f E ng l i sh a t t r i bu t e ( 1981a , b) demo n s t r a t e d t h a t l owe r s t a t u s to t hos e s pe ak e r s among 3 2 Me x i c a n-Ame r i c a ns a nd 4 A n g l o-Ame r i c a n s who we r e t i ve j udge s to have s t ronger non-n a t i ve used the de f i n i t ion o f a s a ma n n e r o f the s t a nd a rd . s y nt a c t i c a l ( Th i s a nd de f i n i t ion pronunc i a t i ons a s s o c i a t e d a s we l l accents in E ng l i sh . f rom s t a ndard l ex i ca l i n c l ud e s w i th s o c i a l a s non - n a t i v e p r o n u n c i a t i o n s . ) l op e d a n judge d by other n a a c ce n t g i ve n by G i l e s a nd Powes l a nd pron u n c i a t i o n d i f f e r e n t the grammat i c a l , n at i ve s p e ak e r s i n t e r e s t i ng 2 -s t e p me t h o d o f l e ve l s Th ey ( 1 975 ) s p e e ch w i t h cons i s t e n t w i th t h e r e fore n o n -s t a ndard a nd g eogra ph i c a l Bre nnan d i a l e ct s a nd Bre n n a n d e v e e s t i ma t i ng a c c e n t e d n e s s a s fo l l ows: 1. Thr e e l i ngu i s t s r e l i ab l y j udge t h e a c c e nt e d n e s s o f s pe ak e r s us i ng a s e ve n - p o i n t P r e l i m i na ry A c c e n t e d n e s s s c a l e. 2. Judg e s prov i d e d a s s e s sme nt s ty p e s o f a c c e n t e d p r o n u n c i a t i o n. A c c e nt edne s s of Th i s wa s I nd e x. - 1 1 4 the - r e l a t i ve freque n cy o f then c a l cu l ated 18 t o an Bre n n a n a n d Bre n n a n f ound t h a t the A c c e nt edne s s I nd e x wa s a s i g n i f i c a n t p r e d i c t o r o f the Pr e l i m i n a ry A c c e nt e d n e s s a c c e nt e d ne s s l i s t e n e rs a s s e ssme n t s tra i ned the Rya n a nd l i s t e ne r s i nv e s t i g a t i o n Daws o n ( 1 975 ) Eng l i s h s p e e c h l i sh b i l i ngu a l s . Th e a c c e nt e d n e s s c o rre l a t e d wi th the a l ong by n a i ve n a t i ve the s h owe d tha t c a n g i ve re l i ab l e a c c e nt edne s s o f h i gh l y s ame ma t e r i a l a nd o f i n a p r e v i ou s e x p e r i me n t . An e ar l i e r e x p e r i me n t al Bre n na n , of sca l e , s ame l i n e s by n o n- l i ngu i s t i c a l l y j udgeme n t s o f the degree o f s amp l e s p r o d u c e d by S p a n i s h -Eng judg eme n t s o c c urre n c e of we r e found spec i f i c to b e f e a t ur e s o f a c c e nt e d p r o n u n c i at i o n . The work o f Ry a n , Corso 2 . 1 ( 1 978 ) C a rr a n z a a n d Mo f f i e a nd Ry a n a nd S e ba s t i a n ( 1 9 77 ) , ( 1 980 ) S e ba s t i a n , d i s cussed a l s o d e a l s w i th the r e l a t i o n sh i p b e twe e n d e gr e e n e s s a nd n a t i ve s p e ak e r a t t i t ud e s t o n o n- n a t i ve of Ry a n a nd i n s e c t i on a c c e n t ed s p e e ch . 2 . 5 At t i tude me a s ur eme n t I n mo s t of the s t ud i e s d i s c u s s e d i n t h e p r ev i o u s s e c t i o n n a t i ve or s t a nd a rd s p e ak e r s were r e q u i r e d t o eva l u a t e n o n- n a t i ve o r n o n s t a nd a rd s p e ak e rs. simi l ar i n mo s t e x am i n i ng t h e o f Ed i nburgh s p e ak e r s ' Th e me t h o d o f e l i c i t i n g a t t i t ud e stud i e s o f att i t ude s eva l ua t i o n in of t o d i f fe r e n t wh a t s h e re f e rr e d t o pre s e n t e d th i s k i nd . as s t a f f a n d s t ud e nt s a t k i nd s of of For e x amp l e , j udg eme n t s Roma i ne (1 980 ) t h e U n i ve r s i ty Ed i nburgh a c c e n t s ( s t a ndard s t a n d a rd a nd n o n - s t a nd a rd s p e e c h ) Osgood ' s O s g o od, S uc i sema nt i c a nd Ta n n e n b a um - 1 15 - d i fferent i a l , ( 1 957 ) . is use d f i rst Th i s c o n s i s t e d of 20 s e ve n-po i n t s c a l e s who s e e nd po i nt s re pre s e nt ed oppos i t e chara c t er i s t i c s . The judg e s we r e r e qu i r e d to i nd i c a t e t h e d e gr e e a s s o c i a t e e a c h s p e ak e r w i t h e a c h c h a r a c t e r i s t i c . i s t i c s we r e a s pe c t s o f (a) c h o s e n to e l i c i t the The s e chara c t er c o n c e r n i ng t h e f o l l owing l i s t e ne r s a t t i t ud e t o the s p e ak e r : tra i ts: P e r s o na l i ty h o ne s t / d i s h o ne s t ; n e ga t i ve / p o s i t i ve ; f r i e nd l y / u n f r i e nd l y; t ough / g e n t l e e du c a t e d / u n e d u c a t e d ; chara c t er i s t i c s : St atus (b) e v a l ua t i o n s t o wh i ch t h ey l ower c l ass /upper c l ass (c) Abs o l u t e char a c t e r i s t i c s: (d) Spe e ch or c od e f orma l / i n f orma l ; (e) I d e n t i ty : urban/rura l ; a y o u ng / o l d m i l d/bro a d ; p o l i t e / r ough ; S c ott i sh/not c l e a r/unc l e a r ; s l o p py / c a r e f u l S c o t t i sh ; l i k e a n Ed i n l i k e a n Ed i nburgh p e r s o n S o l i d a r i ty a nd s p e ak / n o t ch a r a c t e r i s t i c s: i n c or r e c t / c or re c t ; burgh p e r s o n / no t (f) m a s c u l i ne / f em i n i ne ; l i ngu i s t i c s e c ur i t y : l i k e t h e way I t h i nk f r i e nd o f m i n e / as l i k e t h e way I wou l d if I s p e ak; I wou l d n' t wa n t l i k e t o s p e ak / n o t l i ke as the way I i f he or she t h i nk I c ou l d b e t o know h i m o r h e r a t al l ; l i ke l i ke t h e way I wou l d t o s peak . Edwa rds ( 1 977 ) used on l y n i ne o p p o s e d p a i rs c over t hr e e b r o a d c a t e g o r i e s o f (a) C ompe t e n c e: (b) Soc i a l cha r a c t e r i s t i c s t o p e r s o n a l i ty : i nd u s t r i ou s / l a zy ; amb i t i o u s / unamb i t i ou s ; of i n t e l l i g e n t / u n i n t e l l i g e nt ; i mp or t a n t / u n i m p or t a n t a t t r a c t i ve ne s s: f r i e nd l y / u n f r i e nd l y ; s e ns e o f h umo u r / humo ur l e s s (c) Pers ona l i n t egr i ty : t ru s two r t hy / u ntrus two r t hy ; - 1 1 6 - h e l pfu l / unhe l p f u l ; g e n e r o u s / no t g e ne r o u s . Edwa rds p o i n t e d by Bourh i s , ( 1 975 ) out that t h e s e t r a i ts have b e e n p r ev i o u s l y u s ed G i l e s a n d Taj f e l ( 1 9 73 ) a nd L ambe r t , G i l es a nd P i c a rd . Tu c k e r a nd Lamb e r t r e n t c a t e gor i e s of ( 1969 ), n a t i ve o f Ame r i c a n Eng l i sh , i n a s t udy o f t h e a t t i t u d e s o f d i f f e Ame r i c a n s t owa rd s p e rf o rme d a p r e l i m i n a ry groups s i mi l a r t o the groups e s t ab l i sh wh i ch they of use a s u s e d by t h e grou p s d e s c r i be " f r i e nd l i ne s s" a nd " s u c c e s s". categor i es i nves t i g a t i o n p l a nned t o ad j e c t i ve s we r e s i n ce d i f f e r e n t d i f f e r e n t var i et i e s Th i s s e ems j ud g e s a r e ( us i ng j udge s ) to concerned t o a l og i c a l step l ike l y t o u s e d i f f erent adje ct i ve s t o de n o t e t h e s ame c h a r a c t e r i s t i c s . W i l l i ams mode l ( 1 9 74 ) a nd C a r r a nza a nd Rya n i nvo l v i ng d i f f erent Lamb e r t two-d i me n s i o n s : f r om s u c c e s s a nd ( 1 9 6 9» . s t a t us Ident i f i c a t i on o f a s a range , ty p i c a l l y a s s ume An ers are ab l e t o va r i e t i e s ( mo r e too i n Tu ck e r a nd s c a l i ng pro c e dure s r a t h e r than a p o i nt. to i n t h e p r e v i o us s e c t i o n l a ng u a g e va r i e t i e s a r e a v e ry a t t i t ude s t o t h e g r o u p s who s p e ak t h e i mp or t a n t ( no t a c c e nt or i g i n t h a t a t t i t ud e s good re f l e c t i on o f j udgme n t a l a nd s o l i da r i ty f r i e nd l i n e s s d i s c u s s ed Th e eva l u a t i ve r e a c t i o n s t ud i e s d i s c u s s e d va r i e ty . d i s c us s a W i l l i ams s ug g es t s t h e u s e o f wh i c h s p e c i f y a t t i t ud e 2 . 6 ( 1 975 ) part or of t h i s a s s ump t i o n l e s s pre c i s e l y ) i nvo l ve d . - 1 17 - is that the i d e n t i fy t h e l a ng u a g e l i s t e n l a ng u a g e P r e v i o us s t ud i es h ave i d e n t i fy a r e c o rd e d 1 978 , Ry a n 1 983 , s c r i bes f i ve n a t i ve (e .g. vo i c e F l ege w i th as 1 9 84 a nd s t a nd a rd or non-n a t i ve ; Roma n c e, if S l av i c ) ; wh a t r e g i o n a l /so c i a l u p p e r c l ass S p a i n , O t h e r s t ud i e s , or l a ng u a g e var i e ty , a t t i t udes a c c e n ts e rs Bre n n a n a nd to we r e l i st e n e rs . i de n t i fy P r i gg e de n o n n o n- s t a ndard , l a n g u a g e g r o u p? l a ng u a g e ? ; (e .g. be t t e r ab l e P a l me r ( 1973 ) to V i e t n ame s e a c c e n t s , a nd Ry a n howe v e r , ( 1 9 75 ) e x am i n e d Ame r i c a n Euro p e a n a nd n a t i ve l ooked a t n a t i ve a t t i t udes i n c l ud i ng Fre n c h , G i l es Eng l i sh , North Ame r i c a n, f ound t ha t i de n t i fy a c c e n t f ound t h a t S p a n i sh -a c c e n t e d Carranza Eastern ( 1 970 ) I nd i a n and G e rma n . have d e a l t f or S p a n i sh - a c c e n ted Eng l i sh . I t a l i an , a nd G i l es me t h o d o l ogy , b u t usu a l l y o n l y r e g a rd i ng e . g. ( 1 9 74 ) , a nd f or e i g n a c c e n t s I ta l i an , bas i c of l a ng u a g e d o e s h e u s e ? s p e e ch , ( 1 981 ) Norweg i a n, i n Eng l i sh , to r e g i o n a l We l sh , a nd Ha n l ey if ( 1 983 ) l owe r c l ass Me x i c a n-Ame r i c a n ) . one Mu l a c , i de nt i f i ca t i on that t o n o n -n a t i ve Bre n n a n i n the wh i ch Ry a n Joh a nsso n 1 981 ) . o f t e n us i ng ma t ch e d gu i se n o n - n a t i ve e . g. i s t h e s p e a k e r' s n a t i ve di a l ect of a b i l i ty t o S c ov e l n o n - n a t i ve , wha t ( c. f . n o n -s t a nd a rd ; w i t h a t t i t udes a nd n a t i ve s p e a k e rs ' n o n - n a t i ve l e ve l s o f s p e c i f i city s p e a k e rs : na t i ve dea l t o l de r l i s t e n o r i g i ns t h a n y o u ng e r n a t i ve Eng l i sh s p e ak e rs c ou l d but n o t Ara b i c , a l t h o ugh they r a t e d the L i ng a l a or a c c e n t g r o u ps d i f f e re n t l y. In t h e p r e s e nt t h e way proje c t we a r e , amo ng o t h e r t h i ngs , i n wh i c h n a t i ve Swe des ' f l e c t the i r un l i k e l y t h a t a t t i t udes to a t t i t udes t o i mm i g r a n t g r o u ps. n a t i ve s p e ak e rs o f - 1 18 Swe d i sh - c o n c e r n e d w i th f o r e i g n a c c e n t s re I n t u i t i ve l y , have t h e i t se ems s ame a t t i t ud e e i ther t o al l m i n or i t i e s i n Swe d e n . to n o n - n a t i ve n o n- n a t i ve var i e t i e s o f C l ear l y , m i n o r i ty groups no n-na t i ve s p e e c h d e p e nd t o a ne s s w i t h of the non-s t a nd ard , s p e ak e r tha n or can I f he te l l is he tha t the Eng l i sh . i nve s t i g a t i o n o f t h e to ( re g a rd i ng t h e ext e nt it is W i th th i s i de n t i fy . the s p e e ch is re a c t d i f f erent l y t o the or a is a m i dd l e-c l a s s s p e ak e r o f p roje c t ' s att i t ude s i mp or t a n t to f i rs t to k n ow a c c e n t s d i f f e r e n t g r o u p s o f n a t i ve s p e ak e r s to pr e c i s e Ame r i c a n f u ndame n t a l t o wh i ch nat i ve Swe d e ' s a t t i t u d e s the i r o bv i o u s l y s p e ak e r s i d e nt i fy t he or i g i n s p e ak e r f r om Cy p r u s , f o r e i g n a c c e nts r e f l e c t groups ) ab l e t o may o f G r e ek que s t i on , n a t i ve can on l y hear that n a t i ve s p e ak e r I n our of c o ns i d e r a b l e e x t e nt o n t h e n o n- na t i ve , i f he a t t i t ud e s e thn i c a s e l i c i t e d u s i ng r e c ord i n g s o f w i t h t h e n a t i ve sp e ak e r t h e n o n - na t i ve a c c e n t. Swe d i sh or t o a l l of e t h n i c m i n or i ty wh i ch n o n - na t i ve Swe d i s h are a b l e i n f o rma t i o n we c a n p r o c e e d to a n swe r t h e q u e s t i o n o f wh e t h e r Swe d e s have d i f f e r e n t a t t i t ud e s to d i f f e rent n o n-nat i ve a c c e nt s. 3. Exper i me n t s 3. 1 I d e n t i f i c a t i o n o f f o r e i gn a c c e n t s Ma t e r i a l : d a t a ba s e 1 988 ) a nd f i rs t h a l f Sun " read A t a p e wa s ( Engs t ra nd pre p a r e d 1 987 , o t h e r r e c orded ( about in s p e a k e r s o f Swe d i sh , n a t i ve S c a n i a n Eng s t r a nd ma t e r i a l , 2 5 s e c o nd s ) Swe d i s h one d i a l ect , f r om t h e mas t e r t a p e s o f by of a nd c o n t a i n i ng r e c ord i ngs of the the t ext " Th e Nor t h W i nd a nd t h e 29 d i f f erent n o n - n a t i ve a nd 5 n a t i ve once w i th a c c e nt . - IRIS C u n n i ngh am-A n d e r s s o n o f wh om r e ad t h e t e x t a nd the 1 19 - twi c e , once in his a s i mu l a t e d Ma l a r va l l ey The adva n t a g e s o f u s i ng hav i ng no n-n a t i ve gat i on o f may o c c ur ma t er i a l c aused a nd s p e e ch; i ng s al l a s p e ak e r s , is r e c o rd i ngs o f i n the t h e r e f ore p r e s uma b l y t h a t t h e s p e ak e r s ' of the i r a c c e nt s u i t ab l y n e u t r al cho s e n d i f f i cu l t in Swe d i sh l i ngu i s t i c t a p e var i e d i n the a s other to wa s e x c l u d e d, c o n s i de r a b l y in the the l ess l a rge s t 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 - f ar a l t h o ugh a ccents , ( a nd a nd as poss i b l e s p e ak e r s the a c cents Th e s p eak e r s we r e J ama i c a n woma n Swe d i sh-Turk i sh man J a p a ne s e ma n Swa h i l i ma n K o r e a n ma n Kurd i sh woma n Swe d i s h - Ame r i c a n m a n G r e e k ma n N . Swe d i sh ma n P o l i s h ma n S c a n i a n ma n F i n n i s h - Swe d i sh ma n B e nga l i ma n Ma l ar va l l e y woma n Ru s s i a n wom a n G e rma n ma n C z e c h ma n 1 20 - i mm i - f am i l i a r f o l l ow i n g o r d e r : F i nn i sh ma n S p a n i s h ma n Po l i s h woma n Fre n c h ma n Turk i sh woma n L a p p i s h ma n S p a n i sh woma n Fre n ch woma n S p a n i sh woma n G r e e k woma n P e r s i a n woma n A r a b i c ma n S e r b o c r o a t i a n woma n K i nya rwa nda woma n T i gr i ny a ma n S c a n i a n- a c c e n t e d Ma l a r Yoruba m a n Norwe g i a n woma n I R I S data- I n a dd i t i o n , s t r e ng t h . the ma ny r e c ord- SPEAKERS : 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 i n f o rma n t s' i n the i d e n t i fy ) as i n c o n- vers i ons o f f rom t h e s u ch t h a t b a ckground. f l u e n cy wa s v e ry p o o r we r e e rr o r s wh i ch t h e s ame o r t h ograph i c f rom t h e ma t e r i a l I R I S d a t a ba s e more i nve s t i - a c c e nt w i thout the t h e n o n- n a t i ve gra n t g r o u ps we re r e p r e s e n t ed a s we l l r e c o rd e d f or e i g n f ocused f i rs t l y , s o t h e i r u t t e r a n c e s a r e ma x i - the t ext r e a d i l y ava i l a b l e c o nt a i n ed on the a l l ows a s e c o nd l y , Th e n o n - na t i ve s p e ak e r s we r e wh o s e c ons i derab l e : sy n t a c t i c a nd s ema n t i c s p e e c h; t h i rd l y , f ou r t h l y , ty p i c a l are l o ng e n ough t o p r ov i de a g o o d s amp l e o f text w e r e ba s e . b y the u s e d by ma l l y c omp a r a b l e; t e nt , f e a t ures o f i n n o n- n a t i ve is t ext s p e a k e rs r e ad a t e xt t h e pho ne t i c c omp l i c a t i o n s such a I n f o rma n t s : c a p a b i l i t i e s r e g ard i ng t h e have d i f f erent to of a nd t h e amo u n t l i ve s, f am i l i a r i t y A f t e r s e ve ra l thre e g r o u p s n o n - n a t i ve i nd i v i du a l n a t i ve to opt imi se 1 8 -y e a r o l d gymn a s i um s t ud e n t s in Upps a l a part s of T i erp (a (a l a rge t h e wor l d ) sma l l s t ude n t s; a nd c o n t r i bu t e a l t hough o t h e r t h e s t i mu l us t a p e, s p e ak e r s o f Swe d i sh we r e u s e d a s p a ne l s : (a) i n f o rma n t i n forma n t ' s e x p e r i e n c e . p i l o t e x p e r i me n t s of a c c e n t s. of t own w i th ma ny - two c l a s s e s of l i s t e ne r n o n -huma n i t i e s s ub j e c t s i mm i gr a n t s f r om d i f f e r e n t 1 7 a nd 1 9 s t ud e n t s; a nd t own 5 0 km f r om Up p s a l a w i t h f ew i mm i grant s ) (c) 15 t e a chers non d e n s i ty o f the c o nt a c t he h a s w i th t h e m mus t w i t h d i f f erent f a c t or s may a f f e c t t h e i de nt i f i c a t i on of a r e a wh e r e t h e the in l a ngu a g e s other of s p e ak e r s n a t i ve s p e ak e r s of l i k e l y that s e ems it I n t u i t i ve l y, accents . na t i ve c a t egor i e s di f f e r e n t Obv i o u s l y, of Swe d i sh a s a s e c o nd (b) - 29 l a nguage in U p p s a l a. Fre e r e s po n s e i de nt i f i c a t i ons . she e t wa s For ha l f d e ve l o p e d about e a ch of Two me thod: the me t h o d s U p p s a l a s t ude n t s the whe r e we r e u s e d a n a n swe r 3 5 vo i c e s t h e l i s t e ne r s h e a rd : (a) I s t h e speak e r a n a t i ve s p e a k e r o f Swe d i sh ? (b) If (c) t h e a nswer t o (a) of the Wh a t is the wa s "n o". t h em t o a n swe r (b), t h e s t ude n t s we r e g i ve n a ma p o f the numb e rs r e p r e s e n t i ng "Fa r E a s t ", Ea s t e r n E u ro p e" e t c . l a nguage we r e a sk e d wh i c h part f rom ? s p e ak e r ' s mo t h e r t o ng u e ? wor l d w i th c o nd ( 17 ) , t h e s ame t hr e e q u e s t i o n s we r e a s k e d wor l d d o e s t h e s p e ak e r c ome To he l p to e l i c i t a c c e n t the - 1 2 1 are a s such a s "No r t h Ame r i c a". Th e t e a ch e r s o f Swe d i s h a s a s e s ame - que s t i ons, except that f or t h em , (b ) wa s changed to : If l a ng u a g e g r o u p d o e s the wh i ch p a r t wor l d d o e s of the p r ov i d e d f or t h e s p e ak e r s i ng in not a l ways a n swe r q ue s t i o n ers, the wa s the degre e of (b) . l a nguage t h i s e x p e r i me n t , S c a nd i n av i a C C. E E . Eur o p e , G A fr i ca & S . Ame r i c a , is S . Eur o p e c orre c t l y It i de n t i f i e d . N . Ame ri c a , D N . A fr i c a , M i dd l e E a s t F F a r Ea s t, S . E. A s i a H I nd i a n s Ub- c o n t i ne n t of e a ch N . Eur o p e , a bove e i gh t zones . pup i l s Swah i l i N o rwe g i a n , the t h e s p e ak e r s ' seen in Greek , a nd Yoruba a c c e n t s i de nt i f i ed or i g i n , Turk i sh , i n t h e a p pr o p r i a t e l a ng u a g e s t h a n we r e f i gure 3 . - 1 22 - the At l e a s t 40% G e rma n , wh i l e Kurd i sh , t e a ch e r s we r e more o f t e n a b l e n a t i ve As can t h e F i n n i s h, B r i t i sh Eng l i s h, a nd Rus s i a n s p e a k e rs' z o n e o f Mor e ove r, i de n t i f i e d c o n s i de r a b l y be t t e r t h a n t h e p u p i l s a t t e a c h e r s a nd the c o u l d a l s o p l a c e S p a n i sh , Aus t r a l i a l i s t e ne r group of n a t i ve Swe d i sh , For t h e pur B o r i g i n o f t h e s p e a k e rs . Arab i c the t e a c h i n t o e i gh t z o n e s : i d e n t i fy i ng t h e g e o gr a ph i c a l p i sh , and d i d i s howe v e r p os s i b l e t o c omp a r e i n t e rms o f t h e the t e a c h e r s we r e both the (c) , Th i s w a s a l s o t ru e o f the wor l d wa s d i v i d e d s h ows how ma ny p e r c e n t be seen , or s t u d e n t s we r e d i s a p p o i nt s poken was S ov i e t e a ch s p e ak e r to , a c c ur a cy w i th wh i ch t h e p a r t o f t h e wor l d wh e r e t h e A 1 wh i ch l a ng u a g e b e l o n g t o ? N o ma p w a s f r om t h e l e s s e r e x t e nt. s p e a k e r' s n a t i ve pos e s o f " n o" , l a ngu a g e b e l o n g the s e n s e t h a t t h ey s e l d om a nswe r e d q u e s t i o n t h ough t o a Ta b l e n a t i ve (a) t e a chers . The r e s u l t s o bt a i ne d R e s u l t s: t h e a n swe r t o part the Fre n ch , t e a ch e rs P o l i sh , of Lap Cze ch , t h e wo r l d . t o c orre c t l y s p e c i fy p u p i l s, as wi l l be Table ZONE A A A A A A A A A B B B C C C C C C C D D D D E E E E E F F G G G G H ACCENT F i nn i sh L a p p i sh Swe d i sh 1 ( b i ! i ng Sw-Tur ) Swe d i sh 2 (No r t h e r n ) Swe d i sh 3 ( S c an i an ) Swe d i sh 4 ( F i nn i sh�Swe d i s h ) Swe d i sh 5 ( Up p l and ) Swe d i sh 6 ( S c an i a n -Up p l a nd ) Norwe g i a n Br i t i sh E ng l i sh Ame r i c a n E ng l i sh G e rman S p an i sh 1 ( S p a i n ) S p a n i sh 2 ( Co l umb i a ) S p a n i sh 3 ( Uruguay ) French 1 Fre n c h 2 Greek 1 Greek 2 Turk i sh Kurd i sh Pers i an Arab i c Po l i sh 1 P o l i sh 2 S e rbo-C r o at Rus s i a n Cze ch J a p ane s e Kor e a n Swah i 1 i K i ny a rwa nda T i gr i ny a Yoruba B e nga l i Gu i d e d r e s po n s e meth o d : r ema i n i ng groups of t e d r e a d i ng s , a nd a f i gure , poss i b l e w i th F i gur e s t o the f orm s i m i l ar n a t i ve l angua g e they we re e n c ou r a g e d t o they we r e unsure . The a nd 2 9 to the the p l ace - 1 23 - ma t r i x f or cross Th ey the s h own s p e ak e r s i n the l i st e d the t op . The i n t h e b o x c o rre s p o nd i ng e a ch s p e ak e r c ro s s e s b ox e s w i t h T i erp pup i l s . s p e ak e r acr o s s they be l i e ved for the nat i v e Swe d i s h a nd a c c e n - l a nguage s i n s t ru c t e d t o p l a c e a n a t i ve a nd 2 s h ow the r e s u l ts s ame t a p e w i th 35 down o n e s i d e a nd t h e numb e r o f pu p i l s we re 1 1 9 U p p s a l a p up i l s we r e te sted u s i ng th e TEACHERS 100 83 100 100 100 100 100 100 100 100 33 83 50 50 100 50 100 67 17 50 83 a 67 83 17 33 83 83 17 a 67 a 33 67 a STUDENTS 94 59 94 1 00 94 100 100 94 94 94 18 53 12 24 29 18 71 18 29 12 12 a 47 6 6 a 47 6 6 a a a 6 17 a so l id i n mo r e f r ame s t o have a nd th a n o n e box if a r e t h o s e wh i ch we re crossed by a t l e a s t 40% o f a r e wr i t t e n i n bo l d t y p e f a c e . t h e numb e r of pu p i l s f or e a ch s p e ak e r . re p r e s e n t pup i l s , s um o f the Th e r e s u l t s n a t i ve ( i n T i erp , t a i ned . e a ch p o s s i b l e s i m i l ar t o l east Swe d i sh , of the As c a n b e s e e n 40% Swe d i s h wa s m i s t ak e n of the f i gu r e of the a c c e nt ; Be nga l i , a nd Swah i l i , pup i l s , s e ems but a t t r i bu t e d Eng l i sh p r i or t o t h e i r a rr i va l wa s s l i gh t l y mor e o f t e n mark e d not i c e that l i ke l y t h a t Swah i l i i n c orre c t F i n l a nd Br i t i sh a nd ( i n the to case o f t h e s p e ak e r s i n Swe d e n ) ; K i ny a rwa nda a l so got qu i te a by Yo rub a t h e Up p s a l a f ew vo t e s - it " A f r i can " ; o n e o f t h e P o l i sh s p e ak e r s wa s m i s t ak e n f or S e rb o c r o a t i n Uppsa l a group ) ; a c orre c t the i de nt i f i ed a nd t h e s p e ak e r wa s s i mp l y i de n t i i de n t i f i e d a s ( ag a i n , the as free i de n t i fy the pup i l s : Be nga l i l e arned c ou l d f o l l ow i ng a c c e n t s we r e h e a rd a s Ame r i c a n o r Br i t i sh be the t h e wro ng a n swe r s ob Swah i l i hav i ng r e p r e s e nt a l s o c o rr e c t l y l e a s t 4 0% o f can p r e s e n t e d to t h e pu p i l s f i g ur e s , f or a F i n n i sh th i s f i gure L a p p i sh a n d F i nn i sh a c c e n t s a n ana lys i s i d e n t i f i c a t i o n s w e r e ma d e by a t the l a nguage . s p e ak e rs wa s f rom t h e l a ng u a g e t ho s e o bt a i ne d u s i ng t h e Norweg i a n , me t h o d p e rm i t s n a t i ve a t t h e t o p a nd b o t t om o f t h e r i gh t one of the Greek Th i s t h e b o x e s r e pr e s e n t wh i c h t h e s p e ak e rs we r e we re v e ry At Th e c o r r e c t a n swe r s in f o r e a ch p o s s i b l e n a t i ve r e s p o n s e me thod . f i ed . in a nd t h e numb e r s a t res pons e s Fr e n ch , numb e r s pup i l s . The numb e r s who c ro s s e d Th e order the i de n t i f i c a t i on o f other P o l i sh s p e ak e r Ame r i c a n Eng l i sh Norwe g i a n s p e ak e r , s p e ak e r n a t i ve t h e i r n a t i ve the Uppsa l a s t ud e n t s t o a n d b y t h e T i e r p p up i l s t o have 1 26 - l a ng u a g e ; ( wh o s p ok e Swe d i sh w i t h o n l y a wa s be l i e ved by - l a nguage a nd the A r a b i c s p e ak e r we r e i n Up p s a l a a s hav i ng Turk i s h a s s l i ght Ame r i c a n a c c e n t ) be a t h e s p e ak e r ' s a ny o f a l a rge number o f n a t i ve but l a ng u a ge s , l a ng u a g e . Th e Ame r i c a n pup i l s T i erp s p e ak e r ' s uvu l a r ARA as not E ng l i s h , as h i s i nt e r p r e t e d t h e K i ny a rwa nda a s i gn that she wa s a n a t i ve s p e ak e r o f Fr e n ch . I n order t o fac i l i tate c omp a r i s o n b e twe e n t h e r e s u l t s a ch i e v e d by t h e t e a c h e r s o f Swe d i s h a s a s e c o n d T i e r p a nd the Up p s a l a , f o rm o f other f o rm o f f i gure s : that ers on l y in the p u p i l s Th e r e s u l t s o f we r e r e a l l y the qu i t e l a ng u a g e we re a b l e ly ( as s h own in or the of or not poor . to f i gure l itt l e two al l . in f o l l ow i ng p o i n t s 3) , t i me s , l a ng u a g e Th e pu p i l s t e a c h e r s we r e wh i l e Th i s a l s o t h e t e a ch c a s e s the t e a ch e r s f am i l i e s . i n the of prev i ous s e c t i on Swe d i sh a numb e r o f a s a s e c o nd a c c e n t s c orre c t ord i n a ry n a t i ve s p e ak e r s , a c c u r a cy . i nexper i e n c e c ou l d on ly Wh i l e t h i s may b e d u e t o the ( t h e i r a ve r a g e re s u l t s . c l asses 12 7 of - a s repre i d e n t i fy 5 - 6 age wa s r e a s o n t o be l i eve that o l d e r - i n the l a ngu age s Wh i l e t e a ch e r s mu c h be t t e r new r e s u l t s w i th a ny h e l p the I n s ome o f gymn a s i um p up i l s years ) , is at s evera l i d e n t i fy q u i t e a nd on a l s o pre s e nt e d the pup i l s . s quare s ; t h e wor l d o r f or e i gn i n f o rma n t s youth f orme d in p o s s i b l e a n swe r s t hems e l ve s . a c c e n t s w i t h a ny d e gr e e o f wou l d a ch i eve Th e n o t g i ve n e x p e r i me n t s d i s c u s s e d s e n t e d by the g r ou p s there are 3. a s we r e mor e c o u l d gue s s gue s s e d o n c e , I de n t i f i c a t i o n o f l a nguage s one t h i nk o f on ly spec i f i ed parts 3. 2 f i gu r e Th e t e a ch e r s we r e p o s s i b l e n a t i ve r e qu i re d t o in resu l ts s t ud e n t s i n m i nd whe n c omp ar i ng t h e t e a c h e r s ' had o n l y t o c r o s s me a n t t e a ch e r s ' a c o n f u s i o n ma t r i x shou l d b e borne the the l a ng u a g e a nd t h e Another gymn a s i um 1 8- 1 9 i n f o rma n t s e x p e r i me n t wa s p e r s t ud e n t s i n Up p s a l a ( p a ra l l e l to see c l as s e s to the other c l as s e s , if t h ey c ou l d r e c og n i s e f or e i gn a c c e n t s . i ng s of l i Th e l a ng u a g e s wa s A tape North used a s a nd the free a ga i n h e r e . response des c r i bed t hr e e ph i c a l zone i n f orma n t s , ma n t s , but not c orr e c t l y . The ( see f rom t h e 1) 2) 20 re ad- ( t h e who l e t e x t ) in 25 n ame d c or r e c t l y , s h own of in i de n t i f i ed the f rom t h e me thod t ab l e n ame d c o r re c t l y by a t by a t a nd who ( ad a p t e d l a ng u a g e s p l a c e d p e r c e nt ag e s Th e r e c o r d e d ma t e r i a l pu p i l s are l a ng u a g e s pre v i ous s e c t i on ) 3) 2 . The l east i n t h e c or r e c t g e og r a l east 40% of l a ngua g e s not the i n f o r- i de n t i f i e d i de n t i f i c a t i o n s a r e s h own b e s i de l a ng u a g e n ame . Tab l e 2 LANGUAGE REGOGNI SED Norwe g i a n 1 0 0% Eng l i sh ( Br i t i sh ) 1 0 0 % Eng l i s h ( Ame r i c a n ) 1 0 0% G e rma n ( No r t h ) 1 0 0% G e rma n ( Au s t r i a n ) 9 0% F i nn i sh 1 0 0% Ru s s i a n 8 5% Fr e n ch 1 0 0% Swe d i s h 1 0 0% D a n i sh 1 0 0% S p a n i s h ( A rge n t i n a ) 7 0% S p a n i sh ( S p a i n ) 8 0 % As speakers ' gu i d e d r e s p o n s e me t h o d s we r e u s e d f r e e re s p o n s e me t h o d c o l umns r e p r e s e n t : the the the Sun il i n the prev i ous s e c t i on ) 40% o f d i f f erent I R I S databas e . and The r e s u l t s l a ng u a g e s u s i ng t h e 29 l a ngu a g e s a ny b e t t e r t h a n s t i mu l us ma t e r i a l . wa s mo s t l y o bt a i n e d f r om t h e Both f or e i gn c o n t a i n i ng W i nd a n d s u p p o s e d l y s i m i l ar ) , can b e s e e n , n ame d c o rr e c t l y t h o s e s h own pup i l s , it ZONE RECOGN I SED A r a b i c 6 5% Swah i l i 6 5 % Cze ch 80% G r e ek 7 5 % P o l i s h 8 5% J a p a ne s e 8 5% Kore a n 9 0% P o r t ugue s e 7 5% Du t c h 8 0% P e r s i a n ma l e 5 5% i s on l y the by i n f i gure the s e mo s t f am i l i a r s t ude n t s . 4 a ch i e v e d by the u s i ng a gu i d e d r e s po n s e a n swe r we r e r e q u i r e d t o c r o s s t h e l a ng u a g e ( s ) - 1 29 - C omp a r e NOT REGOGN I SED P e r s i a n f ema l e L a p p i sh Turk i sh B e ng a l i K i ny a rwa nda T i gr i ny a Yoruba l a nguag e s t h a t the s e r e s u l t s w i th r e ma i n i ng c l a s s f o rm , wh e r e they are of 19 the i n f orma n t s t h o ugh t t h e y we r e FlIiIIE 4. CalfUlilil Mtril f.. 1 .... idlntificlti. Speaker no: Finnish Lappish SlIIedish Norlllegian Danish Bri ti sh English Alerican English - - Polish I'-[ I�i - Kinyarlllanda Voruba Speaker no: - - - - �, r-- l-I- o- - I-- I- - -- r-- -- I - - - :; 1 4 1 2 1 3 - - - - r--' - - - - - - 1 - -- -- -- - =. . - 1 1 - 1-- r--r-- ' - - - ==F== = . - 130 - - 2 1 1 1 1 1 2 2 4 -3 4222 164 7 6 6 It 3 El4 E 1 1 732 6 -- 1 � r�]1 1 1 :; -1 55 31 44 1 3 2 1 E��F- ;- 7 1 - 4 3 ;]9]7 2 ;12 16 20 25 22 4 24 23 28 _ .- 6 -: -- == -- - - - I- -- 1 1 1 3 - I 1 -1 2 8 11 21 1 -26 5 13 -7 9 29 3 17 14 27 15 18 10 6 19 7 -- - 3 1 - r--'1- - - - - roo- - I - - -- - I- - - 1 3 10 2 -- 3 5 2 10 2 6 32 ff 7 1- - 2 4 2 F-[l2 6 2 33 1 573 12 1 1 1 2 : - -2 - Korean Tigrinha l� 6 :; 3 - - Persian SlIIahili - == t2-- 1 Turkish Bengali - -�E: = 7 ��19 17 Czech Japanese - - Russian Arabic - - -- - Portuguese Greek -= - == =- Dutch French 19 _1111 ,..i ll. 8 11 21 1 26 5 13 7 9 29 3 17 14 27 15 18 10 6 19 2 12 16 20 25 22 4 24 23 28 19 17 1 31 19 19 1 Ii9 -F1 Gerlln Spanish - -- = = F = r �r h e a r i ng . guage s Th e s e to s t ude n t s s t ud e n t s , cho o s e f r om c o rr e c t l y Ame r i c a n Eng l i sh , Fre n c h , i de n t i f i e d Au s t r i a n a nd Swe d i s h , D a n i sh , but J a p a n e s e a nd K o re a n . f our A f r i c a n l a ng u a g e s a l so a c h i eved poor re su l t s gu i de d i n f o rma n t s , it At l east o n l y No rwe g i a n , North G e rma n , l a n 40% o f Br i t i s h a nd F i nn i sh , Rus s i a n , as La p p i sh , Dut c h , Czech , The Gre ek , T i gr i ny a wa s f or i de n t i f i e d by n ame , the A f r i c a n is mor e Po l i sh , the on l y one o f g r o u p u s i ng t h e re s po n s e s i nc e resu l t s . poss i b l e a nd P e n i n s u l a r S p a n i sh , t o be Afr i can . not l ist of Arge n t i ne A l t h ough i de n t i f i ed a s the we r e g i ve n a a ch i eve d b e t t e r other group , tha t wh o al l the t h em w e r e f r e e r e s p o n s e me thod l a ng u a g e s . s u i t ab l e i s e v i de n t l y d i f f i c u l t of the Th i s s ug g e s t s f or u s e w i t h n a i ve f or t h em t o t h i nk o f l a ng u a g e name s . 3 . 3 A c c e nt e d n e s s me a s ureme n t As wa s d i s cus s e d a c c e n t s may we l l t o n o n - n a t i ve a c c e n t e dne s s in be s e c t i on a s i gn i f i cant s p e ak e r s . i s be s t a c c e n t s t r e ng t h cus sed It on ma ny n a t ur e o f ( 1981a , b) a c c e nt e dne s s a s j ud g e d w i th a c c e nt edne s s a c c e nt e d ne s s n o n - na t i ve n a t i ve a t t i t u d e s h oweve r a l t og e t h e r c l e ar h ow An i mp r e s s i o n of a f ew mark e d d ev i a t i o n s l es s a s e r i ous c e rt a i n f rom t h e dev i a t i ons . f or e i g n a c c e n t w i l l be Th e d i s i n s e c t i on 4 . A sma l l - s c a l e e x p e r i me n t b a s e d a n d Bre n n a n pred i c tor o f i s not may b e ba s e d o n a the pre c i s e further t h e s t r e ng t h o f t o be me a s ur e d . n a t i ve p r o n u n c i a t i o n o r q ue s t i o n o f 2 .4, c a n be as wa s on the des i gned t o by l i ngu i s t s j ud g e d by n a i ve work r e p o r t e d test wi l l - 1 3 1 - t h e hy p o t he s i s c or r e l a t e i n f o rma n t s . q u a n t i f i e d q u i t e s i mp l y . i n Bre n n a n If F i rs t l y , that p o s i t i ve l y th i s i s so , a n a t t emp t wa s ma d e t o e l i c i t The a c c e n t e dn e s s i n f orma n t s we r e a g r o u p o f no n-huma n i t i e s p o rt e d ) s ub j e c t s a c c ent i n f o rma n t s ma l e s p e ak e r s to who Th e me n t s o f the part i c i pated i n ear l y j ud g e d t h e a c c e n t e d n e s s o f i mme d i a t e l y b e f or e They he ard t h e r e c ord i n g s o f j udge a s e c o nd t i me Me thod : i n f orma n t s . 1 8 -ye ar o l d gymn a s i um p u p i l s o f a l so t h e y h a d h e a rd c a t i o n e x p e r i me n t . we r e 28 f rom n a i ve ( u nre - i d e nt i f i c a t i o n e x p e r i me nt s . Th e Ma t e r i a l : j udgeme n t s i n f orma n t s n i ne o f i n the the i de nt i f i - t h e s p e ak e r s they i n a new o r d e r . we re g i ve n a n a n swe r s h e e t w i t h s t a t e - f o l l ow i ng k i nd : SPEAKER 1 The s p e aker has no 1 f ore i gn a c c e nt Th e i n f o rma n t s we r e c or r e s p o nd e d t o f or e i gn 2 3 i n s t ru c t e d t h e d e gr e e me n t s a t e a c h e nd o f no The s p e aker has a 5 t o c i rc l e s u ch of t h e s p e ak e r s h e a rd ave r a g e of s h own as 1 are s h own t h e a c c e nt e d n e s s wh i c h be s t wou l d in the n a t i ve s p e ak e r w i t h wh i l e a be i n t a b l e 3 b e l ow . f or e a ch o f t h e 9 s p e akers Tab l e 3 . Spe akers Japanese Po l i sh Ame r i c a n Swe d e 1 Swe de 2 Greek Swah i l i Kore a n Lapp i sh t h e n umb e r that a be graded ma x i ma l l y s t rong n o n - n a t i ve a c c e n t r e s u l t s o bt a i n e d a r e s t rong f ore i gn a c ce n t . t o wh i c h t h e y a gr e e d w i t h the s t a t e - the s c a l e , a c c e n t wou l d 4 s p e ak e r w i t h a graded as Th e n a t i ve l e f tmo s t j udgeme n t s g i ve n b y a l l c o l umn , 28 . - 1 32 - - 28 na i ve The l a nguag e s the i n f o rma n t s i n t h e s e c o nd c o l umn . Eva l ua t i on o f a c c ent edne s s Av grades 4 . 25 3 57 1 .5 1 . 07 1 3.5 4 . 32 2 . 93 2 . 86 5 . i n f orma n t s The n e x t f r om a a, b) s t e p wa s t o e l i c i t pane l of of f or the expert pane l of cy o f j udge i ns t a n c e s o f e a c h s p e ak e r . t o n a i ve p o s e s o f the curre nt The a s wa s u s e d f or t h e of dents A pane l e i gh t i n pho ne t i c s a n d the s e 25 20 15 10 5 0 r r r r L.. 10 f i ve m i nu t e f r e q u e nI nd e x i nd e x e s For f or c or r e - t h e pur- l i ngu i s t s ( t h e s ame ma t e r i a l e x p e r i me n t s ) r e c orded j ud g e d b y t h e ( t e a ch e r s a nd l i s tened by t h e n a i ve i n- r e s e ar c h s t u- i nd e p e nd e n t l y t o t h e n o n - n a t i ve p r o n u n c i a t i o n s t h e y h e a rd o n F i gure 5 Re l a t i o n s h i p be twe e n na i ve a nd e xpe rt a c c e n t e d n e s s No . o f d ev i a t i o n s ( expert j udg e s ) have a a set of A c c e n t e d ne s s t h e s t i mu l u s t e x t l i ngu i s t i c s ) re c o rd i ngs a nd mark e d a l l to Th e i r a s i mp l e r me thod wa s u s e d : i de nt i f i c a t i on of a a c c e nt ed n e s s . n i n e ma l e s p e ak e r s wh o s e a c c e n t e d n e s s wa s f orma n t s . in ( 1981 a nd e x p e r t wa s e a ch o f o c c ur r e d f ound t h a t e s t i ma t i o n s ha l f o f not o bt a i n e d a n e x p e r i me n t , f i rs t n a i ve By ave r a g i ng t h e r e l a t i ve Bre n n a n a nd Bre n n a n s p o n d e d we l l be twe e n or e rr o r typ e s type they Bre n n a n a nd Br e n n a n me a s ur eme n t whe t h e r the a c c e n t e d s p e e c h . e a ch e rror Me thod : 18 me a s ur e o f a c c e n t e d ne s s f o r S p a n i sh-a c c e n t e d Eng l i sh . a c c e n t edne s s l i ngu i s t s s amp l e o f c omp a r i s o n a c c e nt edness me thod poss i b l e c omp a r a b l e t r a i n e d pho n e t i c i a ns . p e r f orme d a s i m i l ar me a s ureme n t s a j udgeme n t s * * * * * * 1 A c c e nt edne s s grade s - 1 33 - ( n a i ve i n f orma n t s ) a c o py o f we r e the t e xt . c ou n t e d . Errors n o t e d by a t j udgeme n t s we r e t h e n c omp a r e d w i t h c omp a r i s o n i s s h own c a n be s e e n , re c e i ve d n a i ve 10 be e n in c ou l d b e o b t a i n e d . t h e n a i ve a c c e n t e d n e s s f i gure f or one de v i a t i o n s f rom the there is a j Udgeme n t s . i nt e r p r e t e d a s fact that f or e i gn by t h e t h e Ma l a r va l l ey d i a l e c t the two me t h o d s wh i ch i nd i c a t e s t h a t n a i ve n a t i ve j udge s n a i ve j udg e s . be d i s c u s s e d a nd 2 . 9 f rom t h e the expert o f a c c e nt ednes s l arg e r but a l mo s t i n p a r t b e due North Swe d i sh as h i s t h e o t h e r n o n - n a t i ve s p e ak e r s h a d t h e i r t arge t . i t m i ght Th i s may be Apart j udgeme n t poss i b l e , a c c e nt e dne s s . to f r om t h i s e x c e p c o r r e s p o nd we l l , s i mp l y re l y on Th e s e r e s u l t s a r e c omp a Fur t h e r e x p e r i me n t s u s i ng n a i ve e s t i ma t i o n s o f wi l l Th i s L a p p i sh s p e ak e r i n f o rma nt s , t i b l e w i t h t h o s e o bt a i n e d b y B r e n n a n a nd B r e n n a n a j udgeme n t s . ( t he s p e ak e r h a d of as j udgeme n t s o f p e r f orme d w i t h c ou n t s The L a p p i sh s p e ak e r ' s a c c e n t h a s o bv i ous l y wh i l e mo s t t i on , ana l ys i s Th e s e c l e a r c o rre s p o nd a n c e b e twe e n t h e L a p p i sh t a rg e t d i a l e c t , resu l t , expert a s a n a t i ve d i a l e c t by the e x p e r t to t h e l i ngu i s t s 5. except j udg e s ) a nd n a i ve the The K o r e a n s p e ak e r wa s e x c l ud e d f rom t h e s i n c e o n l y two e x p e r t As l e a s t t wo o f n umb e r o f a c c e n t s . ( 1981 ) . a c c e n t e d n e s s we r e Th e r e s u l t s o b t a i n e d i n s e c t i on 3 . 5 . 3 . 4 A t t i t ud e me a s ur eme n t A s wa s d i s c us s e d t i c a t t i t ud e s in s e c t i on 2 . 5 , make u s e ad j e c t i ve s r e p re s e nt i ng I n f orma n t s a r e ma ny i nve s t i ga t i o n s of l i ngu i s o f a 5 - 7 p o i n t s c a l e who s e e nd p o i n t s a r e d i f ferent persona l requ i red t o - " gr a d e " 1 34 - and other qua l i t i e s . t h e p e r s o na l i t i e s beh i nd the vo i c e s t h ey h e a r a l o ng a numbe r o f t h e s e gra d e s a r e then i nt e rpre ted at t i tudes t o the groups In i nd i c a t i o n o f t h e var i e t i e s our a t t emp t to m i n or i t i e s a nd Th e wa s that to that test the a t t i t ud e s of hy p o t h e s e s to (a) c omp a r a b l e t h e wor l d ( or t h e e th n i c Me thod : t h e i r a t t i t ud e s t o i nt e n t i o n n a t i ve f ore i gn f r om s t r o ng e r a c c e n t s e l i c i t tude s t o of the a e x p e r i me n t s p e ak e r s h a v e d i f f e s p e akers d i f f erent e t hn i c l e s s p o s i t i ve f r om d i f f e r e n t groups ) , a nd (b) n a t i ve s p e ak e r a t t i f or e i g n s p e ak e r s . A var i a t i o n o n the "mat ched s e c t i o n 2 . 5 wa s u s e d t o t e s t f i ve o f me n t s to a c c e n t s w i t h wh i ch t h e s e m i n or i t i e s s p e ak Swe d i sh , p i l o t e x p e r i me n t wa s p e r f o rmed . part s or i n f o rma n t s ' i nve s t i g a t e t h e r e l a t i o n s h i p b e t we e n n a t i ve a t t i t ud e s t o e t h n i c f or e i g n rent s p oke n The s e t h e s p e ak e r s a r e a s s ume d o r k n own t o b e l o ng t o . s p e ak e r s ' the l a ng u a g e as an a d j e c t i ve s c a l e s . t h e ma l e s p e ak e r s pr e s e n t e d i n f o rma nt s , 31 part i c i pa t ed e ar l i er ) , the gu i s e " me t h o d d i s c u s s e d f i rs t hy p o t he s i s . f r om t h e a c c e n t Re c o rd i ngs o f i de nt i f i c a t i on exper i i n s e c t i o n 3 . 1 we r e u s e d f or t h i s e x p e r i me n t . gymna s i um s t ud e n t s i n an ac cent we r e g i ve n f r om T i e r p ( mo s t i de n t i f i c a t i o n e x p e r i me n t i n f o rma t i o n about the Th e o f wh om h a d a f ew mo n t h s s p e ak e r s ' a l t h o ugh i n mo s t cases resu l ts of a c c e nt i d e n t i f i c a t i o n e x p e r i me n t s we r e s t ud i e d c l ose l y at s a f e l y made th i s po i nt about p r e s e n t e d tw i c e to see wh i ch i n f orma t i o n wa s mo t h e r t o ngu e s , the th i s in fa l se e a ch s p e ak e r ' s o r i g i n . t o the i n f orma n t s , r e n t mo t h e r t o ngue . - 1 35 - e a ch fa l se . s t a t eme n t s One o f t i me a s the The cou l d be s p e ak e r s wa s h a v i ng a d i f f e Th e s p e ak e r s we r e : 1 2 3 4 5 6 . . . . . . The s p e ak e r s e x p e r i me n t s W i nd a nd we r e r e ad t h e ( t he i n s t ru c t e d be e n g i ve n s ame t e x t f i rs t h a l f o f the S u n i l ) . Th e c o nt a i n e d pa i r s o f po i nt s c a l e as f i l l i ng in s t a t eme n t s a b o u t the Th e r e we r e t o rs i n f o rma n t 3 1 2 s t a t eme n t p a i r s such as wor t h i n e s s , tic 2 1 f a c t o rs , educa t i on , suc cess , and to fi l l t h e s ame In in . no case e a c h p o i n t we r e p r o b a b i l i ty ) more l i Th e N o r t h d i d the The The they had a n swer f orm to al l , Ta l a r e n ar I Ag u t b i l d a d ( Th e s p e ak e r i s n o t we l l - e du c a t e d ) c ove r i ng persona l i ty f a c - we a l t h , t h e vo i c e k i nd n e s s , a s pure l y f o r e i g n a c c e nt , t ru s t - l i ngu i s - c omp r eh e n s i b i l i ty , is , a n d how i rr i t a t - is . i n f o rma n t s f i nd t h e r e c og n i s e t h a t the f orms d i f f i c u l t l as t vo i c e wa s g r a d e s a s s i gn e d t o e a ch s p e ak e r f or stat i st i ca l ly u s i ng j udg e d t o b e s i g n i f i c a n t l y f r i e nd l y a nd s i g n i f i c a n t l y - f orms and i nd i c a t e t h e d e g r e e i nt e l l i g e n c e a s we l l ana l ys e d Kore a n s p e ak e r wa s of e.g. : in h ow p l e a s a n t f i rst . c ou l d f r i e nd l i ne s s , Ne i t h e r d i d they as the i de n t i f i c a t i o n t h e s p e ak e r s e p a r a t e d by a 5 - 5 i ng t h e s p e ak e r ' s p r o n u n c i a t i o n Re s u l t s : the a n swe r 4 such a s degree o f c l e ar n e s s o f s p e e ch , Kurd Swe d e G e rma n Kurd Swe d e G e rma n a c c e nt s p e ak e r o n c e . wh i c h h e agre e d w i t h t h e s t a t eme n t s Ta l a r e n a r hogu t b i l d a d ( Th e s p e ak e r i s we l l -edu c a t e d ) i n the a a a a a a t h e Swe d i s h v e r s i o n h e a rd t h e s u ch t h a t wa s wa s wa s wa s wa s wa s he he he he he he i n f o rma n t s h e a rd e a ch s p e ak e r tw i c e , to s t art a f t er they to l d to l d tol d to l d to l d to ld we re we r e we r e we r e we r e we r e i n f orma n t s i n f o rma n t s i n f orma n t s i n f orma n t s i n f o rma n t s i n f orma n t s Kor e a n Swe d i s h ( Sk A n e ) Arab i c T i gr i nya Swe d i sh ( N orth ) Kor e a n 1 36 - the t -t e s t . ( at the 5% l ess Th e l eve l success fu l of whe n be l i ev e d t o be a Kurd t h a n whe n b e l i ev e d t o be d i f f e r e n c e s be twe e n t h e t w o gu i s e s we r e n o t a G e rma n . Other s i gn i f i c ant . Th e e f f e c t o f d i f f e r e n t a c c e n t s t r e ng t h wa s mu c h mor e marke d , t h ough may o f it c o urs e be the o t h e r d i f f e r e n c e s b e twe e n t h e d i f f erent the s p e ak e r s i n f o rma n t s t o ngue s ) i ng way s we re to b e we r e Kurd s i nd i c a t e s s p e ak e r s , i nvo l ve d . j ud g e d t o be ( -NS- c a s e that ( w i th i n f o rma n t s r e a c t e d t o s i nce Th e two i n th i s c a s e two s p e ak e r s b e l i e ved by Kor e a n a nd T i gr i ny a a s n a t i ve s i g n i f i c a n t l y d i f f e re n t t h a t t h e r e wa s al- i n the f o l l ow- no s i gn i f i c a n t d i f f e r e n c e be twe e n t h e g r o u p s ) : edu c a t i o n f r i e nd l i ne s s we a l t h k i nd n e s s t ru s two r t h i n e s s suc cess i nt e l l i ge n c e a c c e nt edne s s und e r s t a ndab l e c l a r i ty p l e a s a n t vo i c e i rr i t a t i o n Kore a n a s Kurd S i g n i f i c a n t l y mo r e S i gn i f i c a n t l y mor e S i gn i f i c a n t l y mor e S i g n i f i c a n t l y mor e S i g n i f i c a n t l y mo r e -NS S i g n i f i c a n t l y mor e T i gr i nya as Kurd S i gn i f i c a n t l y mor e S i g n i f i c a n t l y mo r e S i g n i f i c a n t l y mor e S i g n i f i c a n t l y mor e S i gn i f i c a n t l y mor e A s i m i l a r c omp a r i s o n wa s made b e twe e n t h e two s p e ak e r s be l i e ve d by t h e i n f orma n t s t o be G e rma n s ( n a t i ve s p e ak e r s o f K or e a n a nd Ara b i c ) . Kor e a n a s G e rma n G e rman edu c a t i o n f r i e nd l i n e s s we a l th k i ndne s s t r u s twort h i n e s s success i nt e l l i ge n c e a c c e nt edne s s unde r s t a nda b l e c l ar i ty p l e a s a n t vo i c e i rr i t a t i on Arab i c S i gn i f i c a n t l y mor e -NSS i gn i f i c a n t l y mo r e -NS -NSS i gn i f i c a n t l y mo r e -NSS i gn i f i c a n t l y mor e S i g n i f i c a n t l y mor e S i g n i f i c a n t l y mor e -NS-NS - - 13 7 - as Th e e f f e c t o f a c c ent s e ems S t ro ng e r a c c e n t s at c l e ar . l east as f ar a s An c e rn e d . s t r e ng t h a s a p r e d i c t or the o bv i o u s el icit Kore a n a nd i mp r oveme n t of a c c e nt ednes s on e l imi nat e the p r o b l ems s p e ak e r s a s s t i mu l i . asso c i ated Wh i l e t h e s e r e s u l t s a r e v e ry of the d i me n s i o ns a l o ng re l a t e d t o e a ch o t h e r . th i s , a nd t h e in a t t i t ud e s , s p e ak e r s a r e c o n- t h e e xam i n a t i o n a t t i t ud e s . w i th be d i s cu s s e d i n t e r e s t i ng , wh i c h f avo u r a b l e t o th i s e x p e r i me n t wou l d be n a t i ve s p e ak e r Th i s w i l l j udgeme n t s T i gr i ny a i mp l eme n t t h e ma t ch e d gu i s e t e ch n i q u e e f fect l ess o f va l u e the h av i ng two d i f f e r e n t f ur t h e r i n s e c t i on 4 . it is c l e a r t h a t s ome s i x s p e a k e r s we r e j udged a r e C orre l a t i o n t e s t s we r e p e r f o rme d to re l a t i o n s h i p edu c a t i o n f r i e nd l i ne s s we a l t h k i ndne s s trus twort h i ne s s success i n t e l l i ge n c e a c c e nt e d n e s s unders t a nd a b l e c l ar i ty p l e a s a n t vo i c e i rr i t a t i o n As can Wh i l e fr 0 is we that the i nve r s e ) ki tr x x x su in x x 0 x 0 ac un cl pI x 0 0 0 x x x x 0 x x 0 x 0 ir x x x 0 x x x 0 x x x x x 0 x x x x x seen , x 0 x x x 0 x x x x ma ny o f x x x 0 the va r i a b l e s a r e r e l a t e d t o e a c h o t h e r . c ome s a s n o s u r pr i s e t h a t d e g r e e t i ve l y c o rre l a t e d t o d e gr e e o f r e l a t e d t o c l a r i ty o f wa s ( x me a n s x x be it c h e ck f o l l ow i ng r e s u l t s we re o b t a i n e d : t h a t t h e q u a l i t i e s a r e p o s i t i ve l y c or r e l a t e d , ed x the Th i s wou l d S i gn i f i c a nt l y c orre l at ed var i ab l es a t 5% l eve l . Tab l e 4 of to j udge d t o be , of i rr i t a t i o n i s pos i - f or e i g n a c c e nt a nd n e ga t i ve l y c o r - s p e e ch a nd h ow p l e a s a n t t h e s p e ak e r ' s vo i c e t h e c o n n e c t i o n b e twe e n k i nd n e s s o r we a l t h and - 1 38 - x c l ar i ty o f s p e e ch is It l e s s o bv i o u s . is u n c l e a r how t i o n s h i p be twe e n t h e s e va r i a b l e s is It i n f orma n t s ' may be the case that the s p e ak e r s a r e mu c h l ess nua n c e d prev i ou s s i m i l a r work in ge s t , u s i ng f ewer b e twe e n d i me n s i o n s . f r i e nd l i ne s s , h a nd a n d e du c a t i o n , than the a nd tha t t h e i r a t t i t ud e field to k i nd n e s s b y W i l l i ams of " st atus " d a r i ty " ( 1 974 ) a nd persona l s i de a t t i t ud e s t o of the we a l t h , success , k i nd n e s s , wou l d s ug s ummed up n o c or r e l a t i o n on on the other i nd i c a t e the one ( except f or tha t , ( 1 975 ) , a t t i t ud e s as f ou n d t h e d i me ns i o n s i nt e l l i gence ) t r u s tworth i n e s s ) n a t i ve s p e ak e r t h e e th n i c is the d e s i g n a nd t ru s two r t h i n e s s Th i s wo u l d about c a n be there C a r r a n z a a nd Rya n ( e du c a t i o n , ( f r i e nd l i ne s s , e x p e r i me n t s p e ak e r s a nd s u c c e s s e d u c a t i o n and t ru s t wo r t h i n e s s . j udgeme n t s ( s e e s e c t i on 2 . 5 ) that a nd i nt erpre t e d . the the Not i c e we a l th t o be the re l a m i gh t a nd " so l i c ove r t h e ( r e f l e c t i ng the i r m i no r i t i e s wh o s p e ak w i t h t h e a c c e nt in que s t i o n ) . The s p e e c h c h a r a c t e r i s t i c s a c c e n t e d n e s s , s a n t ne s s of vo i c e , c l a r i ty c orre l a t e d to e a ch o t h e r , by a s i ng l e d i me n s i o n a c c e nt i tse l f ) . f r i e nd l i ne s s a nd k i ndne s s ; except rate suc cess a nd eva l u a t i o n a l i rr i t a t i o n we r e p l ea a l ways c a n a l s o be r e p re s e nt e d ( re f l e c t i ng n a t i ve s p e a k e r a t t i t ud e s t o t h e A c c e nt e d n e s s wa s c ompre h e n s i b i l i ty wa s ne s s , s p e e c h a nd s u ch t h a t t he s e every t h i ng e x c e p t we a l th a nd t o e v e ryth i ng of c ompr e h e n s i b i l i ty , c orre l a t e d t o p l e a s a nt success ; f r i e nd l i ne s s , Th i s 1 39 - c o rre l a t e d t o wa s k i nd n e s s s h ows that at s t a t us , c orre l a t e d a nd s u c c e s s ; e v e ryt h i ng e x c e p t d i me n s i o n s a r e r e q u i re d : - was i rr i t a t i o n we a l t h , c orre l a t e d t o i n t e l l i ge n c e . vo i c e e v e ry t h i ng e x c e p t f r i e nd l i l east 3 se pa s o l i d a r i ty a nd s p e e ch Fu r t h e r chara c t e r i s t i c s . e x p e r i me n t s are n e c e s s a ry a t th i s po i nt . 3 . 5 Re l a t i o nsh i p b e twe e n a c c e n t e d n e s s a nd a c c e n t At the s ame t i me a s t h ey t r i e d t o t h e s p e ak e r s reported 3.3, the to g r a d e the the n a i ve on a sca l e and 5 i n the in s e c t i on 3 . 1 , a l s o a sk e d way a s t h ey h e ard r e p re s e n t i ng i n f orma n t s t r i e d ma x i ma l l y i d e n t i fy me thod ( as s p e ak e r ' s s e c t i on 3 . 1 ) we r e v e ry of i nvo l ve d one or mor e t o ng u e . the i n s e c t i on f or e i g n a c c e n t a c c e nt . Th e s e s p e ak e r s u s i ng t h em p u t t i ng c r o s s e s l a ng u ag e s t h e y t h ought I n f o rma n t s a n a t t emp t Tog e t h e r t o a nswe r , w i th the fact that than one l a nguage th i s me a n s t h a t each no f ore i gn we r e p e rm i t t e d t o c r o s s mor e of i n t h e s ame u s i ng t h i s f r e e r e s p o n s e me t h o d a l s o d e s c r i b e d a l ways made u n c e rt a i n . f r om T i e rp we re s p e ak e r s r e p re s e n t i ng t h e or i g i ns mo t h e r o p p o s e d t o the the s t ro n g wh i ch c orre s p o nd i ng t o m i gh t be e a ch i d e n t i f i c a t i o n e x p e r i me n t s i n t h e e x p e r i me nt r e p o r t e d t h e gu i d e d re s p o n s e me t h o d , i n s qu a r e s mo t h e r t o ngu e s o f 2 9 gymn a s i um pu p i l s to 5 wi th 1 a to a c c e nt a c c ent ednes s o f i n f o rma n t s f rom 1 i d e n t i fy t h e i de nt i f i c at i o n s p e ak e r , l a ng u a g e in e v e n wh e n t h e y the i n f orma n t s as t h e n a t i ve t h e r e we r e a l a rge numb e r of wro ng a nswe r s . L o ok i ng a t we rs n o t f i gure 2, pr i n t e d in ove r t h e ma t r i x . i g nore a l l the l e ast 40% of 2) , we are is c l e ar t h a t t h e wro n g a nswe r s bo l d t y p e f a c e ) Wh a t d o e s c o rre c t the the a n swe r s i n f orma n t s l e ft mo s t two-t h i rd s it of t h e ma t r i x , - ans a r e n o t r a ndom l y d i s t r i bu t e d d i s t r i bu t i o n d e p e nd on? I f we a n d t h e o t h e r a n swe r s ma rk e d by a t ( i nd i c a t e d w i t h a nswe rs (al l c l ust ered by t h i c k i n the r e p r e s e n t i ng 1 40 - l i ne s i n f i gure b o t t om a n d r i gh t l a ng u ag e s other than those s p ok e n i n S c a nd i n av i a a r e gue s s e d mor e numb e r o f and No r t h e r n Eur o p e . o f t e n than f a c t ors , o thers . f o r e x amp l e t h a t Th i s may s p e ak e r s o f a r e be l i e v e d t o b e more nume r o u s t h a n o t h e r s , s p e ak Swe d i sh w i t h s t ro n g e r a c c e nt s S ome be l a ngu a g e s c aused certa i n or are by a l a ngua g e s expe c t ed to t h a n s p e ak e r s of other l a n guag e s . I n order t o t e s t the l at t er i de a , the l a nguag e s gu e s s e d a nd t h e a c c e n t e d ne s s ed . C o n s i d e r f o r e x amp l e 2 . F i rs t s ha l l c ome i g nore . the Next t h e r ow 29 a c c o rd i ng s p e ak e r 2 2 , ( gr a d e Th e ( F i n l a nd-Swe d i sh s p e ak e r ) s p e ak e r 1 . 6) , s p e ak e r 3 0 ( gr a d e 2 . 8) 7 x 2.4 Th i s + numb e r gue s s e s + 1 x 2 . 0 is f or F i nn i sh ) , f or s p e ak e r 1 1 who 29 wro ng are f or a l so i gn o r e d . s p e ak e r 16 1 . 6 + 2 x 2 . 8 d i v i ded t o g i ve by the ( gr a d e ( the tota l ave r a g e t h e o t h e r p o s s i b l e mo t h e r t o ngu e s . gue s s e d t o h a v e e a ch ma rk e d w i t h the f i rs t of l a ng u a g e a s 3 S i m i l ar al l l et t ers 1 gue s s for s p e ak e r 2 4 2 . 8) a nd 2 11 f or f or a nd 2 2 : numb e r o f wr o ng a c c e nt edne s s f o rmed f or a l l a c c e nt edne s s wh i c h we 2 x 2 . 8 = 33 . 2 f o r F i nn i sh , ave rage 1 , f i gure 2 g ue s s e s f or s p e akers wr o ng gu e s s e s the 2 .4 . 14 + f or Th e in had a grade pu p i l s a nd gue s s e s are a d d e d t o t h o s e 2 x then 2 s p e ak e r wa s e x am i n f or s peaker the s ame g r o u p o f 2 . 0) . ( gr a d e the c orre c t gue s s e s to 14 of r e p r e s e n t i ng F i n n i sh c ome 7 gu e s s e s 2 . 4 ac cent r e l a t i o n sh i p b e t we e n t h e of the c a l c u l a t i o n s we r e p e r F i gu r e 6 s h ows t h e var i o u s s p e ak e r s wh o we r e mo t h e r t o n g u e . of e a ch The x-ax i s l a ng u a g e s is ( t o be read ve r t i c a l l y ) . Swe d i s h a n d Norwe g i a n we r e n a t ur a l l y g u e s s e d re i gn not i c e ab l e . ac cent wa s bare l y - 1 4 1 - A l so , o n l y whe n the f o F i n n i s h a nd L a p p i sh we r e o n l y gue s s e d f or r e l a t i ve l y s l i gh t i n t e r e s t i ng s i n c e a s t h e Nord i c g r a ph i c a l wh i c h t o be v e ry it certa i n ly wh i ch are accents , is I t wou l d s e em t h a t a c c e nt s s l i gh t s t ronger a nd s t rong . e f f e c t here : are Swe d e n ; t h e y d o n o t be l o ng t o t h e s ame l a ng u a g e s , F i n n i sh a c c e n t f or e i g n a c c e n t s , are a s s umed on the to b e other f rom f a r t h e r a f i e l d . as w i t n e s s e d by the b a r s o n t h e r i gh t f i gu r e ge ogr a ph i c a l in zone s . f r om Swe d e n t h ey are 6 . If more or l e ss on s p ok e n . such that : of zone 2 i n c l ud e s E ng l i s h a nd G e rma n zone 3 i n c l ud e s S p a n i sh . f r om c ou n t r i e s n e ar a s s umed to be l e ng t h o f the a s s i gn e d t o h ow f a r away Swe d i s h a nd Norwe g i a n Fre n c h a nd G r e ek zone 4 i n c l ud e s P o l i s h . zone 5 i n c l ud e s Kurd i sh . Pers i an , zone 6 i n c l ud e s T i gr i ny a , zone 7 i n c l ud e s J a p a ne s e . S e rb o - C ro a t , Swah i l i . t o the ave r age Rus s i a n a nd C z e c h Turk i sh a n d A r a b i c K i nya rwa n d a a n d Yoruba Kor e a n a nd B e nga l i . that s i gn i f i c a n t l y p o s i t i ve l y the zone c or r e l a t e d a c c e ntedness o f numb e r o f t h e l a ng 1% l ev e l ( l es s than the wro n g gu e s s e s l a nguage w i t h r = O . 8 6 1 9 . s e ems t o be f o re i g n a c c e n t . g e o g r a ph i c a l a c c e nt i s a geo i de n t i f i a b l e but of i n c l ud e s F i n n i s h a nd L a p p i sh Th e p o s i t i o n a there t h e ba s i s 1 f or t h a t f or a are zone p r o b a b a l i ty ) poss i b l e l a ng u a g e s i n c l ud e s is f am i l y i n c r e a s i ng 0 is l a ng u a g e h a nd a r e zone a s i mp l e c o rre l a t i o n t e s t s h ows uage the not wh i ch he wi l l d i stance that if an i n f orma n t f a i l s t o re cogn i se p l a c e t h e s p e ak e r ' s mo t h e r t o ng u e at a f r om Swe d e n c o r r e s p o nd i ng t o h ow s t r o ng t h e is . - 1 42 - A n o t h e r p o s s i b l e way t o a c c ou n t a n swe r s in f i gu r e 2 i s to e x c l ud i ng i n f o rma n t s . s h own in all f i gure 7. As a n d Kurd i sh dom i n a t e . i mm i g r a n t ( wh i ch groups i s not s evera l in the we l l . and l a rge t o r e c og n i z e thes e t h e r e f or e b e i ng a b l e p o s s i b l e mo t h e r t o ngu e s f or is e a ch 40% o f t h e l a nguage is b e l i eve t he s e i n t h e Uppsa l a are a t h e r e f or e F i nn i sh . Br i t i sh a nd Ame r i c a n E ng l i sh . Th i s a s p e ak e r . l east i n f o rma nt s a nd e x p e r i me n t . n o t o f t e n wrong l y g u e s s e d . be i ng a b l e that case ) . f or l a n t h e b a r s r e p r e s e n t i ng Turk i sh part i cu l ar l y i n fact the t i me s N o rweg i a n . be be at o f wro ng t i me s e a ch t o ng u e o f by gu e s s e s c a n be s e e n . to numbe r o f ma rke d wro n g I t may d i s t r i bu t i o n the mo t h e r pos i t i ons t h e numb e r o f the l o ok a t t h e guage wa s wr ong l y g u e s s e d to b e Aga i n f or l i ke l y t o o c c ur Lapp i sh . Swe d i s h . Fre n c h a n d J a p a n e s e we r e probab l y due t o the accents i n f o rma n t s ( ex c e p t J a p a ne s e ) fairly t o e x c l ud e t h em f r om t h e r a n g e o f a s p e ak e r who s e a c c e nt t h e y do n o t r e c og n i z e . 4. P l a nned expe r i me n t s and future work 4 . 1 On t h e n a t ure o f Throughout f or e i gn a c c e n t s the d i s c us s i o n o f work d o n e by the i t h a s b e e n a s s umed t h a t u n c o n t rove rs i a l . the t e rm Let us cons i der Spe e ch pronoun c e d w i t h a " f o r e i gn j u s t wh a t n a t i ve s p e ak e r n o rm . s p e e ch t o d ev i a t e f r om * a c c e nt " th i s ask n o rm is w i thout then : Wh e n d o e s dev i a n t s p e e c h b e c ome - It 1 45 - f or e i g n ? is c l e a r a nd i s me a n t by t h i s t e rm . f or e i gn a c c e nt d ev i a t e s s ome u n s pe c i f i e d f i rs t q u e s t i o n we mu s t a u t h o r s a nd o t h e r s . i n s ome way f r om i s howe v e r p o s s i b l e s o u nd i ng f or e i g n . f or The Th i s may be a q u e s t i o n o f how d e v i a n t or two d ev i a n t f e a t ur e is f e a t ur e s added the the s p e e ch s ou n d o n l y s l i gh t l y odd , e ffect i s re l a t e d t o t h i s que s t i on . s t rong a c c e n t may f e a t ur e s , f o rm . Wh a t mak e s a Th e s e a c c e nt is l i ke l y used as the i n a p a r t i c u l ar t h e Swe d i sh gera ted l e ng t h grave numb e r o f d ev i a n t p re s e n t a c c e nt , in c o -o c c ur r e n c e in a mor e ( cf s e c t i on 2 . 3 ) . S t ere otype unas p i r a t e d I t may b e t h e i n a r a ndom of p a r t i c u l ar o f a part i cu l ar i nt i a l case that f ore i gn Cons i der f e a t ur e s c o n s o n a n t s a nd vowe l s , i n o t h e rw i s e wh i l e two o r t h r e e d o n o t o c cur " s i g n a t ur e " i n Swe d i s h . c ontra s t s f e a t ur e s o c c u r r i ng the l a ng u a g e a n d a l ve o l a r r o l l e d I r / . odd , f e a t ur e s f e a ture s that e x amp l e a F i n n i s h a c c e n t of of l a rge i mp re s s i o n o f a We ask pronunc i a t i on I t s e ems f e a t ure s to a Th e f or e i gn n e s s . f or e i g n a c c e n t s t ro ng e r ? dev i a n t ma nne r . due o r t o a sma l l e r numb e r e x agge r a t e d * be P e rh a p s o n e b u t whe n a t h i rd i s o n e o f u nm i s t ak e a b l e A c c e nt s t r e ng t h f or e i g n is . are st ops , f or l a ck exag ve l a r i s e d I I I a ny o n e o f t h e s e n a t i ve Swe d i s h wou l d s ound me r e l y t og e th e r i mme d i a t e l y g i ve an i mp re s s i o n o f a F i nn i sh a c c e nt . Th e q ue s t i o n h e r e * In Wh a t make s a sect i on is: part i cu l ar 2 . 3, two f or e i g n a c c e n t ? q ue s t i o n s wh i ch b e l o ng we r e r a i s e d : - 1 46 - i n t h i s d i s c us s i on * C a n n a t i ve s p e ak e r s have p o s i t i ve a t t i t u d e s t o o n e p e r c e i ve in a p a r t i c u l ar f or e i gn a c c e nt wh i l e t i ve a t t i t ud e t o a n o t h e r ? * f e a t ur e n e g a t i ve it whe n a t t i t ude s t o t h e s ame a t t i t ud e s o c curs f e a t ur e i n one 1 980 ) the t h a n o n e n o n- n a t i ve p l a us i b l e that nu n c i a t i o n o f l ar l y , if e x amp l e , in a n a t i ve is wh i l e a c c e nt f i nd s he is Th i s f e atures s p e e c h redu c t i o n s , s u ch a nd as i n more it s e e ms f i nd t h e r e t ro f l e x p r o n a t i ve s p e ak e r f i nd s vowe l u n a t t r a c t i ve / ne g a t i ve . " e s ko " ) l eads u s t o the f e a t ur e s o f f l ue n cy a c c e nt - s pe c i f i c /e/ to un l i ke l y t o p o or ( a s s h own I n t u i t i ve l y , a prothe t i c l i ke h ow i rr i t a t i ng a r e d i f f e r e n t ge nera l be s i nce o f E ng l i s h - a c c e n t e d Swe d i s h s ame to pronounc e d Arab i c - a c c e nt e d Swe d i sh . p o s i t i ve l i k e l y t o s h ow u p Swe d i sh . the s p e ak e r a part i cu l ar a c c e n t but to test s p e a k e r m i gh t S p a n i s h -a c c e n t e d Swe d i sh , * is s ome va r i e t i e s i n t h e s ame " sko " of n a t i ve a t t r a c t i ve / p o s i t i ve r e du c t i o n f e a t ur e accent a /r/ s ame to i n a no t h e r a c c e n t ? Th e s e q ue s t i o n s s h o u l d n o t b e d i f f i c u l t by B a n n e rt t h ey h a v e a n e g a a nd C a n n a t i ve s p e ak e r s h ave n o n- n a t i ve f e a t ur e they f i nd Simi ( wh e r e by , be it f or p o s i t i ve in n e ga t i ve in f o l l ow i ng q u e s t i o n : f ore i gn accents , a nd n o n - n a t i ve f e a t ur e s such both casua l a s tho s e me n t i o n e d a b ove ? An e x p e r i me n t a i me d a t a nswe r i ng t h e s e q u e s t i o n s pre p a r a t i o n . After care fu l pra c t i c e , o f Swe d i sh ) re c orded a n a t i ve s p e ak e r t e xt w i th f o re i g n a c c e n t f e a t ure s b o t h - 14 7 - a is c u rr e n t l y in t r a i n e d pho n e t i c i a n (a numb e r o f in i so l at i on r e a d i ngs o f a a nd i n var- i ou s c omb i na t i o n s . o c cur The i n ma ny d i f f e r e n t be i ng p r o n o u n c e d requ i red + bu i l d up Th e po i nt o f the wh i ch c omb i na t i o n s of " f o r e i g nn e s s " ; the th i s (b) e x p e r i me nt wh i c h m i n i ma l case ) ; (c) i n f orma n t s . f e a t ur e wh i c h (d) Th e f e a t ur e s wh i ch a n d S chu l ma n (a) i mp r e s s i o n o f c omb i na t i o n s of f e a t ur e s g i ve s o und ( Br i t i sh o r F i n n i sh mo s t up in dev i ant t o n a t i ve i rr i t a t i ng f o r n a t i ve an a c c ent f e a t ure by b e i ng b a s e d o n t h e work o f W i ngs t e d t a c c e nt edne s s i t wa s s t r o ng e r ma t ch e d gu i s e me t h o d . A g e nu i n e f e a ture s ug g e s t e d t h a t f ore i gn Th i s f or e i g n u s i ng c ou l d b e a c c e nt s p e e ch be synthe s i s t e c hn i q u e s , d e gr e e s c o u l d b e p r e s e n t e d t o n a t i ve s p e ak e r s s e c t i on 4 . 1 , and an a c cent i n c l ud e d i f f e r e n t i n s t r o ng e r a nd we a k e r va r i e t i e s of i nvo l ve d . a c c e nt edne s s . - Aga i n , 148 - i n s evera l " c orre c t e d " , d i f f erent to b e s t ud i e d u s i ng t h e c ou l d arranged can A phon e t i c i a n c a n s i mu l a t e s p e ak e r r e a c t i o n s n a t i ve accents s ame v o i c e w i th s ev e r a l degre e s t o e s t a b l i sh ( 1 9 84 ) . I n s e c t i on 3 . 4 2 . is and a the t e c h n i q u e o f bu i l d i ng 4 . 2 Ma n i pu l a t i o n o f we a k e r a nd part l y the f e a t u r e s g i ve f e a t ur e s a r e mo s t i s not e nt i re l y new , a nd a Br i t i sh E ng l i sh a c c e nt i mp r e s s i o n o f a c e r t a i n f or e i g n a c c e n t i n f o rma n t s a nd 1 . c o n s o n a n t s b e i ng p r o consonant , dent a l F i nn i sh a c c e nt . m i n i ma l s u ch a s t h e grave a c c e n t t h e r e t ro f l e x o f ArA to i nvo l ve d we r e p a r t l y t h o s e wh i ch f or e i g n a c c e n t s , a s a c ut e , noun c e d a s a s e q u e n c e f e a t ur e s f eatures (c . f . of ways : f e a t ur e by such that a c c e nt e d n e s s Bannert 1 984 ) . i n t h e way d e s c r i be d c omb i n a t i o ns of t o o bt a i n s e ve r a l t h e s ame the in f e atures d i f f e re n t s p e ak e r wou l d b e 3. A new i mm i gr a n t , g o o d i m i t a t i t i ve i n s eve r a l A new 4. a b ove of c apab i l i t i e s , wi th a b a c kg r o u nd c a n be r e c orded s evera l Th e s p e ak e r ' s c ome we a k e r i n a n a t ur a l Th e t h i rd me t h o d at a l l ows c omp l e t e the be i ng r a t h e r p i l ot of c o n s um i ng imi t a t i on o f pho n e t i c i a n ' s i nvo l ve d , in th i s d i f f erent c omp l e t e , and s t age . p e r i od o f mo n t h s mo s t e f f e c t i ve a t Wh i l e s p e e c h s y n t he s i s i t has case . t h e drawba ck o f Th e s e c o n d me t hod , of the l e a d i ng t o c e r t a i n that t h a t d e s c r i be d can the n be expe c t e d t o be- a n a c c e nt h a s wh i l e o t h e r s a r e ove r l o o k e d . to ove r a s i mp l e s t s t e r e o t y p e mod e l i nh e r e n t d i s a dva n t a g e sible is c o n s i de r i ng s u c h d e t a i l e d ma n i pu l a - a ph o n e t i c i a n ' s r e c ord i ngs a r e s e ve r a l such as t h e s p e e ch , n e c e s s a ry rated , f or e i g n a c c e n t way . e x p e r i me nt c ontro l t i me t i me s a c cent i s probab l y the t i o n a s wou l d be e a s i l y b e c ome whe r e t h e i n t e n s i ve t r a i n i ng s u c h t h a t i mm i g r a n t , l east c a n be t r a i n e d t o r e ad a t e xt a c c e nt e dne s s a r e obt a i n e d . or years . the l i n gu i s t i c a l l y t r a i n e d a nd w i t h d i f f er e n t vers i ons c orre c t e d by d e gr e e s i de a l l y a The l o ng p e r i od u s i ng t h e d i s adva n t a g e a c c e nt in that q u e s t i o n may f e a t ur e s b e i ng e x agge f ou r t h me t h o d has the i s r e q u i r e d b e f or e t h e a nd t h e s p e ak e r ' s a c c e n t e d n e s s is i mp o s - c o n t ro l . Re f e r e n c e s : A r thur , B . , D . F a r r a r a nd B . Br a d f ord ( 1 9 7 4 ) Eva l u a t i ve r e a c t i o n s o f c o l l e ge s t ud e n t s t o d i a l e c t d i f f e r e n c e s i n t h e Eng l i sh o f Me x i c a n-Ame r i c a n s . L a ngu a g e a nd S p e e ch 1 7 , 2 5 5 - 2 7 0 . B a n n e r t , R . ( 1 9 8 0 ) SvA r i gh e t e r med s v e n s k t ut t a l : p r i or i t e r i ng . P r a k t i sk L i ngv i s t i k 5 . - 14 9 - i nve n t e r i ng o ch B a n n e r t , R . ( 1 9 8 4 ) P r o s o dy a nd i nt e l l i g i b i l i ty o f Swe d i sh s p ok e n w i t h a f or e i gn a c c e n t . I n C . -C . E l e r t , I . J oh a n s s o n a nd E . S t r a n g e r t ( e d s . ) : Nord i c P r o s ody I I I . Papers f rom a Symp o s i um . A c t a Un i v . Ume n s i s . Ume � S t ud i e s i n t h e Huma n i t i e s 5 9 , p p . 7 - 1 8 . Bourh i s , R . , H . G i l e s a nd H . Ta j f e l n a n t o f We l sh i de nt i ty . Euro p e a n J . ( 1 9 7 3 ) La ngu a g e a s a d e t e rm i o f S o c . Psych . 3 , 447-6 0 . Br e n n a n , E . a n d J . Bre n n a n ( 1 9 8 1 a ) A c c e nt s c a l i ng a nd l a n g u a g e at t i tudes : Re a c t i o ns to Mex i c a n- Ame r i c a n E ng l i sh s p e e ch . L a ng . Speech 24 , 2 07-2 2 1 . Br e n n a n , E . a nd J . Br e n n a n ( 1981b) Me a s ur e me n t s of a c c e n t a nd a t t i t u d e t oward Me x i c a n Ame r i c a n s p e e ch . J . P s y c h o l i ngu i s t . Re s . 1 0 , 487- 5 0 1 . Bre n n a n , E . , E . Rya n a nd D . Daws o n ( 1 9 7 5 ) S c a l i ng o f a p p a r e n t a c c e n t e d n e s s by ma g n i t u d e e s t i ma t i o n a nd s e n s o ry mod a l i ty ma t ch i ng . J . P s y c ho l i ng u i s t . Re s . 4 , 2 7 - 3 6 . Re n c h e r ( 1 973 ) P e r c e p t i o ns o f Brown , B . L . , W . J . S t r o ng & C. o f ma n i pu l a t i o n s o f a c ou s t i c a l p e r s o n a l i ty f r om s p e e ch : e f f e c t s p a r ame t e r s . J . A . S . A . 5 4 , 2 9 - 3 5 . Br own , B . L . , W . J . S t r o ng & C . Re n c h e r ( 1 9 7 4 ) F i f ty - f our vo i c e s f r om t wo : t h e e f f e c t s o f t h e s i mu l t a n e ou s ma n i p u l a t i o n s o f r a t e , me a n f un d ame n t a l f r e qu e n cy a nd v a r i e n c e o f f un dame nt a l f r e q u e n cy o n r a t i ng s o f p e r s o n a l i ty f rom s p e e ch . C a rr a nz a , M i chae l a nd E l l en B o u c h a rd Rya n ( 1 9 7 5 ) Eva l u a t i ve re a c t i o n s o f b i l i ng u a l A ng l o a nd Me x i c a n Ame r i c a n Ado l e s c e n t s t oward s p e ak e r s o f E ng l i sh a n d S p a n i sh . I n t e r n a t i o n a l J o ur n a l o f t h e S o c i o l ogy o f L a n g u a g e 6 , 8 3- 1 0 4 . C u n n i ngh am-Ande r s s o n , U . ( 1 9 8 6 ) A p p ro a ch i ng t h e Swe d i sh s p e ak e r ' s mode l o f Swe d i sh s p oke n by Eng l i s h s p e a k e r s - a p i l o t s t udy . RUUL 1 4 , 1 - 1 2 . I ns t f o r L i ngv i s t i k , U p p s a l a . D i m i t r i j ev i c , N . R . a nd D . D j or d j ev i c ( 1 9 7 1 ) Th e r e l i a b i l i ty o f t h e s ub j e c t i v e a s s e s sme n t o f t h e p u p i l s ' p r o n u n c i a t i o n o f E ng l i sh a s a f o re i g n l a ng u a g e . I n t . Rev . A p p l . L i ng . 9 , 2 4 5 - 6 5 . Edwa rd s , accents . J.R. ( 1 9 77 ) S t ud e n t s ' react i ons Langu a g e a n d S p e e ch 2 0 ( 3 ) , 2 8 0 - 2 8 6 . to I r i sh r eg i ona l Ei senste i n , M . ( 1 9 8 3 ) N a t i ve re a c t i ons t o n o n - n a t i ve s p e e c h : r e v i ew o f emp i r i c a l r e s e a r c h . S t ud i e s in Sec . Lang . A c qu i s . ( 2 ) , 1 60-176 . a 5 ( 1 9 8 7 ) The I R I S s p e e ch dat a bas e a s t a t us r e Eng s t r a nd , O . p o r t . I n Engs t r a nd , O. ( ed . ) , 1 987 : Papers f r om t h e Swe d i sh Phone t i c s Conference he l d in Up p s a l a , Oct ober 1 7- 1 8 , 1 9 8 6 , 1 2 1 - 1 2 6 . Re p o r t s f r om Up p s a l a Un i ve r s i ty , Dept . o f L i n gu i s t i c s ( RUUL ) 1 7 . Engs t r a nd , O . a nd C u n n i ngh am- A n d e r s s o n , U . 1 988 . I R I S - a da t a b a s e f or c r o s s - l i ngu i s t i c pho ne t i c rese arch . Unpub l i sh e d manu s c r i p t , D e p t . o f L i ng u i s t i c s , U p p s a l a Un i ve r s i ty . - 15 0 - F l e g e , J . a nd R . Hammo nd ( 1 9 8 2 ) M i m i c ry o f n o n -d i s t i n c t i ve pho n e t i c d i f f e r e n c e s b e twe e n l a ngu a g e var i e t i e s . S t ud . S e c o n d L a ng . A c qu i s . 5 , 1 - 1 8 . F l e g e , J . , ( 1 9 8 4 ) The d e t e c t i o n o f Fr e n c h a c c e n t by Ame r i c a n t e ners . J A S A 7 6 ( 3 ) 6 9 2-707 . G a s s , S . a nd V e r o n i s , t h e c ompr e h e n s i b i l i ty ( 1 ) 65-90 . E . M . ( 1 9 84 ) Th e e f f e c t o f f am i l i a r i ty o n o f n o n - na t i ve s p e e ch . L a ngu a g e L e a r n i ng 34 G i l e s , H . and P . F . P owe s l a nd ( 1 9 7 5 ) u a t i o n . A c a d em i c Pre s s New York . G i l e s , Howa rd ( 1 9 7 0 ) Rev i ew 2 2 , 2 1 1 - 2 7 . Johans s on , Lund . F. ( 1 973 ) l i s S p e e ch s ty l e a nd s o c i a l Eva l u a t i ve r e a c t i o n s t o a c c e n t s . I mm i gra n t Swe d i sh p h o n o l ogy . J o h a ns s o n , S . ( 1 9 7 8 ) S t ud i e s o f e rr o r g r a v i t y . Eng 44 ( Un i ve r s i ty o f G o t h e nburg , Swe d e n ) . eva l Edu c a t i o n a l CWK G l e e rup G o th e nburg S t ud . L a bov , W i l l i am ( 1 9 6 6 ) Th e s o c i a l s t r a t i f i c a t i o n o f E ng l i sh i n New York C i ty . Wa s h i ngt o n D . C . C e n t e r f or A p p l i e d L i ngu i s t i c s . Lamb e r t , W . E . , H . G i l es a nd O . P i c ard ( 1 9 7 5 ) La ngu a g e a t t i t u d e s in a F r e n c h - Ame r i c a n c ommu n i ty . I nt ernat i ona l J ourna l of the S o c i o l ogy o f L a ng u a g e 4 , 1 2 7 - 1 5 2 . Lambe r t , W . E . , R . C . Hodgs o n , R . C . G a r d e n e r & S . F i l l e nb a um ( 1 9 6 0 ) Eva l ua t i o n a l r e a c t i o n s t o s p ok e n l a ngu a g e s . J o ur n a l o f Abnorma l a n d S o c i a l P s y c h o l ogy 6 0 ( J a n ) 44- 5 1 . Leht o ne n , J . ( 1 9 8 1 ) P r o b l ems o f me a s ur i ng f l u e n cy a n d n o rma l r a t e o f s p e e c h . I n : J . -G . S avard a nd L . L a f orge ( e d s ) Pr o c e e d i ngs 5 t h I n t e r n a t i o n a l C o ngr e s s of the I n t e r na t i o n a l A s s o c i a t i o n A p p l i e d L i ngu i s t i c s , 3 2 2 - 3 3 1 . ( L ava l U . P . Que b e c ) . Mu l a c , A . , T . D . Ha n l ey a nd D . Y . P r i gg e ( 1 9 74 ) E f f e c t s o f phono l og i c a l s p e e c h f ore i gn e s s upon t h r e e d i me n s i o n s o f a t t i t ud e o f s e l e c t e d Ame r i c a n l i s t e n e r s . Qu a r t e r l y J o u rn a l o f s p e e ch 6 0 , 4 1 1 420 . Osgood , C . , o f me a n i ng . G . J . S u c i a nd P . H . Urb a n a I l l . Ta n n e n b a um ( 1 957 ) Th e me a s ur eme n t P a l me r , L . A . ( 1 9 7 3 ) A p r e l i m i n a ry r e p o r t o n a s t udy o f t h e l i n gu i s t i c c o rre l a t e s o f r a t e r ' s s u b j e c t i ve j udgeme n t s o f n o n - n a t i ve Eng l i sh s p e e c h . i n : R . W . Shuy a nd R.W. Fa s o l d ( e d s ) L a ng u a g e a t t i t ud e s : c urre n t t r e nd s a nd p ro s p e c t s , Wa s h i ngt o n D . C . G e o rg e t own Un i ve r s i ty P r e s s 4 1 - 5 9 . W.P . Rob i ns o n , ( 1972 ) Ha rmo nd swo r t h , P e ngu i n . Language - 1 51 - a nd soc i a l b e h av i our . Roma i ne , S . ( 1 980 ) t o s p e e c h : p r o b l ems i n S c o t l a nd . S ty l i s t i c var i a t i o n a n d eva l ua t i ve r e a c t i o n s i n t h e i nve s t i ga t i o n o f l i ngu i s t i c a t t i t ud e s Ry a n , E. , M. Carranza and R. Mo f f i e ( 1 9 7 7 ) R e a c t i o n s t oward vary i ng d e gr e e o f a c c e n t e dne s s i n the s p e e c h o f S p a n i sh-Eng l i sh b i l i ngu a l s . L a ng . S p e e c h 2 0 , 2 6 7 - 2 7 3 . Ry a n , E.B. ( 1 983 ) Soc i a l p s y c h o l og i c a l me c h a n i sms u n d e r l y i ng n a t i ve s p e ak e r eva l ua t i o n s of n o n - na t i ve s p e e ch . S t ud i e s i n S e c o nd La ngu a g e A c q u i s i t i o n 5 ( 2 ) 1 4 8 - 1 5 9 . Ry a n , E . B . a nd M . A . Carranza ( 1 977 ) I ng r o u p a nd o u t g r o u p r e a c t i o n s t o Me x i c a n-Ame r i c a n l a ng u a g e var i e t i e s . i n : H . G i l e s ( ed ) La ngu a g e , e t h n i c i ty a nd i n t e rg r o u p r e a c t i o n s . L o n d o n . 5 9 - 8 2 . Rya n , E . B . a nd R . J . S e ba s t i a n ( 1 9 8 0 ) Th e e f f e c t s o f s p e e ch s ty l e a n d s o c i a l c l a s s o n s o c i a l j udgeme n t s o f s p e ak e r s . Br i t i sh J our na l o f S o c i a l a n d C l i n i c a l P sy c h o l o gy 1 9 , 2 2 9 - 2 3 3 . ( 1 9 8 1 ) Th e r e c ogn i t i o n o f f or e i g n a c c e nt s i n Eng l i sh S c ove l , T . a nd i t s i mp l i c a t i o n s f o r p s y c h o l i ng u i s t i c theor i es o f l a ng u a g e a c qu i s i t i on . i n : J . -G . S avard and L . L a f o r g e ( e d s ) P r o c e e d i ngs 5th I nternat i ona l C o ngre s s of the I nt e rnat i on a l As s o c i a t i on A p p l i e d L i ngu i s t i c s , 3 8 -4 7 . ( L ava l U . P . Que b e c ) . S e ba s t i a n , R . J . , E . B . Ry a n a nd L . C o r s o ( 1 9 7 8 ) S o c i a l o f s p e ak e r s w i t h d i f f e r i ng d e gr e e s o f a c c e nt e d n e s s . sented at the me e t i ng o f t h e N i n t h Wor l d C o ngr e s s o f Up p s a l a . T a rd i f , C . a nd d ' A ng l e j a n , A. l a ng u e s e c o nd e e t l eurs e f f e t s d i a n Mod e r n L a ngu a g e R e v i ew 3 7 , Tu c k e r , G . l i s teners ' F o r c e s 47 , j udgeme n t s P a p e r pre S o c i o l ogy , ( 1981 ) Les erreurs en fran9a i s s u r l a c ommu n i c a t i o n o ra l e . C a n a 70 6-7 2 3 . R i c h a rd a n d Wa l l a c e E . Lamb e r t ( 1 9 6 9 ) Wh i t e a nd Negro r e a c t i o n s to v a r i o u s Ame r i c a n-Eng l i s h d i a l e c t s . S o c i a l 46 3-8 . W i l l i ams , F r e de r i c k ( 1 9 7 4 ) The tudes . I nt ernat i ona l J ourna l 2 1-32 . W i n g s t e d t , M . a nd R . S chu l ma n accents . In : I .W.U. Dre s s l e r br i dge Un i ve r s i ty P re s s . i de n t i f i c a t i o n o f l i ngu i s t i c a t t i of the S o c i o l o gy o f L a n g u a g e 3 , ( 1 9 84 ) C ompreh e n s i o n o f f or e i gn ( e d ) P h o n o l og i c a . C ambr i dg e : C am A c know l e dgme nt Th i s work wa s s u p p o r t e d i n part by Th e Swe d i sh C o u n c i l f or Re s e a r c h i n t h e Huma n i t i e s a nd S o c i a l S c i e n c e s ( gr a n t s F 5 7 1 / 8 7 a nd F 6 5 5 / 8 8 t o O . Engs t r a nd ) . - 1 52 - REPRESENTING P I TCH ACCENT IN SWED I SH Le s l i e M. B a i l e y* Abs t r act Words i n S t ockh o l m Swe d i sh h a v e one o f two p o s s i b l e a c c e nts . I n t h i s p a p e r , i t i s d emo n s trate d t h a t a c c e n t c h o i ce i s p r e d i c t a b l e f r om t h e word ' s me t r i c a l s t ru c t ure. Th at i s, words w i th A c c e n t 2 h a v e pr i ma ry s t r e s s a t t h e l e f t word e dg e or o n a sy l l a b l e dom i n a t e d by t h e l e f tmo s t f o o t o f t h e word (" l e f t s t r e s s e d g e" ) . Word s w i t h A c c e n t 1 h a v e p r i ma ry s t r e s s a t t h e r i gh t word e d g e o r o n a sy l l a b l e d omi n a t e d by t h e r i ghtmost f o o t o f t h e word ("r i gh t s t r e s s e dge") . I t i s f ur t h e r demo n s t r a t e d t h a t t h e t o n a l c o n t o ur o f a word may b e d e r i ve d w i t hou t k n ow i ng i f the word h a s Ac c e n t 1 or A c c e n t 2. Th i s i s a c c omp l i sh e d by a m a p ping ru l e th a t a s s o c i a t e s a H L t o n e s e qu e n c e t o t h e word a c c o rd i ng t o t h e word ' s s t r e s s e dg e t o d e r i ve t h e a cce n t t o n a l c o n t o ur. A c c e n t i n S t o ckho l m Swed i s h i s t h u s se e n t o b e t h e c omb i n a ti o n o f s t r e s s a nd t o n e r e p r e s e nt a t i o n s wh i ch i nt e r a c t t o d e r i ve two d i f f e r e n t s t r e s s- t o n e r e l a t i o n s h i p s , l a b e l e d " A c c e n t 1 " and "A c c e nt 2. " I n t r odu c t i o n 1 .0 Words A i n S t o ckh o l m Swe d i s h have o n e o f two po s s i b l e word w i t h A c c e n t 1 h a s a b l e. Whe n a n A c c e n t t h e d i s c ours e, t o ne. 1 word h owe v e r, the h i gh t o n e i .e ., h i gh l ight e d s t r e s s e d sy l l a b l e h i gh t o n e wh i ch t h e word ( c f . Bru c e f ocus (c f. in h a s a r i s i ng Bru c e l ow t o n e 1 977). A f a l l i ng t o n e o n t h e s t r e s s e d sy l l a- Wh e n a n A c c e n t 2 word t i ona l is in f ocus , o c curs o n 1977, Bru c e there i s a n add i - a p o s t s t r e s s sy l l a b l e & GArd i ng 1978 , in GArd i ng . Re c e nt l y, the i n f o c u s, wh i ch ma rks word w i t h A c c e nt 2 h a s a 1977) is s t r e s s e d sy l l a- Th i s c o n t o u r may be d e s c r i b e d a s t he Acc e n t 1 p l us the ble . l ow t o n e o n t h e accents . i nterest wi th i n t ona l t h e adve n t phe n ome n a , - 153 - of n o n- l i ne ar ph o n o l ogy a nd a u t o s e gme n t a l ana lyses o f t h e Swe d i sh be e n p u t a cc e n t s h a v e f or t h ( c f. & P i e rr eh umb e r t B a i l ey 1988, L o r e n t z 1984, S c hm i d 1987 , c om i ng) . f i r s t two a n a l y s e s p r o p o s e a s i ng l e t o n a l Th e whi c h ma p s in two d i f f e r e n t way s c o n t ours. Th e t i e r f or e a ch a c c e n t . Be ckma n in c l ud e cannot a c c e nt Lo r e nt z be not l ex i c a l the a c c e n t o f a In p l a c e. B a i l ey tha t Ac c e nt 1 Th ey a r e , state word be ( 1988) , the e xp l i c i t l y the word's a c c e nt there f ore , l ex i c a l e n t ry. wh e t h e r a c c e nt but h i s ana l ys i s r e q u i r e s k n own before o n t h e o t h e r h a nd, are that Th ey, e xp l i c i t l y or d e r i ve d , p h o n o l og i c a l thus , in t ona l a nd P i e rr e h umb e r t a nd i n Swe d i sh . state a nd A c c e n t 2 the poss i b l e (1987 ) p r e di c t e d does is a l so i nf o rmat i o n (1984) a s s i gnme n t that S chmid t i er t h e two a c c e n t l a t t e r two a n a l yses pr o p o s e a d i f f er e n t ( f o r t h c oming) a s signme nt t o d e r i ve B e c kma n f o r t h - c ov e r ma pp i n g t ak e s it i s pro p o s e d t e rms u s e d re pre s e nt a t i o n s o f i nt e r p re t a t i o n o f t o d e n o te Swe d i sh wo rds . p h ono l ogi c a l s t ru c t ur e r a t her t h a n a p r e c o nd i t i o n d e t e rm i n i ng s u c h s t r u c t ur e . l. Th i s pa p e r h a s i s to two r e l a t e d p u r p oses . d emo n s t r a t e t h a t Swe d i sh of a f i rs t word (l i s p r e d i c t a b l e for mo s t o f t h e vo c a bu l ary. i s n o t new (c f . J e n s e n 1980a Th e others a r e pred i ct e d rather than E l ert a nd b, amo n g o t h e r s ) . f r om t h e t h e a c c en t Th e 1977, Li n e l l 1972, a n a l y s i s pro p o s e d h e r e, f r om t h e d i re c t l y the GArd i ng L i n d a u 1970, (e x c e p t th e p r e s e nce o f a S p e c i f i c a l l y, 1972, J e n s e n 1980b) ph o n o l o g i c a l f r om pre - s ur f a c e or a c c e nt is - 154 - Th i s R i sche l h ow e v e r, s t r u c t ur e v owe l in i de a H a ug e n 1967 , i n that pred i cted s t ru c t ur e o f t h e word . or 2) 1963. d i ffers the a c c e nts o f t h e word c h a r a c t e r i s t i c s, p o s t�e s s pur p o s e in s u ch a s t h e word . f r om t h e me t r i c a l Th e s e c o nd p u r p o� e o f t ona l c o n t our of a word th i s p a p e r dir e c t l y t i e r a nd the f r om the me t ri c a l r a t h e r t h a n f r om k n ow l e d g e demo n s t r a t e d t h a t s h ow t h a t a c c e nt of s e gme n t a l the word's a c c e n t . i n Swe d i sh t i o n sh i p . ( 1988) , t ur e , 1 a nd A c c e n t 2 l i es i t dif f e r s s ub s t a n t i a l l y tona l t i er , Th i s i s p rima r i l y due here t ak e s i n t o a c c ou n t a to l ar g e r it is the d i f f er e n c e i n t h e s p i r i t o f B a i l ey i n c o n t e nt: the Thus, the s t r e s s -t o n e r e l a a nd ma p p i ng r u l e d i f f e r s e s. the word i s a c ov e r t e rm f o r t h e in Whi l e t h e a n a l y s i s h e r e i s t i e r) may s t ru c t ur e o f r e l a t i o n s h i p betwe e n s t r e s s a nd t o n e a n d t h a t b e t we e n Ac c e n t the ( r e pr e s e n t e d h e r e by a n a u t o s e gme n t a l ma p ping b e t we e n a t o n a l be d e r i ve d i s to the in fact f o o t s t ruc t he two a n a l y tha t the a n a l ysis of t h e Swe d i sh port i on vo c a bu l a ry . Th e p a p e r pr o c e e d s a s n o l o gy u s e d here i s s cr i pt i on of pre s e nt ed . Th i rd, of a word the d e f i n e d. accents it can f o l l ows . F i r s t, S e c o nd, based t h e a c c e n t t e rm i the d i s t r i bu t i o n d e o n me t r i c a l i s d e mo n s t r a t e d t h a t be d e t e rm i n e d the d i re c t l y s t r u c tu r e i s t ona l c o n t our f r om the me t r i c a l r e p r e s e n t a t i o n o f t h e word. Ac c e nt a c c e n t s, pa p e r, wo rd and Ac c e nt s i nc e h oweve r , a c c e nt a c c e n t s. tut e s 1 the In a word I wi l l be c au s e E ng l i s h , 2 are has at u s e the there f or o f t e n referred mo s t one d i f f erent e x amp l e, the i n t o n a ti o n c o n t o u r amo ng o t h e r s) . In P i t ch , ( c f. i n E ng l i sh, L i b e rma n Swe d i sh , on - 155 - 1975, the In t his i ns t e ad o f type s word a c c e nt prima ry s t re s s e d sy l l a b l e mi nus t h e t i o n of t h i s sy l l a b l e. t h em. accent t e rm p i t ch are of t o as word p i t ch o f word const i i n f o rma is s u p p l i e d by the P i e rr e h umb e r t other h a nd , 1980 , t h e word a c c e nt c ons t it u t e s t h e p i t ch sha p e others ) . t h e p rima ry s t r e s s ed sy l l a b l e of th i s sy l l a b l e (c f . B ru c e s t re s s a c c ent a nd the word a c c e n t o f Swe d i sh red to here as two as p i t ch a c c e n t s by t h e not of be re f er- Swe dish are a l so r e f e r r e d t o mor e s p e ci f i c a l l y , t e rms A c c e nt l a t t e r t e rms h e r e s i n c e grave are wi l l pit c h a c c e n t . a s word to n e s o r, we l l 1977 among word a c c en t o f English h a s b e e n r e f e rr e d t o a s Th e Th e a s we l l a s the a ccur a t e v i a n ac c e n t t y p o l ogy as and gr a v e , acute 1 a n d A c c e nt 2 . I wi l l i nt erpre t a t i o n s f or a l l of as u s e the a c u t e and acc e n t s w i th i n t h e S c a nd i n a- GArding 1977). (c f . 2 .0 A c c en t d i s t r i bution A c c ord i ng t o L i n d a u initial stres s (1970), wi th a pos t s t r e s s A c c e n t 1 o c c u r s on o t h e r s t r e sse d d i sh, i n c l ud e s f i na l , position a s we l l as A cc e n t 1 h owe ve r , " + t o na l a c cen t " � t on a 1 in i t i a l Et o n a I to and s tre s s. Th e s e wit h are s t r e s s on f o n . i n Swe - a n t e p e n u l t i ma t e n umb e r o f s t r e s s. pre s en t e d in Lin d a u's (1), whe r e 2. ro� [t t acc e n I U NAVAdV stres ] __ t h i s ru l e. e .g. , F i rs t , 2 bu t l ... � V... there are d o n o t have f i r s t memb e r is a t e l e f o nnumme r ' t e l e ph on e In Swe d i sh , - 156 - the word. ini t i a l c omp o un d s whos e s t re s s , in s u bs t a n t i a l i n Swe d i s h t h a t r e c e i ve A cc e n t word w i t h n o n- i n i t ia l n umb e r , ' is !J Th e r e are two e x c e p t i o n s com p o unds A a l so has f or A c c en t -� sy l lab l e wor d s w i th sy l l a b l e s wh i ch , monosy l la b l e s . s t a nd s acc e n o c c ur s on p e nu l t i mat e, a c c e n t a s s i g nme n t ru l e (1970) (1) . word s , 2 Acc e n t the p r i ma ry s t re s s o f a c omp ound the f i rs t Fre n c h is c omp o u n d l oans as in i t i a l a l wa y s t h e the s t r e s s, is o l a t ion . In memb e r in ord er the s e to the words in wi l l have f or i s pre s e n t e d accen . The a ng l e d t ona 1 1fs. the s e c a s e s, su f f i x , accen pr i ns e s s a a n a l y s i s, the s e n o t have initial J c a t e g o ry t o c o l l a p s e t he two e nv i r o nme n t s i n it i a l s t r e s s words ' w i t h a fo l l ow i ng f o l l ow i ng s t r e s s e d vowe l a l so r e c e iv e A c c e n t w i t h a n a t i ve d e riva t i o n a l pr i n c e s s . In ' s t r e s s or a f o l l ow i n g s t r e s s e d v owe l i n the 2 a s s i gn revis e d a c c e nt c orr e c t l y t o t h e s e wo r d s Th ere are t h u s A c c e nt ru l e ini t i a l ( 2) f or s tress wi th a stre s s w it h a fo l l ow i ng s t r e s s stressed n ouns. c o l l a p s e th e thr e e e nv ironme nts If not it is A c ce n t , it w i l l 2 in a f o l l ow i ng vowe l i n the wo rd , , a nd i nd e e d p o s s i b l e t o i n t o o n e ru l e t h e e nvir o nme n t s ) - 157 - does they do . t hr e e e nv i r o nme n t s framework : in s i nce (1970) r e c e i ve A c c e n t 1 , Th e f or t w o o f L i nd a u's wo r d s w i l l wo rd. don e Th e l .. l4s t re s � � s t r e s � L' V 1 0 L l exii:ca l p e nu l t ima t e s t r e s s e d n o u ns e . g., p e nu l t i ma t e vowe l . � <i:::. �G \--- s t re s s w i th a non n a t i ve n o nin i tia l Li nd a u s o tha t A c c e n t 2 w i l l b e (1) L= a ny are s e gme n t a l in . S e c o nd, Th e s e stress (2 ) . i n t o o n e ru l e : and n on i n i tia l in t h e word 2. E bra c k e t s s e rve f o r A c ce nt 2 vowe l, -� contain � may . � in not have f i na l a s s i g n e d a l so t o w or d s w i t h a fo l l ow i ng s t re s s e d r ev i s e d ru l e ass i gned t o C omp o un d s w i th there f ore, a c cou n t ru l e sy l l a b l e i s o l a tio n . f i r s t memb e r, s i nce rev i s e s (1970) s t res s e d (as L i ndau has not be an e l egant sta t eme n t . t i on o f to t h e Wha t I propos e , the A c c e n t loca t i o n o f unites the 2 e nvir onmen t s, prim a ry d i s j u n c t i ve t h e s e gme n t a l ana l ys i s s t ru c t ur e o f t h e word 2.1 Metrical f oot in a nd - s t a nd s n e c e s s a ry s + in foot o f u ns t re s s e d sy l l a ble s (1977, h e n c e f or t h LP) , b i n a ry b r a nchi n g o r w(e ak) t r e e who s e a c c o rd i n g t o the (3) , + s t a nd s wh ere . a nd Accor- the foot node s are f o l l ow i ng rule s. +s t r e s s sy l l a b l e, f or a s t r e s s e d sy l l a b l e f or an u n s t r e s s e d sy l l a ble. w s *w + f or n od e s h i g h e r if i t br a nch e s node i s s tro ng (4) w � s A , in the in t h e LP t ree , Oth e rw i s e , . illustrated in it the r i gh t node is i s we ak a nd t h e l e f t (4). � s w I Th e me t r i cal re pre s e n t a t i on s zona , i s ba s e d on t h e a s t r o n g n o d e a l ways dom i n a t e s a i l l ustrated s t r o ng 2 d e s c ri ptio n bu t wh i ch Liberman & Pr i n c e 1977) . (cf. ( i n c l ud i ng z e r o ) labe l e d s(t rong) Second , word 2 f oo t r e pre s e n t e d b y a (3) the Th e d e s c r i ptio n . L i b e rma n and Prince d i ng t o F i rst, in is a u n i t wh i ch c o n s i s t s o f one s t r e s s e d sy l l a b l e some numbe r is wh i ch c orr e l ate s A c c e n t stress A c c e nt a d i f f e r e nt d e s c r ip- repr e s e nt a t i o n 2 . 1. 1 Th e me t r i c a l A is in s t e a d , f or Ame r i ca , f r amework a re i l lus t r at e d in - 158 - Pame l a, (5). a n d Ar i - IA (5 ) A\ - trees wh i ch i l l us t ra t e thre e resu l t pr i ma ry s w s w Ar i z o n a + - + - s w w P ame l a + - - w s w w Ame r i c a - + - Th e s e J\S 1\ 1\ f r om stress t hr e e of e a ch d i f fe r e n t me t r i c a l d i f f erent word stress s t ruc t ur e s p a t t e r n s. Th e c o r r e s p o nds t o t h e sy l l a b l e d om i n a t e d e nt i r e l y by s t r o n g n o d e s . S i n c e the f r om me t r i c a l r e s u l t i ng t i on s t ru c t ur e i n t e rna l trees have as in ( a) feet , as in (b) b i nary S ec o n d , s t ru c t e d in is br a n c h i ng feet , w i th i n h e n c e f or t h Q). Q of word a f t e r f oot c on s t r u c - fo l l ow i ng t w o mod i f i c a t i o n s. ( 6a ) , are repre s e n t ed a s d u e to trees f or f l at t h e o v e r r i ch s t r u c t ur e ( c f. Pr i nc e 1 983, ph o n o l o g i ca l c o n s t i t u e n t s. the p h o n o lo g i c a l word ( t h e t e rm i n a l Ne spor & j us t i fic a t i o n ) . as e l eme n t of (W ) are con- i ns t e a d o f t h e sy n t a c t i c the tree , Th e d i s t i n c t i o n b e t we e n W and Q mu s t b e ma d e sma l l e r t h a n Q. those o f the - for e x amp l e. syn t a c t i c word wh e n W i s but w i t h t h e � Th e r e a s o n f or t h e ch a n g e 1986, 1981 ) , s w w Pame l a + - - Voge l Hay e s ( 6b ) . A\ the w i t h p r e d i c t i ng s t r e s s t h e LP mod e l w i t h th e s w w P ame l a + - of not (c f . be e n c o n s t ru c t e d , t e r n a ry t r e es , is c o ns t i t u e n t s t ru c t ur e i s ad o pt e d h e r e , F i rs t , (6) c o nc e r n h e r e cas e s wh e r e W i s d e t e rm i n e d t o b e sma l l e r t h a n an i nt erna l - 159 - s a ndh i ru l e d o e s not a p p l y acr o s s s t em o r that the d oma i n o f 198 6 f or the b o u n d a r i e s, f o l l ow i n g ways t h i s ca s e, A s t em may t h e a f f i xe s a comp ound may a l s o com p o u n d memb e r s pre f i xe s o f s u f f ixe s jo i n t o t h e as jo i n l ast W of in t h a n Q. the i n care f u l s l e e ve' , th e o t h e r G e rma n i c i n Swe d i sh . In Th i s doma i n in Th e memb e r s o f l an gu a g e s wh e re a f f ixes do not, f i rst W, 198 6 l a ng u a g e s a nd t h e (c f . f o r E ng l i sh ) , t h e memb e r s o f O n e s our c e o f s p e e ch. For e x amp l e, G A rd i ng ( 19 67 ) , pra s t arm Boo i j W may a c omp o u nd e v i de n c e f or (pras t+arm ) ru l e s (c f. e xamp l e G A rd i ng 'pr i e s t sy l l a b i f i e d a s pras t -arm . is Th i s sy l l a b i f i c a t i o n p a t t e r n r u n s cou n t e r t o tarm. Swe d i s h s y l l a- 1967 ) , wh i ch p r e d i ct pras- th a t the sy l l a b i f i c a t i o n i l l u s t ra t e s Swe d i sh c o i nc i d e s w i t h t h e s t em b o u n d a rie s w i t h i n in th e c omp ound rat h e r t ha n w i t h t h e com p o u nd b o u nd a ry. a wo rd In f r om b l o c k e d r e sy l l a bi f i ca t i o n w i t h i n a compound from b i f i ca t ion a f f i xe s . c omp o u n d . S p ec i f i ca l l y , f o rm i nde p e nd e n t W's t his c ome s from it s W's b u t to 1983 for Dutch a nd N e s p o r & V o g e l be sma l l er Ne s por & Vogel f orm W's o n th e i r own . i nd e p e n d e n t W may b e t h e W d e f i n i t i o n a cr o s s f orm a W a p a rt a comp o u n d I n Swe d i sh , (c f . f o rm i nd e p e nd e n t W's . f r om i n d i ca t i ng th a n Q. i s sma l l e r an e x t e n s i ve d i s cu s s i o n o f l a ngu a ge s ) . the affix t h e ru l e Q in sma l l e r t h a n c e rt a i n l e ve l d oma i n e x i s t s i n Swe d i s h wh i ch is Thu s sma l l e r t h a n Q. 2 . 1.2 C o n s t ruct i ng f e et i n Swe dish Swe dis h mak e s u s e o f two f o o t ty p e s , 'unmark e d' a nd 'ma rk e d' f o o t , i l l us t r a t e d (7) u nmarked s w f o ot mark e d w - 160 - wh i ch f oot s I wi l l in (7) .:-z. ca l l the Th e t wo fe e t r e f l ec t a s s i g nme n t on the i n Swe d i sh : Swe d i sh (c f. is a l so S p e c i f i ca l l y, wo rd s, a l so of a L i ne l l 1 972 ) . N o n n a t i ve s t re s s. Th e s e Fre nch s tres s r u l e. most l y o n s tr e s s are a s s i g nme n t for-, e nt - wh i ch i m f ormat i o n . mu s t i gnore S i m i l ar l y , mus t b e occu r (1972 ) , in found on a l l ru l e s a s s i g n i ng s t re s s . s u f f i xe s , the i gnored . mo s t l y a t t ract on pr i ma ry c o r r e c t l y a s s i g n e d s t re s s by t h e Words n a t i ve word s S t re s s e nd i n g s , accord i ng t o L i n e l l word s ( he nce f o r t h "Fr e nch morpho l o g i ca l Both ge - , be- , d er i va t i o na l n o n n a t i ve words, to in f l e ct i o n al word whe n s t re s s l e s s p r e f i x e s wi th n a t i ve (Li n e l l 1972 ) , d er i va t i o n s , f ou n d r e t ai n s t r e s s o n t h e acco rding t o the G e rma n i c s t ress rule . Feet , o f p r i ma ry i n Swe d i sh , s t re s s princ i p l e t h a t Both a r e co n s t r uct e d ba s e d o n t he in t h e word a nd o n t h e u nmark e d a nd mark e d a s t rong n o d e ha s a n t e rs . Th e two ty p e s voca bu l ary a s t h e mark e d of f o l l ows . f o o t co n struct i o n Whe n p r i mary s t re s s is f oot i l l u s t r a t e wo rds w i t h a r e u n b o u n d e d w i th i n W ; f e et a r e d i s t r i bu t e d pr i ma ry s tr e s s al l sy l l a b l e s i n a ny g ath e r s p o s t s tre s s sy l l a b l e (s ) fe e t u n s p ec i f i e d n umb e r Wh e n fo o t g a t h e r s u nmark e d l oca t i o n a s t r o ng n o d e mu s t dom i n a t e a s t re s s e d s y l l a i .e ., the s tr e s s s e n s i t i ve Swe d i s h " G e rma n i c s t r e s s r u l e" ) , f i na l a re n e v e r s t r e s s e d . t h i s p art ble . (h e nce f or t h l oans re t a i n s t re s s ru l e ") f or s t r e s s G e rma n i c wo r d s a r e t y p i c a l l y s t r e s s e d f i rst sy l l a b l e wh i l e Fre nch root, t h e co n f l i ct i ng t e nd e nc i e s i nto i n t h e Swe d i sh i s word i nt o a s i ng l e other pos i t i on the a o f we a k s i s - 16 1 - foot . i n the word, s t r e s s e d sy l l a b l e a nd the f oo t . i n i t i a l and f i na l , f i na l Th e e x amp l e s s tre s s. in (8 ) (8) /\ � w s s t ud e n t - + s w f l icka + , girl' In the s e e x amp l e s , the u nm a rk e d f oo t is f ou nd o n a G e rman ic word a n d the mark e d f o o t is f ou n d o n a Fre n c h word. The o n both k i nds o f words, u nmark e d h owe ver, f oot may due to the be p o s s ibi l i t y o f s u f f i x e s o n F r e nch loan s , (9) /\ s w b e t ala + ' t o pay ' - In there th e f ou n d unstressed as il l u s t rat e d in i n f l ec t i o n al (9) . s w s t ud e n t e n + ' t h e s t ud e n t ' - - add i t i o n to the u nmark e d foot i n th e s e e xample s , is a n o n branching f o o t wh i ch d om i n a t e s t h e syl l able l eft of th e "d e f au l t f o o t." reas o ns. In the p r i m a ry s tr e s s . I t s p r e s e nce LP is I will r e qu i r e d framework , feet cal l th i s f or t h e s u ch a s in to f oot a f ol l ow i n g ( 10) are allowe d . ( 1 0) w s w beta l a + - Bin a ry branch i ng f e e t , branching feet, so the how e v e r, have b e e n s tr uctur e in - 162 - ( 1 0) r e p l ac e d by n-ary is n o t allowe d in t h i s a n aly s i s . a l l owed s i n c e (1 1 ) Th e it n-a ry i s not s t ru c ture a poss i b l e in f oo t (11) is a l s o n ot i n Swe d i s h . � w s w be t a l a - + - Sy l l a b l e s t o form a the l eft of f o o t o n t h e i r own, a n u nmark e d as f o o t must , i l l u s t r a t ed by th e t h e r e f or e , e x amp l e s in (9) . D e r i vat i o n s b ehave words. Th ose w i t h f i n a l t e d ove r t h em u nmark e d is s t r e s s hav e both type s o f (13) wh i c h c o n s i s t f e e t w i t h i n e a ch w i th i n /\ the W. W, d e f au l t f o ot o f mo r e tha n o n e W, c o n- A a c c ording c om p o u nd, w i th a to the there f ore, f e e t. s w s ( huvud ) ( s t ad ) + + , c a p i ta l ' ( head t own ) - t h em, � - stre s s c on s t r u c- w w s f a br i k a t - + 'produ c t' s w w skape l s e + ' c r e a t i o n' s t ru c t f oot ( 1 2a ) . (b) comp o u n d s , a mark e d i n f l ect e d a nd t ho s e w i t h s t r e s s e l s e wh e r e h a v e an non i n i t i a l � F i na l l y , s ame way a s s i mp l e a nd f o ot c o n s t ru c t ed over wh e n s t r e s s ( 12 ) (a ) ( 12b ) , the w w s s w ( t e l e f o n ) ( numme r ) - - + + - 163 - lo c a t i o n o f may c o nt a i n Ther e c a n n o t be i s one type of ass i gned stre s s o r Fre n ch st r e s s ru l e. the word i n t r o du c t io n, d e r i va t i o n a l are n o n na t i ve stress. Th e G e rman i c st r e s s ru l e , i nitia l s t r e ss , 1972 ) . Th ese w i t h n a t i ve o r k op i a ' pr i n c ess' have p e nu l t i ma t e h oweve r , wi l l a ss i g n t h em a nd t h e Fre n c h s t r e s s ru l e w i l l a s s i g n t h em st r e ss . Fr e n c h s t ress ru l e su f f i x , as p r o p os e d in t h e me t r i c a l as the if c o rr e c t l y ass i g n e d s t ress by t h e t h e ru l e d o e s n o t s e e t h e d e r i va t i o na l i n L i ne l l fr amework, (1972 ) . Th i s i s a c c omp l i s h e d, by a n e x t r ame t r i c a l i t y ru l e, su c h f o l l ow i ng . I+ d e r i v s � f fl t: G e rma nIC J Eex� --> Ex t r ame t r i c a l i ty ru l e s , to s t r ess -J / E wh i c h se rve a s s i g nme n t a l g o r i t hms, N Fr e n c are d e r i va t i o n o f pr i n s e ss a in i l lu s t rate d 1 /p r i n s e ssa/ -� p r i nse s s ( a) i nv i s i b l e o r d e r e d b e f o re s t r e s s ( cf . is � t o r e nd e r e nd s a s s i g nme n t a nd f o ot c o ns t r u c t i o n ru l e s Hay e s 1982 ) . ( 15 ) . 2 -->pr i nse ss ( a ) 3 --> + A w s pr i nse ss ( a) A 4 -� w s pr i nsessa + 1. 2. 3. 4. in words Th e s e words may be (15) that me n t i o n e d st ems pr i n s e s sa e .g., ( c f. ( 14 ) h ow e ve r , th o s e wor d s , ' c o py' f i na l L i ne l l Swe d i sh, c or r e c t l y by e i t h e r t h e G e rma n i c Th e s e a r e that e nd i ngs , in mark s u f f i x a s e x t r ame t r i c a l assign s t ress c o nstru c t f e et c o n s t ru c t a f o o t o v e r t h e e xtr ame t r i c a l - 164 - sy l l a b l e Th e Step 4 i s necessary in i s ru l e d o u t . (11 ) 2 . 2 The s t r ess In L i n d au's t i on is (1970 ) a n a l ys i s , ini t i a l" a n o t h e r s t r e ss i n the s t r e ss ( or word. Th a t a s e c o nd st r e ss s t r ess d i st r i bu t i o n is Tha t i s, sy l l a b l e of wh e n a word , e dg e o f t h e word . the l as t sy l l a b l e of the is , s t r e ss e l s e wh e r e . s t r e ss d i s t r i bu- A c c e nt is 2 words have i n t h e word) , Here , wh i l e t h e r e l eva n t l e ft s t r e ss e dge v e rsus r i gh t s t r e ss p r i ma ry stre s s we may say t h a t S i m i l ar l y , a t t h e r i gh t e dg e o f not the r e l evant v e r s us "e l sewh e r e ," u n l e ss t h e r e A c c e n t 1 words have e dge . t h e s t ru c t ur e e dge "wo rd initial f or the s ame r e a s o n t h a t wh e n t h e word, t h e word . occurs it on the f irst i s f ound a t the p r i mary s t r e ss w e may say t h a t l e ft o c curs on it i s f ou nd Wor d s whose p rima ry s t r e s s is l e f t mo s t o r r i ghtmost sy l l a b l e o f t h e word may st i l l be ass i g n e d t o a s t r e ss i s f ou n d word. Th i s is ( 16 ) word e dg e in the i l l us t ra t e d l e f tmost in - A w s p r i nsessa + A s w w w s t e l e f o n numme r - + t h e s e words , o r r i gh t most foot prim a ry of t h e (16 ) . w w s t e l e fon + � in � A s w f l i c ka - s i n c e, + St r ess d i s t r i bu t i o n - + A s w be t a l a I + - � s w w be t a l a d e - + - ' p a i d' - i n Swe d i sh may now b e d e s c r i be d in t e rms o f a p a r ame t r i z e d s t r e ss e dge . ( 17 ) L e f t /R i gh t St r e ss Edge : Th e p r i mary s t r e ss e d sy l l a b l e o c c ur s a t t h e l e f t /r i gh t word e d g e o r i s d om i n a t e d by t h e l e f tmost/r i ghtmo s t fo o t of t h e word . - 165 - Aft er f e e t are c ons t r u c t e d d e t e rm i n e d . The f ir s t st e p sy l l a b l e is at a is f o und. a word e d g e , sponds to l o c at e d. ( 18 ) , then the If it t h e st r e s s e d g e d e t e rm i ne If it i s, if the t he word i s ass i gned e d g e wh e r e t h e s t r ess e d sy l l ab l e edge wh ere i ts i s not d om i n a t i ng a s s i g nme nt f o r b e t a l a l e ft s t re s s is the s t re s s ed f ou n d a t i s a s s ig n e d t h e s t r e s s e dge t h a t word L S E= word, c orre s p o n d s t o t h e word Str e s s e dg e whe r e is to word e dg e . t h e s t ress edge t h a t sy l l ab l e f or a is c o rre f oot is i l l u s tr a t e d in e d g e a n d RSE= r ight s t r e s s edge. LSE (18 ) RSE I� be ta la + The o p p o s i t e s i tua t i o n word wh e r e e dg e s , i .e . , the a f r om that stressed i l l ustrated sy l l ab l e monosy l l a b l e. is Th e s e in (18) is a f o u n d a t b o t h word words ar e ass i gne d a r igh t s t r e s s by a d e f a u l t ru l e. Th e d is t r i bu t i o n stated (19 ) i n te rms of the o f A c c e nt 1 and A c c e n t 2 may n ow b e s t r e s s edge . L e f t s tre s s e d g e = A c c en t 2 R ig h t stre s s e dg e A c c e nt 1 = Th e w o r d s i n the 2 word s, and th e Ac c e nt 1 words . be cau s e, l e ft c o l umn word s Th e word a l t h o ugh i ts ( 16) are e x amp l e s i n th e r i gh t t e l e f on str e s s stres s d e s cr ip t i o n d o e s. in c o l umn are e x amp l es of a p p ear s in patt ern does Wh e n th e word - 166 - o f A c c e nt is bo t h c o l umns n o t cha ng e , in i s o l at ion, its it h a s s t ress e dg e a n d c om p ou nd, t h e word o n t h e r i gh tmos t s y l l ab l e . i s a s s i g n e d A cce n t 1. As the i t s stress is f ound h owe ver, ( i .e. , the c om p o u n d ) . Th e s t re s s e d g e a nd i s a s sig n e d A c c e n t 2. cha n g e p r e d i ct s the I n s ummary, Acc e n t 2 in i t i a l s tre s s , acce n t the s t re s s e dg e e dg e of 2 . 3 L e f t s t r e s s e dge memb e r i n the l e ft c ompound has d e scr i p t i on Th a t is , s t r ess l oc a t i on Th e a n a l y s i s h e r e (20 ) s o l ut i o n t o s ome pre s e nted Mo n omo r ph em i c b i sy l l a b l e s - e n va t t e n ' wa t e r ' - e l s ege l ' sa i l ' - e r l age r ' s t oc k' u n d e r l y i ng re p r e sen t a t i on , El i a s s o n Lin d a u 1970 , 1 972 f or a p p l i e s a f t e r a c c e nt men t a l sur f ace st r e s s, h av e a a nd l eft 1 all t h e s e ca s e s. in (20 ) . e nd i ng a s i nc e Ris che l r e v i e w). a s s i g nme n t , f orm. Th e ba s e d Th e f i rs t ca s e in monosy l l a bl e s 1963, An als o i n p art i n the t h ey r e c e i v e Acc e n t ohma n 1 966; 1 cf e p e n t h e s i s ru l e , p r o du c e s t h e re s u l t - 167 - e xc e p t i on s is of The s e wor d s a r e o f t e n a n a l y z e d a s H a ug e n 1967, l e ft e x c e p t i on s ru l e . is a s t r ess e d g e . t o L i ndau' s t o b e c on s i der e d foot of Word s w i t h A c c e n t i n t h i s s e c t ion a r e on L i n d au's a word s w i t h al l Th e e xce p t i o n s d i s c uss e d ( 1 970 ) of un i t e s the w i t h pe n u l t i ma t e i n a dd i t i o n t o Ac c e n t 2 . i n c omm o n a r i gh t f ir s t Th e s t re s s d e s c rip tion e nv i ro nme n t s. f or e i g n n o u n s r e g a rd l e s s s tr e s s cha n g e . s tress d i s j u nct i ve comp ou n d s have I t h a s a r i gh t of ( c f. a l so wh i c h corr e c t s e g- t h e d e r i va t i o n i s an ini t i a l p l us c orre c t r e s u l t Th a t a s mo n o sy l l a b l e s i s, 'm i rror' is E l i as s on thus , ( 1972) t i on p a i rs s uch is to a s s ege l 1 f or that ' sa i l ' them , al l E l i asson gro u n d s that t h e s o l e mo t i va a nd b e twe e n spege l initial ' m i rr o r, ' s e c o nd rece i ve s s tre s s whe r e t h e A c c e n t 2. i n th i s it c a t e g ory b e b i sy l l ab i c wi th pre fers t h e p l us F i rs t , he states that a d i a cr i t i c bisy l l a b l e e pe nthe s i s in t o ass i g n p l u s d i a cr i t i c members one l o ok (wh i c h needed in A c c o rd i ng t o t h e e p e n t h e s i s rule c omp l i c a t e d ru l e s y nco p e a nd e p e n t h e s i s a n a l ys i s . t o e l im i n a t e it is a s o l u t i o n s) set pro p o s e s, f ew A c c e n t 2 c a s e s . i s d e s i rab l e f o r two r e a s o n s . he i t e l i m i nat e s t h e e p e n t h e sis ru l e mo n o sy l l a b l e of 1 mo n omor phem i c a nd t h e word s (1972) s o l ut i o n be c aus e S e c o nd, Accent according t o E l i a s s o n ) , t o n e t o t h e re l a t i ve l y Elia s s o n, the d i f f ere n t i a t e t h e u n d e r l ying re pre s e n t a t i o n, the wh e n a cce n t o f t h e s e mo n omor p h e m i c b i syl l a b l e s r e c e i ve A c c e n t (ne a r l y 4/5 o f i ns t e ad , spege l i n t h e u n d e r l ying r e p r e s t r e s s e dg e l e ft on the f i rs t rece i ve s A c c e n t 1 S i n c e mo s t e .g., argu e s a g a i n s t p o s i t i ng a mo n o sy l l ab i c h owe v e r, f or th i s type, th i s c orre c t l y r e c e i ve A c c e n t 2. u n d e r l y i ng re pre s e n t a t i o n b i sy l l a b l e s , t h e re f ore, s t re s s e d g e a nd, A c cent 2 wor d s o f Th e s e w o r d s have a a s s i g n e d a nd, f r amework. u n der l y i ng re pre s e n t a t i o n, i n the are p o s i t e d a s b i sy l l a b l e s s e n t a tion . to s i n c e i t pro w i t h i n th e s t re s s e dge t h e s e word s a r e a s s i g n e d a r i gh t re c e i ve A c c e n t 1 . l i t er a t ure i n the i s a p p e a l i ng 1 ca s e s f or t h e A c c e n t du c e s t h e f orth put a na l ys i s e p e n the s i s a c c ou n t Th e mo n o sy l l a b l e s t re s s e d b i sy l l a b l e w i t h A c c e n t 1 . is i n Swe d i s h . needed i n both "a p p l y t o disjo i n t s e t s mak i ng t h e exact l y - 168 - l i ke the memb e r s of the other s e t and v i c e l i ar s t ruct ur a l wh i c h may v e r s a. Thu s t h e se ru l e s e xh i b i t a nd s ugg e s t funct i ona l tha t c omp l eme n t ar i ty. j u s t o n e s i n g l e pro c e s s a pe cu s ome t h i ng is i nv o l ve d" (1972:187) . In s pit e o f t h e sy l l a b l e s fact that F i rs t. t a c t i c g r o u n d s by t h e disy l l a b l e s e nd i ng on s e gme n t o f t h e rhyme as a S e c o nd. l o ok 1. n. - e n. -e l . rhyme s . -er v i o l at e f or r. Ep e n t h e s i s for t h i s f orms o f s o n or i ty r e S p e c i fi c a l l y. f o rms the f i na l i s a h i gh l y i s mo t i v a t e d sy n ch r o n pho n o t a c t i c v i o l a t i o n. a l t h ough e p e n t h e s i s and a l ik e . e . g. , (1972) . r e s o l ve s Fur t h e rmo r e . Epenthe s i s d o e s de l e t e d by syn c o p e , Wh e n ' b i l l s' i f the it is is f or i n o rd e r t o r e du c e i n t h e word. ' m i rr ors ' o t h e r out. i n s e rt e d o n l y sync o p e make d i f f e r e n t t h ey d o s o for d i ffere n t r e a s o n s . Synco p e argue. s t r a t e d above . s e d l ar s o l u t i on b e c ome s s ege l . larl b e c ome s speg l ar v i o l at i on . to be t h e mon o sy l l ab i c a c c ord i ng t o E l i a s s o n I is pro f a c t that numb e r o f unstre s s e d sy l l a b l e s t h e s i s. d i a cr i t i c in the mo n osy l l a b i c r em e dy 's a i l ' p l us I c omp l i c a t e d. c a n b e mo t i va t e d o n ph o n o sy l l ab l e s o n o r o u s s e gme n t : oc c u r s , t o a d i a c r i t i c a nd e p e nthe s i s in st r i ct i ons word s is u nd e r l y i ng mo n o t h e mo n o sy l l ab l e p l us e p e n t he s i s s o l u t i o n thre e reas ons. I s e gll i s e qu a l i n Swed i sh pre f e rr e d ove r t h e b i sy l l ab l e i ca l ly p o s i t i ng f or the A c c e n t 1 c a s e s t h a t t h e e p e n t h e s i s ru l e pose that i ns t e ad o f s o no r i ty the not Is p e g e l l v i o l a t i o ns, i ns e rt a n leI wh i ch Th e s urfa c e form w i l l i n s e r t e d. it from - 169 - n ot as do i l l u cance l e p e nt he t i c + Epe n * spege l ar. ru l e s e x amp l e. d e r i ve d e . g. . the e a ch i s the n leI is r e su l t i n a s o n o r i ty s t ay s . Is e d l l Thus , th e f orm ' bi l l ' p l us larl (p l ur a l mark e r ) n i t e form the a n e p e n t h e t i c lei) s e d l ar d o e s rhyme - f i n a l not c o nt a i n s o nora nt There is n o n e e d, the i nd e f i - p l u s lar/ . The a s o n o r i ty v i o l a t i on i n t h e u n d er l y i ng r e pr e s e n- t a t i o n Is e d l l b e c ome s t h e o n s e t o f s urface. ( s ur f ace o f f rom s e d e l wh i c h c o n t a i n s s ur f a c e form s i nce and not t h e n e xt t h e re f or e , sy l l ab l e o n t h e to pos i t e p e n t he s i s in th i s c a s e . Th i rd, fe a sib l e t h e b i sy l l a b l e in the f ramework E l i a s s o n's a n a l y s i s , bi sy l l a b l e s d o t i ons , i .e . , Ac c e n t 2 hand , is ru l e , are f ew. w i th r e c e i ve A c c e n t 2 . mark t o o here . ma ny c a s e s . t h e mo n o sy l l a b l e A ccord i ng to t h a t mo n omorph em i c Thu s , e xce p- u nd e rgo t h e I n t h e framework h e r e , on the other b i sy l l ab l e s s e t, are initial t h e 400 d i acr i t i c to In t h e i s not t h a t do 2 be c au s e they have a s o l ut i on t h e Acc e nt 2 ru l e . mo n omor ph e m i c c omp l eme n t o f t h e be marked pro p o s e d s ome h ow k n own n o t undergo r eceiv e Acc e n t fore, it di acri t ic mo n omor ph em i c b i sy l l ab l e s the s e i s the p l us s t r e s s. e xce p t i o n s, i nd i ca t e t h a t a n a l ys i s p r e d i ct e d here , the Thu s , it t h a t mus t they shou l d not d i a c r i ti c mus t F o r the re a s on s o ut l i n e d a bove, p l us to e p e n t he s i s s o l u t i o n there- is a d o p t e d here. The s e cond Ac c e nt 1 (21) is case o f pr e s e n t e d in l ef t s t r e s s ( 21 ) edge words ( f r om E l e r t 1972 ) . B i sy l l ab l e s e nd i n g i n -e l s k e nge l sk ' E ng l i s h ' -e l s t me d e l s t ' by me a n s o f ' - ens f u f f e n s 'mis c h i e f' -ersk baje r s k ' Bava r i a n ' -e r s t 0vers t 'u p p e rmo s t ' -ert s t a ndert 'p e n n a n t ' - 170 - t h a t r e c e ive Wh a t i s s tr i k i ng abo u t the s e cases in ( 20 ) . mo n o sy l l ab l e s in the s i m i l a r l y to l yze d as the w i th e p e n t he s i s t age o f that Tha t c a n b e e x t e nd e d t o c ov e r t h e s e they there i s an r e l e v a n t e x amp l e s a r e A n o t h e r a dva n- . o n the cases . t h u s, is I n t he b i sy l t h e s e words o t h e r h a nd , i nd i c a t e , i s mor e h oweve r , c omp l i - i n t e ra c t i o n w i t h mor p h o l ogy o n t h e l i sted i s m o r e c omp l i c a t e d in here. The ( 22 ) . Mo n o sy l l ab l e + - e r ( p r e s en t t e n s e ) ' buy s ' kope r - e r (p l ur a l + um l a u t ) s t ad e r ' t own s ' ( s t ad ' t own ' ) - e n ( d e f i n i t e s u f f i x, a l so - e t ) a n d e n ' t h e d u c k' -r e (compa r a t i ve ) hogr e ' h i gh e r' 'typ i c a l ' typ i sk -isk ( a dj. s u f f ix ) Th e pre s e n t s u r f ace , t e n s e marke r , may be p o s it e d a s an s e n t a t i on . Thu s , p r e s ent s ur f a c e e nd in a a fter f orm , in a / r/ i s a dd e d, i s a s s i g ned , t h e s t em e nd s i n s ert e d , wh i c h in is - e r, or -r on t h e t h e u n d e r l y i ng r e pre- mon o syl l ab i c s t ems rema i n mono s y l l ab i c wh e n t e n s e marker Whe n a cce n t that t h ey may b e a n a- s h o u l d n o t r e ce i ve A c c e n t 2 . o n e h a n d a n d s i nc e e p e n t h e s is If t h e y behave would h av e t o b e mark e d w i t h a d i a cr i t i c t o ca t e d s ince the i s, a s s i g nme n t Th e t h i rd c a s e o f exc e p t i o n s ( 22 ) that u n d e r l y i ng re p r e s e n t a t i o n, f o l l ow i n g a c c e n t l a b l e p l u s s y nco p e s o l u t i o n , that is t h e mo n o sy l l ab l e p l u s e p e n t h e s i s s o l u t i on, it as we l l cas e s v owe l V e r b s t e m s t h a t end , however, a s s ig nme n t , , is n o n sy l l ab i c . then an t o produ c e the /ko p / + /r/ be c ome s kope r e .g., t h e s ur f a c e a s we l l it th e s e words t h e n r e c e i ve A c c e nt 1 . c on s o n a n t a c c e nt s i n ce /e / is c orre c t 'buy s ' . S t ems o t h e r t h a n / a/ r ema i n mo n o sy l l ab l e s o n e . g., i n -a, /bo/ + /r/ b e c ome s bor ' l i ve s ' . o n t h e o t h e r h a nd, - 17 1 - r e c e i ve A c c e n t 2 be c a u s e a s s i gn e d , the i r 1970 ) . a s s i gnme n t . um l a u t , c a s e s, Th e mark e r - e r c a s e s wh ere t h e r e L i nd a u are Words pos i t ed i s a l s o a n um l a u t i n t he e p e nthe s i s w i th re c e i ve ' s p ea k s . ' be Ag a i n, may a l s o the Acc e nt 2, a p p l i es -er p l ur a l , e .g. , f i l me r und e r l y i ng from th e surface f orms variation word a f t er in p l ur a l in ( cf . a c c e nt wi thout ' f i l ms . ' f or t h e -er p l ur a l structure a s /r/ but t h e u n d e r l y i n g r e pre s e n t a t i o n o f t h e two is accent wh e n b i sy l l a b i c / t a l a / + / r / be c ome s t a l ar e .g. , Th e p l u r a l those s t ems an I n the s e is jerI . a r e r e c overab l e the pr e s e n c e o r a bs e n c e o f a n a c c omp a ny i ng um l a u t . The defin i t e et , -t , -n a , may be r e p r e s e n t a t i o n. n umb e r s uff i x, pos i t ed s urface as an forms /n/ in A f t e r a c c e n t a s s i g nme n t , ( - n vs - n a, a p p l ies the s i s with - t vs a f t er pre s e n t t e n s e ma rk e r , -n ) a nd g e nder a c c e nt it (- n a s s i g nme n t o f -e n. -n , t h e u n d e r l y i ng is i nf l e c t e d vs a nd, o c c u r s wh e n t h e s t em e nd s f or E p e n -t ) . a s w i t h t he in a con s o nant . Whe n cases in the / n/ e p e nt he s i s ( 20 ) . i s the /sed l / 'b i l l ' the t h e de f i n i t e f o rm o f t h e i n s e r t i o n d e p e nd s o n wh e t h e r s i ng u l a r o r p l ura l 's a i l ' b e c ome s in to the d i re c t i on o f /s e g l / l arna app l i es s eg l e n in b e c ome s se d e l n d ef i n i t e -en suff i x . the i n the p l ur al ) . e p e n t h e s i s p l a c e s t h e s o n or a n t F o r examp l e. de f i n i t e p l ur a l d e f i n i te s i ngu l ar In both c a ses, i n t h e n e x t s y l l ab l e , but ( s e d h owever, r e l i ev i ng t h e s o nor i ty c l a s h. The c omp a r a t i ve e p e n t h e s i s s i nc e i s -re , e.g. , -re th i s w i l l /h o g / + /r/ c a n n ot be produce - e r, pos i ted a nd t h e bec ome s h o qre - 172 - as / r/ w i th s ur f a c e f orm 'h i gh e r' a nd n o t *hoge r . Th i s ca s e is a n o t h e r e xamp l e o f e p e n t h e s i s ru l e wh i ch i ns e r t s t h e vowe l t h e va r i a t i o n o f the t o the the r i gh t of s o n o r i ty v i o l a t i o n . F i na l l y , the s u f f i x - i sk may b e p o s i t e d a s /sk/ u n d e r l y i ng r e pr e s e nt a t i o n . E p e n t he s i s a s s i g nme n t b u t i ns er t e d an / i/ is i ns e r t i o n o f /e/ v e r s u s / i / b e f or e s o n o ra n t s, mo t i va t e d it than a wh e n / sk/ co n t i n u a n t, f o l l ows e .g . , a n / e/ . /e/ nonsonorant s . f or s o n o r i ty a f t e r acce n t i n s t e ad o f p r e d i ct a b l e : / i / be f ore i n t h i s case occurs is a p p l ie s :� is wh i ch / t y p+ s k/ is Th e i ns e r t e d Epenthe s i s re a s o n s a s a s t op, i n the we l l , is s i nce l e s s s o n or a n t b ecome s typ i s k but / g l om+sk/ b ecome s g l omsk ' f org e t f u l .' c e r t a i n word s I n s umm a ry, wit h a l e ft t h e s ur f ace may b e a n a l yz e d as s t age t h e s t r e s s e dg e of d e r i va t i o n mo n o sy l l ab l e s , e d g e a nd t h e s e words w i l l rece i ve Acce n t e p e n t he s i s ru l e , vowe l , 3.0 Der i v i ng t h e s ect i o n, co n t our o f acce n t t ona l it a r i gh t s tr e s s A f t e r t h e acce n t is d e r i v e d, an i ns e r t s a f o rm . i s d emo n s t ra t e d that d e r i ve d w i t h out the p i t ch acc e n t acce n t a s s i g nme n t The a n a l y s i s p r oc e e d s w i t h i n t h e a u t o s e gme n t a l and has two t i er mu s t b e d e t erm i n e d. mus t b e p r o p o s e d wh i ch t o nal As c o n t our ( G o l dsm i t h 1976) t ona l be a s s i gned on at the i s a s s i gned . t o t h e bisy l l a b i c s ur f a c e a word ca n b e i n f o rma t i o n . framework 1. mon o sy l l a b i c t r i gg e r e d by s o n or i ty v i o l a t i o n s , g i v i ng r i s e In th i s wh e n b e i ng s t re s s edge S eco nd, a s s oc i a t e s t h e tier t o t h e i r t o n e b e a r i ng u n i t s. - 173 - p ar t s. F i rs t , the a ma p p i ng ru l e t on e s o n t h e acce n t 3 . 1 Th e dat a Th e phono l o g i c a l an a l y s i s ph one t i c a n a l y s i s of A l t h ough Bru c e' s un c on t rovers i a l are both A c c en t The r e bl e 2, 1, Th e d i f f e r e n c e t h e p e ak th e p i t c h p e ak fa l l p i t ch pe ak by c omm.) , is data i s not h e p r ov i d e s exp l i - to l i es is the t h e s t r e s s e d sy l l a b l e . For in two the low p i t c h a c c en t s pr e s t r e s s s y l l ab l e . t h e s t re s s e d sy l l a- f or A c c en t 1. For A c c en t i n th e s tr e s s ed sy l l a b l e . a nd t h e p o s t s t r e s s sy l l a b l e . A c c en t 1 a nd A c c e n t 2 ( 1 977) . A c c en t 1 a nd A c c en t 2 the f a l l s w i t hin t h e s t r e s s e d sy l l ab l e, t i nu e s on t o Bru c e A c c ord i ng t o Bru c e i n p i t ch s o t hat l ies in a pit c h p e ak n e a r t h e s t r e s s e d b e t we e n r e l a t i ve c hara c t e r i zed by the pe rs. i s ba s e d on t h e of t h e f or p i t c h g e n e ra t i on. i s an abru p t is i n t e r pr e t a t i on ( Eng s t rand , chara c t e r i zed t i m i ng o f here Swed i s h a l s o Bru c e & G � rd i ng 1978) , c f. sy l l ab l e. S t o ckho l m ( 1977) c i t d a t a and r u l e s ( 1977 , p r e s en t e d the l ow The p i t ch p i t c h c o n- The d i f fe r en c e b e twe e n i l l us t r a t ed in ( 23) , wh e r e � = sy l l ab l e. A c c e nt 1 ( 23) Th e a c c en t s p i t c h p e ak wh i ch i ng t o Bru c e 2 A c c en t o f t en o c c ur is i n c on j un c t i on i n d e p end ent o f ( 1977) . f ocus , i .e . , in c o u r s e. t o Bru c e ' s da t a , ca l l s p i t ch Th i s extra p e ak s e rve s or phr a s e as being A c c ord i ng the a s e n t e n c e a c c en t ) o c c ur s accent the a c c o rd- t o mark a word h i gh l i gh t e d i n the d i s- f o c u s p e ak i n t h e word - 174 - w i th a s e c ond ( wh i ch h e o n t h e sy l l a b l e f o l l owin g t h e low t o n e t h e r e fore , focus ble . the p e ak is pit c h a c c e n t . fou n d o n t h e For A c c e n t 2 non- c omp o u n d s, fo l lowing t h e 2, o f the the pos tst r e s s sy l l a b l e . s e nte n c e (24 ) . in is a nd wit h A c c e n t focus Th e p e ak is i l l u- (24 ) . Ac c e n t 2 A c c e nt 2 0 ( c om p o u nd) � � (a o 0 . . . 0) According to GArd i ng a c c e nt s are Bru c e 1 979 , s t ab l e (1977 , GArd i ng cf . c o n t ou r s , a l s o Bru c e & G Ardi ng Lind b l ad & � * * * 1 978, f ou n d on t h e sy l l a b l e I n c ompou n d s pitch a c c e n t A c c e nt 1 a pos t s t r e s s sy l l a- a c c e n t i s t i e d t o t h e s e cond a ry s t r e s s . r e l a tionship be t we e n strated it For A c c e n t 1, whi l e t h e 1973 ) , the wor d s e n t e n c e a cc e n t is in flue n c e d by s e n t e n c e i n t o n a t i o n. 3.2 Th e ton al tier Th e me l ody pro p o s e d h e r e for Swe dis h include the the f o c u s p e ak. pit c h c urve here. Fir s t , p l aceme n t are c o n t ribu t io n s of t his ana l ysis is o f high a nd phono l og i c a l by s ema n t i c The r e l ow i n f orma t ion. give n/n ew a nd within the f ocus p e ak it a i.e . , bas e d on i s d e t e rmi n e d by can not be Th a t p r e di c t e d f rom is not t o s ay t h at t h e is phon o l ogi c a l l y u ni n t ere s t i ng , t h e discu s sio n - 175 - word f o c u s p eak i s a s sig n e d c r i t e ria, emph a s i s ma rking, not dire c t l y r e l a t e d t o f or s e p a r a t ing pit c h a c c e n t a nd t on e s ph o no l ogi c a l in f orma t io n a l o n e . focus peak t wo re a so n s doe s not conce r n e d wit h p r e di c ting t h e S i nce ( o r discour s e) pit c h a c c e n t only t h a t it h er e o f is t h e phono- l o gi c a l pre di c t a bi l ity o f t h e pit c h s h a p e by a c c o u n ting on l y pre s e n t e d here u s e Bru c e ' s f or t h e pit c h a c c e nt me l o dy, can be ( 1 977 ) o f a word . S e cond, t h e a na l y sis c ompar e d to t h r e e ot h e r a na l y s e s t hat d a t a bu t a l so do not ac c ou n t for t h e fo c u s peak.4 I wi l l b e gin by p r e s e n ting t hr e e r e c en t a na l y s e s o f S t o c kh o l m Swe dis h pit c h a c c e n t each of t h em. I wi l l whi c h ove r c ome s Fir s t , S chmid the then disadva n t a g e ( 1987 ) prop o s e s t h e wh ere L= (25 ) A c c ent 2 1 A c cord i ng to S c hm i d ( 1987 ) , be tone . s p e c i f ied f i l l e d i n by of t his for i n t onatio n, of a n a l y sis for A c c e n t and 1 s t an t f ea t ur e s word accent o t h e r a na l ys e s . f o l l ow i ng t o n a l of a n a l y sis is the Th e t iers f or rest of pos t s t r e s s as a c c e n t me l o dy Bru c e of is b o u n d a ry t o n e s . p i t ch shape t h e p re s t r e s s h i gh l ow for A c c e n t 2 a r e c o n ra ther tha n ( 1977) qu e s t i o n . t h a t know l e d g e be l onging t o t h e p r o po s e s . Th i s , of The main p rob l em w i t h t h i s a c c e n t a s sig nme n t d e t e rm i n e s r e p re s e n t a t i on . Tha t is, a Accent 1 P i e rrehumb e r t and Be c kman tie r s i n the f o c u s a nd c h o s e n bas e d on wh e t h e r t h e word has S e cond, s t r e s s e d sy l la b l e mu s t i t a s s ume s t hat i n t o n a t ion p rop er, t h e a u t o s e gme n t a l on l y the c orre c t l y c h a r a c t e riz e s t h e i s a n emp i ri c a l t h e t ona l the l ow t o n e and H= h i gh t o n e . s u c h as t h e s t r e s s e d sy l l a b l e , cou r s e , arguing a gain s t H L Whi l e a nd p r o p o s e a n a l t e r na t i ve a n a l y s i s Swe dish p i t c h a c c e n t , A c c e nt me l o dy ( 26 ) . - 176 - tona l t i e r is or A c c e nt ( f o r t h c om i ng ) 2. pro p o s e (26 ) Accent 2 Accent 1 H * * L H L I n the i r a n a l y s i s, wh i c h c l o s e r t o Br u c e's is both Swe d i sh p i t c h a cce n t d i f f e r e n c e, wh i c h a c c e nt s are than i s S chm i d's over the s t re s s e d sy l l a b l e. A c c ord i ng t o h i gh , Bruc e ' s B e c kma n ' s l ow t o n e . ana l ys i s . The t i m i ng 1 f r om A c c e n t 2, t one tha t s t r e s s e d sy l l ab l e w i t h is ma p s t o t h e t h e i r repre s e n t a t i o n, Th i s a n a l y s i s da t a . ( 1 977) My ( f or t h c om i ng) d i acri t i c A c c e nt a pre s t r e s s l i ke wh i c h , do not o f f er t h a n k n ow l e d g e Th i rd, f or both of a cce n t s . c o n c ern us h e r e + to r e pr e s e n t a ti o n S c hmid's is P i e rr e h umb e r t the use ( 1 987) a n a l y s i s, ma p ping t ak e s a ny c r i t e r i a p l ac e . a nd of the r e qu i r e s Th ey, as f or c h o o s i ng a t i er o t h e r t h e word ' s a c c e n t . Lore n t z (27) A c c e n t 1 c orr e c t l y c hara c t e ri z e s ob j e c t i o n t h a t a t i e r b e ch o s e n b e f o r e not essence of a nd A c c e n t 2 ma p s a h i gh t o t h e s t r e s s e d sy l l a b l e w i t h a pos t s t r e s s we l l , s e qu e nce, the d i s t i ng u i s h e s Ac c e n t l ow t o ne t o t h e L t one i n t e r p r e t a t i o n of r e pre s e n t e d by an as t e r i sk 1 ma ps a a H (1984) Th e ( see p r o pos e s s e c o nd H, the t ona l t i er in ( 27 ) t h e s e n t e nce a c c e nt , does f oot n o t e 4). Accent 2 * H It a ppe a r s , k n own be f ore L H in t h i s ana l ys i s , that ma p p i ng t ak e s p l a c e - 177 - the a c c e nt s i nce the t i er n e e d n o t be is t h e s ame f o r b o t h a c c e nt s. Lore ntz (1984 ) Th i s is pl aces not a dia crit i c i n s t e a d. H e p r o p o s e s t h a t t h e is mark e d A c c e nt 2 is wit h an the case , h owe v e r , o n the s e gme n t a l s t r e s s e d sy l l a b l e t o s h i f t t h e me l o dy a r r ow . Th e p u rp o s e f or A cc e n t of i n A c c e n t 2 wo r d s s o t h a t a l i gn s w i t h t h e s t r e s s e d s y l l a b l e a nd t h e L examp l e t i er 1 a s t e r i sk a nd t h e s t r e s s e d sy l l ab l e for i s mark e d w i t h an pos t s tress s i nce sy l l a b l e. to g i ve i s r e p e a t e d h e r e . Th e the so l id the a r r ow f i rs t H a l i gn s w i th the Accent l i ne the 2 c o nt our . His i nd i c a t e s t h e f i r s t a s s o ci a t i o n . (28 ) Accent A c c e nt 2 1 -;> och nunnorna * r o ch n umme � / \ , B e f ore * -..;> o ,ch n u n n o r n a , / , H L H H L H * * the c o n t our is " �/ / , � � , � H' L H * ma p p e d. there f ore , the t ona l t ie r i n t h e word a c c e n t mus t b e k n own . I propos e , the ( 29) i ns t e a d , a c c e n t me l o dy of Ac c e nt 1 + (29 ) t o r e p re s e nt S t o ckho l m Swe d i s h. A c c e nt 2 H L Th i s t i e r c o rre c t i s a l so c o n t o ur , d i f fe r e n t i a t e s resu l t uns pe c i f i ed i .e . , A c c e nt the 1 of t h e ma p p i ng r u l e, i s s e n s i t i ve t o metr i ca l f o r p i t ch a c c e n t p l a c eme n t from A c c e nt pro p o s e d c ho i c e . of the 2. i s d e r i ve d a s a in S e c t i o n fa l l The 3.3 , wh i c h wh i c h s t r u c t ur e r a t h e r t h a n d i re c t l y t o a c c e n t a s s i g nme n t . - 178 - 3 . 3 Th e mapp i ng ru l e Th e f i r s t ma p p i ng b e t we e n t h e t i e r o c c urs o n the s t re s s e d vowe l ca l l y, the word h a s a t one on map s t o wh e n t h e o t h er t o n e o n t h e t o na l at i o n l i n e s d o n o t ( 30 ) t i er map s c ro s s . stress edge . s t re s s e d vowe l a n d t h e r i gh t t o n e word h a s s e gme n t a l a r i gh t S pe c i f i wh e n t h e map s t o t h e s tre s s e d g e . i n s u c h a way t hat Th e r u l e s are s tat e d in Th e assoc i- ( 30 ) . Map p i ng Ru l e s 1a . Map H t o t h e s tre s s e d sy l l ab l e wh e n t h e word has a l e f t s tre s s e dg e . Map L to t h e s t r e s s e d sy l l ab l e wh e n t h e word has a r i gh t s t re s s e d g e . lb. 2. Map t h e rema i n i ng t o n e s u c h t hat as s o c iat i o n d o n o t cro s s . Whe n t h e word has l e ft ( 31 ) the the Wh i ch t o n e ma p s t o t h e t h e word ' s l e f t s tr e s s e d g e , s t re s s e d vowe l t i er and s t re s s e d vowe l . d e p e nd s l eft tona l to r i gh t , a l eft b e g i n n i ng s tre s s e d g e , the t on e s l i ne s map w i t h t h e s tre s s e d sy l l ab l e , f r om as in . ( 31 ) H L H L \ \ \'" f l i cka s w pr i n s e s sa w s 'V V Th e re s u l t i ng t o re s u l t of the (1977). I \ t e l e f o n n umme r w w s s w "'V of th i s ma p p i ng Bru c e H L i s the A c c e nt Th e ris e f o c u s p e ak , "V on wh i ch h ere. - 179 - nu has 2 c o n t o ur , a c c ord- i n t e l e f o nnummer not i s the bee n re p r e s e n t e d Wh e n t h e from r i ght in word h a s to l ef t , a r i gh t s t r e s s e dg e , the t o n e s ma p b e g i n n i ng w i t h t h e s t r e s s e d s y l l ab l e, as (32). (32) H L H L H L in I I \ \ t e l efon w w s be t a l a d s I 'W Th e r e s u l t of i ng t o Bru c e t h i s ma p p i ng is t h e A c ce n t 1 c o n t o u r, a c c o rd- ( 1977 ) . I n s umm a ry, p i t ch a c c e nt a t ona l me l o dy c h o i c e ma p s e dg e p a r ame t e r . to that is u n s p ec i f i e d a word a ccord i ng to Th i s p r o cedure d e r i ve s the for t he s t r e s s c or r e c t a c c e nt p a t t e r n o f Swe d i s h words . 4.0 S umma ry I n th i s o f a word paper (l s u ch t h a t i t wa s d e mo n s t r a t e d t h a t o r 2) c a n b e the d e r i v e d f r om a word w i t h a l e ft me t r i c a l t h e me t r i c a l a l s o b e u s e d t o d e r i ve w i thout a pr o p o s e d a nd here �A c c e n t � t h e word's t h us f rom t h e c omb i n a t i on re pre s e n t a t i o n s , st r e s s-t o n e wh i c h i n t era c t r e l a t i o n s h i p s, 2." Th e int era c t i o n is to l a be l e d dr i ve n by - 180 - t h e word c o u l d of e a ch a c c e nt a c c ent a s s i gnme n t . t h e fe a t ur e s i nv e n t ory o f Swe d i sh d i s St o ckho l m t he s t r u c t ure, wa s fur t h er e l i m i nat e s i n s t e ad is It c o n t our t i n c t i ve fe a t ur e s a nd sh ows Swe d i s h 1. s t ru c t ur e o f t o na l pr i o r i k n ow l e dg e o f Th e a n a l ys i s EA c c e n t ] the ac cent s t r e s s e d g e h a s A c c e n t 2 a nd a word w i t h a r i gh t s t r e s s e d g e h a s A c c e nt demo n s t r a t e d t h a t cho i c e o f that p i tch of d e r i ve "Ac c e nt the not i on a c c e nt in s t r e s s and t one two d i f f e r e n t 1" and "A c c e nt of t h e s t r e s s e dg e , and the ana l ys i s there fore doe s not requ i re k n ow l e d g e o f a word ' s a c c e nt a s s i g nme n t a p r i or i i n o rd e r t o p ro c e e d . * I wou l d l i k e t o t h a nk O l l e E ng s t ra nd , B j o r n Hamma r b e rg , Anders Ho l mb e r g , Fran c i s c o L a c e rd a , a nd Toma s R i a d f or r e a d i ng a nd c omme n t i ng o n a n e ar l i er dra f t o f t h i s p a p e r . A l l e r r o r s a r e my own . Th i s wo rk wa s s u p p or t e d by a s ch o l a r sh i p f r om t h e Swe d i s h I n s t i t u t e . Fo o t n o t e s 1 . Jensen ( 1 980 a ) a s s i g n s p i t ch t o Swe d i s h words b a s e d o n t h e s t r e s s p a t t e r n ( p r e s e n c e o r a bs e n c e o f s e c o n d a ry s t re s s ) o f the word a nd a r gu e s t h a t A c c e n t 1 a nd A c c e n t 2 a r e d e r i ve d n o t i o n s i n Swe d i sh ph o n o l o gy . 2 . C f . J e n s e n ( 1 980b ) a nd S chm i d ( 1 987 ) p o s a l s o f f o o t c o n s t r u c t i o n i n Swe d i s h . 3. Th i s wa s p o i n t e d o u t by f or t wo o t h e r p r o I r e n e V o ge l . 4. Lore n t z ( 1 98 4 ) i n c l ud e s the sentence ac cent H i n h i s t o na l t i e r , but h e d o e s n o t a c t u a l l y a c c ou n t f or i t s beha v i or . For e xamp l e , h i s a na l y s i s wou l d n o t p l a c e t h e s e n t e n c e a c c e n t c orr e c t l y i n a l o ng c om p o u n d w i t h s e v e r a l sy l l a b l e s be t we e n t h e p r i ma ry a nd s e c o nd a ry s t r e s s . Thu s , h i s t i e r i s t r e a t e d h e r e a s a word a c c e n t t i e r o n l y . Re f e r e n c e s B a i l e y , L . M. 1988 . Swe d i s h . L i ngu a 75 , A n o n- l i n e a r a n a l y s i s o f 1 0 3-12 4 . B o o i j , G . 1983 . P r i n c i p l e s a nd p a r ame t e r s l ogy . L i ngu i s t i c s 2 1 , 2 49- 280 . Bru c e . G . 1 977 . Swe d i sh Word A c c e n t s t i ve . Lund : G l e e ru p . p i t ch a c c e n t in i n p r o s o d i c pho n o i n S e nt e n c e Pe rspe c Bru c e , G . a nd E . G & rd i ng . 1978 . A p r o s od i c t y p o l o gy f or Swe d i sh d i a l e c t s . Nord i c P r o s o dy , e d . by E . G & rd i ng , G . Bru c e , R . B a n n e r t , 2 1 9- 2 28 . D e p t o f L i ng u i s t i c s : Lund U n i ve r s i ty . E l e r t , C . - C . 1972 . To n a l i ty i n Swe d i s h : Ru l e s a nd a l i s t o f m i n i ma l p a i r s . S t ud i e s f or E i n a r H a uge n , e d . by E . S . F i r ch ow , K . G r i ms t a d , N . Ha s s e l mo , W . A . O ' Ne i l , 1 5 1 -17 3 . Th e H a g u e : Mou t o n . E l i a s s o n . S . 1 972 . U n s t ab l e vowe l s i n Swe d i sh : Syn c o p e , e p e n t h e s i s or b o t h ? S t ud i e s f o r E i n a r Hauge n , e d . by E . - 18 1 - S. F i r c h ow , K . G r i ms t a d , Th e Hague : Mou t o n . G o l d sm i t h , MIT . G A rd i ng , G l e e ru p . ----- J. E. 1 977 . ----- 1 979 . 20 7- 21 5 . Ha s s e l mo , Au t o s egme n t a l 1 976 . 1 96 7 . N. I nt e r n a l W. A. O ' Ne i l , P h o n o l ogy . J un c t ure i n Swe d i s h . Th e S c a nd i n a v i a n Word A c c e n t s . Sentence i nt onat i on Ph . i n Swe d i s h . Lund : 1 74- 1 88 . D. D i ss . : Lund : G l e e ru p . Pho n e t i c a . 36 . G Ard i ng , E . a nd P . L i nd b l a d . 1 973 . C o ns t a n cy a nd var i a t i o n i n Swe d i sh word a c c e n t p a t t e r n s . Work i ng P ape r s 7 , 3 6 - 1 1 0 . Lund Un i ve r s i ty : P h o n e t i c s L a b or a t o ry . H a u g e n , E . 1 96 7 . On t h e ru l e s o f Norwe g i a n t o n a l i ty . u age 43 , 1 8 5 -20 2 . Hay e s , B . 1 98 1 . D i s s . MIT . A Me t r i c a l Th e o ry o f S t r e s s Ru l e s . ----- 1 982 . Ex t r ame t r i c a l i ty a nd Eng l i sh s t r e s s . I ngu i ry 1 3 , 227-276 . Jensen , Journa l J . T . 1 980 a . S t r e s s a n d a c c e n t o f L i ngu i s t i c s 3 , 25 - 5 6 . L a ng Ph . D. L i ngu i s t i c i n Swe d i sh . Nord i c ----- 1 980 b . Th e me t r i c a l s t r u c t ur e o f Swe d i sh A c c e nt . C a h i e r s L i ngu i s t i gu e s d ' O t t awa , e d . by J . T . J e n s e n , 27 1 282 . U n i ve r s i ty o f O t t awa : D e p a r tme n t o f L i ng u i s t i c s . L i b e rma n , D . Diss . : M . 1 975 . MIT . Th e I nt onat i o na l Sys t em o f E ng l i sh . L i b e rma n , M . a nd A . P r i n c e . 1 977 . O n s t r e s s rhy t hm . L i ngu i s t i c I ngu i ry 8 , 24 9 -33 6 . a nd Ph . l i ng u i s t i c L i nd a u , M . 1970 . P r o s o d i c p r o b l ems i n a g e n e r a t i ve ph o n o l o gy o f Swe d i s h . Work i ng P ape r s 2 , L u n d Un i ve r s i t y : Ph o n e t i c s L a b or a t o ry . L i n e l l , P . 1 972 . Rema rks o n Swe d i sh mor p h o l ogy . Up p s a l a Un i v e r s i t y : De p a r tme n t o f L i ng u i s t i c s . RUUL 1 . Lor e nt z , O . 1 984 . S t r e s s a nd t o n e i n a n a c c e nt l a ng u a g e . Nord i c P r o s o dy I I I , e d . by C - C . E l e r t , I . J oh a n s s o n , E . S t r a ng e r t , 1 6 5 - 1 78 . S t o ckho l m : A l mqv i s t & W i ks e l l . N e s p o r , M . a nd I . r e c h t : Fo r i s . Voge l . P i e r r e humb e r t , J . 1 980 . E ng l i sh I n t o n a t i o n . Ph . 1 986 . P r o s o d i c Ph o n o l ogy . Do rd The Ph o n o l ogy a nd Ph o n e t i c s o f D . D i ss . : MIT . - 1 82 - P i e r r e h umb e r t , J . a n d M . of Tone i n Japane s e . P r i n c e , A . 198 3 . 1 4 , 1 9- 1 0 0 . B e c kma n . f or t h c om i n g . Re l a t i ng t o t h e gr i d . Th e S t ru c t ur e L i ngu i s t i c R i s c h e l , J . 196 3 . Mor p h em i c t o n e a nd word t o n e Norwe g i a n . Ph o n e t i c a 10 , 1 5 4- 1 6 4 . S chm i d , K . B . 1987 . N o n- l i n e a r Appr o a c h e s P r o s o dy . Ph . D . D i s s . Br own Un i ve r s i t y . I nqu i ry i n Eas t e r n t o Swe d i sh Word ohma n , S . 196 6 . G e n e r a t i va r e g l e r f o r d e t s v e n s k a v e r b e t s f o n o l og i o c h pr o s od i . F o rh a nd l i nga r v i d S amma n k oms t f or a t t Dry f t a Fr �qor Ror a n d e Sve n s k a n s B e skr i vn i ng I I I , e d . by S . A l l e n , 71- 8 7 . G o t e bo r g Un i ve r s i t y : I n s t i t u t i o n e n f or n or d i sk a s pr �k , I n s t i t u t i o n e n f or e ng l e sk a s pr�k e t . - 183 -