Phonetic Difficulty Index (PDI) is an algorithm phonetically transcribing orthographic text and tagging it with sixty-one expected pronunciation error tags: phonemic, graphophonemic, allophonic, morphological, accentual, sandhi, etc. The resulting resource can be used for learning, teaching, testing, materials preparation, textbook evaluation, dictionary compilation, syllabus design and in many other capacities in EFL pronunciation pedagogy. In this presentation we show examples of how PDI can be used in (i) phonolapsologically analyzing and evaluating graded readers (Reading A-Z) targeting native American children, but adaptable for EFL, (ii) providing well-defined resources to help meet IFA pronunciation syllabus goals, such as "the student should know the phonemic shape of words within a basic English vocabulary, including word stress"
14. Frequent PDI codegrams in RA-Z 7 6 5 4 3 2 1 No used, lived, named 1777 rZ all, also, small, almost 2556 Y0 with, these, those 2569 LZ have, his, hand, whose 5476 UZ for, or, water , over, were, 8136 J0 the, that, them, than 22208 JL of, and, as, was 22399 JZ word examples RA-Z codegram
17. 1. Know the phonemic shape of words within a basic English vocabulary . ALL attested sentences of RA-Z (>3 words) with the maximum score for PDI(Z): all words end in voiced obstruents. text id RA-Z record (sentence) H01 Male s ha ve lar ge antler s H07 Sna g snee zes an d snee zes K16 Reptile s an d amphibian s ha ve backbon es N14 Kind s o f s nail s an d s lu gs Q06 Buildin gs an d bri dges destroy ed W04 Camel s cactu ses an d san d dun es
18. 2. Be able to recognize all the phonemic contrasts of English perceptively, and to implement them in their own production A sentence with 14 occurrences of PDI(0), British≠American , 73 words, PDI/word=1.8. When he crossed pea plants with other pea plants possessing opposite traits , like short with tall or smooth-seeded with wrinkled-seeded , he discovered that the resulting plant always looked like one or the other rather than like a mix of both .
19.
20.
21.
22.
23.
Notes de l'éditeur
You should mention that this does not HAVE to look so. This dump is a rather repelling sight to a typical windows-user
RA-Z is an acronym for Reading from level A to Z. a set of more than a thousand books for American schoolers. This screenshot presents the electronic version of the books available along with the print-out versions which have been the subject of our analyses. Out of the books available on 26 levels, categorised according to the letters of the alphabet plus some additional ones such as nursery rhymes, poetry books or vocabulary books, we have been granted access to the so called graded books, grade G to X, i.e. early school age. Unfortunately, we have no permission to study other types of books such as read aloud books, which – if PDI-zed, would certainly benefit EFL users phonetically.
RAZ library is not limited to English. It has books in American English, British English, French and Spanish. We have analysed the American books. This, considering the dominance of British English in Polish education, allows for phonetic contrasts between the two standards. RA-Z books were written with the American reader in mind. However, ESL and EFL learners also benefit from them. Taking into account the fact that PDI operates on the British Oxford dictionary lexicon, this opportunity to contrast English cross-Atlantically seems extremely promising for our phonetic goals. And as Professor Sobkowiak will explain in greater detail, these texts meet most IFA guidelines for phonetics teaching. First of all, these are authentic materials with fine grained-levels of semantic and syntactic difficulty combined with phonetic difficulty grading.
Which is shown on this graph. PDI confirms that these books demonstrate a steady, though uneven, growth in the phonetic difficulty across as many as 18 levels. PDI allows to identify phonetically most tricky levels. Obviously, application-wise it might be necessary to specify which particular books carry the stigma of some specific phonetic problems, such as a high number of occurrences of ‘th’ or ough’ phonograms. PDI does that as well.
18 levels and 317 analysed texts produce a rough picture of RAZ phonetic difficulty. As can be easily seen some texts on low sematic-syntactic level of difficulty (which means low RAZ level, i.e. to the left of the chart) are quite high as far as phonetic difficulty is concerned (vertically). This arrangement facilitates the process of text selection or materials design to the point when phonetics teachers could create lists of texts for practising specific texts for individual students.
Here is the phonolapsological profile of the RAZ corpus as compared against MEDAL and Brown. The selection of these two reference corpora is naturally not coincidental. MEDAL is a corpus that bears a number of similarities with RA-Z corpus, some of them are the encyclopedic style of writing, addressing learner needs and controlled language use. The choice of Brown is to some extent opportunistic: as no large corpus of general English has ever been phonolapsologically processed . However, it is commonly used as point of reference in corpus studies as one of the 'classic' low-level learner corpora and it is a corpus of American English, exactly like RA‑Z. Also MEDAL definitions have already been PDI-analysed by Professor Sobkowiak, and this has been very useful for our research. A glance at the chart reveals some surprises. MEDAL learner’s dictionary definitions might be expected to constitute a phonetically more challenging type of text than RAZ books for school children. T he American RA-Z corpus is about one third of the classic Brown corpus containing a million words. And yet RAZ has an almost identical phonetic profile as the other two EFL learner corpora.
Naturally, the fact that these texts have a rising phonetic difficulty, does not necessarily imply using them with adults. And yet, there is still another aspect that promotes such use. Regardless of their being books for early schoolers, these books present a variety of topics, that even IFA students might be interested in. Books on politics, genetics, famous people or events in history, of which random examples are listed on the screen.
This slide shows that even low level books may be used for phonetics training, Here for voicing practice. Ten slajd wykosić?
Naturally, using such texts for focus on particular phonetic elements depends on the IFA specific guidelines or individual student’s needs. Yet, as can be seen, the potential for phonetics training is extensive. This text covers a number of interesting phonetics aspects that are surely worth training, such as the words with ou/ow or words such as AIR, LARGE, TURN, WHERE.
Some difficulties occur more frequently than others, naturally. The most frequent one PDI(J) is indirectly related to the morphological criteria in that the more complex the word (length in syllables, compounding, secondary stress) the higher the chance of unstressed schwas occurring in the relatively many weak syllables. Schwa is one of the most typical pronuncing problems among Polish EFL learners and is heavily represented in the RAZ corpus. Considering a high frequency of occurrence of a limited selection of words with a schwa and voiced word-final obstruents in RAZ books, on skillful text selection (which means PDI-driven selection) the reader may be exposed to schwa- or word-final voiced obstruent-loaded texts covering words from the high frequency vocabulary pool. This may facilitate raising phonetics awareness and acquisition.
Selecting texts with a combination of these highly frequent aspects of phonetics may produce even more interesting findings. Bigrams involving short schwa PDI(J) are on the top of the list of most frequent codegrams in RAZ books. At the same time the word examples are words that many EFL students have problems with and their high occurrence in RAZ texts may facilitate noticing the schwa, especially in combination with word-final voicing PDI(JZ) or schwa with voiced apico-dentals PDI(JL). Similarly, PDI(J0), i.e. schwa and British vs American pronunciation patterns may well serve exercises involving high frequency basic vocabulary. Even though only f ive word types are responsible for exactly half of the incidence of PDI(J0) in RA‑Z: for, or, water, over , were, in frequency order , awareness that only these words will be encoutered by the student while reading may facilitate evaluation procedure by providing the teacher with a list of focus words available and a selection of sentences including as many words from the focal list as feasible. Tu przydałoby się pokazać zdanie z czymś takim.
This slide demonstrates visually the frequency of words including PDI(JZ), which stands for schwa and final voicing, a potential trap for most Polish readers of English texts. As one can see in RAZ the most frequent words with this difficult point are AROUND, TOWARD, COVERED, RIVERS, HOURS and KILOMETERS, i.e. basic level words, which in the context of what Professor Sobkowiak will say is a crucial point.
From the perspective of individual RA-Z texts and sentences, rather than levels, it is possible to locate local maxima for a given PDI code. There are, for example, 46 records in the RA-Z corpus, longer than one word, where each word ends in a voiced obstruent, thus getting the PDI(N) tag. Six of these records contain more than three words, thus constituting local word-final-devoicing maxima.