SlideShare a Scribd company logo
1 of 40
The notion of sub-skills in reading
comprehension tests:
An EAP example.
Lumley, T. (1993)

Cindy
2012/11/7
Outline
I. Perceptions of reading subskills in
  ESL
II. The relationship between reading
  subskills and test items
III. The study
IV. Results: Difficulty of subskills
V. Rasch IRT analysis and reading
  subskills
VII. Discussion
VIII. Conclusion
I. Perceptions of
   reading subskills in ESL
The    divisibility of reading comprehension
   into discrete subskills (e.g., Bloom, 1956;
   Gary, 1960; Davis, 1968; Munby, 1978).
1. Reading subskills in syllabus for ESL
Munby’s (1978) framework for specifying
   ESP syllabus content, including its
   extensive list of language microskills, has
   been strongly criticized.
 need analysis
I. Perceptions of
   reading subskills in ESL                  (Cont.)

2. Subskills in test construction
 Carroll’s (1980) identification of lg skills
 From Munby’s taxonomy of 54 lg skills, listing
   11 skills as suitable for testing.
Hughes’ (1989) identification of two levels of
   subskills…
(1)Macroskills: understanding the ideas in the text
   (info., gist, argument)
(2)Microskills: recognizing and interpreting the
   more linguistic features of the text (referents,
   word meanings, discourse indicators)
I. Perceptions of
   reading subskills in ESL             (Cont.)


What     is still unclear is HOW teachers
-- identify microskills
-- be involved in constructing tests, and

  What sort of reliability and validity might
  be attached to Ts’ judgments.
II. The relationship between
   reading subskills and test items
   In Alderson and Lukmani’s (1989) study, Ts….
(1) showed relatively little agreement about the
    subskills tested by a range of reading
    comprehension test items;
(2) disagreed over the order of cognitive abilities
    demanded by the same item;
(3) classified lower/higher achievers based on the
    test, but higher on the test requiring higher-
    order cognitive skills
 Cognitive levels were unrelated to levels of
    linguistic proficiency.
II. The relationship between
   reading subskills and test items
                                       (Cont.)



 low discrimination
 not knowing why the judgments made
 the choices they did with regard to the
 skills tested by test items
 The need for making explicit the
 interpretations of the subskills described.
III. The study
Research     questions:
1) Does a group of 5 experienced EAP Ts
  perceive a common hierarchy of difficulty
  among the subskills?
2) Is it possible for the same group of Ts to
  reach agreement upon subskills tested by
  individual test items in a test of reading
  comprehension?
1. The test
 EAP
Non-English-speaking background Ss
A university reading test with 2 texts,
 total length 1500 words
58 items
Item types: short answer, multiple choice,
 matching, T/F, completing a flow-chart,
 labeling a map
2. The subjects
3   groups of NNS (n = 158)
1) Oversea Ss (n = 90)
2) Ss from a language center (n = 50)
3) Preparing postgraduate qualification in
   business administration (n = 18)
?? Unequal number of participants in each group
3. Test analysis




Rasch analysis, using the program of QUEST
showed items misfitting (square values above
the acceptable limit of 1.3).
4. Procedure

    To establish a common interpretation of
    subskills descriptions and criteria

A    post hoc content analysis
5. Existing lists of subskills

Munby’s  (1978) framework
19 reading microskills were examined
6. Final selection of test items for
analysis

22   items

 difficulty (logit values: -1.875 to 1.875)
 discrimination levels (classical analysis in
 the range 0.55 to 0.97)
 facility levels (0.89 to 0.25)
7. Development of the list of subskills
To  develop the wording of
 subskill description for those analyzing
 the test
Final version: 22 items
8. The raters
 qualified ESL/EFL teachers
 at least five years’ experience
 MA degree in applied linguistics
 involved in language test construction
 the group included the two test developers
 completed the reading test before the
 rating session
9. The rating session
  Procedure
1) Perceived difficulty on a 4-point scale
   (A-D; with A, representing the easiest)
2) They rated the selected items on the same scale.
3) Selected the single, highest-level skill from the lists of
  subskills
4) Each person allocated a subskill to the item.
  To establish a level of agreement about the procedure
  and the interpretation of the subskill description, this
  process was repeated for 3 more items, of varying of
  difficulty.
5) Subskills were then matched to the remaining items
  by each group member.
9. The rating session (Cont.)
1)   The importance of determining the
     focus of each subskill
•    Subskill 4. Explaining a fact with:
     4.1 a single clause
     4.2 multiple clauses
•    Subskill 6. Analysis of the elements within a process,
     to examine methodically their causal/sequential
     relationship

    The difference was initially unclear.
9. The rating session (Cont.)
2) The concept of one or more subksills
  was necessary but not sufficient for
  answering Qs
•   Subskill 1 ‘Dealing with relatively uncommon
    vocabulary: matching of words/phrases referred to in
    text with given equivalent meanings’
    -- impossible to describe or measure it
9. The rating session (Cont.)
3) Some subskills would occur at several or
  all levels
•   Subskill 5: ‘Selecting a phrase as summarizing the main
    topic of a text’


4) Two subskills listed as important in
  reading comprehension, skimming and
  scanning were needed repeatedly
  throughout the test, but could not be
  identified as central to particular items.
9. The rating session (Cont.)
5) To alter the wording of some subskills
   To add a 9th subskill to the existing list
 No level of perceived difficulty was identified



6) Potential confusion remained between
  subskill 9, ‘understanding grammatical
  and semantic reference’ and
  subskill 3 ‘identification of information in
  the text.’
IV. Results: Difficulty of subskills
                                                                 the range of
                                                                 rating given




Bold
represents
cases where
80% or greater
agreement




          For 11 of the 14 subskills is seen to be substantial
          agreement about inherent level of difficulty.
No guidelines were given to the group
as to how the raters should interpret the
levels A to D.
IV. Results: Reading subskills
                   matched
                               to each item

For item 6,
although the
skill required
was seen as
subskill 2,
agreement was
not reached as
to whether this
was subskill 2.1
or 2.2.




         5 raters were able to reach almost complete agreement on which the skill
         to answer the 22 items.
V. Rasch IRT analysis and reading
subskills
1.   Use of IRT in tests of reading comprehension
    Rasch analysis in language testing is unique in
     mapping student ability and item difficulty on
     the same scale.
    Resulting analyses of strengths and weaknesses
     of both individuals and groups have the
     potential to provide useful guidance for
     teachers in planning their teaching,
     e.g., Tests of Reading Comprehension for
     children in primary schools (TORCH) scale of
     representing particular reading subskills.
V. Rasch IRT analysis and reading
subskills                   (Cont.)

2. IRT analysis as validation of teacher perception

  Q: Do items identified by the group of
 teachers as requiring the same subskills
 occur at roughly the same level of difficulty
 as each other, according to the Rasch
 analysis?
Figure 1. Item logit values and skill difficulty levels
VI. Summary of findings
 1) Teachers have a high level of agreement
 about subskills tested by particular test items,
 and they also share common perceptions
 about the relevant difficulty of the subskills.
  2) A significant correlation between teachers’
 perceptions of the difficulty of each subskill
 and the logit values obtained from the IRT
 analysis for items identified as testing as same
 skills. The subskills are seem to fit into broad
 bands of increasing difficulty.
VII. Discussion
 Not practicable to examine all items in the
 reading test
 Use larger, complete sets of test items and
 subskill description

 Limited generalizability because of the
 relationship between question difficulty,
 subskills and text properties.
 The test should be composed of reading texts
 similar to those commonly encountered in the
 final year or two of high school.
 Uncontrollable test-taking process
 Introspective studies of test-takers’ behavior could
 establish whether or not the results of this study
 are supported by the test-takers themselves

 The influence of test method facets (Bachman, 1990):
  To what extent do the item type and formulation of
 the Q affect reader performance, and to what
 extent is performance determined by the text itself?
 Employing various testing methods of the same
 texts, or parallel methods with different types and
 leveled texts.
VII. Discussion
Figure   1
 ?? One skill had to be fully acquired before the
 next could be mastered
 -- Gradually emerging mastery of linguistic skills
 of increasing difficulty, the ability increases
 (Griffin & Nix, 1991)

 ?? How widely the bands in Figure 1 may
 extend
VIII. Conclusion
RQ 1) Does a group of 5 experienced EAP
 Ts perceive a common hierarchy of
 difficulty among the subskills?

    After brief discussion of the use of Rasch
    IRT in analysis of reading comprehension
    test items, the Ts’ consensus regarding
    subskill difficulty level is compared to the
    Rasch analysis of item difficulty, and the
    significant correlation found gives some
    empirical validation to the Ts’
VIII. Conclusion          (Cont.)

RQ 2) Is it possible for the same group of
Ts to reach agreement upon subskills
tested by individual test items in a test of
reading comprehension?

    A high level of concordance between
    raters’ perceptions
Implications

 The  value of using Ts’ judgments in
  examining test content, and the
  procedure in test development, involving
  mapping skills from test content.
 The judgments Ts make about Linguistic
  matters in test design and content validity
  also have significant for teaching.
Reflection

The  diagnostic value of a subskill analysis
 of test performance, the information
 yielded by the identification of any subskill
 as inadequately developed in a group of
 Ss could signal to a T a useful area of
 work as a focus for teaching.
YLE Starters
YLE Movers
YLE Flyers
Thank you!

More Related Content

What's hot

Situational language teaching
Situational language teachingSituational language teaching
Situational language teachingLoc Le
 
Week 1 and week 2 what is young adult literature
Week 1 and week 2 what is young adult literatureWeek 1 and week 2 what is young adult literature
Week 1 and week 2 what is young adult literaturebersihmin
 
Module VIII Testing the Productive Skills
Module VIII Testing the Productive SkillsModule VIII Testing the Productive Skills
Module VIII Testing the Productive SkillsAnnalynInsic
 
ESP (Course Design and skills and strategies)
ESP (Course Design and skills and strategies)ESP (Course Design and skills and strategies)
ESP (Course Design and skills and strategies)Lord Mark Jayson Ilarde
 
Remedial English Instruction (5 of 16)
Remedial English Instruction (5 of 16)Remedial English Instruction (5 of 16)
Remedial English Instruction (5 of 16)Nheru Veraflor
 
Material development in elt current issues
Material development in elt current issues Material development in elt current issues
Material development in elt current issues Aprilianty Wid
 
testing the receptive skills
testing the receptive skillstesting the receptive skills
testing the receptive skillsPachica, Gerry B.
 
Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Kheang Sokheng
 
Testing Grammar
Testing GrammarTesting Grammar
Testing GrammarSamcruz5
 
How to assess and test reading
How to assess and test readingHow to assess and test reading
How to assess and test readingLeslie Gomez
 
4+Approaches+in+Teaching+Literature.pdf
4+Approaches+in+Teaching+Literature.pdf4+Approaches+in+Teaching+Literature.pdf
4+Approaches+in+Teaching+Literature.pdfgaylejoyligaya
 
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...A. Tenry Lawangen Aspat Colle
 
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee MissJillSmith
 

What's hot (20)

Situational language teaching
Situational language teachingSituational language teaching
Situational language teaching
 
Language Curriculum Design - Chapter 9
Language Curriculum Design - Chapter 9Language Curriculum Design - Chapter 9
Language Curriculum Design - Chapter 9
 
Week 1 and week 2 what is young adult literature
Week 1 and week 2 what is young adult literatureWeek 1 and week 2 what is young adult literature
Week 1 and week 2 what is young adult literature
 
Assessing listening
Assessing listeningAssessing listening
Assessing listening
 
LITERATURE AND LANGUAGE TEACHING
LITERATURE AND LANGUAGE TEACHINGLITERATURE AND LANGUAGE TEACHING
LITERATURE AND LANGUAGE TEACHING
 
Module VIII Testing the Productive Skills
Module VIII Testing the Productive SkillsModule VIII Testing the Productive Skills
Module VIII Testing the Productive Skills
 
ESP (Course Design and skills and strategies)
ESP (Course Design and skills and strategies)ESP (Course Design and skills and strategies)
ESP (Course Design and skills and strategies)
 
Remedial English Instruction (5 of 16)
Remedial English Instruction (5 of 16)Remedial English Instruction (5 of 16)
Remedial English Instruction (5 of 16)
 
Approaches to language testing
Approaches to language testingApproaches to language testing
Approaches to language testing
 
Material development in elt current issues
Material development in elt current issues Material development in elt current issues
Material development in elt current issues
 
testing the receptive skills
testing the receptive skillstesting the receptive skills
testing the receptive skills
 
Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Chapter 7(assessing speaking )
Chapter 7(assessing speaking )
 
Testing Grammar
Testing GrammarTesting Grammar
Testing Grammar
 
Assessing Speaking
Assessing SpeakingAssessing Speaking
Assessing Speaking
 
How to assess and test reading
How to assess and test readingHow to assess and test reading
How to assess and test reading
 
4+Approaches+in+Teaching+Literature.pdf
4+Approaches+in+Teaching+Literature.pdf4+Approaches+in+Teaching+Literature.pdf
4+Approaches+in+Teaching+Literature.pdf
 
Testing Reading
Testing ReadingTesting Reading
Testing Reading
 
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
 
Testing reading
Testing readingTesting reading
Testing reading
 
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
 

Similar to Sub-skills in reading comprehension tests

The assessment of deep word knowledge in young learners
The assessment of deep word knowledge in young learnersThe assessment of deep word knowledge in young learners
The assessment of deep word knowledge in young learnersCindy Shen
 
A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...
A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...
A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...Richard Hogue
 
Designing Common Core Assessments LILAC 2013
Designing Common Core Assessments LILAC 2013Designing Common Core Assessments LILAC 2013
Designing Common Core Assessments LILAC 2013lfoy
 
(1)assignment 7.1 a identifying letters, phonemes, and graphemes
(1)assignment 7.1 a identifying letters, phonemes, and graphemes (1)assignment 7.1 a identifying letters, phonemes, and graphemes
(1)assignment 7.1 a identifying letters, phonemes, and graphemes ssuserfa5723
 
(1) assignment 7.1 a identifying letters, phonemes, and grapheme
(1) assignment 7.1 a identifying letters, phonemes, and grapheme(1) assignment 7.1 a identifying letters, phonemes, and grapheme
(1) assignment 7.1 a identifying letters, phonemes, and graphemessuserfa5723
 
Method effects on reading comprehension test performance
Method effects on reading comprehension test performanceMethod effects on reading comprehension test performance
Method effects on reading comprehension test performanceMasoomehVedadtaghavi
 
Tooltip-type, Frame-type, and Concordance Glossing in L2 Reading
Tooltip-type, Frame-type, and Concordance Glossing in L2 ReadingTooltip-type, Frame-type, and Concordance Glossing in L2 Reading
Tooltip-type, Frame-type, and Concordance Glossing in L2 Readingengedukamall
 
Site2011 tomidaokibayashitamura
Site2011 tomidaokibayashitamuraSite2011 tomidaokibayashitamura
Site2011 tomidaokibayashitamuraEiji Tomida
 
Beachwood PTO Information Night
Beachwood PTO Information NightBeachwood PTO Information Night
Beachwood PTO Information NightKen Veon
 
Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...
Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...
Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...juraikha
 
20080603 Assessment Final
20080603 Assessment Final20080603 Assessment Final
20080603 Assessment FinalElly Lin
 
Annotated Bibliography
Annotated BibliographyAnnotated Bibliography
Annotated Bibliographykstaff
 
What does language testing have to offier
What does language testing have to offierWhat does language testing have to offier
What does language testing have to offierAPSACS
 
CCSS Overview For Teacher Educators
CCSS Overview For Teacher EducatorsCCSS Overview For Teacher Educators
CCSS Overview For Teacher EducatorsEileen Murphy
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroomCidher89
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroomCidher89
 
Staircase of Complexity
Staircase of ComplexityStaircase of Complexity
Staircase of ComplexityTrish Huerster
 

Similar to Sub-skills in reading comprehension tests (20)

The assessment of deep word knowledge in young learners
The assessment of deep word knowledge in young learnersThe assessment of deep word knowledge in young learners
The assessment of deep word knowledge in young learners
 
A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...
A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...
A Comparison Of The Performance Of Analytic Vs. Holistic Scoring Rubrics To A...
 
Designing Common Core Assessments LILAC 2013
Designing Common Core Assessments LILAC 2013Designing Common Core Assessments LILAC 2013
Designing Common Core Assessments LILAC 2013
 
(1)assignment 7.1 a identifying letters, phonemes, and graphemes
(1)assignment 7.1 a identifying letters, phonemes, and graphemes (1)assignment 7.1 a identifying letters, phonemes, and graphemes
(1)assignment 7.1 a identifying letters, phonemes, and graphemes
 
(1) assignment 7.1 a identifying letters, phonemes, and grapheme
(1) assignment 7.1 a identifying letters, phonemes, and grapheme(1) assignment 7.1 a identifying letters, phonemes, and grapheme
(1) assignment 7.1 a identifying letters, phonemes, and grapheme
 
L2 assessment
L2 assessmentL2 assessment
L2 assessment
 
Method effects on reading comprehension test performance
Method effects on reading comprehension test performanceMethod effects on reading comprehension test performance
Method effects on reading comprehension test performance
 
Tooltip-type, Frame-type, and Concordance Glossing in L2 Reading
Tooltip-type, Frame-type, and Concordance Glossing in L2 ReadingTooltip-type, Frame-type, and Concordance Glossing in L2 Reading
Tooltip-type, Frame-type, and Concordance Glossing in L2 Reading
 
Site2011 tomidaokibayashitamura
Site2011 tomidaokibayashitamuraSite2011 tomidaokibayashitamura
Site2011 tomidaokibayashitamura
 
Beachwood PTO Information Night
Beachwood PTO Information NightBeachwood PTO Information Night
Beachwood PTO Information Night
 
pr2 dll week 1.docx
pr2 dll week 1.docxpr2 dll week 1.docx
pr2 dll week 1.docx
 
Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...
Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...
Reading Comprehension Processes and Strategies in L1 and L2 in Malaysian Prim...
 
20080603 Assessment Final
20080603 Assessment Final20080603 Assessment Final
20080603 Assessment Final
 
Annotated Bibliography
Annotated BibliographyAnnotated Bibliography
Annotated Bibliography
 
What does language testing have to offier
What does language testing have to offierWhat does language testing have to offier
What does language testing have to offier
 
CCSS Overview For Teacher Educators
CCSS Overview For Teacher EducatorsCCSS Overview For Teacher Educators
CCSS Overview For Teacher Educators
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Staircase of Complexity
Staircase of ComplexityStaircase of Complexity
Staircase of Complexity
 
Designing language test
Designing language testDesigning language test
Designing language test
 

More from Cindy Shen

Back-to-School BINGO班級經營第一堂課
Back-to-School BINGO班級經營第一堂課Back-to-School BINGO班級經營第一堂課
Back-to-School BINGO班級經營第一堂課Cindy Shen
 
106下學期四年級英語教學計畫
106下學期四年級英語教學計畫106下學期四年級英語教學計畫
106下學期四年級英語教學計畫Cindy Shen
 
106上學期四年級英語教學計畫
106上學期四年級英語教學計畫106上學期四年級英語教學計畫
106上學期四年級英語教學計畫Cindy Shen
 
天母國小沈佳慧老師
天母國小沈佳慧老師天母國小沈佳慧老師
天母國小沈佳慧老師Cindy Shen
 
輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄
輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄
輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄Cindy Shen
 
2017Cindy英輔團公開授課0428照片成果表
2017Cindy英輔團公開授課0428照片成果表2017Cindy英輔團公開授課0428照片成果表
2017Cindy英輔團公開授課0428照片成果表Cindy Shen
 
Food support card in differentiated instruction
Food support card in differentiated instruction Food support card in differentiated instruction
Food support card in differentiated instruction Cindy Shen
 
Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)
Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)
Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)Cindy Shen
 
105第二學期Cindy每週一句Week 2
105第二學期Cindy每週一句Week 2105第二學期Cindy每週一句Week 2
105第二學期Cindy每週一句Week 2Cindy Shen
 
天母國小英語每週一句設計理念
天母國小英語每週一句設計理念天母國小英語每週一句設計理念
天母國小英語每週一句設計理念Cindy Shen
 
全校英語推廣活動:每週一句英文廣播節目
全校英語推廣活動:每週一句英文廣播節目 全校英語推廣活動:每週一句英文廣播節目
全校英語推廣活動:每週一句英文廣播節目 Cindy Shen
 
英語口說Rubrics
英語口說Rubrics英語口說Rubrics
英語口說RubricsCindy Shen
 
Tech bio(沈佳慧cindy老師)
Tech bio(沈佳慧cindy老師)Tech bio(沈佳慧cindy老師)
Tech bio(沈佳慧cindy老師)Cindy Shen
 
我是小小設計師! I Am a Little Designer! (Cindy老師)
我是小小設計師! I Am a Little Designer! (Cindy老師)我是小小設計師! I Am a Little Designer! (Cindy老師)
我是小小設計師! I Am a Little Designer! (Cindy老師)Cindy Shen
 
英語教學好站抱抱(Cindy老師分享)
英語教學好站抱抱(Cindy老師分享)英語教學好站抱抱(Cindy老師分享)
英語教學好站抱抱(Cindy老師分享)Cindy Shen
 
公開授課行動研究(天母國小沈佳慧老師)
公開授課行動研究(天母國小沈佳慧老師)公開授課行動研究(天母國小沈佳慧老師)
公開授課行動研究(天母國小沈佳慧老師)Cindy Shen
 
105學年度 天母國小英語每週一句2016-2017
105學年度 天母國小英語每週一句2016-2017105學年度 天母國小英語每週一句2016-2017
105學年度 天母國小英語每週一句2016-2017Cindy Shen
 
開學第一堂的班級經營策略(天母國小沈佳慧)
開學第一堂的班級經營策略(天母國小沈佳慧)開學第一堂的班級經營策略(天母國小沈佳慧)
開學第一堂的班級經營策略(天母國小沈佳慧)Cindy Shen
 
Warm Up Activity BINGO card icebreaker
Warm Up Activity BINGO card icebreakerWarm Up Activity BINGO card icebreaker
Warm Up Activity BINGO card icebreakerCindy Shen
 
班際競賽蘋果班牌
班際競賽蘋果班牌班際競賽蘋果班牌
班際競賽蘋果班牌Cindy Shen
 

More from Cindy Shen (20)

Back-to-School BINGO班級經營第一堂課
Back-to-School BINGO班級經營第一堂課Back-to-School BINGO班級經營第一堂課
Back-to-School BINGO班級經營第一堂課
 
106下學期四年級英語教學計畫
106下學期四年級英語教學計畫106下學期四年級英語教學計畫
106下學期四年級英語教學計畫
 
106上學期四年級英語教學計畫
106上學期四年級英語教學計畫106上學期四年級英語教學計畫
106上學期四年級英語教學計畫
 
天母國小沈佳慧老師
天母國小沈佳慧老師天母國小沈佳慧老師
天母國小沈佳慧老師
 
輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄
輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄
輔導員沈佳慧老師公開授課觀課教師回饋與省思紀錄
 
2017Cindy英輔團公開授課0428照片成果表
2017Cindy英輔團公開授課0428照片成果表2017Cindy英輔團公開授課0428照片成果表
2017Cindy英輔團公開授課0428照片成果表
 
Food support card in differentiated instruction
Food support card in differentiated instruction Food support card in differentiated instruction
Food support card in differentiated instruction
 
Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)
Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)
Frogs Can't Fly reciprocal teaching worksheets (Cindy Shen)
 
105第二學期Cindy每週一句Week 2
105第二學期Cindy每週一句Week 2105第二學期Cindy每週一句Week 2
105第二學期Cindy每週一句Week 2
 
天母國小英語每週一句設計理念
天母國小英語每週一句設計理念天母國小英語每週一句設計理念
天母國小英語每週一句設計理念
 
全校英語推廣活動:每週一句英文廣播節目
全校英語推廣活動:每週一句英文廣播節目 全校英語推廣活動:每週一句英文廣播節目
全校英語推廣活動:每週一句英文廣播節目
 
英語口說Rubrics
英語口說Rubrics英語口說Rubrics
英語口說Rubrics
 
Tech bio(沈佳慧cindy老師)
Tech bio(沈佳慧cindy老師)Tech bio(沈佳慧cindy老師)
Tech bio(沈佳慧cindy老師)
 
我是小小設計師! I Am a Little Designer! (Cindy老師)
我是小小設計師! I Am a Little Designer! (Cindy老師)我是小小設計師! I Am a Little Designer! (Cindy老師)
我是小小設計師! I Am a Little Designer! (Cindy老師)
 
英語教學好站抱抱(Cindy老師分享)
英語教學好站抱抱(Cindy老師分享)英語教學好站抱抱(Cindy老師分享)
英語教學好站抱抱(Cindy老師分享)
 
公開授課行動研究(天母國小沈佳慧老師)
公開授課行動研究(天母國小沈佳慧老師)公開授課行動研究(天母國小沈佳慧老師)
公開授課行動研究(天母國小沈佳慧老師)
 
105學年度 天母國小英語每週一句2016-2017
105學年度 天母國小英語每週一句2016-2017105學年度 天母國小英語每週一句2016-2017
105學年度 天母國小英語每週一句2016-2017
 
開學第一堂的班級經營策略(天母國小沈佳慧)
開學第一堂的班級經營策略(天母國小沈佳慧)開學第一堂的班級經營策略(天母國小沈佳慧)
開學第一堂的班級經營策略(天母國小沈佳慧)
 
Warm Up Activity BINGO card icebreaker
Warm Up Activity BINGO card icebreakerWarm Up Activity BINGO card icebreaker
Warm Up Activity BINGO card icebreaker
 
班際競賽蘋果班牌
班際競賽蘋果班牌班際競賽蘋果班牌
班際競賽蘋果班牌
 

Recently uploaded

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 

Recently uploaded (20)

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 

Sub-skills in reading comprehension tests

  • 1. The notion of sub-skills in reading comprehension tests: An EAP example. Lumley, T. (1993) Cindy 2012/11/7
  • 2. Outline I. Perceptions of reading subskills in ESL II. The relationship between reading subskills and test items III. The study IV. Results: Difficulty of subskills V. Rasch IRT analysis and reading subskills VII. Discussion VIII. Conclusion
  • 3. I. Perceptions of reading subskills in ESL The divisibility of reading comprehension into discrete subskills (e.g., Bloom, 1956; Gary, 1960; Davis, 1968; Munby, 1978). 1. Reading subskills in syllabus for ESL Munby’s (1978) framework for specifying ESP syllabus content, including its extensive list of language microskills, has been strongly criticized.  need analysis
  • 4. I. Perceptions of reading subskills in ESL (Cont.) 2. Subskills in test construction  Carroll’s (1980) identification of lg skills  From Munby’s taxonomy of 54 lg skills, listing 11 skills as suitable for testing. Hughes’ (1989) identification of two levels of subskills… (1)Macroskills: understanding the ideas in the text (info., gist, argument) (2)Microskills: recognizing and interpreting the more linguistic features of the text (referents, word meanings, discourse indicators)
  • 5. I. Perceptions of reading subskills in ESL (Cont.) What is still unclear is HOW teachers -- identify microskills -- be involved in constructing tests, and  What sort of reliability and validity might be attached to Ts’ judgments.
  • 6. II. The relationship between reading subskills and test items  In Alderson and Lukmani’s (1989) study, Ts…. (1) showed relatively little agreement about the subskills tested by a range of reading comprehension test items; (2) disagreed over the order of cognitive abilities demanded by the same item; (3) classified lower/higher achievers based on the test, but higher on the test requiring higher- order cognitive skills  Cognitive levels were unrelated to levels of linguistic proficiency.
  • 7. II. The relationship between reading subskills and test items (Cont.)  low discrimination  not knowing why the judgments made the choices they did with regard to the skills tested by test items  The need for making explicit the interpretations of the subskills described.
  • 8. III. The study Research questions: 1) Does a group of 5 experienced EAP Ts perceive a common hierarchy of difficulty among the subskills? 2) Is it possible for the same group of Ts to reach agreement upon subskills tested by individual test items in a test of reading comprehension?
  • 9. 1. The test  EAP Non-English-speaking background Ss A university reading test with 2 texts, total length 1500 words 58 items Item types: short answer, multiple choice, matching, T/F, completing a flow-chart, labeling a map
  • 10. 2. The subjects 3 groups of NNS (n = 158) 1) Oversea Ss (n = 90) 2) Ss from a language center (n = 50) 3) Preparing postgraduate qualification in business administration (n = 18) ?? Unequal number of participants in each group
  • 11. 3. Test analysis Rasch analysis, using the program of QUEST showed items misfitting (square values above the acceptable limit of 1.3).
  • 12. 4. Procedure  To establish a common interpretation of subskills descriptions and criteria A post hoc content analysis
  • 13. 5. Existing lists of subskills Munby’s (1978) framework 19 reading microskills were examined
  • 14. 6. Final selection of test items for analysis 22 items  difficulty (logit values: -1.875 to 1.875)  discrimination levels (classical analysis in the range 0.55 to 0.97)  facility levels (0.89 to 0.25)
  • 15. 7. Development of the list of subskills To develop the wording of subskill description for those analyzing the test Final version: 22 items
  • 16. 8. The raters  qualified ESL/EFL teachers  at least five years’ experience  MA degree in applied linguistics  involved in language test construction  the group included the two test developers  completed the reading test before the rating session
  • 17. 9. The rating session  Procedure 1) Perceived difficulty on a 4-point scale (A-D; with A, representing the easiest) 2) They rated the selected items on the same scale. 3) Selected the single, highest-level skill from the lists of subskills 4) Each person allocated a subskill to the item. To establish a level of agreement about the procedure and the interpretation of the subskill description, this process was repeated for 3 more items, of varying of difficulty. 5) Subskills were then matched to the remaining items by each group member.
  • 18. 9. The rating session (Cont.) 1) The importance of determining the focus of each subskill • Subskill 4. Explaining a fact with: 4.1 a single clause 4.2 multiple clauses • Subskill 6. Analysis of the elements within a process, to examine methodically their causal/sequential relationship  The difference was initially unclear.
  • 19. 9. The rating session (Cont.) 2) The concept of one or more subksills was necessary but not sufficient for answering Qs • Subskill 1 ‘Dealing with relatively uncommon vocabulary: matching of words/phrases referred to in text with given equivalent meanings’ -- impossible to describe or measure it
  • 20. 9. The rating session (Cont.) 3) Some subskills would occur at several or all levels • Subskill 5: ‘Selecting a phrase as summarizing the main topic of a text’ 4) Two subskills listed as important in reading comprehension, skimming and scanning were needed repeatedly throughout the test, but could not be identified as central to particular items.
  • 21. 9. The rating session (Cont.) 5) To alter the wording of some subskills To add a 9th subskill to the existing list  No level of perceived difficulty was identified 6) Potential confusion remained between subskill 9, ‘understanding grammatical and semantic reference’ and subskill 3 ‘identification of information in the text.’
  • 22. IV. Results: Difficulty of subskills the range of rating given Bold represents cases where 80% or greater agreement For 11 of the 14 subskills is seen to be substantial agreement about inherent level of difficulty.
  • 23. No guidelines were given to the group as to how the raters should interpret the levels A to D.
  • 24. IV. Results: Reading subskills matched to each item For item 6, although the skill required was seen as subskill 2, agreement was not reached as to whether this was subskill 2.1 or 2.2. 5 raters were able to reach almost complete agreement on which the skill to answer the 22 items.
  • 25. V. Rasch IRT analysis and reading subskills 1. Use of IRT in tests of reading comprehension  Rasch analysis in language testing is unique in mapping student ability and item difficulty on the same scale.  Resulting analyses of strengths and weaknesses of both individuals and groups have the potential to provide useful guidance for teachers in planning their teaching, e.g., Tests of Reading Comprehension for children in primary schools (TORCH) scale of representing particular reading subskills.
  • 26. V. Rasch IRT analysis and reading subskills (Cont.) 2. IRT analysis as validation of teacher perception Q: Do items identified by the group of teachers as requiring the same subskills occur at roughly the same level of difficulty as each other, according to the Rasch analysis?
  • 27.
  • 28. Figure 1. Item logit values and skill difficulty levels
  • 29. VI. Summary of findings 1) Teachers have a high level of agreement about subskills tested by particular test items, and they also share common perceptions about the relevant difficulty of the subskills. 2) A significant correlation between teachers’ perceptions of the difficulty of each subskill and the logit values obtained from the IRT analysis for items identified as testing as same skills. The subskills are seem to fit into broad bands of increasing difficulty.
  • 30. VII. Discussion  Not practicable to examine all items in the reading test  Use larger, complete sets of test items and subskill description  Limited generalizability because of the relationship between question difficulty, subskills and text properties.  The test should be composed of reading texts similar to those commonly encountered in the final year or two of high school.
  • 31.  Uncontrollable test-taking process  Introspective studies of test-takers’ behavior could establish whether or not the results of this study are supported by the test-takers themselves  The influence of test method facets (Bachman, 1990): To what extent do the item type and formulation of the Q affect reader performance, and to what extent is performance determined by the text itself?  Employing various testing methods of the same texts, or parallel methods with different types and leveled texts.
  • 32. VII. Discussion Figure 1 ?? One skill had to be fully acquired before the next could be mastered -- Gradually emerging mastery of linguistic skills of increasing difficulty, the ability increases (Griffin & Nix, 1991) ?? How widely the bands in Figure 1 may extend
  • 33. VIII. Conclusion RQ 1) Does a group of 5 experienced EAP Ts perceive a common hierarchy of difficulty among the subskills?  After brief discussion of the use of Rasch IRT in analysis of reading comprehension test items, the Ts’ consensus regarding subskill difficulty level is compared to the Rasch analysis of item difficulty, and the significant correlation found gives some empirical validation to the Ts’
  • 34. VIII. Conclusion (Cont.) RQ 2) Is it possible for the same group of Ts to reach agreement upon subskills tested by individual test items in a test of reading comprehension?  A high level of concordance between raters’ perceptions
  • 35. Implications The value of using Ts’ judgments in examining test content, and the procedure in test development, involving mapping skills from test content. The judgments Ts make about Linguistic matters in test design and content validity also have significant for teaching.
  • 36. Reflection The diagnostic value of a subskill analysis of test performance, the information yielded by the identification of any subskill as inadequately developed in a group of Ss could signal to a T a useful area of work as a focus for teaching.