SlideShare a Scribd company logo
1 of 26
Week 2
Sutrisno Sadji Evenddy, M.Pd.
   Practicality
   Reliability
   Validity
   Authenticity
   Washback
   Not expensive,
   Within appropriate time constraint,
   Relatively easy to administer,
   A scoring/evaluation procedure that is
    specific and time-efficient.
   1. Are administrative details clearly
       established before the test?
   2. Can students complete the test
       reasonably within the set time frame?
   3. Is the cost of the test within budget
        limits?
   Consistency of assessment results
    (Linn & Gronlund).

   A test is reliable if:
   “You give the same test to the same
student or matched students on two
different occasions, the test should yield
similar results.” (Brown,2004)
   Students-related reliability
   Rater reliability
   Test administration reliability
   Test reliability
The most common learner-
related issue in reliability is
caused by temporary
illness, fatigue, a “bad
day”, anxiety, and other
physical or psychological
factors.
    Inter-rater reliability:
    When two or more scorers yield
    inconsistent scores of the same test.

    Factors: lack of attention to scoring,
               inexperience, inattention, etc.
   Intra-rater
     Scoring criteria, fatigue, bias toward
     particular “good” and “bad” students, or
     simple carelessness.
   It can be caused by administration
    factors.
    e.g. noisy from outside, photocopying
          variations, room condition, even
          condition of desks and chair.
Factors cause unreliability:
  If a test too long, test takers may
   become fatigued by the time they reach
   the later items and hastily respond
   incorrectly.
  Ambiguous items.
“Measuring what should be measured”

o   Content-related evidence
o   Criterion-related evidence
o   Construct-related evidence
o   Consequential validity
o   Face validity
   If a test samples the subject matter
    about which conclusions are to be drawn.
   If a test requires the test-taker to
    perform the behavior that is being
    measured.
is used to demonstrate the accuracy of
a measure or procedure by comparing
it with another measure or procedure
which has been demonstrated to be
valid.
Example

imagine a hands-on driving test has been
shown to be an accurate test of driving
skills. By comparing the scores on the
written driving test with the scores from
the hands-on driving test, the written
test can be validated by using a criterion
related strategy in which the hands-on
driving test is compared to the written
test.
1.Concurrent validity/ empiric validity
    if a test result is supported by other
concurrent performance beyond
assessment itself.

e.g.

the validity of a high score on the final
exam of a foreign language course will be
substantiated by actual proficiency in the
language.
2. Predictive validity


    to assess (and predict) a test
taker’s likelihood of future success.

    e.g SNMPTN
How well performance on the
assessment can be interpreted as
meaningful measure of some characteristics
or quality.
How well use of assessment results
accomplishes intended purposes and avoids
unintended effect.
   It refers to the degree to which a test
    looks right, and appears to measure the
    knowledge or ability it claims to
    measure, based on the subjective
    judgment of the examinees who take
    it, the administrative personnel who
    decide on its use, and other
    psychometrically unsophisticated
    observers (Mousavi in Brown, 2004)
   The language       as natural as possible.
   Items      contextualized rather than
    isolated.
   Topics      meaningful
    (relevant, interesting) for the learner.
   Some thematic organization to items is
    provided, such as through a story line or
    episode.
   Tasks represent, or closely
    approximate, real-world tasks.
Contextualized              Decontextualized
‘Going to”

1. What _______ this        1. There are three countries I
   weekend?                    would like to visit. One is
   a. you are going to do      Italy.
   b. are you going to do        a. The other is New
   c. your gonna do                 Zealand
                                    and other is Nepal
                                 b. The others are New
                                     Zealand and Nepal
                                 c. Others are New
                                    Zealand and Nepal
Contextualized       Contextualized

2. I’m not sure.      2. When I was twelve
_______ anything         years old, I used
special?                  ______every day.
a.Are you going to       a. swimming
   do                    b. to swimming
b.You are going to do    c. to swim
c. Is going to do
   The effect of testing on teaching and
    learning (Hughes in Brown, 2004).
   Generally refers to the effects tests have
    on instruction in terms of how students
    prepare for the test (Brown, 2004).
Principles of language assessment

More Related Content

What's hot

Testing, assessing, and teaching
Testing, assessing, and teachingTesting, assessing, and teaching
Testing, assessing, and teaching
Sutrisno Evenddy
 
Communicative language testing
Communicative language testingCommunicative language testing
Communicative language testing
Ida Mantra
 
Communicative Testing
Communicative  TestingCommunicative  Testing
Communicative Testing
Ningsih SM
 
Lexical syllabus
Lexical syllabusLexical syllabus
Lexical syllabus
UNACHI
 
Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.
Vadher Ankita
 
Testing for Language Teachers Arthur Hughes
Testing for Language TeachersArthur HughesTesting for Language TeachersArthur Hughes
Testing for Language Teachers Arthur Hughes
Rajputt Ainee
 
Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Chapter 7(assessing speaking )
Chapter 7(assessing speaking )
Kheang Sokheng
 

What's hot (20)

Testing, assessing, and teaching
Testing, assessing, and teachingTesting, assessing, and teaching
Testing, assessing, and teaching
 
Communicative language testing
Communicative language testingCommunicative language testing
Communicative language testing
 
Language Testing : Principles of language assessment
Language Testing : Principles of language assessment Language Testing : Principles of language assessment
Language Testing : Principles of language assessment
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
 
Assessments, concepts and issues
Assessments, concepts and issuesAssessments, concepts and issues
Assessments, concepts and issues
 
Communicative Testing
Communicative  TestingCommunicative  Testing
Communicative Testing
 
Achieving beneficial blackwash
Achieving beneficial blackwashAchieving beneficial blackwash
Achieving beneficial blackwash
 
Introduction to Language Assessment by Brown
Introduction to Language Assessment by BrownIntroduction to Language Assessment by Brown
Introduction to Language Assessment by Brown
 
Test Techniques
Test TechniquesTest Techniques
Test Techniques
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
 
Beyond tests alternatives in assessment
Beyond tests alternatives in assessmentBeyond tests alternatives in assessment
Beyond tests alternatives in assessment
 
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
 
Testing for Language Teachers
Testing for Language TeachersTesting for Language Teachers
Testing for Language Teachers
 
Lexical syllabus
Lexical syllabusLexical syllabus
Lexical syllabus
 
Designing classroom language tests
Designing classroom language testsDesigning classroom language tests
Designing classroom language tests
 
Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.Language testing and evaluation validity and reliability.
Language testing and evaluation validity and reliability.
 
Kinds of tests and testing
Kinds of tests and testingKinds of tests and testing
Kinds of tests and testing
 
ASSESSMENT CONCEPTS AND ISSUES
ASSESSMENT CONCEPTS AND ISSUESASSESSMENT CONCEPTS AND ISSUES
ASSESSMENT CONCEPTS AND ISSUES
 
Testing for Language Teachers Arthur Hughes
Testing for Language TeachersArthur HughesTesting for Language TeachersArthur Hughes
Testing for Language Teachers Arthur Hughes
 
Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Chapter 7(assessing speaking )
Chapter 7(assessing speaking )
 

Similar to Principles of language assessment

constructionoftests-211015110341 (1).pptx
constructionoftests-211015110341 (1).pptxconstructionoftests-211015110341 (1).pptx
constructionoftests-211015110341 (1).pptx
GajeSingh9
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
Cidher89
 
04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt
04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt
04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt
TubiNaz1
 

Similar to Principles of language assessment (20)

CLASSROOM ACTIVITIES
CLASSROOM  ACTIVITIESCLASSROOM  ACTIVITIES
CLASSROOM ACTIVITIES
 
testing and evaluation
testing and evaluation testing and evaluation
testing and evaluation
 
Educational Evaluation for Special Education
Educational Evaluation for Special EducationEducational Evaluation for Special Education
Educational Evaluation for Special Education
 
PRINCIPLES OF ASSESSMENT 2.pptx
PRINCIPLES OF ASSESSMENT 2.pptxPRINCIPLES OF ASSESSMENT 2.pptx
PRINCIPLES OF ASSESSMENT 2.pptx
 
constructionoftests-211015110341 (1).pptx
constructionoftests-211015110341 (1).pptxconstructionoftests-211015110341 (1).pptx
constructionoftests-211015110341 (1).pptx
 
Construction of Tests
Construction of TestsConstruction of Tests
Construction of Tests
 
Testing
TestingTesting
Testing
 
Educational Psychology and Assessment
Educational Psychology and Assessment Educational Psychology and Assessment
Educational Psychology and Assessment
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Pba1
Pba1Pba1
Pba1
 
A AND E.ppt
A AND E.pptA AND E.ppt
A AND E.ppt
 
3232423232323232323232323232323232323 .pptx
3232423232323232323232323232323232323 .pptx3232423232323232323232323232323232323 .pptx
3232423232323232323232323232323232323 .pptx
 
Kinds of testing (2nd)
Kinds of testing (2nd)Kinds of testing (2nd)
Kinds of testing (2nd)
 
ppt language as..pptx
ppt language as..pptxppt language as..pptx
ppt language as..pptx
 
Educational Assessment and Evaluation
Educational Assessment and Evaluation Educational Assessment and Evaluation
Educational Assessment and Evaluation
 
04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt
04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt
04 Formative and Summative Assessment Practices for the Co-Taught Classroom.ppt
 
construction and administration of unit test in science subject
construction and administration of unit test in science subjectconstruction and administration of unit test in science subject
construction and administration of unit test in science subject
 
B 190313162555
B 190313162555B 190313162555
B 190313162555
 
Testing : An important part of ELT
Testing : An important part of ELTTesting : An important part of ELT
Testing : An important part of ELT
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 

Principles of language assessment

  • 1. Week 2 Sutrisno Sadji Evenddy, M.Pd.
  • 2. Practicality  Reliability  Validity  Authenticity  Washback
  • 3. Not expensive,  Within appropriate time constraint,  Relatively easy to administer,  A scoring/evaluation procedure that is specific and time-efficient.
  • 4. 1. Are administrative details clearly established before the test?  2. Can students complete the test reasonably within the set time frame?  3. Is the cost of the test within budget limits?
  • 5. Consistency of assessment results (Linn & Gronlund).  A test is reliable if: “You give the same test to the same student or matched students on two different occasions, the test should yield similar results.” (Brown,2004)
  • 6. Students-related reliability  Rater reliability  Test administration reliability  Test reliability
  • 7. The most common learner- related issue in reliability is caused by temporary illness, fatigue, a “bad day”, anxiety, and other physical or psychological factors.
  • 8. Inter-rater reliability: When two or more scorers yield inconsistent scores of the same test. Factors: lack of attention to scoring, inexperience, inattention, etc.
  • 9. Intra-rater Scoring criteria, fatigue, bias toward particular “good” and “bad” students, or simple carelessness.
  • 10. It can be caused by administration factors. e.g. noisy from outside, photocopying variations, room condition, even condition of desks and chair.
  • 11. Factors cause unreliability:  If a test too long, test takers may become fatigued by the time they reach the later items and hastily respond incorrectly.  Ambiguous items.
  • 12. “Measuring what should be measured” o Content-related evidence o Criterion-related evidence o Construct-related evidence o Consequential validity o Face validity
  • 13. If a test samples the subject matter about which conclusions are to be drawn.  If a test requires the test-taker to perform the behavior that is being measured.
  • 14. is used to demonstrate the accuracy of a measure or procedure by comparing it with another measure or procedure which has been demonstrated to be valid.
  • 15. Example imagine a hands-on driving test has been shown to be an accurate test of driving skills. By comparing the scores on the written driving test with the scores from the hands-on driving test, the written test can be validated by using a criterion related strategy in which the hands-on driving test is compared to the written test.
  • 16. 1.Concurrent validity/ empiric validity if a test result is supported by other concurrent performance beyond assessment itself. e.g. the validity of a high score on the final exam of a foreign language course will be substantiated by actual proficiency in the language.
  • 17. 2. Predictive validity to assess (and predict) a test taker’s likelihood of future success. e.g SNMPTN
  • 18. How well performance on the assessment can be interpreted as meaningful measure of some characteristics or quality.
  • 19. How well use of assessment results accomplishes intended purposes and avoids unintended effect.
  • 20. It refers to the degree to which a test looks right, and appears to measure the knowledge or ability it claims to measure, based on the subjective judgment of the examinees who take it, the administrative personnel who decide on its use, and other psychometrically unsophisticated observers (Mousavi in Brown, 2004)
  • 21.
  • 22. The language as natural as possible.  Items contextualized rather than isolated.  Topics meaningful (relevant, interesting) for the learner.  Some thematic organization to items is provided, such as through a story line or episode.  Tasks represent, or closely approximate, real-world tasks.
  • 23. Contextualized Decontextualized ‘Going to” 1. What _______ this 1. There are three countries I weekend? would like to visit. One is a. you are going to do Italy. b. are you going to do a. The other is New c. your gonna do Zealand and other is Nepal b. The others are New Zealand and Nepal c. Others are New Zealand and Nepal
  • 24. Contextualized Contextualized 2. I’m not sure. 2. When I was twelve _______ anything years old, I used special? ______every day. a.Are you going to a. swimming do b. to swimming b.You are going to do c. to swim c. Is going to do
  • 25. The effect of testing on teaching and learning (Hughes in Brown, 2004).  Generally refers to the effects tests have on instruction in terms of how students prepare for the test (Brown, 2004).