Language Testing

LANGUAGE
PROFICIENCY
TESTING
A Critical Survey

Ple ase Go d m ay Ino t fail
Ple ase Go d m ay Ig e t o ve r sixty pe r ce nt
Ple ase Go d m ay Ig e t a hig h place
Ple ase Go d m ay alltho se like ly to be at m e g e t kille d in
ro ad accide nts and m ay the y die ro aring .
Irish no ve list McGahe rn

Overview
Types of language tests
Ways of describing tests
Evaluating the usefulness of language tests
Overview of common language tests:
TOEFL, TOEIC, IELTS, and CAEL
Impact of testing on learning and teaching
Critical use of language tests
Testing Questions

Testing Questions
What is actually being tested by the test
we are using?
What is the“best” test to use?
What relevant information does the test
provide?
How is testing affecting teaching and
learning behaviour?
Is language testing “fair”?

Validity, reliability, feasibility
Reliability relates to the consistency of an
assessment.
A reliable assessment is one which
consistently achieves the same results with
the same (or similar) cohort of students.
A valid assessment is one which
measures what it is intended to
measure
Totally valid or reliable/Driving test

Process of observation and objective
accumulation of evidences about the
individual learning process of students.
- How to assess?
−Checklist
−Informal teaching observation
Assessment

Consider the following:
o You apply for a part-time job to work your way through
school. You learn that as part of the application process,
you must take a test of word-processing speed and a
personality test.
o Mr. and Mrs. Gómez receive a call from their child’s
third-grade teacher, who says she is concerned about
Luis’ performance on a reading test. She would like to
refer Luis for further testing to see whether Luis has a
learning disability.
o Mr. and Mrs. Torres tell you that their son is not eligible
for special-education services because he scored “too
high” on an intelligence test.

Types of Assessment ( moments
of…)

Assessment – The process of collecting data for the purpose of
making decisions about individuals and groups, and this
decision-making role is the reason that assessment touches
so many people’s lives.
People react strongly when test scores are used to make
interpersonal comparisons in which they or those they
love look inferior.
Power of Testing

Testing – Consists of administering a particular set of questions to an
individual or group of individuals to obtain a score. The score is the
end product of testing.
Testing may be part of the larger process
Testing and assessment are not synonymous.
Assessment is a multifacted process that involves far
more than just administering a test.
High quality assessment procedures anyone’s
performance on any task is influenced by (1) the
demands of the task itself, (2) the history and
characteristics the individual brings to the task, and (3)
the factors inherent in the context in which the
assessment is carried out.
Facts

 Results
 Formats
 Quantitative
 Grades
 Letters
 Indicators
testing

• Standard test: TOEFL – IELTS – PET- CAE
• Placement test: Licenciatura test for freshmen
students
• Proficiency test: TOEFL - IELTS
• Achievement test: Parciales – workshops in ALx
Types of tests in language
education

Goal: it is the aim expected at the end of
learning process.
Standard: accurate conceptual domain of a
topic.
Descriptors: are the achievements by
competences, they are used in present with
closed and specific characteristics.
Indicators: it is the “regulator” of the curriculum
it is not a final result, because it is subject to

GOAL •To use English in common situations.
STANDARD •Students will use English to involve himself in social
circumstances.
descriptor •To recognize social codes.
Assume a critical position above actual events.
indicators •Students recognize social codes.
•Students recognize social codes with difficulty .
•Student has a lot of difficulties to recognize social
codes.
Avoid the use of
not.
NO

How do create an evaluation?
1.Formulate the descriptors
2.Design a plan
3.Observe the learning process
4.Evaluate
5.Determine the efficiency of
pedagogies.

Evaluation in Colombian settings
National standard for evaluation:
ICFES Saber 5 – 9 - 11 ECAES
Saber pro
National standard for grading:
LAW 230: E S A I D
Decreto 1290:
1 – 5 /10 - 100

EVALUATION AT “INITIAL
schOOL”
1.Goal, standards, descriptors and indicators
based on the “Unified” Standards.
2.Strategies for evaluating in the five skills.
3.Continuous assessing of students development
4.Supportive strategies for solving academic and
personal problems
1.Scales to compare national standards with
school’s scales
2.Explicit self evaluation
3.Participation of the educational community

BIBLIOGRAPHY
Common European Framework for References of Language.
Cambridge University Press.
Alderson, C.J., Beretta, A.(1993) Evaluating second language
education.(pp 4-27.).Location: Cambridge: Cambridge University
Press.
Evaluación y Promoción por Estándares y Competencias. Rivera, G.
(2009)
El proceso de la evaluación. Series lineamientos curriculares idiomas
extranjeros. Ministerio de Educación Nacional.

Types of Language Tests
Achievement test
associated with process of instruction
assesses where progress has been
made
should support the teaching to which it
relates
Alternative Assessment
need for assessment to be integrated
with the goals of the curriculum

Proficiency test
aims to establish a test taker’s
readiness for a particular
communicative role
general measure of “language ability”
measures a relatively stable trait
used to make predictions about future
language performance (Hamp-Lyons,
1998)
high-stakes test

Some ways of describing tests
Objective Subjective
Indirect Direct
Discrete-point Integrative
Aptitude / Achievement/
Proficiency Performance
External Internal
Norm-Referenced Criterion-Referenced

Evaluating the usefulness of a
language test
Usefulness= reliability+validity+ impact
authenticity+interactiveness+practicality
(Bachman and Palmer, 1996)
TEST
USEFULNESS
TEST
USEFULNESS
RELIABILITYRELIABILITY VALIDITYVALIDITY
ImpactImpact AuthenticityAuthenticity
PracticalityPracticality InteractivenessInteractiveness

Evaluating the usefulness of a
language test
Essential measurement qualities
reliability
construct validity
Evaluation: test taker - test task - Target
Language Use (TLU)
TLU
Test TaskTest Taker

Overview of common language
proficiency tests
TOEFL TOEIC
IELTS
CAEL

Test of English as a Foreign
Language
One million test takers per
year
P&P 310-677/ CBT 0-300
Three sections:
Listening
Structure and Written
Expression
Reading
Comprehension
TWE

Test of English as a Foreign
Language
Proficiency
Achievement
discord between test and understanding of
language and communication
passive recognition of language
cutoff scores are very problematic
general proficiency ≠ academic proficiency

Test of English forInternational
Communication
TOEFL equivalent for
workplace setting
two sections, 200 q.
listening
reading
entertainment,
manufacturing, health,
travel, finance, etc.
“objective and cost-
efficient”

Test of English forInternational
Communication
Objective
Subjective
Discrete-point
Integrative
Proficiency
Achievement
lack of correspondence with TLU

International English Language
Testing System
Academic/General
Results reported in
band scores 1-9
ListeningListening
G.ReadingG.Reading A.ReadingA.Reading
G.WritingG.Writing A.WritingA.Writing
SpeakingSpeaking

International English Language
Testing System
Objective
Subjective
Discrete-point
Integrative
Proficiency
Achievement
test tasks reflective of academic
tasks

Canadian Academic English
Language Assessment
Mirrors language
use in university
Topic-
based,integrated
reading, listening,
and writing tasks
provides specific
diagnostic
information
scores are reported
in bands 10-90

Canadian Academic English
Language Assessment
Proficiency Achievement
tests performance and use
diminished gap between test and classroom
validity is supported by teacher evaluations
studies on predicting academic success

Washback: The Impact of Tests on
Teaching and Learning
“The power of tests has a strong influence on
curriculum and learning outcomes”
(Shohamy, 1993)
good test ≠ positive washback
form of test impact depends on
antecedent: educational context and condition
process
consequences (Wall,
2000)

Critical Language Testing
Focus on consequence and ethics of test
use
Tests are embedded in cultural,
educational, and political arenas
whose agenda?
Questions traditional testing knowledge
English proficiency= academic success?
English: got it or get it!
Responsible test use (Hamp-Lyons, 2000)

Testing Questions
What is actually being tested by the test we
are using?
What is the”best” test to use?
What relevant information does the test
provide?
How is testing affecting teaching and
learning behaviour?
Is language testing “fair”?

Test design criteria
Usefulness= reliability+validity+ impact
authenticity+interactiveness+practicality
 reliability= consistency of measurement
 validity= the extent to which the inferences that we make
on the basis of the test are valid given the target language
use situation
 authenticity= how closely does the test resemble the
actual language use situation
 interactiveness= to what extent is the test taker involved in
active communication
 impact= what is the effect of the test on test takers, test
users, teachers etc.

Time – language level – design
Layout
Theoretical support (one page to explain
the test; explain why your test is
usefulness, the type of test, )
Score 1 – 5 (create bands for scores)
Make copies for the whole group
15 minutes per skill (except - speaking)

Language Testing

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (7)

Similaire à Language Testing

Similaire à Language Testing (20)

Plus de edac4co

Plus de edac4co (20)

Dernier

Dernier (20)

Language Testing

Notes de l'éditeur