SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Corpus Linguistics for
Language Learning and Teaching
Martin Wynne martin.wynne@bodleian.ox.ac.uk
Bodleian Libraries
Faculty of Linguistics, Philology and Phonetics
The 'aftermath' of the seminar
Subject: Les Francais des Corpus – Aftermath
Dear colleagues,
First, many thanks for presenting at /attending
the Francais des Corpus Workshop and for making
it such a success.
I promised I would keep you in touch with one
another and hope that the full list of your e-
mail addresses above makes that possible.
…
'aftermath'
Collocates:
War
Gulf
coup
World
disaster
Tiananmen
death
revolution
defeat
Chernobyl
affair
riots
battle
massacre
wars
election
Crisis
events
explosion
invasion
trial
fire
June
Square
victory
accident
attempt
Significant collocates in the British National Corpus
(a representative corpus of British English released in 1994).
BNCWeb parameters:
There are 1486 different types in your collocation database
for the query "[word="aftermath"%c] [word="of"%c]".
(Your query "aftermath of" returned 544 hits in 337 different texts)
The selected range was 1 to 4.
Corpus basis for calculation: the whole BNC.
Type of calculation: Log-likelihood
Tag restriction: any noun
Collocates occur at least 5 times in the whole BNC.
Words collocate at least 5 times.
What is a corpus?
“…a collection of pieces of language, selected and ordered according
to explicit linguistic criteria in order to be used as a sample of the
language.”
(Sinclair 1996)
What is Corpus Linguistics?
(1) Focus on linguistic performance, rather than competence
(2) Focus on linguistic description, rather than linguistic
universals
(3) Focus on quantitative, as well as qualitative models of
language
(4) Focus on a more empiricist, rather than rationalist view of
scientific inquiry.
(Leech 1992)
How do you know things about
language? Where do we get our
knowledge from?
What does your knowledge and
experience tell you about the use of
‘try to’ & ‘try and’?
Fill in the blanks
1. Did you try … talk her out of swimming?
2. Mr. Kissinger, try … explain to us what might happen
3. He did it to try … score points
4. They both wanted to try ... have a family
5. They try … treat you like machines
6. Sometimes, people try … make fun of you by
imitating you.
7. Now the government will try … sell all of this.
8. Did you try … get out of it?
9. I will try … understand this.
Fill in the blanks
1. Did you try and talk her out of swimming?
2. Mr. Kissinger, try and explain to us what might happen
3. He did it to try and score points
4. They both wanted to try and have a family
5. They try to treat you like machines
6. Sometimes, people try to make fun of you by imitating you.
7. Now the government will try to sell all of this.
8. Did you try to get out of it?
9. I will try ? understand this. [This one was made up!]
•
“Try and do something is incorrect
for try to do…” [Partridge and Greet
1947]
•
“Try and is well established in
conversational use ..Try to is to be
preferred in serious writing” [Plain
Words 1986]
•
“… try and has been socially
acceptable for these two centuries
… is not used in an elevated style”
[Webster’s Dictionary 1989]
What are the factors governing the choice
and distribution of try to vs. try and ?
How would you investigate this question?
Spoken
British
English
W
ritten
British
EnglishSpoken
Am
erican
EnglishW
ritten
Am
erican
English
0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
80.00%
90.00%
100.00%
try to
try and
Based on CobuildDirect and Longman Spoken American Corpus.
BNC COCA Hansard GloWbE COHA soap operas wikipedia
0
50
100
150
200
250
300
350
try and (pmw)
try to (pmw)
Try to or try and? Verb
complementation in British and
American English
Hommerberg & Tottie (2007)
ICAME Journal 31:45-64
http://icame.uib.no/ij31/ij31-page45-64.pdf
Uses of corpus linguistics in
language pedagogy
●
Developing new theories (e.g. differences between regional
varieties, identifying new varieties such as 'English as a
Lingua Franca')
●
A source primary data for developing e.g.:
➢
dictionaries
➢
grammars
➢
textbooks (and other teaching materials)
●
Preparing materials for classes (e.g. as a source of
examples)
●
Studying learner language (in a learner corpus)
●
Data-driven learning in the classroom
“Why not just Google it?”
Linguistic
●
Biased distribution of text-types and genres
●
Repeated and reused text
●
Unknown provenance (“Who wrote this, when, and why?”)
●
Mixture of native and non-native producers of language
●
Mixture of varieties
Technical
●
Unclear separation of elements of the webpage (body text, sidebars, adverts, etc.)
●
Accessing the ‘hidden web’ (content which is not visible to search)
●
Accessing language embedded in audio and video streams
●
Lack of persistence locations and identifiers
Methodological
●
Difficult to compare frequencies of occurrence
●
Unknown (or undesirable) sampling and ranking strategies
of search engines (e.g. promoting commercial products and services, prioritizing
words in titles and headers, user-specific settings)
Problems with language in the corpus
●
Limited by copyright (and other legal and ethical barriers)
●
Expensive, time-consuming and slow to make
●
Limited size
●
Not up to date
●
Incomplete information about provenance and context
●
Design decisions were made by someone else
●
Not easily comparable to other corpora
●
Access restrictions
●
Limited functions available for analysis and exploration
●
Not connected to other resources or tools
●
Difficult to deploy in the classroom
Find evidence in one or more corpora to
help explain the sources of irony and
humour in Homer’s utterance:
“I'm just going out to commit certain deeds”
http://kisscartoon.eu/watch/the-simpsons-season-9-episode-16-dumbbell-indemnity/
(9:27-9:52)
Exercise
Links for Practical Work
●
http://bncweb.lancs.ac.uk/ (register using ac.uk email address)
●
http://corpus.byu.edu
●
All links can be found via: https://ota.ox.ac.uk/oxonly/oxford.xml
Data-driven language learning in the
classroom: some reflections
●
Can you use a corpus to reveal 'real language'?
●
Do we want to teach ‘real language’? Should teachers prefer
to control the rate and order of exposure to linguistic
features?
●
Can teachers easily deal with unrestricted language in the
classroom?
●
Effective reading and interpretation of concordance lines and
collocation lists require practice, and the acquisition of skills.
●
There are often difficult technical issues in effective
deployment of corpora in the classroom.
Antconc
●
Download for free from
http://www.antlab.sci.waseda.ac.jp/software.html
●
Use with any 'plain' text (txt, html, xml)
●
Multilingual
capabilities
●
Does not interpret
mark-up or metadata
CQPweb: an online interface for many corpora
http://cqpweb.lancs.ac.uk
Finding resources
https://ota.ox.ac.uk/oxonly/oxford.xml
References
●
Chambers, A. and M. Wynne. ‘Sharing corpus resources in language learning.’ In F. Zhang and B. Barber (eds.)
Handbook of Research on Computer-Enhanced Language Acquisition and Learning. Hershey, PA: IGI Global,
2008, 438-451.
●
Hommerberg, C. and G. Tottie (2007). Try to or Try and? Verb complementation in British and American English.
ICAME Journal 31: 45-64. http://icame.uib.no/ij31/ij31-page45-64.pdf
●
Leech, G. (1992). Corpora and theories of linguistic performance. In J. Startvik (Ed.), Directions in corpus
linguistics (pp. 105-122). Berlin: Mouton de Gruyter.
●
McEnery, A. & Z. Xiao (2010), What corpora can offer in language teaching and learning. In E. Hinkel (ed.)
Handbook of Research in Second Language Teaching and Learning (Vol. 2). London / New York: Routledge.
[http://www.lancaster.ac.uk/fass/projects/corpus/ZJU/xpapers/McEnery_Xiao_teaching.PDF]
●
McEnery, A., R. Xiao and Y. Tono, (2006). Corpus-based Language Studies: An Advanced Resource Book.
Routledge.
●
Sinclair, J.McH. 1996. 'Preliminary recommendations on corpus typology' EAGLES Document TCWG-CTYP/P
(available from http://www.ilc.cnr.it/EAGLES/corpustyp/corpustyp.html).
Online resources
●
Corpora for users in the University of Oxford https://ota.ox.ac.uk/oxonly/oxford.xml
●
Brigham Young Corpora http://corpus.byu.edu/ (also via Solo)
●
British National Corpus http://bncweb.lancs.ac.uk/ (free registration required here)
●
Linguee (bilingual translations) https://www.linguee.com/
●
VOICE. 2013. The Vienna-Oxford International Corpus of English (version 2.0 Online) http://voice.univie.ac.at,
also available for download from the Oxford Text Archive (http://purl.ox.ac.uk/ota/2542).

Contenu connexe

Tendances

Introduction to Discourse analysis
Introduction to Discourse analysisIntroduction to Discourse analysis
Introduction to Discourse analysisNeny Isharyanti
 
Ch 6 corpus linguistics
Ch 6   corpus linguisticsCh 6   corpus linguistics
Ch 6 corpus linguisticsNaveed Khokher
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguisticsRaul Vargas
 
English 781 881 we slides
English 781 881 we slidesEnglish 781 881 we slides
English 781 881 we slideslisyaseloni
 
English for specific purpose
English for specific purposeEnglish for specific purpose
English for specific purposeAdnanBaloch15
 
Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Umm-e-Rooman Yaqoob
 
Corpus linguistics, ch6
Corpus linguistics, ch6Corpus linguistics, ch6
Corpus linguistics, ch6VivaAs
 
Types of corpus linguistics Parallel ,aligned...
 Types of corpus linguistics Parallel ,aligned... Types of corpus linguistics Parallel ,aligned...
Types of corpus linguistics Parallel ,aligned...RajpootBhatti5
 
Corpus linguistics in language learning
Corpus linguistics in language learningCorpus linguistics in language learning
Corpus linguistics in language learningnfuadah123
 
Chapter 6 sociolinguistics
Chapter 6  sociolinguisticsChapter 6  sociolinguistics
Chapter 6 sociolinguisticsamiraJabbarinia
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basicsJorge Baptista
 
lexicography
lexicographylexicography
lexicographyayfa
 
What is Applied Linguistics?
What is Applied Linguistics?What is Applied Linguistics?
What is Applied Linguistics?Shajaira Lopez
 

Tendances (20)

Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
Introduction to Discourse analysis
Introduction to Discourse analysisIntroduction to Discourse analysis
Introduction to Discourse analysis
 
Ch 6 corpus linguistics
Ch 6   corpus linguisticsCh 6   corpus linguistics
Ch 6 corpus linguistics
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
English 781 881 we slides
English 781 881 we slidesEnglish 781 881 we slides
English 781 881 we slides
 
English for specific purpose
English for specific purposeEnglish for specific purpose
English for specific purpose
 
Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics
 
Product oriented syllabus1
Product oriented syllabus1Product oriented syllabus1
Product oriented syllabus1
 
Corpus linguistics, ch6
Corpus linguistics, ch6Corpus linguistics, ch6
Corpus linguistics, ch6
 
Types of corpus linguistics Parallel ,aligned...
 Types of corpus linguistics Parallel ,aligned... Types of corpus linguistics Parallel ,aligned...
Types of corpus linguistics Parallel ,aligned...
 
Corpus linguistics in language learning
Corpus linguistics in language learningCorpus linguistics in language learning
Corpus linguistics in language learning
 
Chapter 6 sociolinguistics
Chapter 6  sociolinguisticsChapter 6  sociolinguistics
Chapter 6 sociolinguistics
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basics
 
lexicography
lexicographylexicography
lexicography
 
What is Applied Linguistics?
What is Applied Linguistics?What is Applied Linguistics?
What is Applied Linguistics?
 
Applied linguistics hmwk
Applied linguistics hmwkApplied linguistics hmwk
Applied linguistics hmwk
 
Language planning
Language planning Language planning
Language planning
 
Functional grammar
Functional grammarFunctional grammar
Functional grammar
 

Similaire à Corpus Linguistics for Language Teaching and Learning

Technologies and englishes
Technologies and englishesTechnologies and englishes
Technologies and englishesTariq Usman
 
5810 day 3 sept 20 2014
5810 day 3 sept 20 2014 5810 day 3 sept 20 2014
5810 day 3 sept 20 2014 SVTaylor123
 
English teacher english learner forever - HIGOR CAVALCANTE
English teacher english learner forever - HIGOR CAVALCANTEEnglish teacher english learner forever - HIGOR CAVALCANTE
English teacher english learner forever - HIGOR CAVALCANTEBruna Caltabiano
 
CLIL + selections + brainwave 2013
CLIL + selections + brainwave 2013CLIL + selections + brainwave 2013
CLIL + selections + brainwave 2013Majid Safadaran
 
The Elephant in the Room - The Taboo Issue of a Teacher's English
The Elephant in the Room - The Taboo Issue of a Teacher's EnglishThe Elephant in the Room - The Taboo Issue of a Teacher's English
The Elephant in the Room - The Taboo Issue of a Teacher's EnglishHigor Cavalcante
 
5810 day 3 sept 20 2014
5810 day 3 sept 20 2014 5810 day 3 sept 20 2014
5810 day 3 sept 20 2014 SVTaylor123
 
English teacher: English learner forever
English teacher: English learner foreverEnglish teacher: English learner forever
English teacher: English learner foreverBruna Caltabiano
 
English teacher english learner forever
English teacher english learner foreverEnglish teacher english learner forever
English teacher english learner foreverBruna Caltabiano
 
Oral fluency and spoken grammar 2016
Oral fluency and spoken grammar 2016Oral fluency and spoken grammar 2016
Oral fluency and spoken grammar 2016Ron Martinez
 
5810 oral lang anly transcr wkshp (fall 2014) pdf
5810 oral lang anly transcr wkshp (fall 2014) pdf  5810 oral lang anly transcr wkshp (fall 2014) pdf
5810 oral lang anly transcr wkshp (fall 2014) pdf SVTaylor123
 
From syllabus design to curriculum development
From syllabus design to curriculum developmentFrom syllabus design to curriculum development
From syllabus design to curriculum developmentminhthuy072
 
English intro B1 intensive Feb 2013
English intro B1 intensive Feb 2013English intro B1 intensive Feb 2013
English intro B1 intensive Feb 2013David Nicholson
 
Using corpora in instruction
Using corpora in instructionUsing corpora in instruction
Using corpora in instructionJonathan Smart
 
Raising students' awareness of the construction of communicative (in)competen...
Raising students' awareness of the construction of communicative (in)competen...Raising students' awareness of the construction of communicative (in)competen...
Raising students' awareness of the construction of communicative (in)competen...Rachel Wicaksono
 
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfApplied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfDr.Badriya Al Mamari
 
Chapter one for presentation( Course curriculum development in Language Teach...
Chapter one for presentation( Course curriculum development in Language Teach...Chapter one for presentation( Course curriculum development in Language Teach...
Chapter one for presentation( Course curriculum development in Language Teach...louth sran
 
Can we develop TV drama corpus-informed English vocabulary materials for elem...
Can we develop TV drama corpus-informed English vocabulary materials for elem...Can we develop TV drama corpus-informed English vocabulary materials for elem...
Can we develop TV drama corpus-informed English vocabulary materials for elem...Hiroya Tanaka
 

Similaire à Corpus Linguistics for Language Teaching and Learning (20)

Technologies and englishes
Technologies and englishesTechnologies and englishes
Technologies and englishes
 
5810 day 3 sept 20 2014
5810 day 3 sept 20 2014 5810 day 3 sept 20 2014
5810 day 3 sept 20 2014
 
English teacher english learner forever - HIGOR CAVALCANTE
English teacher english learner forever - HIGOR CAVALCANTEEnglish teacher english learner forever - HIGOR CAVALCANTE
English teacher english learner forever - HIGOR CAVALCANTE
 
CLIL + selections + brainwave 2013
CLIL + selections + brainwave 2013CLIL + selections + brainwave 2013
CLIL + selections + brainwave 2013
 
The Elephant in the Room - The Taboo Issue of a Teacher's English
The Elephant in the Room - The Taboo Issue of a Teacher's EnglishThe Elephant in the Room - The Taboo Issue of a Teacher's English
The Elephant in the Room - The Taboo Issue of a Teacher's English
 
5810 day 3 sept 20 2014
5810 day 3 sept 20 2014 5810 day 3 sept 20 2014
5810 day 3 sept 20 2014
 
English teacher: English learner forever
English teacher: English learner foreverEnglish teacher: English learner forever
English teacher: English learner forever
 
English teacher english learner forever
English teacher english learner foreverEnglish teacher english learner forever
English teacher english learner forever
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
 
Oral fluency and spoken grammar 2016
Oral fluency and spoken grammar 2016Oral fluency and spoken grammar 2016
Oral fluency and spoken grammar 2016
 
5810 oral lang anly transcr wkshp (fall 2014) pdf
5810 oral lang anly transcr wkshp (fall 2014) pdf  5810 oral lang anly transcr wkshp (fall 2014) pdf
5810 oral lang anly transcr wkshp (fall 2014) pdf
 
From syllabus design to curriculum development
From syllabus design to curriculum developmentFrom syllabus design to curriculum development
From syllabus design to curriculum development
 
A1 2009
A1 2009A1 2009
A1 2009
 
English intro B1 intensive Feb 2013
English intro B1 intensive Feb 2013English intro B1 intensive Feb 2013
English intro B1 intensive Feb 2013
 
Using corpora in instruction
Using corpora in instructionUsing corpora in instruction
Using corpora in instruction
 
Raising students' awareness of the construction of communicative (in)competen...
Raising students' awareness of the construction of communicative (in)competen...Raising students' awareness of the construction of communicative (in)competen...
Raising students' awareness of the construction of communicative (in)competen...
 
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdfApplied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
Applied Linguistics session 111 0_07_12_2021 Applied linguistics challenges.pdf
 
Applications of CL to FLT
Applications of CL to FLTApplications of CL to FLT
Applications of CL to FLT
 
Chapter one for presentation( Course curriculum development in Language Teach...
Chapter one for presentation( Course curriculum development in Language Teach...Chapter one for presentation( Course curriculum development in Language Teach...
Chapter one for presentation( Course curriculum development in Language Teach...
 
Can we develop TV drama corpus-informed English vocabulary materials for elem...
Can we develop TV drama corpus-informed English vocabulary materials for elem...Can we develop TV drama corpus-informed English vocabulary materials for elem...
Can we develop TV drama corpus-informed English vocabulary materials for elem...
 

Plus de Martin Wynne

MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdfMartin Wynne
 
CLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposalsCLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposalsMartin Wynne
 
CLARIN - Corpora, corpus tools and collaboration
CLARIN - Corpora, corpus tools and collaborationCLARIN - Corpora, corpus tools and collaboration
CLARIN - Corpora, corpus tools and collaborationMartin Wynne
 
Forty-five Years of the OTA
Forty-five Years of the OTAForty-five Years of the OTA
Forty-five Years of the OTAMartin Wynne
 
Corpus Approaches to the Language of Literature 2008
Corpus Approaches to the Language of Literature 2008Corpus Approaches to the Language of Literature 2008
Corpus Approaches to the Language of Literature 2008Martin Wynne
 
Exploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentExploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentMartin Wynne
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTAMartin Wynne
 
Big data and Digital Transformations in the Humanities
Big data and Digital Transformations in the HumanitiesBig data and Digital Transformations in the Humanities
Big data and Digital Transformations in the HumanitiesMartin Wynne
 
Hacking EEBO: colour terms
Hacking EEBO: colour termsHacking EEBO: colour terms
Hacking EEBO: colour termsMartin Wynne
 
When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?Martin Wynne
 
Annotated Corpora for Research in the Humanities
Annotated Corpora for Research in the HumanitiesAnnotated Corpora for Research in the Humanities
Annotated Corpora for Research in the HumanitiesMartin Wynne
 

Plus de Martin Wynne (11)

MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdf
 
CLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposalsCLARIN Supporting Horizon Europe proposals
CLARIN Supporting Horizon Europe proposals
 
CLARIN - Corpora, corpus tools and collaboration
CLARIN - Corpora, corpus tools and collaborationCLARIN - Corpora, corpus tools and collaboration
CLARIN - Corpora, corpus tools and collaboration
 
Forty-five Years of the OTA
Forty-five Years of the OTAForty-five Years of the OTA
Forty-five Years of the OTA
 
Corpus Approaches to the Language of Literature 2008
Corpus Approaches to the Language of Literature 2008Corpus Approaches to the Language of Literature 2008
Corpus Approaches to the Language of Literature 2008
 
Exploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentExploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic Enlightenment
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTA
 
Big data and Digital Transformations in the Humanities
Big data and Digital Transformations in the HumanitiesBig data and Digital Transformations in the Humanities
Big data and Digital Transformations in the Humanities
 
Hacking EEBO: colour terms
Hacking EEBO: colour termsHacking EEBO: colour terms
Hacking EEBO: colour terms
 
When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?When will there be a digital revolution in the humanities?
When will there be a digital revolution in the humanities?
 
Annotated Corpora for Research in the Humanities
Annotated Corpora for Research in the HumanitiesAnnotated Corpora for Research in the Humanities
Annotated Corpora for Research in the Humanities
 

Dernier

History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 

Dernier (20)

History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 

Corpus Linguistics for Language Teaching and Learning

  • 1. Corpus Linguistics for Language Learning and Teaching Martin Wynne martin.wynne@bodleian.ox.ac.uk Bodleian Libraries Faculty of Linguistics, Philology and Phonetics
  • 2. The 'aftermath' of the seminar Subject: Les Francais des Corpus – Aftermath Dear colleagues, First, many thanks for presenting at /attending the Francais des Corpus Workshop and for making it such a success. I promised I would keep you in touch with one another and hope that the full list of your e- mail addresses above makes that possible. …
  • 3. 'aftermath' Collocates: War Gulf coup World disaster Tiananmen death revolution defeat Chernobyl affair riots battle massacre wars election Crisis events explosion invasion trial fire June Square victory accident attempt Significant collocates in the British National Corpus (a representative corpus of British English released in 1994). BNCWeb parameters: There are 1486 different types in your collocation database for the query "[word="aftermath"%c] [word="of"%c]". (Your query "aftermath of" returned 544 hits in 337 different texts) The selected range was 1 to 4. Corpus basis for calculation: the whole BNC. Type of calculation: Log-likelihood Tag restriction: any noun Collocates occur at least 5 times in the whole BNC. Words collocate at least 5 times.
  • 4. What is a corpus? “…a collection of pieces of language, selected and ordered according to explicit linguistic criteria in order to be used as a sample of the language.” (Sinclair 1996)
  • 5. What is Corpus Linguistics? (1) Focus on linguistic performance, rather than competence (2) Focus on linguistic description, rather than linguistic universals (3) Focus on quantitative, as well as qualitative models of language (4) Focus on a more empiricist, rather than rationalist view of scientific inquiry. (Leech 1992)
  • 6. How do you know things about language? Where do we get our knowledge from?
  • 7. What does your knowledge and experience tell you about the use of ‘try to’ & ‘try and’?
  • 8. Fill in the blanks 1. Did you try … talk her out of swimming? 2. Mr. Kissinger, try … explain to us what might happen 3. He did it to try … score points 4. They both wanted to try ... have a family 5. They try … treat you like machines 6. Sometimes, people try … make fun of you by imitating you. 7. Now the government will try … sell all of this. 8. Did you try … get out of it? 9. I will try … understand this.
  • 9. Fill in the blanks 1. Did you try and talk her out of swimming? 2. Mr. Kissinger, try and explain to us what might happen 3. He did it to try and score points 4. They both wanted to try and have a family 5. They try to treat you like machines 6. Sometimes, people try to make fun of you by imitating you. 7. Now the government will try to sell all of this. 8. Did you try to get out of it? 9. I will try ? understand this. [This one was made up!]
  • 10. • “Try and do something is incorrect for try to do…” [Partridge and Greet 1947] • “Try and is well established in conversational use ..Try to is to be preferred in serious writing” [Plain Words 1986] • “… try and has been socially acceptable for these two centuries … is not used in an elevated style” [Webster’s Dictionary 1989]
  • 11. What are the factors governing the choice and distribution of try to vs. try and ? How would you investigate this question?
  • 13. BNC COCA Hansard GloWbE COHA soap operas wikipedia 0 50 100 150 200 250 300 350 try and (pmw) try to (pmw)
  • 14. Try to or try and? Verb complementation in British and American English Hommerberg & Tottie (2007) ICAME Journal 31:45-64 http://icame.uib.no/ij31/ij31-page45-64.pdf
  • 15. Uses of corpus linguistics in language pedagogy ● Developing new theories (e.g. differences between regional varieties, identifying new varieties such as 'English as a Lingua Franca') ● A source primary data for developing e.g.: ➢ dictionaries ➢ grammars ➢ textbooks (and other teaching materials) ● Preparing materials for classes (e.g. as a source of examples) ● Studying learner language (in a learner corpus) ● Data-driven learning in the classroom
  • 16. “Why not just Google it?” Linguistic ● Biased distribution of text-types and genres ● Repeated and reused text ● Unknown provenance (“Who wrote this, when, and why?”) ● Mixture of native and non-native producers of language ● Mixture of varieties Technical ● Unclear separation of elements of the webpage (body text, sidebars, adverts, etc.) ● Accessing the ‘hidden web’ (content which is not visible to search) ● Accessing language embedded in audio and video streams ● Lack of persistence locations and identifiers Methodological ● Difficult to compare frequencies of occurrence ● Unknown (or undesirable) sampling and ranking strategies of search engines (e.g. promoting commercial products and services, prioritizing words in titles and headers, user-specific settings)
  • 17. Problems with language in the corpus ● Limited by copyright (and other legal and ethical barriers) ● Expensive, time-consuming and slow to make ● Limited size ● Not up to date ● Incomplete information about provenance and context ● Design decisions were made by someone else ● Not easily comparable to other corpora ● Access restrictions ● Limited functions available for analysis and exploration ● Not connected to other resources or tools ● Difficult to deploy in the classroom
  • 18. Find evidence in one or more corpora to help explain the sources of irony and humour in Homer’s utterance: “I'm just going out to commit certain deeds” http://kisscartoon.eu/watch/the-simpsons-season-9-episode-16-dumbbell-indemnity/ (9:27-9:52) Exercise
  • 19. Links for Practical Work ● http://bncweb.lancs.ac.uk/ (register using ac.uk email address) ● http://corpus.byu.edu ● All links can be found via: https://ota.ox.ac.uk/oxonly/oxford.xml
  • 20. Data-driven language learning in the classroom: some reflections ● Can you use a corpus to reveal 'real language'? ● Do we want to teach ‘real language’? Should teachers prefer to control the rate and order of exposure to linguistic features? ● Can teachers easily deal with unrestricted language in the classroom? ● Effective reading and interpretation of concordance lines and collocation lists require practice, and the acquisition of skills. ● There are often difficult technical issues in effective deployment of corpora in the classroom.
  • 21. Antconc ● Download for free from http://www.antlab.sci.waseda.ac.jp/software.html ● Use with any 'plain' text (txt, html, xml) ● Multilingual capabilities ● Does not interpret mark-up or metadata
  • 22. CQPweb: an online interface for many corpora http://cqpweb.lancs.ac.uk
  • 24. References ● Chambers, A. and M. Wynne. ‘Sharing corpus resources in language learning.’ In F. Zhang and B. Barber (eds.) Handbook of Research on Computer-Enhanced Language Acquisition and Learning. Hershey, PA: IGI Global, 2008, 438-451. ● Hommerberg, C. and G. Tottie (2007). Try to or Try and? Verb complementation in British and American English. ICAME Journal 31: 45-64. http://icame.uib.no/ij31/ij31-page45-64.pdf ● Leech, G. (1992). Corpora and theories of linguistic performance. In J. Startvik (Ed.), Directions in corpus linguistics (pp. 105-122). Berlin: Mouton de Gruyter. ● McEnery, A. & Z. Xiao (2010), What corpora can offer in language teaching and learning. In E. Hinkel (ed.) Handbook of Research in Second Language Teaching and Learning (Vol. 2). London / New York: Routledge. [http://www.lancaster.ac.uk/fass/projects/corpus/ZJU/xpapers/McEnery_Xiao_teaching.PDF] ● McEnery, A., R. Xiao and Y. Tono, (2006). Corpus-based Language Studies: An Advanced Resource Book. Routledge. ● Sinclair, J.McH. 1996. 'Preliminary recommendations on corpus typology' EAGLES Document TCWG-CTYP/P (available from http://www.ilc.cnr.it/EAGLES/corpustyp/corpustyp.html). Online resources ● Corpora for users in the University of Oxford https://ota.ox.ac.uk/oxonly/oxford.xml ● Brigham Young Corpora http://corpus.byu.edu/ (also via Solo) ● British National Corpus http://bncweb.lancs.ac.uk/ (free registration required here) ● Linguee (bilingual translations) https://www.linguee.com/ ● VOICE. 2013. The Vienna-Oxford International Corpus of English (version 2.0 Online) http://voice.univie.ac.at, also available for download from the Oxford Text Archive (http://purl.ox.ac.uk/ota/2542).