Effectiveness of Gamesourcing Expert Painting Annotations

•

0 j'aime•141 vues

1) The document explores whether non-experts can accurately annotate paintings by subject type by playing an online game. 2) It finds that while individual non-expert annotations are less accurate than experts, aggregating many non-expert annotations increases precision and achieves notable agreement with experts. 3) Users' annotation accuracy improves when labeling paintings with "perfect" type data and when they can apply what they've learned to new paintings of known subject types.

Sciences

Effectiveness of Gamesourcing
Expert Painting Annotations
Are there features of images or
subject types that can predict
high or low agreement?
?
Start to play!
Can a simpliﬁed version
of an expert annotation
task be carried out by
non-experts?
baseline #
imperfect #
200
400
600
800
1000
1200
numberofannotations(bars)
020406080100
users
percentageofcorrectannotations(dots)
baseline %
imperfect %
baseline #
imperfect #
1
10
100
1000
numberofannotations(bars)
2 4 6 8 10
020406080100
number of repetitions
percentageofcorrectannotations(lines)
baseline %
imperfect %
Do users learn
to correctly label
subject types of
paintings?
?
Can they apply what
they have learned to
new paintings of
known subject types?
?
2
1
7
2
1
1
2
1
3
3
8
3
2
5
3
1
30
1
3
1
1
1
37
1
4
8
12
1
6
7
1
1
8
figu
land
full
port
alle
half
genr
hist
kach
city
seas
stil
anim
town
flow
mari
maes
othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes
Non−Experts
Experts
0
25
50
75
100
Percent
baseline condition − aggregated annotations
96
11
4
1
6
3
1
1
6
1
3
1
1
3
9
3
7
1
2
1
2
1
3
2
6
1
2
1
4
2
1
1
1
23
2
3
19
3
3
1
1
12
1
11
5
1
1
5
1
1
4
6othe
figu
land
full
port
alle
half
genr
hist
kach
city
seas
stil
anim
town
flow
mari
maes
othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes
Non−Experts
Experts
0
25
50
75
100
Percent
imperfect condition − aggregated annotations
48
6
4
8
5
48
4
1
26
6
5
5
5
6
26
38
164
2
27
1
12
39
35
5
1
129
51
34
3
1
1
11
49
2
1
1
29
13
47
1
1
1
3
1
107
3
1
2
2
1
1
286
1
8
16
1
1
2
6
105
2
86
1
2
20
2
203
3
12
1
3
2
53
7
1
1
2
9
6
11
1
1
27
5
1
1
3
1
2
3
846
5
23
8
4
58
1
16
3
1
2
95
2
1
2
77
32
15
15
1
1
2
30
980
4
16
1
27
10
5
9
1
86
6
2
1
9
2
3
6
1
4
20
2
3
136
3
1
6
18
9
3
2
355
18
2
28
4
13
2
5
2
1
86
1
17
6
132
29
86
1
2
3
45
2
21
12
18
1
13
1
5
3
164
1
14
2
7
1
figu
land
full
port
alle
half
genr
hist
kach
city
seas
stil
anim
town
flow
mari
maes
othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes
Non−Experts
Experts
0
25
50
75
Percent
baseline condition − individual annotations
291
63
8
7
5
52
10
9
6
34
4
29
14
8
13
65
7
3
1
59
2
20
10
9
2
7
2
1
8
3
4
2
13
2
9
5
32
8
2
1
1
60
1
1
1
1
2 6
12
1
1
10
35
1
8
2
2
1
1
1
10
4
1
1
3
3
4
1
6
1
7
5
1
1
1
176
20
1
3
3
30
6
1
6
166
3
7
1
6
1
7
18
6
38
1
4
1
1
3
4
6
3
1
10
4
1
89
1
1
6
1
2
1
62
3
1
7
23
10
4
1
1
1
3 26
3
1
25
2
9
2
5
4
5
31
25
2
1
4
2
othe
figu
land
full
port
alle
half
genr
hist
kach
city
seas
stil
anim
town
flow
mari
maes
othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes
Non−Experts
Experts
0
25
50
75
Percent
imperfect condition − individual annotations
How do they compare with
experts, both, individually
and as a crowd?
?
Top players:
1. Myriam C. Traub
2. Jacco van Ossenbruggen
3. Jiyin He
4. Lynda Hardman
!
Label paintings
with subject types
from the Art and
Architecture
Thesaurus!
Game over! Congratulations!
You found out that our results show a notable agreement between experts and
non-experts, that users improve when playing on “perfect” data, and that
aggregating annotations increases their precision. Future research will focus on
peer-feedback and using judgements to improve the selection of candidates.
baseline #
imperfect #
0
50
100
150
200
250
300
350
numberofannotations(bars)
sequence number of new images
percentageofcorrectannotations(lines)
baseline %
imperfect %
[1,20] (40,60] (80,100] (120,140] (160,180] (200,220] (240,260] (280,300] (320,340] (360,380]
020406080100

Contenu connexe

En vedette

Screenplay for movie trailer

muna mohammed

Introduction to Computational Statistics

Trailer screenplay

Prez 1

травми 1

Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses

Dmitry Grapov

Audience theory research

cansu12

En vedette (7)

Screenplay for movie trailer

Introduction to Computational Statistics

Trailer screenplay

Prez 1

травми 1

Metabolomics and Beyond Challenges and Strategies for Next-gen Omic Analyses

Audience theory research

Plus de Myriam Traub

Digitized document collections often suffer from OCR errors that may impact a document’s readability and retrievability. We studied the effects of correcting OCR errors on the retrievability of documents in a historic newspaper corpus of a digital library. We computed retrievability scores for the uncorrected documents using queries from the library’s search log, and found that the document OCR character error rate and retrievability score are strongly correlated. We computed retrievability scores for manually corrected versions of the same documents, and report on differences in their total sum, the overall retrievability bias, and the distribution of these changes over the documents, queries and query terms. For large collections, often only a fraction of the corpus is manually corrected. Using a mixed corpus, we assess how this mix affects the retrievability of the corrected and uncorrected documents. The correction of OCR errors increased the number of documents retrieved in all conditions. The increase contributed to a less biased retrieval, even when taking the potential lower ranking of uncorrected documents into account.

Impact of Crowdsourcing OCR Improvements on Retrievability Bias

Myriam Traub

This is the set of slides for my presentation at JCDL 2016 in Newark, USA. Bias in the retrieval of documents can directly influence the information access of a digital library. In the worst case, systematic favoritism for a certain type of document can render other parts of the collection invisible to users. This potential bias can be evaluated by measuring the retrievability for all documents in a collection. Previous evaluations have been performed on TREC collections using simulated query sets. The question remains, however, how representative this approach is of more realistic settings. To address this question, we investigate the effectiveness of the retrievability measure using a large digitized newspaper corpus, featuring two characteristics that distinguishes our experiments from previous studies: (1) compared to TREC collections, our collection contains noise originating from OCR processing, historical spelling and use of language; and (2) instead of simulated queries, the collection comes with real user query logs including click data. First, we assess the retrievability bias imposed on the newspaper collection by different IR models. We assess the retrievability measure and confirm its ability to capture the retrievability bias in our setup. Second, we show how simulated queries differ from real user queries regarding term frequency and prevalence of named entities, and how this affects the retrievability results.

Querylog-based Assessment of Retrievability Bias in a Large Newspaper Corpus

Myriam Traub

The Nature Of Digitally-Produced Data: Towards Social-Scientific Tool Criticism

Myriam Traub

Search engines are not “objective" pieces of technology, and bias in Delpher's search engine may or may not harm user access to certain type of documents in the collection. In the worst case, systematic favoritism for a certain type can render other parts of the collection invisible to users. This potential bias can be evaluated by measuring the “retrievability" for all documents in a collection. We explain the ideas underlying the retrievability metric, and how we measured it on the KB Newspaper collection. We describe and quantify the retrievability bias imposed on the newspaper collection by three different commonly used Information Retrieval models. For this, we investigated how document features such as length, type, or date of publishing influence the retrievability. We also investigate the effectiveness of the retrievability measure, featuring two characteristics that set our experiments apart from previous studies: (1) the newspaper collection contains noise originating from OCR processing, and historical spelling and use of language; and (2) rather than the simulated queries used in other studies, we use real user query logs including click data. We show how simulated queries differ from real user queries regarding term frequency and prevalence of named entities, and how this affects the results of a retrieval task.

Querylog-based Assessment of Retrievability Bias in Delpher

Myriam Traub

Humanities scholars increasingly rely on digital archives for their research instead of time-consuming visits to physical archives. This shift in research method has the hidden cost of working with digitally processed historical documents: how much trust can a scholar place in noisy representations of source texts? In a series of interviews with historians about their use of digital archives, we found that scholars are aware that optical character recognition (OCR) errors may bias their results. They were, however, unable to quantify this bias or to indicate what information they would need to estimate it. This, however, would be important to assess whether the results are publishable. Based on the interviews and a literature study, we provide a classification of scholarly research tasks that gives account of their susceptibility to specific OCR- induced biases and the data required for uncertainty estimations. We conducted a use case study on a national newspaper archive with example research tasks. From this we learned what data is typically available in digital archives and how it could be used to reduce and/or assess the uncertainty in result sets. We conclude that the current knowledge situation on the users’ side as well as on the tool makers’ and data providers’ side is insufficient and needs to be improved.

Impact Analysis of OCR Quality on Research Tasks in Digital Archives

Myriam Traub

Slides for my talk at the Scientific Meeting at Centrum Wiskunde & Informatica, 29/05/2015 A standard procedure in humanities research is to evaluate the aptitude of a source by performing source criticism. This includes the examination of the validity, reliability and relevance of a source in the context of a given research task. By turning the physical manifestations of the sources into digital objects, the sources are transformed by the technology applied to them. Some of the used tools, such as OCR engines, do not yield error-free results. For humanities researchers, this means that they do not only have to deal with source-inherent biases, but they also have to take into account tool-induced bias. The aim of our research is to develop methods that enable humanities researchers to perform tool criticism by revealing uncertainty in the data by tracing tool-induced bias.

Tool Criticism

Myriam Traub

Estimating the Impact of OCR Quality on Research Tasks in the Digital Humanities

Myriam Traub

Measuring the Effectiveness of Gamesourcing Expert Oil Painting Annotations

Myriam Traub

Plus de Myriam Traub (8)

Impact of Crowdsourcing OCR Improvements on Retrievability Bias

Querylog-based Assessment of Retrievability Bias in a Large Newspaper Corpus

The Nature Of Digitally-Produced Data: Towards Social-Scientific Tool Criticism

Querylog-based Assessment of Retrievability Bias in Delpher

Impact Analysis of OCR Quality on Research Tasks in Digital Archives

Tool Criticism

Estimating the Impact of OCR Quality on Research Tasks in the Digital Humanities

Measuring the Effectiveness of Gamesourcing Expert Oil Painting Annotations

Dernier

Forensic Biology & Its biological significance.pdf

rohankumarsinghrore1

The computation of anti-derivatives is just an in-tellectual challenge, we know how to take deriv-atives, but … can we invert the process? We call this Computing the indefinite integral . In the last presentation we have seen a few indefinite integrals (we called them bricks), but they did not include the anti-derivative of many functions! We are going to try and do better !

COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)

AkefAfaneh2

Kochi CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL IN We are Providing :- ● – Private independent collage Going girls . ● – independent Models . ● – House Wife’s . ● – Private Independent House Wife’s ● – Corporate M.N.C Working Profiles . ● – Call Center Girls . ● – Live Band Girls . ●- Foreigners & Many More . Service type: 1.In call 2.out call 3. full Lip to Lip kiss 4.69 5.b-job without Condom 6. Hard Core sex & Much More. 7 Body to Body Touch 8 Kissing 9 Sucking Boobs and More 10 Enjoy by Hand 11 Relax By Oral 12 Sex with Happy Ending • In Call and Out Call Service • 3* 5* 7* Hotels Service • 24 Hours Available • Indian, Russian, Punjabi, Kashmiri Escorts • Real Models, College Girls, House Wife, Also Available • Short Time and Full Time Service Available • Hygienic Full AC Neat and Clean Rooms Avail. In Hotel 24 hours • Daily Escorts Staff Available • Minimum to Maximum Range Available.

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL

kantirani197

Introduction,importance and scope of horticulture.pptx

Bhagirath Gogikar

STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION

rouseeyyy

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics

sakshisoni2385

Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...

Silpa

Proteomics: types, protein profiling steps etc.

Silpa

Conjugation, transduction and transformation

Areesha Ahmad

(Vivek)Call Us, 8448380779,Call girls in Delhi NCr – We Offer best in class call girls. escort Service At Affordable Price At low Rate with Space Night 8000 We Are One Of The Oldest Escort and Call girls Agencies in Delhi. You Will Find That Our Female Escorts Are Full Of Fun, Sexy And They Would Love Enjoy Your Company. We Have A Fantastic Selection Of Escort Ladies Available For In-Calls As Well As Out-Calls. Our Escorts Are Not Only Beautiful But All Have Great Personalities Making Them The Perfect Companion For Any Occasion. In-Call:- You Can Come At Our Place in Delhi Our place Which Is Very Clean Hygienic 100% safe Accommodation. Out-Call:- You have To Come Pick The Girl From My Place We Are Also Provide Door Step Services (Delhi Ncr, Noida, Gurgaon, Faridabad, Ghaziabad Note:- Pic Collectors Time Passers Bargainers Stay Away As We Respect The Value For Your Money Time And Expect The Same From You Hygienic:- Full Ac room And Clean Rooms Available In Hotel 24 * 7 Hourly In Delhi NCR More Details, With WhatsApp Number, +91-8448380779

Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified

Delhi Call girls

Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)

Joonhun Lee

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry

Alex Henderson

Factory Acceptance Test( FAT).pptx .

Poonam Aher Patil

Just Call Vip call girls Srinagar Jammu Kashmir Escorts ☎️ 8617697112 Starting From 5K to 15K High Profile Escorts In Srinagar Jammu Kashmir ❤Personal Whatsapp Number Jammu Kashmir Call Girls 8617697112 💦✅. There are a number of Srinagar Jammu Kashmir Escorts willing to meet you at an affordable rate, which also possesses high moral standards and humanitarian tendencies. These girls can help satisfy the sexual desires of clients without fail; it is therefore essential that clients select an established service. Our services feature various packages at competitive rates: One shot: ₹2000/in-call, ₹5000/out-call Two shots with one girl: ₹3500/in-call, ₹6000/out-call Body to body massage with sex: ₹3000/in-call Full night for one person: ₹7000/in-call, ₹10000/out-call

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.

Nitya salvi

Bacterial Identification and Classifications

Areesha Ahmad

GBSN - Microbiology (Unit 2)

Areesha Ahmad

Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...

Mohammad Khajehpour

Clean In Place(CIP).pptx .

Poonam Aher Patil

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx

RizalinePalanog2

Context. WASP-76 b has been a recurrent subject of study since the detection of a signature in high-resolution transit spectroscopy data indicating an asymmetry between the two limbs of the planet. The existence of this asymmetric signature has been confirmed by multiple studies, but its physical origin is still under debate. In addition, it contrasts with the absence of asymmetry reported in the infrared (IR) phase curve. Aims. We provide a more comprehensive dataset of WASP-76 b with the goal of drawing a complete view of the physical processes at work in this atmosphere. In particular, we attempt to reconcile visible high-resolution transit spectroscopy data and IR broadband phase curves. Methods. We gathered 3 phase curves, 20 occultations, and 6 transits for WASP-76 b in the visible with the CHEOPS space telescope. We also report the analysis of three unpublished sectors observed by the TESS space telescope (also in the visible), which represents 34 phase curves. Results. WASP-76 b displays an occultation of 260±11 and 152±10 ppm in TESS and CHEOPS bandpasses respectively. Depending on the composition assumed for the atmosphere and the data reduction used for the IR data, we derived geometric albedo estimates that range from 0.05 ± 0.023 to 0.146 ± 0.013 and from <0.13 to 0.189 ± 0.017 in the CHEOPS and TESS bandpasses, respectively. As expected from the IR phase curves, a low-order model of the phase curves does not yield any detectable asymmetry in the visible either. However, an empirical model allowing for sharper phase curve variations offers a hint of a flux excess before the occultation, with an amplitude of ∼40 ppm, an orbital offset of ∼−30◦ , and a width of ∼20◦ . We also constrained the orbital eccentricity of WASP-76 b to a value lower than 0.0067, with a 99.7% confidence level. This result contradicts earlier proposed scenarios aimed at explaining the asymmetry observed in high-resolution transit spectroscopy. Conclusions. In light of these findings, we hypothesise that WASP-76 b could have night-side clouds that extend predominantly towards its eastern limb. At this limb, the clouds would be associated with spherical droplets or spherically shaped aerosols of an unknown species, which would be responsible for a glory effect in the visible phase curves.

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b

Sérgio Sacani

Dernier (20)

Forensic Biology & Its biological significance.pdf

COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL

Introduction,importance and scope of horticulture.pptx

STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics

Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...

Proteomics: types, protein profiling steps etc.

Conjugation, transduction and transformation

Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified

Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry

Factory Acceptance Test( FAT).pptx .

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.

Bacterial Identification and Classifications

GBSN - Microbiology (Unit 2)

Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...

Clean In Place(CIP).pptx .

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b

Effectiveness of Gamesourcing Expert Painting Annotations

1. Effectiveness of Gamesourcing Expert Painting Annotations Are there features of images or subject types that can predict high or low agreement? ? Start to play! Can a simpliﬁed version of an expert annotation task be carried out by non-experts? baseline # imperfect # 200 400 600 800 1000 1200 numberofannotations(bars) 020406080100 users percentageofcorrectannotations(dots) baseline % imperfect % baseline # imperfect # 1 10 100 1000 numberofannotations(bars) 2 4 6 8 10 020406080100 number of repetitions percentageofcorrectannotations(lines) baseline % imperfect % Do users learn to correctly label subject types of paintings? ? Can they apply what they have learned to new paintings of known subject types? ? 2 1 7 2 1 1 2 1 3 3 8 3 2 5 3 1 30 1 3 1 1 1 37 1 4 8 12 1 6 7 1 1 8 figu land full port alle half genr hist kach city seas stil anim town flow mari maes othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes Non−Experts Experts 0 25 50 75 100 Percent baseline condition − aggregated annotations 96 11 4 1 6 3 1 1 6 1 3 1 1 3 9 3 7 1 2 1 2 1 3 2 6 1 2 1 4 2 1 1 1 23 2 3 19 3 3 1 1 12 1 11 5 1 1 5 1 1 4 6othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes Non−Experts Experts 0 25 50 75 100 Percent imperfect condition − aggregated annotations 48 6 4 8 5 48 4 1 26 6 5 5 5 6 26 38 164 2 27 1 12 39 35 5 1 129 51 34 3 1 1 11 49 2 1 1 29 13 47 1 1 1 3 1 107 3 1 2 2 1 1 286 1 8 16 1 1 2 6 105 2 86 1 2 20 2 203 3 12 1 3 2 53 7 1 1 2 9 6 11 1 1 27 5 1 1 3 1 2 3 846 5 23 8 4 58 1 16 3 1 2 95 2 1 2 77 32 15 15 1 1 2 30 980 4 16 1 27 10 5 9 1 86 6 2 1 9 2 3 6 1 4 20 2 3 136 3 1 6 18 9 3 2 355 18 2 28 4 13 2 5 2 1 86 1 17 6 132 29 86 1 2 3 45 2 21 12 18 1 13 1 5 3 164 1 14 2 7 1 figu land full port alle half genr hist kach city seas stil anim town flow mari maes othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes Non−Experts Experts 0 25 50 75 Percent baseline condition − individual annotations 291 63 8 7 5 52 10 9 6 34 4 29 14 8 13 65 7 3 1 59 2 20 10 9 2 7 2 1 8 3 4 2 13 2 9 5 32 8 2 1 1 60 1 1 1 1 2 6 12 1 1 10 35 1 8 2 2 1 1 1 10 4 1 1 3 3 4 1 6 1 7 5 1 1 1 176 20 1 3 3 30 6 1 6 166 3 7 1 6 1 7 18 6 38 1 4 1 1 3 4 6 3 1 10 4 1 89 1 1 6 1 2 1 62 3 1 7 23 10 4 1 1 1 3 26 3 1 25 2 9 2 5 4 5 31 25 2 1 4 2 othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes othe figu land full port alle half genr hist kach city seas stil anim town flow mari maes Non−Experts Experts 0 25 50 75 Percent imperfect condition − individual annotations How do they compare with experts, both, individually and as a crowd? ? Top players: 1. Myriam C. Traub 2. Jacco van Ossenbruggen 3. Jiyin He 4. Lynda Hardman ! Label paintings with subject types from the Art and Architecture Thesaurus! Game over! Congratulations! You found out that our results show a notable agreement between experts and non-experts, that users improve when playing on “perfect” data, and that aggregating annotations increases their precision. Future research will focus on peer-feedback and using judgements to improve the selection of candidates. baseline # imperfect # 0 50 100 150 200 250 300 350 numberofannotations(bars) sequence number of new images percentageofcorrectannotations(lines) baseline % imperfect % [1,20] (40,60] (80,100] (120,140] (160,180] (200,220] (240,260] (280,300] (320,340] (360,380] 020406080100

Effectiveness of Gamesourcing Expert Painting Annotations

Recommandé

Recommandé

Contenu connexe

En vedette

En vedette (7)

Plus de Myriam Traub

Plus de Myriam Traub (8)

Dernier

Dernier (20)

Effectiveness of Gamesourcing Expert Painting Annotations