SlideShare une entreprise Scribd logo
1  sur  66
Télécharger pour lire hors ligne
Crowdsourcing beyond                      …


  Building Crowdmining Services for
         Your Own Research

                           Kuan-Ta Chen
                Institute of Information Science
                        Academia Sinica


CrowdKDD’12 Aug 12, 2012
What I’m going to talk
Crowdsourcing?
Crowdsourcing + Data Mining Research?
Common Fallacies of CS4DM Research
Pomics: A Crowdmining Service
Conclusion
Crowdsourcing
            = Crowd + Outsourcing

               “soliciting solutions via open calls
                   to large-scale communities”


CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   3
A more formal definition


           “Crowdsourcing is the act of taking a job traditionally
           performed by a designated agent (usually an employee)
           and outsourcing it to an undefined, generally large
           group of people in the form of an open call.” [1]




       [1] Howe, Jeff. Crowdsourcing: A Definition, http://crowdsourcing.typepad.com/



CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                        4
What Can
                Crowdsourcing Do?



CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   5
Brand Tagging




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   7
Data Entry
        Reward: 4.4 USD/hour




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   8
General Questions
        Reward: points on Yahoo! Answers




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   9
When crowdsourcing meets data
               mining…



                         Crowdsourcing                             Data mining




                                                   What’s in here?

CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                 10
Crowdsourcing for Data Mining:
                   Issues
        Purposes                                                   Methodologies
              Annotation                                             Recruiting
              (ground-truth generation)                              Incentives
              Evaluation                                             Task Design
              Retrieval                                              Workflow
              Human-in-the-loop                                      Learning from crowd
              computation                                            Quality control
                                                                     Cheat detection




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                           11
Crowdsourcing Uses
      in Data Mining Research



CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   12
Image Semantics
        Reward: 0.04 USD / task

    main theme?
     key objects?
  unique attributes?




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   13
find out photos of revolvers!




                                    0.02 USD/ task
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   14
Human Skeleton




                                     0.01 USD/ task
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   15
Photo Orientation




                                   0.01 USD/ task
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   17
Perspectives for 3D Objects




         Thi Phuong Nghiem, Axel Carlier, Geraldine Morin, and Vincent Charvillat,
         "Enhancing online 3D products through crowdsourcing," ACM CrowdMM'12.

CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                     18
Web Site Classifier




                                        12 USD / hour
   Panos Ipeirotis, “Crowdsourcing using Mechanical Turk: Quality Management and
   Scalability,” Invited Talk at CSDM 2011.
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                   19
Photographers’ Intention
                                                              to support a task?
                                                              to capture a bad feeling?
                                                              to preserve a good feeling?
                                                              to recall later on?
                                                              to publish it online?
                                                              to show it to friends and
                                                              family?




        Mathias Lux, Mario Taschwer, and Oge Marques, “A Closer Look at Photographers’
        Intentions: a Test Dataset,” ACM CrowdMM’12.

CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                            20
Linguistic Affective Judgement
        Affective response (Snow et al. 2008)


                                                   “Closing and cancellations
                                                  top advice on flu outbreak”




        USD 0.4 to label 20 headlines (140 labels)
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                22
A Lot More Examples
        Document relevance evaluation
            Alonso et al. (2008)
        Document rating collection
            Kittur et al. (2008)
        Noun compound paraphrasing
            Nakov (2008)
        Person name resolution
            Su et al. (2007)
        And so on...

CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   24
THE COMMON FALLACIES
         -- EXPERIENCES FROM CROWDMM’12



                                             Thanks to CrowdMM’12 co-organizers: Wei-Tsang
                                             Ooi, Martha Larson, and Wei-Ta Chu; also thanks
                                             to “Crowdsourcing for Multimedia” SI co-guest-
                                             editors Paul Bennent and Matt Lease.



CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                               25
Common Fallacies #1

                    Crowdsourcing is NOT JUST
                      conducting user studies


                   Crowd is uncontrollable with
            tasks performed in uncontrolled conditions
                          How to manage the crowd?


CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   26
Common Fallacies #2

               Crowdsourcing is NOT JUST
             analyzing user-generated content

               Cope with the noise in UGC rather than
                       only the information.
                How to manage the imperfectness &
                        diversity in UGC?


CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   27
Common Fallacies #2

               Crowdsourcing is NOT JUST
             analyzing user-generated content

                       Put the task element in the loop
                Re-purposing the creation of UGC as
                      your own microtasks



CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   28
Common Fallacies #3
                Crowdsourcing is NOT JUST
              posting tasks on Mechanical Turk
                        Explicit Crowdsourcing                Implicit Crowdsourcing




                                                             Piggyback Crowdsourcing




    Doan et al, "Crowdsourcing systems on the World-Wide Web," CACM, vol 54, no 4, 2011.
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                           29
An implicit crowdmining platform for multimedia content
Crowdsourcing for Data Mining:
                   Issues
        Purposes                                                   Methodologies
              Annotation                                             Recruiting
              (ground-truth generation)                              Incentives
              Evaluation                                             Task Design
              Retrieval                                              Workflow
              Human-in-the-loop                                      Learning from crowd
              computation                                            Quality control
                                                                     Cheat detection




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                           31
The Era of Too Many Photos
        People today use pictures to write down their daily experience
        (with the prevalence of digital cameras)




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen     32
How to Share Photos?
3 Common Ways
        Photo browsing
        Photo/video slideshow
        Illustrated text




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   34
Photo Browsing




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   35
Photo/Video slideshow




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   36
Illustrated Text




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   37
A MISSING PIECE

CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   38
Comics




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   39
Photo Comics – Baby Born




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   40
Photo Comics – Birthday Party




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   41
Photo Comics – Daily Fun




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   42
Media Comparison
                         Creation           Viewer            Viewer                  Port-
                                                                          Richness
                           Cost              Req.             Control                ability

       Photo
                            Low               Low                  High     Low       Low
      browsing

     Slideshow           Medium               Low                  Low    Medium      Low

     Illustrated
                            High              High                 High    High      High
         Text

        Comic               High              Low                  High    High      High
               How to lower it?
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                               43
Comic Making – Cartoonist’s Way




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   44
http://www.pomics.net
Goal of Pomics
Pomics = Picture to Comics




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   47
                                                                   47
Computer-Aided Storytelling

               Location
       Timing Analysis
    Aesthetics Analysis          Picture                         Automated
    Semantics Analysis
                                                 Auto            Draft        User
                                              Storytelling       Story       Editing
                                              Machine Learning

             Own rating            User
             Popularity         Preference     Adjustment                              Final Story




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                                     48
Technical Challenges #1
        Semantics Analysis
              Human recognition
              Emotion recognition
              Behavior recognition
              Object recognition
              Location identification
              Natural language processing
        Aesthetics Analysis
              Exposure
              Composition
        Timing Analysis
        Contextual Analysis
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   49
                                                                   49
Technical Challenges #2
       Automatic Storytelling
             Significant photo selection
             Paginating and page layouting
             Narrative design




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   50
Pomics as a Social Service


                         Web albums

                                                                   Publish &
                                                                    share
                              Web
                           resources




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen               51
Live Demo




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   52
                                                                   52
HOW IS
         RELATED TO CROWDSOURCING?




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   55
USERS ARE IMPLICITLY DOING
         IMAGE ANNOTATION AND
         EVALUATION




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   56
What pictures are used?




                                                             Aesthetics
                                                             information

   Why the 3
   pictures were
   used?

CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen           57
Wizard Interface



                                                                   Aesthetics
                                                                   information




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                 58
The Page Layout




                                                                   Saliency info




                 Semantics
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                   59
Usage Statistics of Pomics
                   (since July 15 2012)
        352 authors
        434 comic books
              4,362 frames
              4,332 images used
              1,057 image annotations
              3,789 text balloons


        3000+ shares on Facebook


CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   60
WHAT WE HAVE GATHERED SO
         FAR?




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   61
Picture Aesthetics Info




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   62
Picture Aesthetics (cont.)




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   63
Picture Saliency Info




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   64
Picture Semantics
        Love / Like / Dear
        Happy
        Sleepy / sleeping
        Tears
        Wearing a hat
        NO!




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   65
Can Pomics Do Micro-tasks?
        The answer is YES!




        Users were asked to create comics using a specific
        album
        Rewarded by 200 MB quota if their books are “shared”
        by 20+ FB users
CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   66
Picture Aesthetics from
                          Microtasks




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   67
Picture Saliency from Microtasks




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen   68
Crowdmining Services
        Advantages
              No or little hiring cost once right incentives are given
              Easily scale up
              Can change the game rules to fit to research


        Disadvantages
              High development cost
              Less flexible
              Hard to find the right incentives (besides money)



CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen         69
Conclusion
Crowdmining is a potential and exciting area
Crowdsourcing != Mechanical Turking
A lot more can be done with crowdmining
services

                     Building your own
                    crowdmining service
                           today!
CrowdMM 2012
                                           (in conjunction with ACM Multimedia 2012)


        Keynote: Prof. Masataka Goto
        (AIST, Japan)
        11 oral+poster presentations
              Annotation, Evaluation, Novel applications
        An industrial panel discussion
        Welcome to join us!            http://crowdmm.org/




CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen                  71
Unleash the power of
      Crowd!

         Thank You!
                 Kuan-Ta Chen
                Academia Sinica

         http://www.iis.sinica.edu.tw/~swc

Contenu connexe

En vedette

Research Skills I Learned in UIUC from Pi-Cheng Hsiu
Research Skills I Learned in UIUC from Pi-Cheng HsiuResearch Skills I Learned in UIUC from Pi-Cheng Hsiu
Research Skills I Learned in UIUC from Pi-Cheng HsiuSheng-Wei (Kuan-Ta) Chen
 
Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms
Beyond Mechanical Turk: An Analysis of Paid Crowd Work PlatformsBeyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms
Beyond Mechanical Turk: An Analysis of Paid Crowd Work PlatformsMatthew Lease
 
20160802 那些年,關於大學的那些事...
20160802 那些年,關於大學的那些事...20160802 那些年,關於大學的那些事...
20160802 那些年,關於大學的那些事...羅左欣
 
20150911 [社團事務] 跑酷社創社審查
20150911 [社團事務] 跑酷社創社審查20150911 [社團事務] 跑酷社創社審查
20150911 [社團事務] 跑酷社創社審查羅左欣
 
從鍵盤走向社會 @KCG
從鍵盤走向社會 @KCG從鍵盤走向社會 @KCG
從鍵盤走向社會 @KCGHsiao-hsien Yang
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Academia Sinica
 
20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作
20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作
20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作羅左欣
 
20161211 給社團人的一封信
20161211 給社團人的一封信20161211 給社團人的一封信
20161211 給社團人的一封信羅左欣
 
20160829 夢想,Loading...
20160829 夢想,Loading...20160829 夢想,Loading...
20160829 夢想,Loading...羅左欣
 
20141214擔任活動執秘應注意之事項
20141214擔任活動執秘應注意之事項20141214擔任活動執秘應注意之事項
20141214擔任活動執秘應注意之事項羅左欣
 
R統計軟體簡介
R統計軟體簡介R統計軟體簡介
R統計軟體簡介Person Lin
 

En vedette (20)

Research Skills I Learned in UIUC from Pi-Cheng Hsiu
Research Skills I Learned in UIUC from Pi-Cheng HsiuResearch Skills I Learned in UIUC from Pi-Cheng Hsiu
Research Skills I Learned in UIUC from Pi-Cheng Hsiu
 
Inside VCL
Inside VCLInside VCL
Inside VCL
 
一位年輕探索者的建議
一位年輕探索者的建議一位年輕探索者的建議
一位年輕探索者的建議
 
當學術研究者遇見線上遊戲
當學術研究者遇見線上遊戲當學術研究者遇見線上遊戲
當學術研究者遇見線上遊戲
 
Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms
Beyond Mechanical Turk: An Analysis of Paid Crowd Work PlatformsBeyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms
Beyond Mechanical Turk: An Analysis of Paid Crowd Work Platforms
 
線上遊戲與雲端運算
線上遊戲與雲端運算線上遊戲與雲端運算
線上遊戲與雲端運算
 
20160802 那些年,關於大學的那些事...
20160802 那些年,關於大學的那些事...20160802 那些年,關於大學的那些事...
20160802 那些年,關於大學的那些事...
 
20150911 [社團事務] 跑酷社創社審查
20150911 [社團事務] 跑酷社創社審查20150911 [社團事務] 跑酷社創社審查
20150911 [社團事務] 跑酷社創社審查
 
從鍵盤走向社會 @KCG
從鍵盤走向社會 @KCG從鍵盤走向社會 @KCG
從鍵盤走向社會 @KCG
 
Web 2.0世代的資安議題
Web 2.0世代的資安議題Web 2.0世代的資安議題
Web 2.0世代的資安議題
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
 
20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作
20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作
20150104各類型社團文件的製作,上傳以及活動舉辦的說明與操作
 
20161211 給社團人的一封信
20161211 給社團人的一封信20161211 給社團人的一封信
20161211 給社團人的一封信
 
一位程式人 PI 在中研院
一位程式人 PI 在中研院一位程式人 PI 在中研院
一位程式人 PI 在中研院
 
Borland C++Builder 進階課程
Borland C++Builder 進階課程Borland C++Builder 進階課程
Borland C++Builder 進階課程
 
Borland C++Builder 入門課程
Borland C++Builder 入門課程Borland C++Builder 入門課程
Borland C++Builder 入門課程
 
20160829 夢想,Loading...
20160829 夢想,Loading...20160829 夢想,Loading...
20160829 夢想,Loading...
 
Network and Multimedia QoE Management
Network and Multimedia QoE ManagementNetwork and Multimedia QoE Management
Network and Multimedia QoE Management
 
20141214擔任活動執秘應注意之事項
20141214擔任活動執秘應注意之事項20141214擔任活動執秘應注意之事項
20141214擔任活動執秘應注意之事項
 
R統計軟體簡介
R統計軟體簡介R統計軟體簡介
R統計軟體簡介
 

Similaire à Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your Own Research

Crowdsourcing challenges and opportunities 2012
Crowdsourcing challenges and opportunities 2012Crowdsourcing challenges and opportunities 2012
Crowdsourcing challenges and opportunities 2012xin wang
 
Middeware2012 crowd
Middeware2012 crowdMiddeware2012 crowd
Middeware2012 crowdmjfrankli
 
Innovations_For_Social_Upliftment_18Feb2012
Innovations_For_Social_Upliftment_18Feb2012Innovations_For_Social_Upliftment_18Feb2012
Innovations_For_Social_Upliftment_18Feb2012Sandeep Goyal
 
The acceleration of disruption: opportunities and threats for construction - ...
The acceleration of disruption: opportunities and threats for construction - ...The acceleration of disruption: opportunities and threats for construction - ...
The acceleration of disruption: opportunities and threats for construction - ...Comit Projects Ltd
 
Dell B2B: APT by Brilliant Noise
Dell B2B: APT by Brilliant NoiseDell B2B: APT by Brilliant Noise
Dell B2B: APT by Brilliant NoiseAntony Mayfield
 
Bigger than Any One: Solving Large Scale Data Problems with People and Machines
Bigger than Any One: Solving Large Scale Data Problems with People and MachinesBigger than Any One: Solving Large Scale Data Problems with People and Machines
Bigger than Any One: Solving Large Scale Data Problems with People and MachinesTyler Bell
 
Opportunities with data science
Opportunities with data scienceOpportunities with data science
Opportunities with data scienceAshiq Rahman
 
Building customer value through innovation
Building customer value through innovationBuilding customer value through innovation
Building customer value through innovationLorenzo Vicens
 
The Future of News
The Future of NewsThe Future of News
The Future of Newspriscillamok
 
Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...
Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...
Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...Tim Minshall
 
MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02
MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02
MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02Nick Hill
 
Damss scurt v2 dss an evolving class ...
Damss scurt  v2 dss an evolving class ...Damss scurt  v2 dss an evolving class ...
Damss scurt v2 dss an evolving class ...ISSIP
 
Deep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic ApplicationsDeep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic ApplicationsDarius Burschka
 
Data mining and homeland security rl31798
Data mining and homeland security rl31798Data mining and homeland security rl31798
Data mining and homeland security rl31798Daniel John
 
姜俊宇/從資料到知識:從零開始的資料探勘
姜俊宇/從資料到知識:從零開始的資料探勘姜俊宇/從資料到知識:從零開始的資料探勘
姜俊宇/從資料到知識:從零開始的資料探勘台灣資料科學年會
 
Kedengkedeng B2
Kedengkedeng B2Kedengkedeng B2
Kedengkedeng B2cmidg1
 
Machine Learning
Machine LearningMachine Learning
Machine LearningVivek Garg
 
Emerging data haven_technologies_humtech2014_madhusudan_raman
Emerging data haven_technologies_humtech2014_madhusudan_ramanEmerging data haven_technologies_humtech2014_madhusudan_raman
Emerging data haven_technologies_humtech2014_madhusudan_ramanMadhu Raman
 

Similaire à Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your Own Research (20)

Crowdsourcing challenges and opportunities 2012
Crowdsourcing challenges and opportunities 2012Crowdsourcing challenges and opportunities 2012
Crowdsourcing challenges and opportunities 2012
 
Middeware2012 crowd
Middeware2012 crowdMiddeware2012 crowd
Middeware2012 crowd
 
Innovations_For_Social_Upliftment_18Feb2012
Innovations_For_Social_Upliftment_18Feb2012Innovations_For_Social_Upliftment_18Feb2012
Innovations_For_Social_Upliftment_18Feb2012
 
The acceleration of disruption: opportunities and threats for construction - ...
The acceleration of disruption: opportunities and threats for construction - ...The acceleration of disruption: opportunities and threats for construction - ...
The acceleration of disruption: opportunities and threats for construction - ...
 
Dell B2B: APT by Brilliant Noise
Dell B2B: APT by Brilliant NoiseDell B2B: APT by Brilliant Noise
Dell B2B: APT by Brilliant Noise
 
Bigger than Any One: Solving Large Scale Data Problems with People and Machines
Bigger than Any One: Solving Large Scale Data Problems with People and MachinesBigger than Any One: Solving Large Scale Data Problems with People and Machines
Bigger than Any One: Solving Large Scale Data Problems with People and Machines
 
Opportunities with data science
Opportunities with data scienceOpportunities with data science
Opportunities with data science
 
Daren Brabham Stakeholder Engagement Conference 2010
Daren Brabham Stakeholder Engagement Conference 2010Daren Brabham Stakeholder Engagement Conference 2010
Daren Brabham Stakeholder Engagement Conference 2010
 
Building customer value through innovation
Building customer value through innovationBuilding customer value through innovation
Building customer value through innovation
 
The Future of News
The Future of NewsThe Future of News
The Future of News
 
Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...
Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...
Innovation Roundtable: The (actual and potential) impacts of 3D printing on b...
 
MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02
MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02
MITRE-ATARC-Cloud-Computing-White-Paper-2016-05-02
 
Damss scurt v2 dss an evolving class ...
Damss scurt  v2 dss an evolving class ...Damss scurt  v2 dss an evolving class ...
Damss scurt v2 dss an evolving class ...
 
Deep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic ApplicationsDeep Learning - a Path from Big Data Indexing to Robotic Applications
Deep Learning - a Path from Big Data Indexing to Robotic Applications
 
Data mining and homeland security rl31798
Data mining and homeland security rl31798Data mining and homeland security rl31798
Data mining and homeland security rl31798
 
姜俊宇/從資料到知識:從零開始的資料探勘
姜俊宇/從資料到知識:從零開始的資料探勘姜俊宇/從資料到知識:從零開始的資料探勘
姜俊宇/從資料到知識:從零開始的資料探勘
 
Kedengkedeng B2
Kedengkedeng B2Kedengkedeng B2
Kedengkedeng B2
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
A Server-Assigned Crowdsourcing Framework
A Server-Assigned Crowdsourcing FrameworkA Server-Assigned Crowdsourcing Framework
A Server-Assigned Crowdsourcing Framework
 
Emerging data haven_technologies_humtech2014_madhusudan_raman
Emerging data haven_technologies_humtech2014_madhusudan_ramanEmerging data haven_technologies_humtech2014_madhusudan_raman
Emerging data haven_technologies_humtech2014_madhusudan_raman
 

Dernier

Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 

Dernier (20)

Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptx
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 

Crowdsourcing beyond Mechanical Turk: Building Crowdmining Services for Your Own Research

  • 1. Crowdsourcing beyond … Building Crowdmining Services for Your Own Research Kuan-Ta Chen Institute of Information Science Academia Sinica CrowdKDD’12 Aug 12, 2012
  • 2. What I’m going to talk Crowdsourcing? Crowdsourcing + Data Mining Research? Common Fallacies of CS4DM Research Pomics: A Crowdmining Service Conclusion
  • 3. Crowdsourcing = Crowd + Outsourcing “soliciting solutions via open calls to large-scale communities” CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 3
  • 4. A more formal definition “Crowdsourcing is the act of taking a job traditionally performed by a designated agent (usually an employee) and outsourcing it to an undefined, generally large group of people in the form of an open call.” [1] [1] Howe, Jeff. Crowdsourcing: A Definition, http://crowdsourcing.typepad.com/ CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 4
  • 5. What Can Crowdsourcing Do? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 5
  • 6. Brand Tagging CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 7
  • 7. Data Entry Reward: 4.4 USD/hour CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 8
  • 8. General Questions Reward: points on Yahoo! Answers CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 9
  • 9. When crowdsourcing meets data mining… Crowdsourcing Data mining What’s in here? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 10
  • 10. Crowdsourcing for Data Mining: Issues Purposes Methodologies Annotation Recruiting (ground-truth generation) Incentives Evaluation Task Design Retrieval Workflow Human-in-the-loop Learning from crowd computation Quality control Cheat detection CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 11
  • 11. Crowdsourcing Uses in Data Mining Research CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 12
  • 12. Image Semantics Reward: 0.04 USD / task main theme? key objects? unique attributes? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 13
  • 13. find out photos of revolvers! 0.02 USD/ task CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 14
  • 14. Human Skeleton 0.01 USD/ task CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 15
  • 15. Photo Orientation 0.01 USD/ task CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 17
  • 16. Perspectives for 3D Objects Thi Phuong Nghiem, Axel Carlier, Geraldine Morin, and Vincent Charvillat, "Enhancing online 3D products through crowdsourcing," ACM CrowdMM'12. CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 18
  • 17. Web Site Classifier 12 USD / hour Panos Ipeirotis, “Crowdsourcing using Mechanical Turk: Quality Management and Scalability,” Invited Talk at CSDM 2011. CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 19
  • 18. Photographers’ Intention to support a task? to capture a bad feeling? to preserve a good feeling? to recall later on? to publish it online? to show it to friends and family? Mathias Lux, Mario Taschwer, and Oge Marques, “A Closer Look at Photographers’ Intentions: a Test Dataset,” ACM CrowdMM’12. CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 20
  • 19. Linguistic Affective Judgement Affective response (Snow et al. 2008) “Closing and cancellations top advice on flu outbreak” USD 0.4 to label 20 headlines (140 labels) CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 22
  • 20. A Lot More Examples Document relevance evaluation Alonso et al. (2008) Document rating collection Kittur et al. (2008) Noun compound paraphrasing Nakov (2008) Person name resolution Su et al. (2007) And so on... CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 24
  • 21. THE COMMON FALLACIES -- EXPERIENCES FROM CROWDMM’12 Thanks to CrowdMM’12 co-organizers: Wei-Tsang Ooi, Martha Larson, and Wei-Ta Chu; also thanks to “Crowdsourcing for Multimedia” SI co-guest- editors Paul Bennent and Matt Lease. CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 25
  • 22. Common Fallacies #1 Crowdsourcing is NOT JUST conducting user studies Crowd is uncontrollable with tasks performed in uncontrolled conditions  How to manage the crowd? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 26
  • 23. Common Fallacies #2 Crowdsourcing is NOT JUST analyzing user-generated content Cope with the noise in UGC rather than only the information.  How to manage the imperfectness & diversity in UGC? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 27
  • 24. Common Fallacies #2 Crowdsourcing is NOT JUST analyzing user-generated content Put the task element in the loop  Re-purposing the creation of UGC as your own microtasks CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 28
  • 25. Common Fallacies #3 Crowdsourcing is NOT JUST posting tasks on Mechanical Turk Explicit Crowdsourcing Implicit Crowdsourcing Piggyback Crowdsourcing Doan et al, "Crowdsourcing systems on the World-Wide Web," CACM, vol 54, no 4, 2011. CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 29
  • 26. An implicit crowdmining platform for multimedia content
  • 27. Crowdsourcing for Data Mining: Issues Purposes Methodologies Annotation Recruiting (ground-truth generation) Incentives Evaluation Task Design Retrieval Workflow Human-in-the-loop Learning from crowd computation Quality control Cheat detection CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 31
  • 28. The Era of Too Many Photos People today use pictures to write down their daily experience (with the prevalence of digital cameras) CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 32
  • 29. How to Share Photos?
  • 30. 3 Common Ways Photo browsing Photo/video slideshow Illustrated text CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 34
  • 31. Photo Browsing CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 35
  • 32. Photo/Video slideshow CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 36
  • 33. Illustrated Text CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 37
  • 34. A MISSING PIECE CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 38
  • 35. Comics CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 39
  • 36. Photo Comics – Baby Born CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 40
  • 37. Photo Comics – Birthday Party CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 41
  • 38. Photo Comics – Daily Fun CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 42
  • 39. Media Comparison Creation Viewer Viewer Port- Richness Cost Req. Control ability Photo Low Low High Low Low browsing Slideshow Medium Low Low Medium Low Illustrated High High High High High Text Comic High Low High High High How to lower it? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 43
  • 40. Comic Making – Cartoonist’s Way CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 44
  • 43. Pomics = Picture to Comics CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 47 47
  • 44. Computer-Aided Storytelling Location Timing Analysis Aesthetics Analysis Picture Automated Semantics Analysis Auto Draft User Storytelling Story Editing Machine Learning Own rating User Popularity Preference Adjustment Final Story CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 48
  • 45. Technical Challenges #1 Semantics Analysis Human recognition Emotion recognition Behavior recognition Object recognition Location identification Natural language processing Aesthetics Analysis Exposure Composition Timing Analysis Contextual Analysis CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 49 49
  • 46. Technical Challenges #2  Automatic Storytelling  Significant photo selection  Paginating and page layouting  Narrative design CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 50
  • 47. Pomics as a Social Service Web albums Publish & share Web resources CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 51
  • 48. Live Demo CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 52 52
  • 49. HOW IS RELATED TO CROWDSOURCING? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 55
  • 50. USERS ARE IMPLICITLY DOING IMAGE ANNOTATION AND EVALUATION CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 56
  • 51. What pictures are used? Aesthetics information Why the 3 pictures were used? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 57
  • 52. Wizard Interface Aesthetics information CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 58
  • 53. The Page Layout Saliency info Semantics CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 59
  • 54. Usage Statistics of Pomics (since July 15 2012) 352 authors 434 comic books 4,362 frames 4,332 images used 1,057 image annotations 3,789 text balloons 3000+ shares on Facebook CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 60
  • 55. WHAT WE HAVE GATHERED SO FAR? CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 61
  • 56. Picture Aesthetics Info CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 62
  • 57. Picture Aesthetics (cont.) CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 63
  • 58. Picture Saliency Info CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 64
  • 59. Picture Semantics Love / Like / Dear Happy Sleepy / sleeping Tears Wearing a hat NO! CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 65
  • 60. Can Pomics Do Micro-tasks? The answer is YES! Users were asked to create comics using a specific album Rewarded by 200 MB quota if their books are “shared” by 20+ FB users CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 66
  • 61. Picture Aesthetics from Microtasks CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 67
  • 62. Picture Saliency from Microtasks CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 68
  • 63. Crowdmining Services Advantages No or little hiring cost once right incentives are given Easily scale up Can change the game rules to fit to research Disadvantages High development cost Less flexible Hard to find the right incentives (besides money) CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 69
  • 64. Conclusion Crowdmining is a potential and exciting area Crowdsourcing != Mechanical Turking A lot more can be done with crowdmining services Building your own crowdmining service today!
  • 65. CrowdMM 2012 (in conjunction with ACM Multimedia 2012) Keynote: Prof. Masataka Goto (AIST, Japan) 11 oral+poster presentations Annotation, Evaluation, Novel applications An industrial panel discussion Welcome to join us! http://crowdmm.org/ CrowdKDD’12: Crowdsourcing beyond Mechanical Turk / Kuan-Ta Chen 71
  • 66. Unleash the power of Crowd! Thank You! Kuan-Ta Chen Academia Sinica http://www.iis.sinica.edu.tw/~swc