SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
Introduction to Mechanical Turk
Artificial Artificial Intelligence




AWS User Group Berlin
Thomas Metschke
25.03.2010
Peritor GmbH
Amazon Mechanical Turk
is a marketplace for work.




                             2
Mechanical Turk Marketplace



 400,000+ Workers
 In 100+ Countries
 Available 24/7
 Programmatically
  Accessible
                         http://www.flickr.com/photos/diamond_rain/2543837414/




                                                                                 3
So there are basically

         Workers                                   Requesters




      http://www.flickr.com/photos/saad/1968774   http://www.flickr.com/photos/chicagobart/4181533461




                                                                                                        4
Mechanical Turk as a Worker

        Workers



                                                  Make money by working on
                                                   Human Intelligence Tasks

                                                  Workers can work from home
                                                   and choose their own work
                                                   hours

     http://www.flickr.com/photos/saad/1968774




                                                                                5
Your Dashboard




                 6
Your Dashboard




       The number of available tasks.




                                        7
Your Dashboard




       Total Earnings and Bonuses.




                                     8
Your Dashboard



          HIT Status and Totals.




                                   9
How do I get the money?



                Amazon         Bank
 U.S. Bank
                  Gift       Checks in
  account
               Certificate    Rupees




                                         10
Mechanical Turk as a Requester

                                 Requesters


 Have access to a global,
  on-demand, 24 x 7 workforce

 Can get thousands of HITs
  completed in minutes

 Pay only when they are
  satisfied with the results
                                http://www.flickr.com/photos/chicagobart/4181533461




                                                                                      11
Requesting HITs




      Requesters              Workers          Requesters


• define and create   • work on your     • approve and pay
  your HITs             HITs               for completed
• load HITs to        • submit results     HITs
  Mechanical Turk                        • use the results




                                                             12
Design HITs




               Enter Properties
               Design Layout



                                   13
Design HITs - faster




                       Take developer and use
                       CSV files
                       SOAP / REST or
                       Amazon Mechanical Turk
                       developer tools




                                                14
What would it look like


 http://mechanicalturk.amazonaws.com/
          ?Service=AWSMechanicalTurkRequester
          &AWSAccessKeyId=[the Requester's Access Key ID]
          &Version=2008-08-02
          &Operation=CreateHIT
          &Signature=[signature for this request]
          &Timestamp=[your system's local time]
          &Title=Location%20and%20Photograph%20Identification
          &Description=Select%20the%20image%20that%20best%20represents
          &Reward.1.Amount=5 &Reward.1.CurrencyCode=USD
          &Question=[URL-encoded question data]
          &AssignmentDurationInSeconds=30
          &LifetimeInSeconds=604800
          &Keywords=location,%20photograph,%20image,%20identification,%20opinion




                                                                                   15
Publish HITs




 credit card   debit card
                             HITs have to be paid in
                              advance
  Amazon                     Amazon takes 10% on top
               U.S. bank
 Payments
                account
  account




                                                        16
Use Mechanical Turk for


                   Work that requires Human
                    Judgment
                   Work that algorithms
                    cannot completely solve
                   Work that has
                    unpredictable or spiky
                    volume

                                               17
Improving Data Quality

                                           Background
    Are these two
                                             Data is the company’s business
businesses the same?                         Accuracy and breadth are key to
                                              differentiation

                                           Process
  Peritor GmbH        Peritor Consulting     1 MM data points to ingest each day
 Blücherstraße 22     Blücherstraße 22       200 data sources
   10961 Berlin       Hof III Aufgang 6
 http://peritor.com     10961 Berlin       Problem
                                             Data needs to be normalized,
                                              enhanced and de-dupped
                                             Algorithms could get data about 70%
       YES                  NO                clean

                                                                                    18
Moderating User
Generated Content

Is this image explicit?
                                                      Background
                                                        User generated content is a key part
                                                         of a web 2.0 experience

                                                      Process
                                                        Millions of photos uploaded every
                                                         day

                                                      Problem
                                                        Need to ensure user generated
      http://www.flickr.com/photos/cmak/1521356521/

                                                         content meets site guidelines

    YES                                          NO

                                                                                                19
Categorization
                                                           Background
 What kind of dress is                                       Consumers need to be able to
        this?                                                 quickly find a product when shopping
                                                              online

                                                           The Business Process
                                                             Millions of new products are
                                                              introduced everyday
                                                             Products are sourced from hundreds
                                                              of merchants and manufacturers,
    http://www.flickr.com/photos/34801476@N00/296743627/      each with their own taxonomy
                  Cocktail                                 Problem
                                                             Need to properly categorize new
             Bridal dress
                                                              products quickly in order to monetize

                                                                                                      20
Optimizing your HITs for


                  Price




      Accuracy             Speed
                                   21
Check it out!




          http://mturk.com
          http://turkers.proboards.com




                                         22
Thank you for your attention
Peritor GmbH
Blücherstr. 22, Hof III Aufgang 6
10961 Berlin
Tel.: +49 (0)30 69 20 09 84 0
Fax: +49 (0)30 69 20 09 84 9
Internet: www.peritor.com
E-Mail: info@peritor.com



© Peritor GmbH - Alle Rechte vorbehalten

Contenu connexe

Tendances

Tendances (6)

How to hire software engineers - given at pymunich.com
How to hire software engineers - given at pymunich.comHow to hire software engineers - given at pymunich.com
How to hire software engineers - given at pymunich.com
 
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
[Whitepaper] Robots in Recruiting - The Implications of AI on Talent Acquisition
 
How AI is going to transform recruitment?
How AI is going to transform recruitment?How AI is going to transform recruitment?
How AI is going to transform recruitment?
 
Cloud Computing careers India - by Karrox Technologies
Cloud Computing careers India -  by Karrox TechnologiesCloud Computing careers India -  by Karrox Technologies
Cloud Computing careers India - by Karrox Technologies
 
AI Recruitment - How Businesses Are Winning the Race for the Talent
AI Recruitment - How Businesses Are Winning the Race for the TalentAI Recruitment - How Businesses Are Winning the Race for the Talent
AI Recruitment - How Businesses Are Winning the Race for the Talent
 
AI in Talent Acquisition
AI in Talent AcquisitionAI in Talent Acquisition
AI in Talent Acquisition
 

En vedette

Search technologies & aws cloud search
Search technologies & aws cloud searchSearch technologies & aws cloud search
Search technologies & aws cloud search
Amazon Web Services
 

En vedette (18)

Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and moreScaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
Scaling Drupal in AWS Using AutoScaling, Cloudformation, RDS and more
 
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
(STG202) AWS Import/Export Snowball: Large-Scale Data Ingest into AWS
 
Common Workloads on the AWS Cloud
Common Workloads on the AWS CloudCommon Workloads on the AWS Cloud
Common Workloads on the AWS Cloud
 
Journey through the Cloud - Best Practices Getting Started in the AWS Cloud
Journey through the Cloud - Best Practices Getting Started in the AWS CloudJourney through the Cloud - Best Practices Getting Started in the AWS Cloud
Journey through the Cloud - Best Practices Getting Started in the AWS Cloud
 
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
AWS re:Invent 2016: Getting to Ground Truth with Amazon Mechanical Turk (MAC201)
 
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
(STG312) Amazon Glacier Deep Dive: Cold Data Storage in AWS
 
DNS DDoS mitigation using Amazon Route 53 and AWS Shield
DNS DDoS mitigation using Amazon Route 53 and AWS ShieldDNS DDoS mitigation using Amazon Route 53 and AWS Shield
DNS DDoS mitigation using Amazon Route 53 and AWS Shield
 
cloud computing in e commerce
cloud computing in e commercecloud computing in e commerce
cloud computing in e commerce
 
Best Practices for Running eCommerce in the AWS Cloud
Best Practices for Running eCommerce in the AWS CloudBest Practices for Running eCommerce in the AWS Cloud
Best Practices for Running eCommerce in the AWS Cloud
 
Cloudschool 2014
Cloudschool 2014Cloudschool 2014
Cloudschool 2014
 
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
(STG311) AWS Storage Gateway: Secure, Cost-Effective Backup & Archive
 
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
 
Search technologies & aws cloud search
Search technologies & aws cloud searchSearch technologies & aws cloud search
Search technologies & aws cloud search
 
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
Webinar AWS 201 - Using Amazon Virtual Private Cloud (VPC)
 
AWS Black Belt Online Seminar 2017 AWS Storage Gateway
AWS Black Belt Online Seminar 2017 AWS Storage GatewayAWS Black Belt Online Seminar 2017 AWS Storage Gateway
AWS Black Belt Online Seminar 2017 AWS Storage Gateway
 
Deep Dive - Amazon Virtual Private Cloud (VPC)
Deep Dive - Amazon Virtual Private Cloud (VPC)Deep Dive - Amazon Virtual Private Cloud (VPC)
Deep Dive - Amazon Virtual Private Cloud (VPC)
 
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
Cloud Architectures with AWS Direct Connect (ARC304) | AWS re:Invent 2013
 
(NET406) Deep Dive: AWS Direct Connect and VPNs
(NET406) Deep Dive: AWS Direct Connect and VPNs(NET406) Deep Dive: AWS Direct Connect and VPNs
(NET406) Deep Dive: AWS Direct Connect and VPNs
 

Similaire à AWS User Group Berlin - Introduction To Amazon Mechanical Turk

McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
optier
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overview
nickychu
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
optier
 
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM iEnhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
Precisely
 
Introduction to IBM API Management
Introduction to IBM API Management Introduction to IBM API Management
Introduction to IBM API Management
Patrick Bouillaud
 
Mdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitizedMdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitized
mtlgirlgeeks
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW Modernization
Robert Gleave
 

Similaire à AWS User Group Berlin - Introduction To Amazon Mechanical Turk (20)

Agileload - load testing tool for better web performance
Agileload - load testing tool for better web performanceAgileload - load testing tool for better web performance
Agileload - load testing tool for better web performance
 
The Case for Embedded Analytics: Improve the Value of your Applications with ...
The Case for Embedded Analytics: Improve the Value of your Applications with ...The Case for Embedded Analytics: Improve the Value of your Applications with ...
The Case for Embedded Analytics: Improve the Value of your Applications with ...
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 EditionNotes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
Notes/Domino Application Development Competitive Advantage - UKLUG 2011 Edition
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overview
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
Auto ai for skillsfuture
Auto ai for skillsfuture Auto ai for skillsfuture
Auto ai for skillsfuture
 
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM iEnhance ServiceNow with Automated Discovery for Mainframe and IBM i
Enhance ServiceNow with Automated Discovery for Mainframe and IBM i
 
Synthetic Monitoring Deep Dive - AppSphere16
Synthetic Monitoring Deep Dive - AppSphere16Synthetic Monitoring Deep Dive - AppSphere16
Synthetic Monitoring Deep Dive - AppSphere16
 
The Cloud: A Game-Changer for Web Performance Testing
The Cloud: A Game-Changer for Web Performance TestingThe Cloud: A Game-Changer for Web Performance Testing
The Cloud: A Game-Changer for Web Performance Testing
 
DATA BI: put key insights at the finger tip of decision makers.
DATA BI: put key insights at the finger tip of decision makers.DATA BI: put key insights at the finger tip of decision makers.
DATA BI: put key insights at the finger tip of decision makers.
 
Introduction to IBM API Management
Introduction to IBM API Management Introduction to IBM API Management
Introduction to IBM API Management
 
De-mystifying Robotic Process Automation
De-mystifying Robotic Process AutomationDe-mystifying Robotic Process Automation
De-mystifying Robotic Process Automation
 
Migrate to microservices
Migrate to microservicesMigrate to microservices
Migrate to microservices
 
Unlock Salesforce.com with Bonita Open Solution
Unlock Salesforce.com with Bonita Open SolutionUnlock Salesforce.com with Bonita Open Solution
Unlock Salesforce.com with Bonita Open Solution
 
Mdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitizedMdawson product strategy preso geek girls 12 7-12 sanitized
Mdawson product strategy preso geek girls 12 7-12 sanitized
 
Bimodal IT and EDW Modernization
Bimodal IT and EDW ModernizationBimodal IT and EDW Modernization
Bimodal IT and EDW Modernization
 
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
5 Steps To Deliver The Fastest Mobile Shopping Experience This Holiday Season
 
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
Enterprise-class mobile apps: Moving your business into the future - Amy Ande...
 
Death to Manual Deployments
Death to Manual DeploymentsDeath to Manual Deployments
Death to Manual Deployments
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

AWS User Group Berlin - Introduction To Amazon Mechanical Turk

  • 1. Introduction to Mechanical Turk Artificial Artificial Intelligence AWS User Group Berlin Thomas Metschke 25.03.2010 Peritor GmbH
  • 2. Amazon Mechanical Turk is a marketplace for work. 2
  • 3. Mechanical Turk Marketplace  400,000+ Workers  In 100+ Countries  Available 24/7  Programmatically Accessible http://www.flickr.com/photos/diamond_rain/2543837414/ 3
  • 4. So there are basically Workers Requesters http://www.flickr.com/photos/saad/1968774 http://www.flickr.com/photos/chicagobart/4181533461 4
  • 5. Mechanical Turk as a Worker Workers  Make money by working on Human Intelligence Tasks  Workers can work from home and choose their own work hours http://www.flickr.com/photos/saad/1968774 5
  • 7. Your Dashboard The number of available tasks. 7
  • 8. Your Dashboard Total Earnings and Bonuses. 8
  • 9. Your Dashboard HIT Status and Totals. 9
  • 10. How do I get the money? Amazon Bank U.S. Bank Gift Checks in account Certificate Rupees 10
  • 11. Mechanical Turk as a Requester Requesters  Have access to a global, on-demand, 24 x 7 workforce  Can get thousands of HITs completed in minutes  Pay only when they are satisfied with the results http://www.flickr.com/photos/chicagobart/4181533461 11
  • 12. Requesting HITs Requesters Workers Requesters • define and create • work on your • approve and pay your HITs HITs for completed • load HITs to • submit results HITs Mechanical Turk • use the results 12
  • 13. Design HITs  Enter Properties  Design Layout 13
  • 14. Design HITs - faster Take developer and use CSV files SOAP / REST or Amazon Mechanical Turk developer tools 14
  • 15. What would it look like http://mechanicalturk.amazonaws.com/ ?Service=AWSMechanicalTurkRequester &AWSAccessKeyId=[the Requester's Access Key ID] &Version=2008-08-02 &Operation=CreateHIT &Signature=[signature for this request] &Timestamp=[your system's local time] &Title=Location%20and%20Photograph%20Identification &Description=Select%20the%20image%20that%20best%20represents &Reward.1.Amount=5 &Reward.1.CurrencyCode=USD &Question=[URL-encoded question data] &AssignmentDurationInSeconds=30 &LifetimeInSeconds=604800 &Keywords=location,%20photograph,%20image,%20identification,%20opinion 15
  • 16. Publish HITs credit card debit card  HITs have to be paid in advance Amazon  Amazon takes 10% on top U.S. bank Payments account account 16
  • 17. Use Mechanical Turk for  Work that requires Human Judgment  Work that algorithms cannot completely solve  Work that has unpredictable or spiky volume 17
  • 18. Improving Data Quality Background Are these two  Data is the company’s business businesses the same?  Accuracy and breadth are key to differentiation Process Peritor GmbH Peritor Consulting  1 MM data points to ingest each day Blücherstraße 22 Blücherstraße 22  200 data sources 10961 Berlin Hof III Aufgang 6 http://peritor.com 10961 Berlin Problem  Data needs to be normalized, enhanced and de-dupped  Algorithms could get data about 70% YES NO clean 18
  • 19. Moderating User Generated Content Is this image explicit? Background  User generated content is a key part of a web 2.0 experience Process  Millions of photos uploaded every day Problem  Need to ensure user generated http://www.flickr.com/photos/cmak/1521356521/ content meets site guidelines YES NO 19
  • 20. Categorization Background What kind of dress is  Consumers need to be able to this? quickly find a product when shopping online The Business Process  Millions of new products are introduced everyday  Products are sourced from hundreds of merchants and manufacturers, http://www.flickr.com/photos/34801476@N00/296743627/ each with their own taxonomy Cocktail Problem  Need to properly categorize new Bridal dress products quickly in order to monetize 20
  • 21. Optimizing your HITs for Price Accuracy Speed 21
  • 22. Check it out! http://mturk.com http://turkers.proboards.com 22
  • 23. Thank you for your attention Peritor GmbH Blücherstr. 22, Hof III Aufgang 6 10961 Berlin Tel.: +49 (0)30 69 20 09 84 0 Fax: +49 (0)30 69 20 09 84 9 Internet: www.peritor.com E-Mail: info@peritor.com © Peritor GmbH - Alle Rechte vorbehalten