SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
rNews
 Embedded Data For
The News Industry




                     1
Hello!

§ Stuart Myles – @smyles
 Lead of the IPTC Semantic Web WG &
 Deputy Director of Schema Standards,
 The Associated Press

§ Evan Sandhaus – @kansandhaus
 Lead Architect, Semantic Platforms,
 The New York Times Company

§ Andreas Gebhard – @agebhard
 Managing Editor,
 Getty Images
                                        2
...And 50
Others
            4
Story components which are obvious to a person…




     STORY


PHOTO
...are not so obvious to a machine.




   STORY


PHOTO
The Problem of Structured Data
§ Modern Web Sites
  Built with 3 Tier
                               Display
  Architecture
  • Data Tier: Database
                                Tier
    Where Content Lives.
  • Presentation Tier:
    HTML Document that is
    sent to user.
  • Logic Tier: Software
                               Logic
    that reads from the Data
    Tier and outputs the        Tier
    Presentation Tier.




                                Data
                                Tier
                                         8
The Problem Of Structured Data:
           Continued
            Data Tier                                      Logic Tier             Display Tier

Label      Type     Value                                               <html>
                                                                          <head>
id         number   1248069162607                                           <title>
Headline   text     New Web Code Draws Concern...                              New Web Code Draws Concern...
                                                                            </title>
Byline     text     By TANZINA VEGA                                       </head>
Date       date     20101010                                              <body>
                                                                            <div>
Body       text     In the next few years, a powerful...                       New Web Code Draws Concern...
Length     number   1123                                                    </div>
                                                                            <div>
Tag        text     Privacy                                                    By TANZINA VEGA
Tag        text     Computers and the Internet                              </div>
                                                                            <div>
Tag        text     Web Browsers                                               October 10, 2010
                                                                            </div>
                                                                            <div>
                                                                               In the next few years, a powerful...
                                                                            </div>
                                                                          </body>
§ Content very well structured on Data                                 </html>

       Tier, but all of this structure is lost in
       translation to presentation tier.


                                                                                                                      9
The Problem Of Structured Data:
    Continued


<html>
          Display Tier                                        ?
  <head>
    <title>
       New Web Code Draws Concern...
    </title>
  </head>
  <body>
    <div>




                                              =
       New Web Code Draws Concern...
    </div>
    <div>
       By TANZINA VEGA
    </div>
    <div>
                                                  § Search engines, social
       October 10, 2010                             networks, aggregators and
    </div>
    <div>                                           other sites only see the
       In the next few years, a powerful...
    </div>                                          Display Tier, and cannot
  </body>
</html>
                                                    leverage the underlying
                                                    structure of the data.




                                                                                10
Semantic Markup Standards

Microformats      RDFa          Microdata         JSON




 § First      § Official    § Unofficial   § Official
 § Simple     § Complex     § Flexible     § Developers
 § Rigid      § OpenGraph   § Schema.org   § External




                                                              11
rNews
        12
rNews Defined


  rNews is a data model for
  embedding machine-readable
  publishing metadata in web
  documents and a set of
  suggested implementations.




                               13
rNews is a data model

                                          PostalAddress



                         address     address                            address




                                                   geo
                                               Coordinates   Geo
         Person                    Location               Coordinates                   Organization




            creator                                                                     creator
            editor                                           name                        editor
            contributor                                                             contributor
            provider                                                                   provider
            copyrightHolder                     Concept                         copyrightHolder
            accountablePerson                                                sourceOrganization
                                              about   mentions

       Comment

                         comment




                                               NewsItem




                                          associatedMedia


                                         associatedArticle


       ImageObject
       VideoObject                                                                          Article
       AudioObject                                                                                     14
for embedding machine-readable
publishing metadata in web
documents


Headline
Byline
Tags
Creator
...




                                 15
and a set of suggested implementations



      RDFa      Microdata    JSON




      Today     Very Soon   Maybe?




                                         16
rNews - Working Example




                          17
1<!DOCTYPE4html4PUBLIC4">//W3C//DTD4XHTML41.04Transitional//EN"44"http://www.w3.org/TR/xhtml1/DTD/xhtml1>transitional.dtd">44
 2<html>
 3<head></head>
 4<body>
 5444<div>
 64444<div>
 74444444<div>Allies4Are44Split...</div>
 8444444<div>NATO4Takes44Command</div>
 9444444<div>
1044444444<img44src="img/libya_sample_reuters.jpg"/>
1144444444<div>Credit:4Goran4Tomasevic/Reuters</div>
12444444444<div>Rebel4fighters44take...</div>
13444444</div>
14444444<div>By4STEVEN4LEE44MYERS</div>
15444444<div>WASHINGTON44|4March424,42011</div>
16444444<div>
1744444444<p>Having44largely4succeeded...</p>
18444444</div>
19444444<div>
2044444444<p><a44href="http://www.nytimes.com/content/help/rights/copyright/copyright>notice.html">
2144444444444©4Copyright442011
22444444444</a><span>The4New4York44Times44Company</span></p>
2344444444<p><a44href="http://www.nytimes.com/ref/membercenter/help/agree.html">
2444444444444Disclaimer
25444444444</a></p>
264444444</div>
274444</div>
2844444<div>
29444444<div>
3044444444<div>Section</div>
3144444444<div>World</div>
32444444</div>
33444444<div>Tags</div>
344444444<div>
3544444444<div>
364444444444<div>People</div>
374444444444<div>Qaddafi,4Muammar44el></div>
3844444444</div>
39444444</div>444444
40444444<div>
4144444444<div>Discussion44(3)</div>
4244444444<div>
434444444444<div>So4the4question44is..."</div>
444444444444<div>
454444444444<a44href="http://timespeople.nytimes.com/view/user/27242827/activities.html">Chuck</a></div>
464444444444<div>March425th,44201148:274am</div>
4744444444</div>
48444444</div>
494444</div>
50444</div>
51</body>
52</html>
                                                                                                                                 18
HTML 5 Microdata
<!DOCTYPE HTML>
<html itemscope itemtype="http://schema.org/NewsArticle" >
<head>
! <style type="text/css">@import url(css/iptc_times2.css);</style>
! <meta itemprop="dateCreated" content="2011-03-23"/>
! <meta itemprop="description" content="The questions about the command..."/>
! <meta itemprop="inLanguage" content="en-US"/>
! <meta itemprop="thumbnailUrl" content="http://graphics8.nytimes.com/images/common/icons/t_wb_75.gif"/>
! <meta itemprop="genre" content="Current"/>
! <meta itemprop="id" content="1248069687395"/>
! <meta itemprop="version" content="2"/>
! <meta itemprop="publishingPrinciples" content="http://www.nytco.com/press/ethics.html"/>
! <meta itemprop="wordCount" content="879"/>!
</head>
<body>
! <div style="height:900px" class="article">
! ! <div class="a_column">
! ! ! <div itemprop="headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div>
! ! ! <div itemprop="alternativeHeadline" class="rider">NATO Takes Command</div>
! ! ! <div itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject">
! ! ! ! <img itemprop="URL" class="image" src="img/libya_sample_reuters.jpg"/>
! ! ! ! <div class="image_credit">Credit:
! ! ! ! ! <span itemprop="creator" itemscope itemtype="http://schema.org/Person">
! ! ! ! ! ! <span itemprop="name">Goran Tomasevic</span>
! ! ! ! ! </span>
! ! ! ! ! /
! ! ! ! ! <span itemprop="sourceOrganization" itemscope itemtype="http://schema.org/Organization">
! ! ! ! ! ! <span itemprop="name">Reuters</span>
! ! ! ! ! ! <meta itemprop="tickerSymbol" content="NYSE TRI"/>
! ! ! ! ! </span>
! ! ! ! </div>
                                                                                                            19
RDFa
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">
<html xmlns:rnews="http://dec.iptc.org/rnews/0.1/">
<head>
! <style type="text/css">@import url(css/iptc_times2.css);</style>
</head>
<body>
! <div class="article" style="height:623px">
! ! <div class="a_column">
! ! ! <div property="rnews:headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div>
! ! ! <div class="rider">NATO Takes Command</div>
! ! ! <div class="main_image">
! ! ! ! <img class="image" src="img/libya_sample_reuters.jpg"/>
! ! ! ! <div class="image_credit">Credit: Goran Tomasevic/Reuters</div>
! ! ! ! <div class="image_caption">
! ! ! ! ! Rebel fighters take cover during a shelling near Ajdabiyah, Libya on Thursday.
! ! ! ! </div>
! ! ! </div>
! ! ! <div rel="rnews:createdBy" class="byline">By
! ! ! ! <span about="http://demo.iptc.org/per/steven_lee_myers" typeof="rnews:Person">
! ! ! ! ! <span property="rnews:name">STEVEN LEE MYERS</span>
! ! ! ! </span>
! ! ! </div>
! ! ! <div class="publication_date">
! ! ! ! <span property="rnews:dateline">WASHINGTON</span>
! ! ! ! |
! ! ! ! <span property="rnews:dateCreated" content="2011-03-24">March 24, 2011</span>
! ! ! </div>



                                                                                                                  20
The Way to rNews
The Way To rNews

       § June: Genesis of rNews - Evan at SemTech 2010
2010



       § November 5 - Rome: chartered
       § internal discussions about NYT draft
       § March 9 - Dubai: rNews 0.1
2011




       § lots of feedback, changes and additions
       § June 9 - Berlin: rNews 0.5
       § June 28: rNews 0.6
       § September 6: rNews 0.7 [aligned w/ schema.org]
       § October 7 - Vienna: rNews 1.0                    22
Engaging Our Community




                         23
Engaging Our Community




                         24
Engaging Our Community




                         25
Engaging Our Community




                         26
Feedback we incorporated...

§ In Person
  • 3 Meetups: New York, Berlin, London
  • Over a dozen one-on-one meetings with leading media and
    technology companies.
§ Online
  • Rnews.org forum
  • Numerous Blog Posts
§ In The Standard’s Community
  • W3C Community Group
  • Media Standards Trust


                                                              27
Feedback we incorporated...



                                         point
                                         circle
 latitude                                elevation
longitude                                polygon
 altitude                                box
                                         line
                              Geo
            Location       Coordinates




                                                     28
Feedback we incorporated...




          editor

 Person                  NewsItem




                                    29
rNews
    Benefits
Or Why You Should Care
     About rNews
Benefit #1: Better Links

With Structured
Data
No Structured
Data




                                   31
Benefit #2: Better Analytics




Javascript can extract richer news metadata
    Analytics per item, not just per page

                                              32
Benefit #3: Better Ad Placement


                    Leverage metadata
                       not just text




Avoid unfortunate
 juxtapositions                         33
rNews as a news API




  Level the Playing Field
Encourage Open Innovation   34
35
How Can You Help Us Get to rNews 1.0?

§ Check out the rNews 0.7 spec
§ Mark up some pages using rNews
§ Extract rNews properties using your favourite distiller
§ Dream up The Next Metadata Killer App™


             Let us know what you think
           Let us know how we can help
        @smyles • @agebhard • @kansandhaus

                                                             36
rNews
Thank
 You

        37

Contenu connexe

Plus de Stuart Myles

Plus de Stuart Myles (20)

IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For News
 
IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies Ideas
 
IPTC Board Spring 2019
IPTC Board Spring 2019IPTC Board Spring 2019
IPTC Board Spring 2019
 
IPTC Spring 2019 Conference
IPTC Spring 2019 ConferenceIPTC Spring 2019 Conference
IPTC Spring 2019 Conference
 
Photomation or Fauxtomation?
Photomation or Fauxtomation?Photomation or Fauxtomation?
Photomation or Fauxtomation?
 
Image Tagging at the Associated Press
Image Tagging at the Associated PressImage Tagging at the Associated Press
Image Tagging at the Associated Press
 
IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018
 
IPTC AGM 2018 Welcome
IPTC AGM 2018 WelcomeIPTC AGM 2018 Welcome
IPTC AGM 2018 Welcome
 
How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?
 
IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018
 
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
 
Ap Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and ChallengesAp Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and Challenges
 
IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018
 
Sustaining Television News Technical Challenges
Sustaining Television News Technical ChallengesSustaining Television News Technical Challenges
Sustaining Television News Technical Challenges
 
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
 
The Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing DirectorThe Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing Director
 
IPTC Approach to News in JSON
IPTC Approach to News in JSONIPTC Approach to News in JSON
IPTC Approach to News in JSON
 
IPTC News in JSON November 2017
IPTC News in JSON November 2017IPTC News in JSON November 2017
IPTC News in JSON November 2017
 
IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017
 
Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

On our way to IPTC rNews 1.0 - Inception and Design of a Standard

  • 1. rNews Embedded Data For The News Industry 1
  • 2. Hello! § Stuart Myles – @smyles Lead of the IPTC Semantic Web WG & Deputy Director of Schema Standards, The Associated Press § Evan Sandhaus – @kansandhaus Lead Architect, Semantic Platforms, The New York Times Company § Andreas Gebhard – @agebhard Managing Editor, Getty Images 2
  • 3.
  • 5.
  • 6. Story components which are obvious to a person… STORY PHOTO
  • 7. ...are not so obvious to a machine. STORY PHOTO
  • 8. The Problem of Structured Data § Modern Web Sites Built with 3 Tier Display Architecture • Data Tier: Database Tier Where Content Lives. • Presentation Tier: HTML Document that is sent to user. • Logic Tier: Software Logic that reads from the Data Tier and outputs the Tier Presentation Tier. Data Tier 8
  • 9. The Problem Of Structured Data: Continued Data Tier Logic Tier Display Tier Label Type Value <html> <head> id number 1248069162607 <title> Headline text New Web Code Draws Concern... New Web Code Draws Concern... </title> Byline text By TANZINA VEGA </head> Date date 20101010 <body> <div> Body text In the next few years, a powerful... New Web Code Draws Concern... Length number 1123 </div> <div> Tag text Privacy By TANZINA VEGA Tag text Computers and the Internet </div> <div> Tag text Web Browsers October 10, 2010 </div> <div> In the next few years, a powerful... </div> </body> § Content very well structured on Data </html> Tier, but all of this structure is lost in translation to presentation tier. 9
  • 10. The Problem Of Structured Data: Continued <html> Display Tier ? <head> <title> New Web Code Draws Concern... </title> </head> <body> <div> = New Web Code Draws Concern... </div> <div> By TANZINA VEGA </div> <div> § Search engines, social October 10, 2010 networks, aggregators and </div> <div> other sites only see the In the next few years, a powerful... </div> Display Tier, and cannot </body> </html> leverage the underlying structure of the data. 10
  • 11. Semantic Markup Standards Microformats RDFa Microdata JSON § First § Official § Unofficial § Official § Simple § Complex § Flexible § Developers § Rigid § OpenGraph § Schema.org § External 11
  • 12. rNews 12
  • 13. rNews Defined rNews is a data model for embedding machine-readable publishing metadata in web documents and a set of suggested implementations. 13
  • 14. rNews is a data model PostalAddress address address address geo Coordinates Geo Person Location Coordinates Organization creator creator editor name editor contributor contributor provider provider copyrightHolder Concept copyrightHolder accountablePerson sourceOrganization about mentions Comment comment NewsItem associatedMedia associatedArticle ImageObject VideoObject Article AudioObject 14
  • 15. for embedding machine-readable publishing metadata in web documents Headline Byline Tags Creator ... 15
  • 16. and a set of suggested implementations RDFa Microdata JSON Today Very Soon Maybe? 16
  • 17. rNews - Working Example 17
  • 18. 1<!DOCTYPE4html4PUBLIC4">//W3C//DTD4XHTML41.04Transitional//EN"44"http://www.w3.org/TR/xhtml1/DTD/xhtml1>transitional.dtd">44 2<html> 3<head></head> 4<body> 5444<div> 64444<div> 74444444<div>Allies4Are44Split...</div> 8444444<div>NATO4Takes44Command</div> 9444444<div> 1044444444<img44src="img/libya_sample_reuters.jpg"/> 1144444444<div>Credit:4Goran4Tomasevic/Reuters</div> 12444444444<div>Rebel4fighters44take...</div> 13444444</div> 14444444<div>By4STEVEN4LEE44MYERS</div> 15444444<div>WASHINGTON44|4March424,42011</div> 16444444<div> 1744444444<p>Having44largely4succeeded...</p> 18444444</div> 19444444<div> 2044444444<p><a44href="http://www.nytimes.com/content/help/rights/copyright/copyright>notice.html"> 2144444444444©4Copyright442011 22444444444</a><span>The4New4York44Times44Company</span></p> 2344444444<p><a44href="http://www.nytimes.com/ref/membercenter/help/agree.html"> 2444444444444Disclaimer 25444444444</a></p> 264444444</div> 274444</div> 2844444<div> 29444444<div> 3044444444<div>Section</div> 3144444444<div>World</div> 32444444</div> 33444444<div>Tags</div> 344444444<div> 3544444444<div> 364444444444<div>People</div> 374444444444<div>Qaddafi,4Muammar44el></div> 3844444444</div> 39444444</div>444444 40444444<div> 4144444444<div>Discussion44(3)</div> 4244444444<div> 434444444444<div>So4the4question44is..."</div> 444444444444<div> 454444444444<a44href="http://timespeople.nytimes.com/view/user/27242827/activities.html">Chuck</a></div> 464444444444<div>March425th,44201148:274am</div> 4744444444</div> 48444444</div> 494444</div> 50444</div> 51</body> 52</html> 18
  • 19. HTML 5 Microdata <!DOCTYPE HTML> <html itemscope itemtype="http://schema.org/NewsArticle" > <head> ! <style type="text/css">@import url(css/iptc_times2.css);</style> ! <meta itemprop="dateCreated" content="2011-03-23"/> ! <meta itemprop="description" content="The questions about the command..."/> ! <meta itemprop="inLanguage" content="en-US"/> ! <meta itemprop="thumbnailUrl" content="http://graphics8.nytimes.com/images/common/icons/t_wb_75.gif"/> ! <meta itemprop="genre" content="Current"/> ! <meta itemprop="id" content="1248069687395"/> ! <meta itemprop="version" content="2"/> ! <meta itemprop="publishingPrinciples" content="http://www.nytco.com/press/ethics.html"/> ! <meta itemprop="wordCount" content="879"/>! </head> <body> ! <div style="height:900px" class="article"> ! ! <div class="a_column"> ! ! ! <div itemprop="headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div> ! ! ! <div itemprop="alternativeHeadline" class="rider">NATO Takes Command</div> ! ! ! <div itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject"> ! ! ! ! <img itemprop="URL" class="image" src="img/libya_sample_reuters.jpg"/> ! ! ! ! <div class="image_credit">Credit: ! ! ! ! ! <span itemprop="creator" itemscope itemtype="http://schema.org/Person"> ! ! ! ! ! ! <span itemprop="name">Goran Tomasevic</span> ! ! ! ! ! </span> ! ! ! ! ! / ! ! ! ! ! <span itemprop="sourceOrganization" itemscope itemtype="http://schema.org/Organization"> ! ! ! ! ! ! <span itemprop="name">Reuters</span> ! ! ! ! ! ! <meta itemprop="tickerSymbol" content="NYSE TRI"/> ! ! ! ! ! </span> ! ! ! ! </div> 19
  • 20. RDFa <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"> <html xmlns:rnews="http://dec.iptc.org/rnews/0.1/"> <head> ! <style type="text/css">@import url(css/iptc_times2.css);</style> </head> <body> ! <div class="article" style="height:623px"> ! ! <div class="a_column"> ! ! ! <div property="rnews:headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div> ! ! ! <div class="rider">NATO Takes Command</div> ! ! ! <div class="main_image"> ! ! ! ! <img class="image" src="img/libya_sample_reuters.jpg"/> ! ! ! ! <div class="image_credit">Credit: Goran Tomasevic/Reuters</div> ! ! ! ! <div class="image_caption"> ! ! ! ! ! Rebel fighters take cover during a shelling near Ajdabiyah, Libya on Thursday. ! ! ! ! </div> ! ! ! </div> ! ! ! <div rel="rnews:createdBy" class="byline">By ! ! ! ! <span about="http://demo.iptc.org/per/steven_lee_myers" typeof="rnews:Person"> ! ! ! ! ! <span property="rnews:name">STEVEN LEE MYERS</span> ! ! ! ! </span> ! ! ! </div> ! ! ! <div class="publication_date"> ! ! ! ! <span property="rnews:dateline">WASHINGTON</span> ! ! ! ! | ! ! ! ! <span property="rnews:dateCreated" content="2011-03-24">March 24, 2011</span> ! ! ! </div> 20
  • 21. The Way to rNews
  • 22. The Way To rNews § June: Genesis of rNews - Evan at SemTech 2010 2010 § November 5 - Rome: chartered § internal discussions about NYT draft § March 9 - Dubai: rNews 0.1 2011 § lots of feedback, changes and additions § June 9 - Berlin: rNews 0.5 § June 28: rNews 0.6 § September 6: rNews 0.7 [aligned w/ schema.org] § October 7 - Vienna: rNews 1.0 22
  • 27. Feedback we incorporated... § In Person • 3 Meetups: New York, Berlin, London • Over a dozen one-on-one meetings with leading media and technology companies. § Online • Rnews.org forum • Numerous Blog Posts § In The Standard’s Community • W3C Community Group • Media Standards Trust 27
  • 28. Feedback we incorporated... point circle latitude elevation longitude polygon altitude box line Geo Location Coordinates 28
  • 29. Feedback we incorporated... editor Person NewsItem 29
  • 30. rNews Benefits Or Why You Should Care About rNews
  • 31. Benefit #1: Better Links With Structured Data No Structured Data 31
  • 32. Benefit #2: Better Analytics Javascript can extract richer news metadata Analytics per item, not just per page 32
  • 33. Benefit #3: Better Ad Placement Leverage metadata not just text Avoid unfortunate juxtapositions 33
  • 34. rNews as a news API Level the Playing Field Encourage Open Innovation 34
  • 35. 35
  • 36. How Can You Help Us Get to rNews 1.0? § Check out the rNews 0.7 spec § Mark up some pages using rNews § Extract rNews properties using your favourite distiller § Dream up The Next Metadata Killer App™ Let us know what you think Let us know how we can help @smyles • @agebhard • @kansandhaus 36