SlideShare a Scribd company logo
1 of 26
Calais
SDFORUM / Semantic Web SIG
Sep 3, 2008
Calais?
ClearForest
              • Founded in 1998 by text analytics
                pioneers

              • A software organization that enables
                Intelligent Information

              • Enterprise and government customers
              • Led the market in the establishment of
                unstructured text as a key corporate
                asset

              • Acquired by Reuters June 2007
              • Offices: Boston, Israel
Today:
Toolkit for Building Next Generation Solutions
Semantic Web and Advertising




                                   Right Offer



        +                      =
                                   Right Way

                                   Right Person

                                   Right Time
The Real World



• Most advertising driving content is text
• Most of it isn’t semantically enabled
• Most of it won’t be semantically enabled


• Why: Latency, cost and short shelf-life
Calais’ Piece of the Puzzle

         Unstructured Documents                      • A semantic metadata
          (Text / HTML / XML)
                                                       generation service that extracts
                                                       entities, facts and events from
                 Calais                                unstructured text

                                                     • Two new capabilities: topics &
  Named
                    Facts              Events
                                                       relevance
  Entities

                                     Management
                                                     • Available for commercial or
   People,          Position,
 Companies,         Alliance,
                                     Change, IPO,
                                     Labor Action,
                                                       non-commercial use up to
 Geographies,     Education,
   Albums,           Political
                                       Sporting,
                                     Entertainment
                                                       40,000 times per day
 Authors, etc.   Affiliation, etc.
                                          etc.
<Topic>M&A</Topic>

                                                             <Company>Reuters</Company>

Reuters Announced the Acquisition of ClearForest             <Company>ClearForest Ltd.</Company>
New York - April 30, 2007
                                                             <Acquisition offset=quot;494quot; length=quot;130quot;>
Reuters, the global information company, has entered           <Company_Acquirer>Reuters</Company_Acquirer>
into an agreement to acquire all of the outstanding            <Company_Acquired>ClearForest Ltd.</Company_Acquired>
shares of ClearForest Ltd., a privately held provider of       <Status>Planned</Status>
Text Analytics solutions, whose tagging platform and         </Acquisition>
analytical products allow clients to derive precise
business information from huge amounts of textual            <Product>Text Analytic Solution </Product>
content.
                                                             <Company>ClearForest Ltd.</Company>
ClearForest has received sufficient shareholder approval
to complete the transaction, which is expected to close      <Company>Reuters</Company>
in approximately 30 days, subject to customary closing
conditions. The financial terms were not disclosed.          <Country>United States</Country>
Reuters plans to retain and continue to work with the
existing management team and their highly skilled            <Country>Israel</Country>
workforces in the US and Israel. It also plans to continue
to support existing products and customers.                  <Company>Reuters</Company>
Reuters believes that search will be a pivotal element to
the future of how financial information is sourced and       <Person>Gerry Campbell</Person>
consumed. As part of its drive into this space, Reuters
has created a new strategic group and appointed Gerry        <ManagementChange offset=quot;2789quot; length=quot;92quot;>
Campbell, who will oversee the integration of                  <Person>Gerry Campbell</Person>
ClearForest and drive this innovation.                         <Company>Reuters</Company>
                                                               <Action>Enters</Position>
                                                             </ManagementChange>
What Calais Understands Today
• Entities
   –   City , Company , Continent , Country , Currency , EmailAddress , EntertainmentAwardEvent ,
       Facility , FaxNumber , Holiday , IndustryTerm , MarketIndex , MedicalCondition , Movie ,
       MusicAlbum , MusicGroup , NaturalDisaster , NaturalFeature , Organization , Person ,
       PhoneNumber , Product , ProvinceOrState , PublishedMedium , RadioProgram , RadioStation ,
       Region , SportsEvent , SportsGame , Technology , TVShow , TVStation , URL



• Events & Facts
   –   Acquisition , Alliance , AnalystEarningsEstimate , AnalystRecommendation , Bankruptcy ,
       BusinessRelation , Buybacks , CompanyAffiliates , CompanyCustomer ,
       CompanyEarningsAnnouncement , CompanyEarningsGuidance , CompanyInvestment ,
       CompanyLegalIssues , CompanyLocation , CompanyMeeting , CompanyReorganization ,
       CompanyTechnology , ConferenceCall , CreditRating , FamilyRelation , IPO , JointVenture ,
       ManagementChange , Merger , PersonAttributes , PersonCommunication , PersonEducation ,
       PersonPolitical , PersonPoliticalPast , PersonProfessional, PersonTravel , Quotation , StockSplit


• Topics
   –   Business, technology, health, sports. etc. – Significant growth planned

• Growing
   –   10 – 15 new concepts are added every couple of months
Live Example

Viewer Demo


Gnosis Demo
Extending Calais’ Reach
  More than just a web service – a growing collection of tools
    and applications to make it valuable in the real world

                                                            FeedShaver
                                                             Wirecatch
                                                            LinkedFacts
                             NET.                           Powerhouse
          UIMA               JAVA                           RSS Tagger
          Drupal             Ruby                            TopBraid
        WordPress             PHP               Gnosis      And more…
         Content         Development            Browser
                                                            Applications
     Management Tools   Tools & Libraries      Extensions


                                      Calais
How Calais is Being Used Today
• Mail & Guardian Online is using Calais to consolidate multiple
  content sources into sections and provide enhanced navigation in
  those news sections.
How Calais is Being Used Today
• Gist Automatically aggregates multiple news sources and automatically slots them
  into topic, etc.
How Calais is Being Used Today
• A few other examples

  –   Event based monitoring
  –   Investing and risk assessment
  –   Various topical syndication plays
  –   Intelligent RSS aggregators
  –   Automated Micro-Sites
A Question for Discussion Later..
• Where are the advertising people?

  – Almost 6,000 registered users
  – Dozens of deployed applications
  – Close to 1,000,000 uses per day
Making it Applicable to Advertising
• Disclaimer….
• What do we need for effective advertising?


• Four key components
  –   Something to sell
  –   Contextual Framework (keywords…?)
  –   Knowledge about the potential buyer &
  –   Knowledge of the buyers behavior
Context
• Calais can provide context – not just keywords
• Event & relationship detection is your friend
• Examples
  –   Sporting Event (& team)
  –   Album Release (& artist)
  –   Management Change (& person, company, position)
  –   Family Relation
  –   Person Political
  –   Quotes
Knowing the Buyer & Behavior
• We have two fundamental tools


  – Profiles & other volunteered information
  – Behavioral breadcrumbs

  – Calais allows you to create a much richer behavioral
    profile of the consumer – a contextual profile

  – Example: What kinds of content do they consume?
     • Sports, business, technology, health, lifestyle
     • What people do they read about, what companies?
Five Ideas
• Again…


  – I am not an advertising guy

  – Let’s get the discussion started
Context-Driven Ad Placement
• Moving beyond keywords


  – Can we use the semantic metadata generated by Calais
    to create richer context for placing an correctly?

  – For example – sporting events, album releases
Topic Hubs & Microsites
“Aboutness” and Relevancy
    FAA outage reveals odd practises<
When a computer glitch at a Federal Aviation Administration centre caused widespread airline delays this week, it served as a reminder that the US flight system is waiting for a modernising overhaul. But it also appears
the FAA's management of its existing technologies falls short of standards in other vital sectors. By using computing practises that would be considered poor in credit card networks or power plant operators, for example,
the FAA was vulnerable to a problem caused when new software was loaded at the Atlanta centre that distributes flight plans. Because the FAA relies on just two computing systems, one in Atlanta and one in Salt Lake
City, to handle that chore for the entire nation, the software glitch all but sank the system on Tuesday. The Salt Lake centre remained up and served as a backup, but it became overloaded by information coming from
airlines. More than 600 flights were delayed from Atlanta all the way to Boston and Chicago. A failure at the same Atlanta centre caused major delays across the East Coast in June 2007. Such breakdowns often can be
prevented with sufficient redundancy, or enough different computers and communication channels to handle the same workload in an emergency. Redundancy is so critical for power and water utilities that they can be
fined hundreds of thousands of dollars a day if they're found insufficiently prepared - and $1-million (about R8-million) per day if they're found to be wilfully negligent. 'In the industries I work in, if you have something that
critical, you generally build more redundancy,' said Jason Larsen, a security researcher with consultancy IOActive who previously spent five years at Idaho National Laboratory examining electrical plants' control
systems. 'If this (FAA outage) happened at a power plant, I'd be telling them to open up their checkbook and expect to be fined.' FAA spokesperson Tammy Jones stressed that these types of problems 'don't happen on
a mass scale or a regular basis,' and noted that the FAA handles 50 000 to 60 000 fights a day. And flying on US airlines has never been safer. 'The system is working,' she said. 'We are making sure people are getting
from one place to another.' Basil Barimo, vice president of operations and safety for the Air Transport Association of America, a trade association that represents the nation's largest carriers, says the fundamental
problem is that the FAA still relies on outdated technology, including a radar-based control system designed in the 1940s and '50s. Barimo is optimistic that the FAA's NextGen modernisation program - a $15-billion-plus
upgrade to satellite-based technology that will take nearly 20 years to complete - will help make more efficient use of the nation's airspace and safely allow more planes in the sky. At the Atlanta centre that saw this
week's failure, the National Airspace Data Interchange Network computer has been owned and operated by the FAA since the 1980s, after the Dutch company that developed it went out of business. The network is
being upgraded, and will have much more memory, process data much more quickly and be more robust and 'fault-tolerant.' 'We should see significant improvements by the end of September ... which should prevent
the type of problem we had on Tuesday,' said FAA spokesperson Laura Brown. The agency also is considering adding a third backup site for that and other systems at a technology centre in New Jersey, but no final
decisions have been made, she added. However, Doug Church, a spokesperson for the National Air Traffic Controllers Association - a union that has been locked in a contract dispute with the FAA since 2006 - argues
that the agency has tried to focus on future technology to deflect its lack of diligence in maintaining its current systems. Not only did Church cite the agency's lack of a 'safety net of redundancy,' but he also pointed to its
'fix-on-fail' policy of waiting for something to break before addressing a problem. Indeed, in December, the agency exempted its computer maintenance personnel from having to perform some periodic certification
checks as required by government handbooks for technical equipment. The FAA said that would eliminate unnecessary certifications that historically had little or no effect on total system performance and safety. And a
2006 report from the Government Accountability Office had found support for the idea in some instances. But computing experts say they often advise private companies to reject that approach. 'It's common, you see it
in retail too - it's the whole 'don't fix it if it ain't broke' thing,' said Branden Williams, director of a unit of VeriSign that assesses the security of retailers' payment systems. 'It's unfortunate because it's very reactive, and it
typically winds up costing you more. If you do fix-on-fail, it usually costs you more.' Of course, there's a difference between a private company's outage that delays your DVD order, and one at the agency administering
airline traffic. And such events have happened to the FAA multiple times. Communications between an air traffic control centre in Memphis, which directs planes passing through a 250-mile radius from the city, and an
unknown number of airplanes were disrupted this month when a car struck a utility pole, severing a fibre-optic cable. Last September, the same centre lost all its communications and some air traffic controllers had to
use their personal cell phones to route planes out of the seven-state area. The FAA blamed that outage on the failure of a major AT&T phone line. In May, the FAA system that issues preflight notices to pilots about
runway, equipment and security issues went down for about a day when a server crashed and the backup operated too slowly to be effective. The database was not able to issue updates or new notices, but pilots
continued to receive relevant information from local air traffic controllers and through alternate systems. After this week's outage, Paul Proctor, a Gartner analyst focused on security and regulatory compliance for large
corporations, said it appeared that the FAA didn't deploy the flight-plan computers with nearly as much redundancy as big companies generally have in systems critical to their operations. 'You need to do a good analysis
about whether this is acceptable risk,' Proctor said. 'One of the things the government is betting on is the fact that if there's ... a failure, it's not a safety issue.' Sid McGuirk, associate professor and coordinator of the air
traffic management program at Embry-Riddle Aeronautical University in Daytona Beach, Fla., believes that given the budget realities facing the FAA, the agency has maintained a good balance. It keeps the system
running efficiently without compromising safety, said McGuirk, a former air traffic controller and FAA manager for 35 years. 'From time to time, we are going to have a glitch, but it's a tradeoff,' he said. 'Would I like to see
more modern equipment in the system? Sure. But most folks would not want to see their taxes tripled to pay for new technology every two years.'
“Aboutness” and Relevancy
 FAA outage reveals odd practises<

• What’s in that?
  – Dozens of people, places, things
  – Dozens of quotes
  – Hundreds? of keywords


• What’s Important?
  –   Federal Aviation Administration – 0.781
  –   Atlanta – 0.773
  –   United States – 0.443
  –   Boston – 0.401
Mashup Ads
• Context-enhanced ads


  – Can we use metadata generated by Calais (places,
    events, etc) to create customized ads?

  – Mashupads from Dapper are a start – but they rely on
    structure.
Contextual Profiling
• Can we create much richer customer profiles based
  on behavior?


  – What people do I read about?
  – What geographies do I read about?
  – How much time do I spend reading business news vs.
    lifestyle news?
• www.opencalais.com

 – Gallery – code and applications examples
 – Forums
 – Documentation

More Related Content

What's hot

How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsOntotext
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Websamar_slideshare
 
Campaign for Richer Metadata
Campaign for Richer MetadataCampaign for Richer Metadata
Campaign for Richer MetadataCrossref
 
Open Data and News Analytics Demo
Open Data and News Analytics DemoOpen Data and News Analytics Demo
Open Data and News Analytics DemoOntotext
 
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...Bradley Allen
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Juan Sequeda
 
Adding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to DeliveryAdding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to DeliveryOntotext
 
Structured SEO Data: An overview and how to for Drupal
Structured SEO Data:  An overview and how to for DrupalStructured SEO Data:  An overview and how to for Drupal
Structured SEO Data: An overview and how to for Drupalcgmonroe
 
Building the Inform Semantic Publishing Ecosystem: from Author to Audience
Building the Inform Semantic Publishing Ecosystem: from Author to AudienceBuilding the Inform Semantic Publishing Ecosystem: from Author to Audience
Building the Inform Semantic Publishing Ecosystem: from Author to AudienceVital.AI
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentOntotext
 
Diving in Panama Papers and Open Data to Discover Emerging News
Diving in Panama Papers and Open Data to Discover Emerging NewsDiving in Panama Papers and Open Data to Discover Emerging News
Diving in Panama Papers and Open Data to Discover Emerging NewsOntotext
 
Making the Web searchable
Making the Web searchableMaking the Web searchable
Making the Web searchablePeter Mika
 
Why Semantics Matter? Adding the semantic edge to your content, right from au...
Why Semantics Matter? Adding the semantic edge to your content,right from au...Why Semantics Matter? Adding the semantic edge to your content,right from au...
Why Semantics Matter? Adding the semantic edge to your content, right from au...Ontotext
 
Structured SEO Data Overview and How To
Structured SEO Data Overview and How ToStructured SEO Data Overview and How To
Structured SEO Data Overview and How Tocgmonroe
 
Linked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsLinked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsJay Myers
 
WT - Web & Working of Search Engine
WT - Web & Working of Search EngineWT - Web & Working of Search Engine
WT - Web & Working of Search Enginevinay arora
 
Practical Applications of Semantic Web in Retail -- Semtech 2014
Practical Applications of Semantic Web in Retail -- Semtech 2014 Practical Applications of Semantic Web in Retail -- Semtech 2014
Practical Applications of Semantic Web in Retail -- Semtech 2014 Jay Myers
 
How search engines work
How search engines workHow search engines work
How search engines workChinna Botla
 

What's hot (19)

How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk Analytics
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Web
 
Campaign for Richer Metadata
Campaign for Richer MetadataCampaign for Richer Metadata
Campaign for Richer Metadata
 
Open Data and News Analytics Demo
Open Data and News Analytics DemoOpen Data and News Analytics Demo
Open Data and News Analytics Demo
 
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
 
Adding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to DeliveryAdding Semantic Edge to Your Content – From Authoring to Delivery
Adding Semantic Edge to Your Content – From Authoring to Delivery
 
Structured SEO Data: An overview and how to for Drupal
Structured SEO Data:  An overview and how to for DrupalStructured SEO Data:  An overview and how to for Drupal
Structured SEO Data: An overview and how to for Drupal
 
Building the Inform Semantic Publishing Ecosystem: from Author to Audience
Building the Inform Semantic Publishing Ecosystem: from Author to AudienceBuilding the Inform Semantic Publishing Ecosystem: from Author to Audience
Building the Inform Semantic Publishing Ecosystem: from Author to Audience
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news content
 
Diving in Panama Papers and Open Data to Discover Emerging News
Diving in Panama Papers and Open Data to Discover Emerging NewsDiving in Panama Papers and Open Data to Discover Emerging News
Diving in Panama Papers and Open Data to Discover Emerging News
 
Information Update Feb 2008
Information Update Feb  2008Information Update Feb  2008
Information Update Feb 2008
 
Making the Web searchable
Making the Web searchableMaking the Web searchable
Making the Web searchable
 
Why Semantics Matter? Adding the semantic edge to your content, right from au...
Why Semantics Matter? Adding the semantic edge to your content,right from au...Why Semantics Matter? Adding the semantic edge to your content,right from au...
Why Semantics Matter? Adding the semantic edge to your content, right from au...
 
Structured SEO Data Overview and How To
Structured SEO Data Overview and How ToStructured SEO Data Overview and How To
Structured SEO Data Overview and How To
 
Linked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI MplsLinked Data Presentation at TDWI Mpls
Linked Data Presentation at TDWI Mpls
 
WT - Web & Working of Search Engine
WT - Web & Working of Search EngineWT - Web & Working of Search Engine
WT - Web & Working of Search Engine
 
Practical Applications of Semantic Web in Retail -- Semtech 2014
Practical Applications of Semantic Web in Retail -- Semtech 2014 Practical Applications of Semantic Web in Retail -- Semtech 2014
Practical Applications of Semantic Web in Retail -- Semtech 2014
 
How search engines work
How search engines workHow search engines work
How search engines work
 

Similar to Calais @ the SD Forum

Cloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdf
Cloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdfCloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdf
Cloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdfAmazon Web Services
 
What IT Transformation Really Means for the Enterprise
What IT Transformation Really Means for the EnterpriseWhat IT Transformation Really Means for the Enterprise
What IT Transformation Really Means for the EnterpriseTom Laszewski
 
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...Amazon Web Services
 
AI and IoT innovation - an industry focus
AI and IoT innovation - an industry focusAI and IoT innovation - an industry focus
AI and IoT innovation - an industry focusAmazon Web Services
 
Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018Amazon Web Services
 
Hadoop’s Impact on Recruit Company
Hadoop’s Impact on Recruit CompanyHadoop’s Impact on Recruit Company
Hadoop’s Impact on Recruit CompanyRecruit Technologies
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?Rackspace
 
AWS Initiate - Tendências da Transformação Digital
AWS Initiate - Tendências da Transformação DigitalAWS Initiate - Tendências da Transformação Digital
AWS Initiate - Tendências da Transformação DigitalAmazon Web Services LATAM
 
Culture of Innovation - AWS Transformation Day Boston 2018
Culture of Innovation - AWS Transformation Day Boston 2018Culture of Innovation - AWS Transformation Day Boston 2018
Culture of Innovation - AWS Transformation Day Boston 2018Amazon Web Services
 
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018Amazon Web Services
 
Innovation for Everyone - Transformation Day Montreal 2018
Innovation for Everyone - Transformation Day Montreal 2018Innovation for Everyone - Transformation Day Montreal 2018
Innovation for Everyone - Transformation Day Montreal 2018Amazon Web Services
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceSkillspeed
 
How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018
How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018
How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018Amazon Web Services
 
Better Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSBetter Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSAmazon Web Services
 
An Overview of Machine Learning on AWS
An Overview of Machine Learning on AWSAn Overview of Machine Learning on AWS
An Overview of Machine Learning on AWSAmazon Web Services
 
Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business
Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business
Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business Bernard Marr
 
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...Amazon Web Services
 

Similar to Calais @ the SD Forum (20)

Cloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdf
Cloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdfCloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdf
Cloud Choices- Quantifying the Cost and Risk Implications of Cloud.pdf
 
What IT Transformation Really Means for the Enterprise
What IT Transformation Really Means for the EnterpriseWhat IT Transformation Really Means for the Enterprise
What IT Transformation Really Means for the Enterprise
 
Keynote
KeynoteKeynote
Keynote
 
Interwoven Brochure
Interwoven BrochureInterwoven Brochure
Interwoven Brochure
 
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
Harness the Power of Crowdsourcing with Amazon Mechanical Turk (AIM351) - AWS...
 
AI and IoT innovation - an industry focus
AI and IoT innovation - an industry focusAI and IoT innovation - an industry focus
AI and IoT innovation - an industry focus
 
Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018
 
Hadoop’s Impact on Recruit Company
Hadoop’s Impact on Recruit CompanyHadoop’s Impact on Recruit Company
Hadoop’s Impact on Recruit Company
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?
 
Tendências na Transformação Digital
Tendências na Transformação DigitalTendências na Transformação Digital
Tendências na Transformação Digital
 
AWS Initiate - Tendências da Transformação Digital
AWS Initiate - Tendências da Transformação DigitalAWS Initiate - Tendências da Transformação Digital
AWS Initiate - Tendências da Transformação Digital
 
Culture of Innovation - AWS Transformation Day Boston 2018
Culture of Innovation - AWS Transformation Day Boston 2018Culture of Innovation - AWS Transformation Day Boston 2018
Culture of Innovation - AWS Transformation Day Boston 2018
 
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
 
Innovation for Everyone - Transformation Day Montreal 2018
Innovation for Everyone - Transformation Day Montreal 2018Innovation for Everyone - Transformation Day Montreal 2018
Innovation for Everyone - Transformation Day Montreal 2018
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in Finance
 
How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018
How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018
How we think about Innovation at Amazon, AWS Startup Day Cape Town 2018
 
Better Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSBetter Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWS
 
An Overview of Machine Learning on AWS
An Overview of Machine Learning on AWSAn Overview of Machine Learning on AWS
An Overview of Machine Learning on AWS
 
Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business
Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business
Data Is The New Oil: How Shell Has Become A Data-Driven And AI-Enabled Business
 
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
ProTips for Scaling AWS Training to Accelerate Adoption (DVC203) - AWS re:Inv...
 

More from Krista Thomas

The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010Krista Thomas
 
Simple OpenCalais Whitepaper
Simple OpenCalais WhitepaperSimple OpenCalais Whitepaper
Simple OpenCalais WhitepaperKrista Thomas
 
OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilKrista Thomas
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09Krista Thomas
 
Open Calais @ Transparent Text
Open Calais @ Transparent TextOpen Calais @ Transparent Text
Open Calais @ Transparent TextKrista Thomas
 
Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Krista Thomas
 
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Krista Thomas
 
Open Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsOpen Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsKrista Thomas
 
Open Calais Release 4.0
Open Calais Release 4.0Open Calais Release 4.0
Open Calais Release 4.0Krista Thomas
 
Final Calais For ONA
Final Calais For ONAFinal Calais For ONA
Final Calais For ONAKrista Thomas
 

More from Krista Thomas (12)

Ad.ly Introduction
Ad.ly IntroductionAd.ly Introduction
Ad.ly Introduction
 
San diego
San diegoSan diego
San diego
 
The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010
 
Simple OpenCalais Whitepaper
Simple OpenCalais WhitepaperSimple OpenCalais Whitepaper
Simple OpenCalais Whitepaper
 
OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry Council
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
 
Open Calais @ Transparent Text
Open Calais @ Transparent TextOpen Calais @ Transparent Text
Open Calais @ Transparent Text
 
Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Tague Semtech Keynote 2009
Tague Semtech Keynote 2009
 
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
 
Open Calais For SF And LA Meetups
Open Calais For SF And LA MeetupsOpen Calais For SF And LA Meetups
Open Calais For SF And LA Meetups
 
Open Calais Release 4.0
Open Calais Release 4.0Open Calais Release 4.0
Open Calais Release 4.0
 
Final Calais For ONA
Final Calais For ONAFinal Calais For ONA
Final Calais For ONA
 

Recently uploaded

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Recently uploaded (20)

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

Calais @ the SD Forum

  • 1. Calais SDFORUM / Semantic Web SIG Sep 3, 2008
  • 3. ClearForest • Founded in 1998 by text analytics pioneers • A software organization that enables Intelligent Information • Enterprise and government customers • Led the market in the establishment of unstructured text as a key corporate asset • Acquired by Reuters June 2007 • Offices: Boston, Israel
  • 4. Today: Toolkit for Building Next Generation Solutions
  • 5. Semantic Web and Advertising Right Offer + = Right Way Right Person Right Time
  • 6. The Real World • Most advertising driving content is text • Most of it isn’t semantically enabled • Most of it won’t be semantically enabled • Why: Latency, cost and short shelf-life
  • 7. Calais’ Piece of the Puzzle Unstructured Documents • A semantic metadata (Text / HTML / XML) generation service that extracts entities, facts and events from Calais unstructured text • Two new capabilities: topics & Named Facts Events relevance Entities Management • Available for commercial or People, Position, Companies, Alliance, Change, IPO, Labor Action, non-commercial use up to Geographies, Education, Albums, Political Sporting, Entertainment 40,000 times per day Authors, etc. Affiliation, etc. etc.
  • 8. <Topic>M&A</Topic> <Company>Reuters</Company> Reuters Announced the Acquisition of ClearForest <Company>ClearForest Ltd.</Company> New York - April 30, 2007 <Acquisition offset=quot;494quot; length=quot;130quot;> Reuters, the global information company, has entered <Company_Acquirer>Reuters</Company_Acquirer> into an agreement to acquire all of the outstanding <Company_Acquired>ClearForest Ltd.</Company_Acquired> shares of ClearForest Ltd., a privately held provider of <Status>Planned</Status> Text Analytics solutions, whose tagging platform and </Acquisition> analytical products allow clients to derive precise business information from huge amounts of textual <Product>Text Analytic Solution </Product> content. <Company>ClearForest Ltd.</Company> ClearForest has received sufficient shareholder approval to complete the transaction, which is expected to close <Company>Reuters</Company> in approximately 30 days, subject to customary closing conditions. The financial terms were not disclosed. <Country>United States</Country> Reuters plans to retain and continue to work with the existing management team and their highly skilled <Country>Israel</Country> workforces in the US and Israel. It also plans to continue to support existing products and customers. <Company>Reuters</Company> Reuters believes that search will be a pivotal element to the future of how financial information is sourced and <Person>Gerry Campbell</Person> consumed. As part of its drive into this space, Reuters has created a new strategic group and appointed Gerry <ManagementChange offset=quot;2789quot; length=quot;92quot;> Campbell, who will oversee the integration of <Person>Gerry Campbell</Person> ClearForest and drive this innovation. <Company>Reuters</Company> <Action>Enters</Position> </ManagementChange>
  • 9. What Calais Understands Today • Entities – City , Company , Continent , Country , Currency , EmailAddress , EntertainmentAwardEvent , Facility , FaxNumber , Holiday , IndustryTerm , MarketIndex , MedicalCondition , Movie , MusicAlbum , MusicGroup , NaturalDisaster , NaturalFeature , Organization , Person , PhoneNumber , Product , ProvinceOrState , PublishedMedium , RadioProgram , RadioStation , Region , SportsEvent , SportsGame , Technology , TVShow , TVStation , URL • Events & Facts – Acquisition , Alliance , AnalystEarningsEstimate , AnalystRecommendation , Bankruptcy , BusinessRelation , Buybacks , CompanyAffiliates , CompanyCustomer , CompanyEarningsAnnouncement , CompanyEarningsGuidance , CompanyInvestment , CompanyLegalIssues , CompanyLocation , CompanyMeeting , CompanyReorganization , CompanyTechnology , ConferenceCall , CreditRating , FamilyRelation , IPO , JointVenture , ManagementChange , Merger , PersonAttributes , PersonCommunication , PersonEducation , PersonPolitical , PersonPoliticalPast , PersonProfessional, PersonTravel , Quotation , StockSplit • Topics – Business, technology, health, sports. etc. – Significant growth planned • Growing – 10 – 15 new concepts are added every couple of months
  • 11. Extending Calais’ Reach More than just a web service – a growing collection of tools and applications to make it valuable in the real world FeedShaver Wirecatch LinkedFacts NET. Powerhouse UIMA JAVA RSS Tagger Drupal Ruby TopBraid WordPress PHP Gnosis And more… Content Development Browser Applications Management Tools Tools & Libraries Extensions Calais
  • 12. How Calais is Being Used Today • Mail & Guardian Online is using Calais to consolidate multiple content sources into sections and provide enhanced navigation in those news sections.
  • 13. How Calais is Being Used Today • Gist Automatically aggregates multiple news sources and automatically slots them into topic, etc.
  • 14. How Calais is Being Used Today • A few other examples – Event based monitoring – Investing and risk assessment – Various topical syndication plays – Intelligent RSS aggregators – Automated Micro-Sites
  • 15. A Question for Discussion Later.. • Where are the advertising people? – Almost 6,000 registered users – Dozens of deployed applications – Close to 1,000,000 uses per day
  • 16. Making it Applicable to Advertising • Disclaimer…. • What do we need for effective advertising? • Four key components – Something to sell – Contextual Framework (keywords…?) – Knowledge about the potential buyer & – Knowledge of the buyers behavior
  • 17. Context • Calais can provide context – not just keywords • Event & relationship detection is your friend • Examples – Sporting Event (& team) – Album Release (& artist) – Management Change (& person, company, position) – Family Relation – Person Political – Quotes
  • 18. Knowing the Buyer & Behavior • We have two fundamental tools – Profiles & other volunteered information – Behavioral breadcrumbs – Calais allows you to create a much richer behavioral profile of the consumer – a contextual profile – Example: What kinds of content do they consume? • Sports, business, technology, health, lifestyle • What people do they read about, what companies?
  • 19. Five Ideas • Again… – I am not an advertising guy – Let’s get the discussion started
  • 20. Context-Driven Ad Placement • Moving beyond keywords – Can we use the semantic metadata generated by Calais to create richer context for placing an correctly? – For example – sporting events, album releases
  • 21. Topic Hubs & Microsites
  • 22. “Aboutness” and Relevancy FAA outage reveals odd practises< When a computer glitch at a Federal Aviation Administration centre caused widespread airline delays this week, it served as a reminder that the US flight system is waiting for a modernising overhaul. But it also appears the FAA's management of its existing technologies falls short of standards in other vital sectors. By using computing practises that would be considered poor in credit card networks or power plant operators, for example, the FAA was vulnerable to a problem caused when new software was loaded at the Atlanta centre that distributes flight plans. Because the FAA relies on just two computing systems, one in Atlanta and one in Salt Lake City, to handle that chore for the entire nation, the software glitch all but sank the system on Tuesday. The Salt Lake centre remained up and served as a backup, but it became overloaded by information coming from airlines. More than 600 flights were delayed from Atlanta all the way to Boston and Chicago. A failure at the same Atlanta centre caused major delays across the East Coast in June 2007. Such breakdowns often can be prevented with sufficient redundancy, or enough different computers and communication channels to handle the same workload in an emergency. Redundancy is so critical for power and water utilities that they can be fined hundreds of thousands of dollars a day if they're found insufficiently prepared - and $1-million (about R8-million) per day if they're found to be wilfully negligent. 'In the industries I work in, if you have something that critical, you generally build more redundancy,' said Jason Larsen, a security researcher with consultancy IOActive who previously spent five years at Idaho National Laboratory examining electrical plants' control systems. 'If this (FAA outage) happened at a power plant, I'd be telling them to open up their checkbook and expect to be fined.' FAA spokesperson Tammy Jones stressed that these types of problems 'don't happen on a mass scale or a regular basis,' and noted that the FAA handles 50 000 to 60 000 fights a day. And flying on US airlines has never been safer. 'The system is working,' she said. 'We are making sure people are getting from one place to another.' Basil Barimo, vice president of operations and safety for the Air Transport Association of America, a trade association that represents the nation's largest carriers, says the fundamental problem is that the FAA still relies on outdated technology, including a radar-based control system designed in the 1940s and '50s. Barimo is optimistic that the FAA's NextGen modernisation program - a $15-billion-plus upgrade to satellite-based technology that will take nearly 20 years to complete - will help make more efficient use of the nation's airspace and safely allow more planes in the sky. At the Atlanta centre that saw this week's failure, the National Airspace Data Interchange Network computer has been owned and operated by the FAA since the 1980s, after the Dutch company that developed it went out of business. The network is being upgraded, and will have much more memory, process data much more quickly and be more robust and 'fault-tolerant.' 'We should see significant improvements by the end of September ... which should prevent the type of problem we had on Tuesday,' said FAA spokesperson Laura Brown. The agency also is considering adding a third backup site for that and other systems at a technology centre in New Jersey, but no final decisions have been made, she added. However, Doug Church, a spokesperson for the National Air Traffic Controllers Association - a union that has been locked in a contract dispute with the FAA since 2006 - argues that the agency has tried to focus on future technology to deflect its lack of diligence in maintaining its current systems. Not only did Church cite the agency's lack of a 'safety net of redundancy,' but he also pointed to its 'fix-on-fail' policy of waiting for something to break before addressing a problem. Indeed, in December, the agency exempted its computer maintenance personnel from having to perform some periodic certification checks as required by government handbooks for technical equipment. The FAA said that would eliminate unnecessary certifications that historically had little or no effect on total system performance and safety. And a 2006 report from the Government Accountability Office had found support for the idea in some instances. But computing experts say they often advise private companies to reject that approach. 'It's common, you see it in retail too - it's the whole 'don't fix it if it ain't broke' thing,' said Branden Williams, director of a unit of VeriSign that assesses the security of retailers' payment systems. 'It's unfortunate because it's very reactive, and it typically winds up costing you more. If you do fix-on-fail, it usually costs you more.' Of course, there's a difference between a private company's outage that delays your DVD order, and one at the agency administering airline traffic. And such events have happened to the FAA multiple times. Communications between an air traffic control centre in Memphis, which directs planes passing through a 250-mile radius from the city, and an unknown number of airplanes were disrupted this month when a car struck a utility pole, severing a fibre-optic cable. Last September, the same centre lost all its communications and some air traffic controllers had to use their personal cell phones to route planes out of the seven-state area. The FAA blamed that outage on the failure of a major AT&T phone line. In May, the FAA system that issues preflight notices to pilots about runway, equipment and security issues went down for about a day when a server crashed and the backup operated too slowly to be effective. The database was not able to issue updates or new notices, but pilots continued to receive relevant information from local air traffic controllers and through alternate systems. After this week's outage, Paul Proctor, a Gartner analyst focused on security and regulatory compliance for large corporations, said it appeared that the FAA didn't deploy the flight-plan computers with nearly as much redundancy as big companies generally have in systems critical to their operations. 'You need to do a good analysis about whether this is acceptable risk,' Proctor said. 'One of the things the government is betting on is the fact that if there's ... a failure, it's not a safety issue.' Sid McGuirk, associate professor and coordinator of the air traffic management program at Embry-Riddle Aeronautical University in Daytona Beach, Fla., believes that given the budget realities facing the FAA, the agency has maintained a good balance. It keeps the system running efficiently without compromising safety, said McGuirk, a former air traffic controller and FAA manager for 35 years. 'From time to time, we are going to have a glitch, but it's a tradeoff,' he said. 'Would I like to see more modern equipment in the system? Sure. But most folks would not want to see their taxes tripled to pay for new technology every two years.'
  • 23. “Aboutness” and Relevancy FAA outage reveals odd practises< • What’s in that? – Dozens of people, places, things – Dozens of quotes – Hundreds? of keywords • What’s Important? – Federal Aviation Administration – 0.781 – Atlanta – 0.773 – United States – 0.443 – Boston – 0.401
  • 24. Mashup Ads • Context-enhanced ads – Can we use metadata generated by Calais (places, events, etc) to create customized ads? – Mashupads from Dapper are a start – but they rely on structure.
  • 25. Contextual Profiling • Can we create much richer customer profiles based on behavior? – What people do I read about? – What geographies do I read about? – How much time do I spend reading business news vs. lifestyle news?
  • 26. • www.opencalais.com – Gallery – code and applications examples – Forums – Documentation

Editor's Notes

  1. <number>First draft, with beautiful work by Sagit. Note that ALL text is editable.
  2. <number>
  3. Background slide updated to reflect more recent background. Note that Bloomberg logo is here. Should be okay for internal use.
  4. Tom has reviewed this slide and words.
  5. Tom asks that you add this extra slide.
  6. The first of three examples from openCalais developers. Note that all 3 show tag clouds; Tom is going to find an alternative that shows a different implementation.
  7. This would should stay in; it was built by a TR developer.