SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Treparel        Dr. Anton Heijs
 Delftechpark 26          CEO/CTO
                     anton@treparel.com
  2628 XH Delft
The Netherlands
                   September 24, 2012
www.treparel.com
Analysing large patent portfolios: Nothing
         remains uncovered

Agenda
• Introduction Treparel &
  KMX Patent Analytics
• How to deal with Big Data
  in patents?
• Landscape analysis of
  large patent sets
• Use Cases: SWOT analysis

                                            Fig 1: patent landscape of ebook technologies




Treparel KMX – All rights reserved 2012   www.treparel.com                                  2
About Treparel
         • Treparel is an innovative technology solution provider of
                – Big Data Text Analytics and Visualization technology
                – Patent Analytics solutions
         • KMX is an integrated data analysis toolset which provides
                – Fast and accurate insights in large unstructured document sets
                  to allow companies to make better informed decisions.
         • KMX software platform
                – Strong focus on R&D with university ecosystem
                – Over 30 man years of software-development
                – Used by knowledge driven organizations in technology, chemical,
                  and life sciences
         • Based in Delft, The Netherlands since 2006.
Treparel KMX – All rights reserved 2012   www.treparel.com                         3
IP landscape trend example: 1970-2007




Alstom               GE




Dong Feng            Siemens            4
The importance of analytics in IP

          Research                  Discovery   Development         Market           Life Cycle
                                                                    Launch           Management


                                                   Market, Legal and Competitive Analysis
                                                        Patent Analysis
                       Research Analysis



                              $


          Investment
                                           Decreasing return                 Increasing Return
                                                                                        years
Treparel KMX – All rights reserved 2012          www.treparel.com                                 5
Economic changes in the last decades
         • Globalization :
            – More companies, increased competition, lower margins
            – Innovation for shorter product life cycles
            – Manufacturing has become a commodity by outsourcing to
              low wage countries
         • The cost of R&D increased but competition leads to price
           erosion
         • Return on R&D investment drives importance for value
           creation from IP
         • Competitive edge of companies shifts from
           production-based to knowledge-based




Treparel KMX – All rights reserved 2012   www.treparel.com         6
The Intellectual Economy
         Past : Selling products finances R&D
                Research ,                Development ,              Marketing ,   Life Cycle
                Discovery                 Manufacturing              Sales         Management

                                                                                    Revenues
                        Return on investments
         Today: value creation and revenue generation from IP (to finance R&D)

                Research ,                Development ,              Marketing ,   Life Cycle
                Discovery                 Manufacturing              Sales         Management

                       Innovations

                       IP Licensing

                          Revenues                                                  Revenues


                        Return on investments
Treparel KMX – All rights reserved 2012           www.treparel.com
                                                                                                7
Adapting to economic developments

The New Reality                                               Value from IP portfolio
• IP strategy over time                                       • Protect market share
    – Increase the value of the IP portfolio                  • Save cost for royalties via cross
    – Maximize the ROI from R&D                                 licensing
• License management                                          • Generate income from royalties
    – analyze the impact of economic developments             • Benefit from License Out from
      and its effect for the IP strategy                        joint ventures or spin-offs

Drivers
• Number of Patent filings is growing
• Growing need to drive revenues from
  licensing to fund R&D investment




Treparel KMX – All rights reserved 2012        www.treparel.com                               8
Trends and implications for the future
         • Economies depend stronger on each other
         • Innovation is a driver of economic growth
         • Globalization generates many patents from China
           and South Korea
         • The number and complexity of patent filings is
           growing

         Growing need for fast and accurate analysis of
           large sets of patents


Treparel KMX – All rights reserved 2012   www.treparel.com   9
Getting more insight using less experts
               Big Data Paradox:
               • Limited (human) resources available for in-depth analysis
               • Growing need for data driven decisions

               Growing Data, Faster Insights: Big Data Analytics


          Velocity                    Volume




Variety                                        Complexity




     Treparel KMX – All rights reserved 2012                www.treparel.com   10
The big data paradox ; more data but less
        knowledge ‘Information Gap’


10

                             Available data
 8                                                                             Information Gap
 6                                          Data driven decisions

 4


 2                           Available experts for supporting decision making

 0

     1990           1995             2000        2005          2010     2015      2020




Treparel KMX – All rights reserved 2012              www.treparel.com                       11
Information democracy:
         Information Creators and Consumers
         • Creators
                – Defines & prepopulate Analysis Pipelines and test it on
                  the data
                – Deploys these pipeline using Cloud computing
                       • Required computing capacity can scale up with the business /
                         analytical needs
         • Consumers
                – Pre defined analytical reports
                – Sharing feedback and input to the results to optimize
                  analysis


                   Sharing information empowers collaboration
Treparel KMX – All rights reserved 2012    www.treparel.com                        12
The traditional IP search and analysis
                                                            Request




                                                            Results
      Database                            Tools


         Data                Information Creator          Request Information   Information Consumer



   The traditional approach:
   • Each search/analysis request is focussed on a specific question from one user
   • When the number of request increases this requires more human searchers
   • When the searches involve analysis of more patents this requires more time
   • Very specific searches can not be automated
   • Analysis of large documents sets can be automated – which is an opportunity to
     analyse more and become more competitive

Treparel KMX – All rights reserved 2012            www.treparel.com                            13
Information democracy:
         Proactive analysis to search and analysis
        Patent                                                Analyst
       Database


       Research
       Database                  Running
                                                             Business
                                 Analytics                     User
       Marketing                 Pipeline
       Database




          Data               Information Creators             Push Information   Information Consumers


   Liberate the Information Search, facilitate the discovery process:
   • The knowledge Creator:
        • defines the analysis pipeline and test it on the data
        • deploys the analyses using cloud computing resources
   • Direct access for information consumers for in depth analyses
Treparel KMX – All rights reserved 2012             www.treparel.com                            14
Liberate more information using new
         technologies
         • Enable a small group of experts with tools to set-up IP
           analysis pipelines
            – Extend the search on request approach with a pro-
               active analysis approach on large document sets
            – Use analysis pipelines to auto generate visualizations
               in a browser
            – Invest in new technology coming from Big Data
               Analytics
         • Give a large group of users access to the internal
           webpages providing them with rich statistical
           information and interactive visualizations


Treparel KMX – All rights reserved 2012   www.treparel.com             15
Combining proactive analysis with traditional
         IP search and discovery
                                                        Personal Request
         Tools

                                                        Tailor made Results
      Database


        Patent                                             Business
       Database                                              User

                                Running                                           Exchange
                                                                                 Information
       Research
       Database
                                Analytics
                                Pipeline                    Analyst

       Marketing
       Database




          Data                Information Creators            Information     Information Consumers
Treparel KMX – All rights reserved 2012          www.treparel.com                              16
Performing small to large scale SWOT
          analysis
                                                                          SWOT analysis example
 Patent
Database                    Queries                                       • What are most important
                                                                            patents?
                                                                          • Who owns them?
                                                                          • What is growth of
                                                                            patents by:
                                                                                  •   Technology?
                                                                                  •   Owner?
                                                                                  •   Country?
                                                                                  •   Year?



                               5000 patents   1000 patents          500 patents



                                                                                  Business   Overview
                                                                                    User     and details
                                              Ranking              Filtering

 Treparel KMX – All rights reserved 2012        www.treparel.com                                  17
Auto reporting & analysis for multiple users

  • Reporting of aggregated
    results:
         – Pie & bar charts
  • Providing overview of the
    subject:
         – landscape visualization
  • Enabling rich interaction




 Treparel KMX – All rights reserved 2012   www.treparel.com   18
Page 18
Use Case1: SWOT analysis of Ebooks
• Perform proactive SWOT analysis of ebooks market
   Amazon Kindle – Apple – Samsung/Google and other players

• Who owns what?
     • What can we learn from competitive technology landscape?
• Why?
     • Determine a company/technology position and opportunities
• We do this in KMX by:
     1. Query to get patents on electronic paper technology
     2. Landscape analysis
     3. Classification/Ranking
     4. Filter and select subset
     5. Iterate step 1-to-5




Treparel KMX – All rights reserved 2012   Fig 2: Overview landscape visualization of 4257 patens   19
            19
Analysis of ebook technology




                                                 Fig 3: Overview landscape visualization of 4257 patents
Treparel KMX – All rights reserved 2012   www.treparel.com                                            20
Use document classification to rank the
             patents
Purple = most important patents
Red = least relevant patents




                   Fig 4: Ranked patents using a classifier for ebook technology (In purple the selection of relevant
                   patents for deeper analysis)
    Treparel KMX – All rights reserved 2012                 www.treparel.com                                            21
Drill deeper in the data to learn more




                                          Fig 5: Landscape visualization going from 4257 to 1049 to 369 patents



         After removing the irrelevant patents we use filtering to
         determine:
         • Who are the important players (assignees, inventors)?
         • Where are the important patents filed (countries)?
         • What is the trend over time (growth of patents over the years)?



Treparel KMX – All rights reserved 2012   www.treparel.com                                             22
Define the relevant set of patents to identify
             your strengths & opportunities




Purple = most important patents
Red = least relevant patents
                                              Fig 6: Landscape visualization of 369 most important patents in ebook technology
    Treparel KMX – All rights reserved 2012                                                                                23
The role of language:
         Clustering of Patents in Chinese text




                                          Fig 7: Patent landscape visualization using the chinese or englisch text
Treparel KMX – All rights reserved 2012           www.treparel.com                                               24
Use Case 2: SWOT Technical & Biological patents

• Perform SWOT analysis in a converging market: analyze claims with mixed
  technologies
• Who owns what?
      • How does the technology landscape looks like?
• Why ?
      • Determine a company/technology position and opportunities
• We do this in KMX by:
      1. Query to get patents on mechanical/electronic/optics mixed with
         biological technology
      2. Landscape analysis
      3. Classification/Ranking
      4. Filter and select subset
      5. Iterate step 1-to-5




Treparel KMX – All rights reserved 2012   Fig 8: Landscape visualization of 10920 patents   25
Use Case: SWOT analysis : patents covering
         multiple technology areas (engineering & biology)

• Patents become more
  complex to analyse

• Examples:
       • More detailed claims
       • Mixed technologies in
         the claims

• Obtaining an landscape
  overview is then key

• Analysis from the users
  perspective is essential
  (classification/ranking)




                                                         Fig 9: Landscape visualization of 10920 patents

Treparel KMX – All rights reserved 2012   www.treparel.com                                            26
Patents from total set with biological focus
• Using the text from the
  title/abstract and claims

• landscape analysis
  provides overview

• Using document
  classification to
  determine sub clusters

• Using classification and
  ranking to determine
  the most relevant
  documents from the
  users perspective

                                   Fig 10: Landscape visualization of 134 patens
Key takeaways
          Global economy demands value generation from an IP strategy
          Big Data Paradox:
            • Limited (human) resources available for in-depth analysis
            • Growing need for data driven decisions

          The information creators (patent searchers) focus on
            • Providing proactive information on generic analysis tasks
            • Perform specific analysis for single user request

          The information consumers (patent council)
            • Get knowledge from automated analysis with
              interactive capabilities
            • Obtain SWOT analysis knowledge of competitors
            • Built and optimize patent strategies

Treparel KMX – All rights reserved 2012   www.treparel.com                28

Contenu connexe

Similaire à Finding trends in large scale document sets

PMG Oct 2011 Patents and intellectual property 101 for product managers final
PMG Oct 2011 Patents and intellectual property 101 for product managers finalPMG Oct 2011 Patents and intellectual property 101 for product managers final
PMG Oct 2011 Patents and intellectual property 101 for product managers finalDerek Pettingale
 
CambridgeIP: Marketing Your Technology in the Credit Crunch
CambridgeIP: Marketing Your Technology in the Credit CrunchCambridgeIP: Marketing Your Technology in the Credit Crunch
CambridgeIP: Marketing Your Technology in the Credit CrunchCambridgeIP Ltd
 
Accelerating innovation and diffusion of renewable energy technologies: techn...
Accelerating innovation and diffusion of renewable energy technologies: techn...Accelerating innovation and diffusion of renewable energy technologies: techn...
Accelerating innovation and diffusion of renewable energy technologies: techn...CambridgeIP Ltd
 
Technology Transfer IP Elendel Forum Mit Israel
Technology Transfer IP Elendel Forum Mit IsraelTechnology Transfer IP Elendel Forum Mit Israel
Technology Transfer IP Elendel Forum Mit IsraelJosh (Tzvika) Avnery
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
CambridgeIP Webinar: Developing a fact Based IP Strategy
CambridgeIP Webinar: Developing a fact Based IP StrategyCambridgeIP Webinar: Developing a fact Based IP Strategy
CambridgeIP Webinar: Developing a fact Based IP StrategyCambridgeIP Ltd
 
10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the GlobeDataWorks Summit
 
Cambridge IP Webinar: Developing a fact-based IP strategy
Cambridge IP Webinar: Developing a fact-based IP strategyCambridge IP Webinar: Developing a fact-based IP strategy
Cambridge IP Webinar: Developing a fact-based IP strategyQuentin Tannock
 
APAC Big Data Strategy RadhaKrishna Hiremane
APAC Big Data  Strategy RadhaKrishna  HiremaneAPAC Big Data  Strategy RadhaKrishna  Hiremane
APAC Big Data Strategy RadhaKrishna HiremaneIntelAPAC
 
APAC Big Data Strategy_RK
APAC Big Data Strategy_RKAPAC Big Data Strategy_RK
APAC Big Data Strategy_RKIntelAPAC
 
Backupeugene.buff
Backupeugene.buffBackupeugene.buff
Backupeugene.buffNASAPMC
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life RevolutionCapgemini
 
Text and Data Visualization Introduction 2012
Text and Data Visualization Introduction 2012Text and Data Visualization Introduction 2012
Text and Data Visualization Introduction 2012Treparel
 
The Possibility of Technology Due Diligence based on IP Panoramic View Analytics
The Possibility of Technology Due Diligence based on IP Panoramic View AnalyticsThe Possibility of Technology Due Diligence based on IP Panoramic View Analytics
The Possibility of Technology Due Diligence based on IP Panoramic View AnalyticsVALUENEX
 
Права на интеллектуальную собственность в растущем бизнесе.
Права на интеллектуальную собственность в растущем бизнесе.Права на интеллектуальную собственность в растущем бизнесе.
Права на интеллектуальную собственность в растущем бизнесе.Dmitry Tseitlin
 
Business Innovation Conference 10 11 2011
Business Innovation Conference 10 11 2011Business Innovation Conference 10 11 2011
Business Innovation Conference 10 11 2011Maria Thompson
 
From Customer Insights to Action
From Customer Insights to ActionFrom Customer Insights to Action
From Customer Insights to ActionCapgemini
 

Similaire à Finding trends in large scale document sets (20)

PMG Oct 2011 Patents and intellectual property 101 for product managers final
PMG Oct 2011 Patents and intellectual property 101 for product managers finalPMG Oct 2011 Patents and intellectual property 101 for product managers final
PMG Oct 2011 Patents and intellectual property 101 for product managers final
 
CambridgeIP: Marketing Your Technology in the Credit Crunch
CambridgeIP: Marketing Your Technology in the Credit CrunchCambridgeIP: Marketing Your Technology in the Credit Crunch
CambridgeIP: Marketing Your Technology in the Credit Crunch
 
Accelerating innovation and diffusion of renewable energy technologies: techn...
Accelerating innovation and diffusion of renewable energy technologies: techn...Accelerating innovation and diffusion of renewable energy technologies: techn...
Accelerating innovation and diffusion of renewable energy technologies: techn...
 
Technology Transfer IP Elendel Forum Mit Israel
Technology Transfer IP Elendel Forum Mit IsraelTechnology Transfer IP Elendel Forum Mit Israel
Technology Transfer IP Elendel Forum Mit Israel
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
CambridgeIP Webinar: Developing a fact Based IP Strategy
CambridgeIP Webinar: Developing a fact Based IP StrategyCambridgeIP Webinar: Developing a fact Based IP Strategy
CambridgeIP Webinar: Developing a fact Based IP Strategy
 
10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe
 
Intel
IntelIntel
Intel
 
Cambridge IP Webinar: Developing a fact-based IP strategy
Cambridge IP Webinar: Developing a fact-based IP strategyCambridge IP Webinar: Developing a fact-based IP strategy
Cambridge IP Webinar: Developing a fact-based IP strategy
 
APAC Big Data Strategy RadhaKrishna Hiremane
APAC Big Data  Strategy RadhaKrishna  HiremaneAPAC Big Data  Strategy RadhaKrishna  Hiremane
APAC Big Data Strategy RadhaKrishna Hiremane
 
APAC Big Data Strategy_RK
APAC Big Data Strategy_RKAPAC Big Data Strategy_RK
APAC Big Data Strategy_RK
 
Backupeugene.buff
Backupeugene.buffBackupeugene.buff
Backupeugene.buff
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
CMP IP communication 4-08
CMP IP communication 4-08CMP IP communication 4-08
CMP IP communication 4-08
 
Text and Data Visualization Introduction 2012
Text and Data Visualization Introduction 2012Text and Data Visualization Introduction 2012
Text and Data Visualization Introduction 2012
 
The Possibility of Technology Due Diligence based on IP Panoramic View Analytics
The Possibility of Technology Due Diligence based on IP Panoramic View AnalyticsThe Possibility of Technology Due Diligence based on IP Panoramic View Analytics
The Possibility of Technology Due Diligence based on IP Panoramic View Analytics
 
Права на интеллектуальную собственность в растущем бизнесе.
Права на интеллектуальную собственность в растущем бизнесе.Права на интеллектуальную собственность в растущем бизнесе.
Права на интеллектуальную собственность в растущем бизнесе.
 
Business Innovation Conference 10 11 2011
Business Innovation Conference 10 11 2011Business Innovation Conference 10 11 2011
Business Innovation Conference 10 11 2011
 
From Customer Insights to Action
From Customer Insights to ActionFrom Customer Insights to Action
From Customer Insights to Action
 
Current Business Trends
Current Business TrendsCurrent Business Trends
Current Business Trends
 

Dernier

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Dernier (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Finding trends in large scale document sets

  • 1. Treparel Dr. Anton Heijs Delftechpark 26 CEO/CTO anton@treparel.com 2628 XH Delft The Netherlands September 24, 2012 www.treparel.com
  • 2. Analysing large patent portfolios: Nothing remains uncovered Agenda • Introduction Treparel & KMX Patent Analytics • How to deal with Big Data in patents? • Landscape analysis of large patent sets • Use Cases: SWOT analysis Fig 1: patent landscape of ebook technologies Treparel KMX – All rights reserved 2012 www.treparel.com 2
  • 3. About Treparel • Treparel is an innovative technology solution provider of – Big Data Text Analytics and Visualization technology – Patent Analytics solutions • KMX is an integrated data analysis toolset which provides – Fast and accurate insights in large unstructured document sets to allow companies to make better informed decisions. • KMX software platform – Strong focus on R&D with university ecosystem – Over 30 man years of software-development – Used by knowledge driven organizations in technology, chemical, and life sciences • Based in Delft, The Netherlands since 2006. Treparel KMX – All rights reserved 2012 www.treparel.com 3
  • 4. IP landscape trend example: 1970-2007 Alstom GE Dong Feng Siemens 4
  • 5. The importance of analytics in IP Research Discovery Development Market Life Cycle Launch Management Market, Legal and Competitive Analysis Patent Analysis Research Analysis $ Investment Decreasing return Increasing Return years Treparel KMX – All rights reserved 2012 www.treparel.com 5
  • 6. Economic changes in the last decades • Globalization : – More companies, increased competition, lower margins – Innovation for shorter product life cycles – Manufacturing has become a commodity by outsourcing to low wage countries • The cost of R&D increased but competition leads to price erosion • Return on R&D investment drives importance for value creation from IP • Competitive edge of companies shifts from production-based to knowledge-based Treparel KMX – All rights reserved 2012 www.treparel.com 6
  • 7. The Intellectual Economy Past : Selling products finances R&D Research , Development , Marketing , Life Cycle Discovery Manufacturing Sales Management Revenues Return on investments Today: value creation and revenue generation from IP (to finance R&D) Research , Development , Marketing , Life Cycle Discovery Manufacturing Sales Management Innovations IP Licensing Revenues Revenues Return on investments Treparel KMX – All rights reserved 2012 www.treparel.com 7
  • 8. Adapting to economic developments The New Reality Value from IP portfolio • IP strategy over time • Protect market share – Increase the value of the IP portfolio • Save cost for royalties via cross – Maximize the ROI from R&D licensing • License management • Generate income from royalties – analyze the impact of economic developments • Benefit from License Out from and its effect for the IP strategy joint ventures or spin-offs Drivers • Number of Patent filings is growing • Growing need to drive revenues from licensing to fund R&D investment Treparel KMX – All rights reserved 2012 www.treparel.com 8
  • 9. Trends and implications for the future • Economies depend stronger on each other • Innovation is a driver of economic growth • Globalization generates many patents from China and South Korea • The number and complexity of patent filings is growing Growing need for fast and accurate analysis of large sets of patents Treparel KMX – All rights reserved 2012 www.treparel.com 9
  • 10. Getting more insight using less experts Big Data Paradox: • Limited (human) resources available for in-depth analysis • Growing need for data driven decisions Growing Data, Faster Insights: Big Data Analytics Velocity Volume Variety Complexity Treparel KMX – All rights reserved 2012 www.treparel.com 10
  • 11. The big data paradox ; more data but less knowledge ‘Information Gap’ 10 Available data 8 Information Gap 6 Data driven decisions 4 2 Available experts for supporting decision making 0 1990 1995 2000 2005 2010 2015 2020 Treparel KMX – All rights reserved 2012 www.treparel.com 11
  • 12. Information democracy: Information Creators and Consumers • Creators – Defines & prepopulate Analysis Pipelines and test it on the data – Deploys these pipeline using Cloud computing • Required computing capacity can scale up with the business / analytical needs • Consumers – Pre defined analytical reports – Sharing feedback and input to the results to optimize analysis Sharing information empowers collaboration Treparel KMX – All rights reserved 2012 www.treparel.com 12
  • 13. The traditional IP search and analysis Request Results Database Tools Data Information Creator Request Information Information Consumer The traditional approach: • Each search/analysis request is focussed on a specific question from one user • When the number of request increases this requires more human searchers • When the searches involve analysis of more patents this requires more time • Very specific searches can not be automated • Analysis of large documents sets can be automated – which is an opportunity to analyse more and become more competitive Treparel KMX – All rights reserved 2012 www.treparel.com 13
  • 14. Information democracy: Proactive analysis to search and analysis Patent Analyst Database Research Database Running Business Analytics User Marketing Pipeline Database Data Information Creators Push Information Information Consumers Liberate the Information Search, facilitate the discovery process: • The knowledge Creator: • defines the analysis pipeline and test it on the data • deploys the analyses using cloud computing resources • Direct access for information consumers for in depth analyses Treparel KMX – All rights reserved 2012 www.treparel.com 14
  • 15. Liberate more information using new technologies • Enable a small group of experts with tools to set-up IP analysis pipelines – Extend the search on request approach with a pro- active analysis approach on large document sets – Use analysis pipelines to auto generate visualizations in a browser – Invest in new technology coming from Big Data Analytics • Give a large group of users access to the internal webpages providing them with rich statistical information and interactive visualizations Treparel KMX – All rights reserved 2012 www.treparel.com 15
  • 16. Combining proactive analysis with traditional IP search and discovery Personal Request Tools Tailor made Results Database Patent Business Database User Running Exchange Information Research Database Analytics Pipeline Analyst Marketing Database Data Information Creators Information Information Consumers Treparel KMX – All rights reserved 2012 www.treparel.com 16
  • 17. Performing small to large scale SWOT analysis SWOT analysis example Patent Database Queries • What are most important patents? • Who owns them? • What is growth of patents by: • Technology? • Owner? • Country? • Year? 5000 patents 1000 patents 500 patents Business Overview User and details Ranking Filtering Treparel KMX – All rights reserved 2012 www.treparel.com 17
  • 18. Auto reporting & analysis for multiple users • Reporting of aggregated results: – Pie & bar charts • Providing overview of the subject: – landscape visualization • Enabling rich interaction Treparel KMX – All rights reserved 2012 www.treparel.com 18 Page 18
  • 19. Use Case1: SWOT analysis of Ebooks • Perform proactive SWOT analysis of ebooks market Amazon Kindle – Apple – Samsung/Google and other players • Who owns what? • What can we learn from competitive technology landscape? • Why? • Determine a company/technology position and opportunities • We do this in KMX by: 1. Query to get patents on electronic paper technology 2. Landscape analysis 3. Classification/Ranking 4. Filter and select subset 5. Iterate step 1-to-5 Treparel KMX – All rights reserved 2012 Fig 2: Overview landscape visualization of 4257 patens 19 19
  • 20. Analysis of ebook technology Fig 3: Overview landscape visualization of 4257 patents Treparel KMX – All rights reserved 2012 www.treparel.com 20
  • 21. Use document classification to rank the patents Purple = most important patents Red = least relevant patents Fig 4: Ranked patents using a classifier for ebook technology (In purple the selection of relevant patents for deeper analysis) Treparel KMX – All rights reserved 2012 www.treparel.com 21
  • 22. Drill deeper in the data to learn more Fig 5: Landscape visualization going from 4257 to 1049 to 369 patents After removing the irrelevant patents we use filtering to determine: • Who are the important players (assignees, inventors)? • Where are the important patents filed (countries)? • What is the trend over time (growth of patents over the years)? Treparel KMX – All rights reserved 2012 www.treparel.com 22
  • 23. Define the relevant set of patents to identify your strengths & opportunities Purple = most important patents Red = least relevant patents Fig 6: Landscape visualization of 369 most important patents in ebook technology Treparel KMX – All rights reserved 2012 23
  • 24. The role of language: Clustering of Patents in Chinese text Fig 7: Patent landscape visualization using the chinese or englisch text Treparel KMX – All rights reserved 2012 www.treparel.com 24
  • 25. Use Case 2: SWOT Technical & Biological patents • Perform SWOT analysis in a converging market: analyze claims with mixed technologies • Who owns what? • How does the technology landscape looks like? • Why ? • Determine a company/technology position and opportunities • We do this in KMX by: 1. Query to get patents on mechanical/electronic/optics mixed with biological technology 2. Landscape analysis 3. Classification/Ranking 4. Filter and select subset 5. Iterate step 1-to-5 Treparel KMX – All rights reserved 2012 Fig 8: Landscape visualization of 10920 patents 25
  • 26. Use Case: SWOT analysis : patents covering multiple technology areas (engineering & biology) • Patents become more complex to analyse • Examples: • More detailed claims • Mixed technologies in the claims • Obtaining an landscape overview is then key • Analysis from the users perspective is essential (classification/ranking) Fig 9: Landscape visualization of 10920 patents Treparel KMX – All rights reserved 2012 www.treparel.com 26
  • 27. Patents from total set with biological focus • Using the text from the title/abstract and claims • landscape analysis provides overview • Using document classification to determine sub clusters • Using classification and ranking to determine the most relevant documents from the users perspective Fig 10: Landscape visualization of 134 patens
  • 28. Key takeaways  Global economy demands value generation from an IP strategy  Big Data Paradox: • Limited (human) resources available for in-depth analysis • Growing need for data driven decisions  The information creators (patent searchers) focus on • Providing proactive information on generic analysis tasks • Perform specific analysis for single user request  The information consumers (patent council) • Get knowledge from automated analysis with interactive capabilities • Obtain SWOT analysis knowledge of competitors • Built and optimize patent strategies Treparel KMX – All rights reserved 2012 www.treparel.com 28