SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
The Challenges of Managing “Big Data” in the
Patent Field
14-15 April 2014, Nice
Olivier Huc
Specialists
in Patent
Information
Building
Intelligent
Patent
Information
Solutions
since 1996
What we do
Trusted
by IP experts
Worldwide
Corporations,
National Patent
Offices, Patent
Attorneys and
Patent Search
Firms worldwide
International
Customer
Support
Global client base
With Offices and
Support across
Europe North
America, and Asia
Patent Families
Analytics
Quality Control
Fast Search
Legal Status
Review
Alerts
• 23 Full Text Collections
• 48 Million Families
• 103 Issuing Authorities
• IPC, CPC US and JP classes
• Quality Controlled content
• Normalised data
3 Patent Data Myths
• Myth #1: Patent data is just another type of
“Big Data”
• Myth #2: Patent Data is handled automatically
• Myth #3: Patent Data is consistent worldwide
• Patent Data volume might be smaller, data is
more complex (languages, text, fields)
• Patent data is not retrieved on the fly, it is
hosted, indexed and optimized
• There are multiple sources with overlap
• Data quality is a major issue
• Users have a low tolerance for errors
The reality
• Total data volume exceeds 35 Tb
• 49 million families and 103 publishing bodies
• 95 million publications
• 47 million full-texts including over 23 million non-Latin into
English machine translations
• 54 million clipped images and 45 million complete sets of
drawings
Database Facts
• Minesoft and RWS host their own data center, located just
outside of London
• Control
• Confidentiality
• Reactivity
• Speed
• Distributed search engine
• Continuous data update and indexing => no need to interrupt
or restart the online services,
+ new data immediately searchable
Hardware & Search Engine
• Multiple data sources:
• DOCDB weekly feeds (EPO)
• National Patent Offices
• Commercial collections
• External information (such as National Registers)
• Despite the complexity, having multiple sources for
the same country is a great advantage:
• Complementarity
• Improved quality
• Security
• Speed
Sources
• We perform stringent quality checks
• Human
• Programmatic
• Manual checks on some source data collections as they arrive: e.g.
Indian (IN), Thai (TH) and The Philippines (PH)
• Errors in data are identified programmatically by strict pre-set
parameters which are then manually corrected by our data team
• e.g. IC8=AO1G1/00
• Although we follow EPO’s INPADOC rules for families (extended),
we recreate all our families to ensure consistency
Data Quality
Adding extra value to PatBase data:
• Families are automatically reviewed and, then if necessary, rebuilt
when we receive new and/or corrected information (e.g. priority)
• Tagging of examples, paragraphs and claims is done in order to
facilitate searching specific sections of text
• Machine translation: when a family gets new text, the family is
reassessed to see if a machine translation needs to be
added/replaced/deleted.
Data Quality
TW AN/PR inputs TW AN/PR outputs
083303675 Emperor year conversion
& Type of application
TW19940303675F
092128911 TW20030128911
092128911 TW20040201682U
US AN/PR inputs US AN/PR outputs
US29/356,858 20100303 Type of application & Year US20100356858F
1301618611 A US20110016186
AT AN/PR inputs AT AN/PR outputs
A 709/95 Type of application & Year AT19950000709
GM647/96 AT19960000647U
Standardisation of patent data
Formatting application and priority information
• Formatting patent numbers and kind codes
• Formatting dates
Thailand use Buddhist years (Gregorian calendar year plus 543)
US date format - 2011/09/02 (9 February 2011)
European date format – 2011/09/02 (2 September 2011)
2007
Standardisation of patent data
The EPO standardize names to assist searching.
PatBase contains both standard and non-standard names.
Standard name assigned by the EPO
Non-standard name consists of whatever
is filed or published on the patent
Standard Non-standard
PIRELLI IND PIRELI SPA
PIRELLI IND PIRELLA SPA
PIRELLI IND PIRELLE S P A
PIRELLI IND PIRELLI DPA
PIRELLI IND PIRELLI S p A
PIRELLI IND PIRELLI S A
PIRELLI IND PIRELLI S P A
PIRELLI IND PIRELLI S P A FIRMA
PIRELLI IND PIRELLI S P A IT
PIRELLI IND PIRELLI S P CA
PIRELLI IND PIRELLI SPA IT
PIRELLI IND PIRELLI SPP
PIRELLI IND PIRELLU SPA
PIRELLI IND PIRELLY SPA
This is a small example set of the
non-standard names that The EPO
assign the standard name ‘Pirelli’
There are currently 188 non-standard names for the
standard name ‘Pirelli’
Standardization of patent data
• Date Formats
• All fields, e.g. patent classifications, assignees, text etc. have set
parameters. Where these are not matched data errors are
identified for manual editing.
• If a text is illegible (we have programmatic systems in place
measuring this) it will not be allowed into the database and be
identified as requiring manual attention (often manual typing).
• Character conversions
We have thousands of symbol / letter conversions in our
programs:
• & is replaced by and
• œ is replaced by oe
• β is replaced by ss
Data Improvements
Insertion of paragraph breaks and paragraph numbers
Data Improvements
Output in PatBase
Source text
• Errors appear in source data so manual checks are essential
• Example – Granted patent information from the Indian Patent Office
Journal. Three different inventions have incorrectly been given the same
publication number
Manual checks
IN000008
Data quality issues
On the Thai patent office website - the same publication number is used for two
different applications
Patent copy for TH48405 A
In PatBase
Application number: TH19981004295
Publication number: TH48406 A
Application number: TH19981002185
Publication number: TH48405 A
Wrong number
Correct number
Manual checks
• Acquiring data from multiple sources enables us to supplement records,
but also alerts us to errors thus ensuring accuracy
KR20010012826 A – Glial Cell Line-
Derived Neurotrophic Factor
Receptors
KR20010112826 A – Single phase six
pole DC brushless axial fan motor of
transistor type
Source EPO – Error in information This EPO record is a
combination of two
inventions. The
publication number
does not match with
the invention.
Identifying data errors
Incorrect data received from source
In cases such as these we correct the error in PatBase and inform the EPO
NULL values were
supplied in the
EPO’s DOCDB file
as Applicants
Identifying data errors
Example of an incorrect assignment from the USPTO
PatBase family 41683901
Excerpt from USPTO assignments database
Identifying data errors
Translations
• Principle: the English text of an equivalent is
always better than the Machine Translation
• All non-latin Texts are machine translated into
English and indexed when added to PatBase
• On a rolling basis we re-translate texts to
benefit from the continuous improvements of
translation engines
Machine translation
• Machine translations are made as data is added, removed / rebuilt. This
is all done before indexing.
• We run a rolling re-translate and re-index program to optimize the quality
of our machine translated full-text
Original translation, Thai into English Re-translation, Thai into English
Original translation, Thai into English Re-translation, Thai into English
Translations
Re-translation Korean into EnglishOriginal translation, Korean into English
Translations
Assignee translations
• Non-latin assignees are indexed
• Non-latin assignees are also translated
• First 10,000 CN and JP assignees have been
manually translated by RWS
• All others are Machine Translated until an “official”
Latin names appear in the family
Cross-lingual Tool
• Initially developed by WIPO, CLIR (Cross Lingual
Information Retrieval) allows our users to generate
multilingual searches
• Using an advanced statistical text analysis system
based on the PCT corpus, the cross-lingual search tool
identifies variants in multiple languages for search
terms entered by the user.
=> Better translation – translated words originate from
PCT applications
• Source: INPADOC
• All legal status events are categorised with a PRS
code
• Challenge: 2628 different PRS codes, some no longer
in use
• Solution: Grouping similar legal events together:
Legal Status
Reassignment
Deemed Withdrawn/Abandoned
Examined
Renewal Fees Paid
Granted
Lapsed/Expired/Ceased/Dead
Licence
Non-Entry into National Phase
National Phase Entry
Opposition Filed/Request for revocation
Published
Restored/Reinstated/Amended
Revoked/Rejected/Annuled/Invalid
Withdrawn/Abandoned/Terminated/Void
Legal Status Timeline
• Most patent databases are structured and optimized for
Patent Searching, not for Analytics
• At Minesoft, we developed a special database with
proprietary meta tags dedicated to the analytics
• Coverage is important – Beware of data gaps
• Importance of a web service (API)
• Importance of incorporating your own custom data or
legal status information in your analysis
Analytics
Thank you
PatBase celebrates its 10th anniversary
Olivier Huc – olivier@minesoft.com

Contenu connexe

Tendances

ICIC 2014 Patent Citation Analysis: Tools and Techniques
ICIC 2014 Patent Citation Analysis: Tools and Techniques ICIC 2014 Patent Citation Analysis: Tools and Techniques
ICIC 2014 Patent Citation Analysis: Tools and Techniques Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPII-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPDr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridDr. Haxel Consult
 
II-PIC 2017: China: Life after the Patent Tsunami
II-PIC 2017: China: Life after the Patent TsunamiII-PIC 2017: China: Life after the Patent Tsunami
II-PIC 2017: China: Life after the Patent TsunamiDr. Haxel Consult
 
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond SearchII-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond SearchDr. Haxel Consult
 
ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...
ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...
ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...Dr. Haxel Consult
 
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...Dr. Haxel Consult
 
Efficient and Effective Patent Landscaping Using PatBase: a Case Study
Efficient and Effective Patent Landscaping Using PatBase: a Case Study    Efficient and Effective Patent Landscaping Using PatBase: a Case Study
Efficient and Effective Patent Landscaping Using PatBase: a Case Study Dr. Haxel Consult
 
IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....
IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....
IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....Dr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-SDV 2016 Questel Intellixir
II-SDV 2016 Questel IntellixirII-SDV 2016 Questel Intellixir
II-SDV 2016 Questel IntellixirDr. Haxel Consult
 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...Dr. Haxel Consult
 
IC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePointIC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePointDr. Haxel Consult
 
II-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent DataII-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent DataDr. Haxel Consult
 
II-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics TechnologiesII-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics TechnologiesDr. Haxel Consult
 
II-PIC 201: Product Presentation CAS / STN
II-PIC 201: Product Presentation CAS / STN II-PIC 201: Product Presentation CAS / STN
II-PIC 201: Product Presentation CAS / STN Dr. Haxel Consult
 

Tendances (20)

ICIC 2014 Patent Citation Analysis: Tools and Techniques
ICIC 2014 Patent Citation Analysis: Tools and Techniques ICIC 2014 Patent Citation Analysis: Tools and Techniques
ICIC 2014 Patent Citation Analysis: Tools and Techniques
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPII-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IP
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
 
II-PIC 2017: China: Life after the Patent Tsunami
II-PIC 2017: China: Life after the Patent TsunamiII-PIC 2017: China: Life after the Patent Tsunami
II-PIC 2017: China: Life after the Patent Tsunami
 
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond SearchII-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
 
ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...
ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...
ICIC 2014 Valuing IP in the Chemical Space – Science, Art and Special Conside...
 
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
ICIC 2017: The Use of Patent Information for Innovation and Competitive Intel...
 
Efficient and Effective Patent Landscaping Using PatBase: a Case Study
Efficient and Effective Patent Landscaping Using PatBase: a Case Study    Efficient and Effective Patent Landscaping Using PatBase: a Case Study
Efficient and Effective Patent Landscaping Using PatBase: a Case Study
 
IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....
IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....
IC-SDV 2018: Jane List (Extract Information) Machine Translation for patents....
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2016 Questel Intellixir
II-SDV 2016 Questel IntellixirII-SDV 2016 Questel Intellixir
II-SDV 2016 Questel Intellixir
 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
 
IC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePointIC-SDV 2018: Search Technology / VanatagePoint
IC-SDV 2018: Search Technology / VanatagePoint
 
II-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent DataII-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
 
II-SDV 2016 Expert System
II-SDV 2016 Expert SystemII-SDV 2016 Expert System
II-SDV 2016 Expert System
 
II-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics TechnologiesII-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics Technologies
 
II-PIC 201: Product Presentation CAS / STN
II-PIC 201: Product Presentation CAS / STN II-PIC 201: Product Presentation CAS / STN
II-PIC 201: Product Presentation CAS / STN
 

En vedette

II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...Dr. Haxel Consult
 
II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...
II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...
II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...Dr. Haxel Consult
 
II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...
II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...
II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...Dr. Haxel Consult
 
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...Dr. Haxel Consult
 
II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...
II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...
II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...Dr. Haxel Consult
 
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...Dr. Haxel Consult
 
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...Dr. Haxel Consult
 

En vedette (7)

II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
II-SDV 2014 Automated Relevancy Check of Patents and Scientific Literature (K...
 
II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...
II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...
II-SDV 2014 A New Approach to Flexible, Meaning-Rich Document Parsing (Paul B...
 
II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...
II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...
II-SDV 2014 Competitive Positioning and Technological Complementarity in the ...
 
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
II-SDV 2014 The Digital Workplace – The death of desktop search? (Martin Whit...
 
II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...
II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...
II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstr...
 
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
II-SDV 2014 Hybrid Intelligence – foresight for opportunities (Youri Aksenov ...
 
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
II-SDV 2014 Search and Data Mining Open Source Platforms (Patrick Beaucamp - ...
 

Similaire à II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patents for business (Olivier Huc, Minesoft, UK)

Aeren -Company Collateral - 2015
Aeren -Company Collateral - 2015Aeren -Company Collateral - 2015
Aeren -Company Collateral - 2015Aeren IP
 
Big Data Analysis for Standard Essential Patents
Big Data Analysis for Standard Essential PatentsBig Data Analysis for Standard Essential Patents
Big Data Analysis for Standard Essential PatentsAlex G. Lee, Ph.D. Esq. CLP
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...
ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...
ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...Dr. Haxel Consult
 
ICIC 2013 New Product Introductions Minesoft
ICIC 2013 New Product Introductions MinesoftICIC 2013 New Product Introductions Minesoft
ICIC 2013 New Product Introductions MinesoftDr. Haxel Consult
 
Patent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin Nair
Patent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin NairPatent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin Nair
Patent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin NairBananaIP Counsels
 
Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...
Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...
Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...Knobbe Martens - Intellectual Property Law
 
ICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscDr. Haxel Consult
 
Ipph introduction of ipph patent products and service
Ipph introduction of ipph patent products and serviceIpph introduction of ipph patent products and service
Ipph introduction of ipph patent products and servicexiaonengfan
 
II-SDV 2014 Product Presentations Minsoft
II-SDV 2014 Product Presentations MinsoftII-SDV 2014 Product Presentations Minsoft
II-SDV 2014 Product Presentations MinsoftDr. Haxel Consult
 
The EPO document collection: A technical treasure chest
The EPO document collection:A technical treasure chestThe EPO document collection:A technical treasure chest
The EPO document collection: A technical treasure chestGO opleidingen
 
Big Data Expo 2015 - HP Information Management & Governance
Big Data Expo 2015 - HP Information Management & GovernanceBig Data Expo 2015 - HP Information Management & Governance
Big Data Expo 2015 - HP Information Management & GovernanceBigDataExpo
 
Search patent literature 2011 esther arias [modo de compatibilidad]
Search patent literature 2011 esther arias  [modo de compatibilidad]Search patent literature 2011 esther arias  [modo de compatibilidad]
Search patent literature 2011 esther arias [modo de compatibilidad]Esther Arias Pérez-Ilzarbe
 
PatSeer Premier Overview
PatSeer Premier OverviewPatSeer Premier Overview
PatSeer Premier OverviewGridlogics
 
WIPS Global Brochure, New
WIPS Global Brochure, NewWIPS Global Brochure, New
WIPS Global Brochure, Newshikha gupta
 
IANA: Who, What, Why?
IANA: Who, What, Why?IANA: Who, What, Why?
IANA: Who, What, Why?APNIC
 

Similaire à II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patents for business (Olivier Huc, Minesoft, UK) (20)

Aeren -Company Collateral - 2015
Aeren -Company Collateral - 2015Aeren -Company Collateral - 2015
Aeren -Company Collateral - 2015
 
Big Data Analysis for Standard Essential Patents
Big Data Analysis for Standard Essential PatentsBig Data Analysis for Standard Essential Patents
Big Data Analysis for Standard Essential Patents
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...
ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...
ICIC 2014 High volume, High Quality Patent Translation across Multiple Domain...
 
ICIC 2013 New Product Introductions Minesoft
ICIC 2013 New Product Introductions MinesoftICIC 2013 New Product Introductions Minesoft
ICIC 2013 New Product Introductions Minesoft
 
II-SDV 2016 GRIDLOGICS
II-SDV 2016 GRIDLOGICSII-SDV 2016 GRIDLOGICS
II-SDV 2016 GRIDLOGICS
 
Patent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin Nair
Patent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin NairPatent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin Nair
Patent: Patent Searching / A Presentation at NALSAR Hyderabad - Nitin Nair
 
Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...
Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...
Protecting Your Intellectual Property: Cost-Saving Techniques, Legal Updates ...
 
ICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction Gridlogisc
 
Ipph introduction of ipph patent products and service
Ipph introduction of ipph patent products and serviceIpph introduction of ipph patent products and service
Ipph introduction of ipph patent products and service
 
II-SDV 2014 Product Presentations Minsoft
II-SDV 2014 Product Presentations MinsoftII-SDV 2014 Product Presentations Minsoft
II-SDV 2014 Product Presentations Minsoft
 
The EPO document collection: A technical treasure chest
The EPO document collection:A technical treasure chestThe EPO document collection:A technical treasure chest
The EPO document collection: A technical treasure chest
 
Big Data Expo 2015 - HP Information Management & Governance
Big Data Expo 2015 - HP Information Management & GovernanceBig Data Expo 2015 - HP Information Management & Governance
Big Data Expo 2015 - HP Information Management & Governance
 
AI-SDV 2021: Dolcera
AI-SDV 2021: DolceraAI-SDV 2021: Dolcera
AI-SDV 2021: Dolcera
 
Search patent literature 2011 esther arias [modo de compatibilidad]
Search patent literature 2011 esther arias  [modo de compatibilidad]Search patent literature 2011 esther arias  [modo de compatibilidad]
Search patent literature 2011 esther arias [modo de compatibilidad]
 
PatSeer Premier Overview
PatSeer Premier OverviewPatSeer Premier Overview
PatSeer Premier Overview
 
Querying Patent Data for Empirical Scholarship : Tools and Strategies
Querying Patent Data for Empirical Scholarship : Tools and StrategiesQuerying Patent Data for Empirical Scholarship : Tools and Strategies
Querying Patent Data for Empirical Scholarship : Tools and Strategies
 
II-SDV 2016 Minesoft
II-SDV 2016 MinesoftII-SDV 2016 Minesoft
II-SDV 2016 Minesoft
 
WIPS Global Brochure, New
WIPS Global Brochure, NewWIPS Global Brochure, New
WIPS Global Brochure, New
 
IANA: Who, What, Why?
IANA: Who, What, Why?IANA: Who, What, Why?
IANA: Who, What, Why?
 

Plus de Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

Plus de Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Dernier

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfkalichargn70th171
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...Nitya salvi
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durbanmasabamasaba
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburgmasabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 

Dernier (20)

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 

II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patents for business (Olivier Huc, Minesoft, UK)

  • 1. The Challenges of Managing “Big Data” in the Patent Field 14-15 April 2014, Nice Olivier Huc
  • 2. Specialists in Patent Information Building Intelligent Patent Information Solutions since 1996 What we do Trusted by IP experts Worldwide Corporations, National Patent Offices, Patent Attorneys and Patent Search Firms worldwide International Customer Support Global client base With Offices and Support across Europe North America, and Asia
  • 3. Patent Families Analytics Quality Control Fast Search Legal Status Review Alerts • 23 Full Text Collections • 48 Million Families • 103 Issuing Authorities • IPC, CPC US and JP classes • Quality Controlled content • Normalised data
  • 4. 3 Patent Data Myths • Myth #1: Patent data is just another type of “Big Data” • Myth #2: Patent Data is handled automatically • Myth #3: Patent Data is consistent worldwide
  • 5. • Patent Data volume might be smaller, data is more complex (languages, text, fields) • Patent data is not retrieved on the fly, it is hosted, indexed and optimized • There are multiple sources with overlap • Data quality is a major issue • Users have a low tolerance for errors The reality
  • 6. • Total data volume exceeds 35 Tb • 49 million families and 103 publishing bodies • 95 million publications • 47 million full-texts including over 23 million non-Latin into English machine translations • 54 million clipped images and 45 million complete sets of drawings Database Facts
  • 7. • Minesoft and RWS host their own data center, located just outside of London • Control • Confidentiality • Reactivity • Speed • Distributed search engine • Continuous data update and indexing => no need to interrupt or restart the online services, + new data immediately searchable Hardware & Search Engine
  • 8. • Multiple data sources: • DOCDB weekly feeds (EPO) • National Patent Offices • Commercial collections • External information (such as National Registers) • Despite the complexity, having multiple sources for the same country is a great advantage: • Complementarity • Improved quality • Security • Speed Sources
  • 9. • We perform stringent quality checks • Human • Programmatic • Manual checks on some source data collections as they arrive: e.g. Indian (IN), Thai (TH) and The Philippines (PH) • Errors in data are identified programmatically by strict pre-set parameters which are then manually corrected by our data team • e.g. IC8=AO1G1/00 • Although we follow EPO’s INPADOC rules for families (extended), we recreate all our families to ensure consistency Data Quality
  • 10. Adding extra value to PatBase data: • Families are automatically reviewed and, then if necessary, rebuilt when we receive new and/or corrected information (e.g. priority) • Tagging of examples, paragraphs and claims is done in order to facilitate searching specific sections of text • Machine translation: when a family gets new text, the family is reassessed to see if a machine translation needs to be added/replaced/deleted. Data Quality
  • 11. TW AN/PR inputs TW AN/PR outputs 083303675 Emperor year conversion & Type of application TW19940303675F 092128911 TW20030128911 092128911 TW20040201682U US AN/PR inputs US AN/PR outputs US29/356,858 20100303 Type of application & Year US20100356858F 1301618611 A US20110016186 AT AN/PR inputs AT AN/PR outputs A 709/95 Type of application & Year AT19950000709 GM647/96 AT19960000647U Standardisation of patent data Formatting application and priority information
  • 12. • Formatting patent numbers and kind codes • Formatting dates Thailand use Buddhist years (Gregorian calendar year plus 543) US date format - 2011/09/02 (9 February 2011) European date format – 2011/09/02 (2 September 2011) 2007 Standardisation of patent data
  • 13. The EPO standardize names to assist searching. PatBase contains both standard and non-standard names. Standard name assigned by the EPO Non-standard name consists of whatever is filed or published on the patent Standard Non-standard PIRELLI IND PIRELI SPA PIRELLI IND PIRELLA SPA PIRELLI IND PIRELLE S P A PIRELLI IND PIRELLI DPA PIRELLI IND PIRELLI S p A PIRELLI IND PIRELLI S A PIRELLI IND PIRELLI S P A PIRELLI IND PIRELLI S P A FIRMA PIRELLI IND PIRELLI S P A IT PIRELLI IND PIRELLI S P CA PIRELLI IND PIRELLI SPA IT PIRELLI IND PIRELLI SPP PIRELLI IND PIRELLU SPA PIRELLI IND PIRELLY SPA This is a small example set of the non-standard names that The EPO assign the standard name ‘Pirelli’ There are currently 188 non-standard names for the standard name ‘Pirelli’ Standardization of patent data
  • 14. • Date Formats • All fields, e.g. patent classifications, assignees, text etc. have set parameters. Where these are not matched data errors are identified for manual editing. • If a text is illegible (we have programmatic systems in place measuring this) it will not be allowed into the database and be identified as requiring manual attention (often manual typing). • Character conversions We have thousands of symbol / letter conversions in our programs: • & is replaced by and • œ is replaced by oe • β is replaced by ss Data Improvements
  • 15. Insertion of paragraph breaks and paragraph numbers Data Improvements Output in PatBase Source text
  • 16. • Errors appear in source data so manual checks are essential • Example – Granted patent information from the Indian Patent Office Journal. Three different inventions have incorrectly been given the same publication number Manual checks IN000008
  • 17. Data quality issues On the Thai patent office website - the same publication number is used for two different applications Patent copy for TH48405 A In PatBase Application number: TH19981004295 Publication number: TH48406 A Application number: TH19981002185 Publication number: TH48405 A Wrong number Correct number Manual checks
  • 18. • Acquiring data from multiple sources enables us to supplement records, but also alerts us to errors thus ensuring accuracy KR20010012826 A – Glial Cell Line- Derived Neurotrophic Factor Receptors KR20010112826 A – Single phase six pole DC brushless axial fan motor of transistor type Source EPO – Error in information This EPO record is a combination of two inventions. The publication number does not match with the invention. Identifying data errors
  • 19. Incorrect data received from source In cases such as these we correct the error in PatBase and inform the EPO NULL values were supplied in the EPO’s DOCDB file as Applicants Identifying data errors
  • 20. Example of an incorrect assignment from the USPTO PatBase family 41683901 Excerpt from USPTO assignments database Identifying data errors
  • 21. Translations • Principle: the English text of an equivalent is always better than the Machine Translation • All non-latin Texts are machine translated into English and indexed when added to PatBase • On a rolling basis we re-translate texts to benefit from the continuous improvements of translation engines
  • 22. Machine translation • Machine translations are made as data is added, removed / rebuilt. This is all done before indexing. • We run a rolling re-translate and re-index program to optimize the quality of our machine translated full-text Original translation, Thai into English Re-translation, Thai into English Original translation, Thai into English Re-translation, Thai into English Translations
  • 23. Re-translation Korean into EnglishOriginal translation, Korean into English Translations
  • 24. Assignee translations • Non-latin assignees are indexed • Non-latin assignees are also translated • First 10,000 CN and JP assignees have been manually translated by RWS • All others are Machine Translated until an “official” Latin names appear in the family
  • 25. Cross-lingual Tool • Initially developed by WIPO, CLIR (Cross Lingual Information Retrieval) allows our users to generate multilingual searches • Using an advanced statistical text analysis system based on the PCT corpus, the cross-lingual search tool identifies variants in multiple languages for search terms entered by the user. => Better translation – translated words originate from PCT applications
  • 26. • Source: INPADOC • All legal status events are categorised with a PRS code • Challenge: 2628 different PRS codes, some no longer in use • Solution: Grouping similar legal events together: Legal Status Reassignment Deemed Withdrawn/Abandoned Examined Renewal Fees Paid Granted Lapsed/Expired/Ceased/Dead Licence Non-Entry into National Phase National Phase Entry Opposition Filed/Request for revocation Published Restored/Reinstated/Amended Revoked/Rejected/Annuled/Invalid Withdrawn/Abandoned/Terminated/Void
  • 28. • Most patent databases are structured and optimized for Patent Searching, not for Analytics • At Minesoft, we developed a special database with proprietary meta tags dedicated to the analytics • Coverage is important – Beware of data gaps • Importance of a web service (API) • Importance of incorporating your own custom data or legal status information in your analysis Analytics
  • 29.
  • 30. Thank you PatBase celebrates its 10th anniversary Olivier Huc – olivier@minesoft.com