SlideShare une entreprise Scribd logo
1  sur  44
Télécharger pour lire hors ligne
Analyzing Patent Full-Text
A Study
1 April 7, 2014
Analysing Patent Full Text
Richard Gynn - LexisNexis
Analyzing Patent Full-Text
A Study
2 April 7, 2014
Agenda
1) Full Text Availability
2) Analyzing full text
- Discussion/considerations
- Big picture analysis
- Detailed analysis - Study
3) Conclusions
Full Text content available from vendors has evolved to a point
where most of the top publishing authorities are readily available.
Analysing Patent Full Text. Availability
Full Text Availability – Top 10 Publishing Authorities (available from most big vendors)
April 7, 2014
Analyzing Patent Full-Text
A Study
4
China, Korea, Japan are not
the big deal they used to be!
Text can be available to analyse in English
Full Text Availability – Authorities available from at least one vendor
April 7, 2014
Analyzing Patent Full-Text
A Study
5
Full Text Availability by volume- > 100k publications
April 7, 2014
Analyzing Patent Full-Text
A Study
6
0
5
10
15
20
25
JP
US
CN
DE
EP
KR
GB
FR
WO
CA
AU
TW
SU
ES
AT
SE
IT
RU
CH
NL
BE
FI
BR
DK
IN
NO
PL
IL
DD
ZA
MX
HU
PT
CS
AR
IE
NZ
CZ
GR
Millions
Full Text Availability by volume- > 100k publications
April 7, 2014
Analyzing Patent Full-Text
A Study
7
0
5
10
15
20
25
JP
US
CN
DE
EP
KR
GB
FR
WO
CA
AU
TW
SU
ES
AT
SE
IT
RU
CH
NL
BE
FI
BR
DK
IN
NO
PL
IL
DD
ZA
MX
HU
PT
CS
AR
IE
NZ
CZ
GR
Millions
31 of these 39 are currently
available from vendors
Account for vast majority of total volume
Full Text Availability by volume - < 100k publications
April 7, 2014
Analyzing Patent Full-Text
A Study
8
0
10,000
20,000
30,000
40,000
50,000
60,000
70,000
80,000
90,000
100,000
HK
YU
RO
SG
TR
MY
LU
BG
PH
UA
TH
CL
EA
ID
HR
SK
CO
SI
VN
PE
UY
OA
EG
IS
EC
Full Text Availability by volume - < 100k publications
April 7, 2014
Analyzing Patent Full-Text
A Study
9
0
10,000
20,000
30,000
40,000
50,000
60,000
70,000
80,000
90,000
100,000
HK
YU
RO
SG
TR
MY
LU
BG
PH
UA
TH
CL
EA
ID
HR
SK
CO
SI
VN
PE
UY
OA
EG
IS
EC
Much smaller amounts currently
available from vendors ~ 300,000
If all were to become available would add about 1.5% to full text
that is currently available, e.g. equivalent to Spain or Taiwan
Full Text Availability by volume - < 10k publications
April 7, 2014
Analyzing Patent Full-Text
A Study
10
0
1,000
2,000
3,000
4,000
5,000
6,000
7,000
8,000
9,000
10,000
MA
AP
VE
EE
LV
GT
CU
LT
MD
CR
PA
CY
DO
MC
ZM
ZW
SV
SM
JO
PY
GE
DZ
KE
MT
HN
MW
NI
ME
TJ
GC
BO
MN
BA
KZ
BY
TT
Full Text Availability by volume - < 10k publications
April 7, 2014
Analyzing Patent Full-Text
A Study
11
0
1,000
2,000
3,000
4,000
5,000
6,000
7,000
8,000
9,000
10,000
MA
AP
VE
EE
LV
GT
CU
LT
MD
CR
PA
CY
DO
MC
ZM
ZW
SV
SM
JO
PY
GE
DZ
KE
MT
HN
MW
NI
ME
TJ
GC
BO
MN
BA
KZ
BY
TT
One currently available from vendors
In total these would add about 0.1% to full text that is currently available
Analyzing Patent Full-Text
A Study
12 April 7, 2014
• Are we nearly there yet?
• There’s a lot of full text available to make use
• Most vendors have a significant volumes
• Rapidly diminishing returns for each authority added
Full Text Availability
Bringing You The World
• We are already in a good place
• In terms of % availability at least
Analysing Patent Full Text. Discussion/considerations
Analyzing Patent Full-Text
A Study
14 April 7, 2014
Full Text – What Is It?
Full-text – what is it?
• Everything of course?!
― …will concentrate on:
Considerations
April 7, 2014
Analyzing Patent Full-Text
A Study
15
There’s clearly a lot out
there, so why don’t we see
so much analysis of patent
full text?
Analyzing Patent Full-Text
A Study
16 April 7, 2014
Considerations - Language
• Can only compare like for like in same language
…non-Latin character issues too
• Noise – Patent full-text likes to state things like
…the complete opposite of what it’s about!
Considerations - Language
How I might introduce myself
…If I was a patent!
나는 사람들이 밥, 앤드류, 데이브 앨런 같은
이름이, 이름이. 나는 밥, 앤드류, 데이브 나
앨런 아니에요. 내 이름은 리처드입니다
I have a name, people have names like
Bob, Andrew, Dave and Alan. I’m not
Bob, Andrew, Dave or Alan.
My name is Richard
私は人々がボブ、アンドリュー、デイブとアラ私は人々がボブ、アンドリュー、デイブとアラ私は人々がボブ、アンドリュー、デイブとアラ私は人々がボブ、アンドリュー、デイブとアラ
ンのような名前を持っている、名前を持っていンのような名前を持っている、名前を持っていンのような名前を持っている、名前を持っていンのような名前を持っている、名前を持ってい
ます。私はボブ、アンドリュー、デイブかアランます。私はボブ、アンドリュー、デイブかアランます。私はボブ、アンドリュー、デイブかアランます。私はボブ、アンドリュー、デイブかアラン
ないよ。私ないよ。私ないよ。私ないよ。私の名の名の名の名前はリチャードです前はリチャードです前はリチャードです前はリチャードです
Considerations
Other Considerations:
• Massive amounts of data
– Time?
– How deal with ?
• Will it contain anything useful?
/benefit outweigh effort?
April 7, 2014
Analyzing Patent Full-Text
A Study
17
• Tools
– Big picture?
– Details?
Big Picture - Landscape Analysis
April 7, 2014
Analyzing Patent Full-Text
A Study
18
Big picture, topographic mapping (Discussion)
Here more full text could provide:
• Broader country analysis (often full-text not available)
• More consistency across authorities – e.g. more claims
― Compare like for like, e.g. not claims, title & abstract against title
• Full text more useful for details
• Themes/commonalities easier to
find using claims, title, abstract
• Whilst useful, vast majority of
landscape analysis done elsewhere,
…i.e. details rather than big picture
Analysing Patent Full Text. Study
The Details - Study
Detailed analysis – looking for what?
• New/emerging, different
• Competitive/market comparisons
• Strength, weakness, opportunity, threat
April 7, 2014
Analyzing Patent Full-Text
A Study
20
What can I find using the full
text that I couldn’t using title,
abstract and bibliography?
The Details - The Technology
April 7, 2014
Analyzing Patent Full-Text
A Study
21
Terahertz analysis, e.g. imaging, spectroscopy?
Terahertz radiation - between Infra-red and microwave
The Details - The Search
April 7, 2014
Analyzing Patent Full-Text
A Study
22
• Broad Strategy
― Analysis IPCs + Terahertz
Radiation Synonyms
― Keyword Terahertz
Imaging & Spectroscopy
5,955 documents/3,365 families
Study - PatentOptimizer
Analyzing Patent Full-Text
A Study
23 April 7, 2014
Analysis Details:
• Small/emerging areas of 6-7 families
• Look at terms & phrases, parts, claim
elements (all numbers represent families)
PatentOptimizer™ Analysis of EP, PCT & US results
• English Translations
PatentOptimizer – Terms & Phrases
April 7, 2014
Analyzing Patent Full-Text
A Study
24
Diagnosis - General
PatentOptimizer – Terms & Phrases
April 7, 2014
Analyzing Patent Full-Text
A Study
25
Not found in Title, Abstract (or claims) –
All From Spectral Image Inc
Learned – Something seemingly unique to them
SAME DOCUMENTS
PatentOptimizer – Terms & Phrases
April 7, 2014
Analyzing Patent Full-Text
A Study
26
Not found in Title, Abstract (or claims) – All
monitoring vitamin K concentration in blood
Learned – A more recent (emerging?) use
Diagnosis - General
PatentOptimizer – Parts
April 7, 2014
Analyzing Patent Full-Text
A Study
27
Remote monitoring, e.g. of Bluetooth® headset user
Learned – Interesting, but not massively relevant result, would like to
investigate applications further
Diagnosis - general
PatentOptimizer – Claim Elements
April 7, 2014
Analyzing Patent Full-Text
A Study
28
Looking for infiltration or extravasation
during intravenous infusion
Learned – New possibly interesting area, seemingly
dominated by one organisation
Diagnosis – general
A61M – introducing remedies
Study - VantagePoint
Analyzing Patent Full-Text
A Study
29 April 7, 2014
Analysis Details:
• Data Statistics
• Terms uniquely appearing in full text
• Highly occurring terms used in small
numbers of documents
• Investigate terms unique to 2013
priority onward
Vantage Point Analysis of TotalPatent full text results
• English Translations
Vantage Point - Statistics
Very low percent of terms and words, available for
analysis are actually in the title and abstract
Title &
Abstract
• 42,614 words &
phrases
• 16,251 words
Claims
• ~132k words and
phrases not in
Title or Abstract
• ~44k words in
Title or Abstract
Full-text
• ~1.3M unique
words & phrases
• ~650k unique
words
April 7, 2014
Analyzing Patent Full-Text
A Study
30
Vantage Point – Terms only appearing in full text 2013 onwards
April 7, 2014
Analyzing Patent Full-Text
A Study
31
Vantage Point – Terms only appearing in full text 2013 onwards
April 7, 2014
Analyzing Patent Full-Text
A Study
32
Detection of tetracycline drug –
concern in resistance to antibiotics
Learned – New area (clearer language in full-text)
optical investigation
Vantage Point – Terms only appearing in full text 2013 onwards
April 7, 2014
Analyzing Patent Full-Text
A Study
33
Looking for gas hydrates (fracking)
Learned – New area (uncovered by more consistent
repetition in full text)
general investigation,
sampling
Analysing Patent Full Text. Conclusions
Findings
April 7, 2014
Analyzing Patent Full-Text
A Study
35
• Full text useful
• Claims less so (in this case)
Most words and phrases in the “full text”,
did not appear in Abstract & Title
• Text mined wasn’t necessarily applications, but pointed towards
• More consistent repetition in full text
Helped mainly find new/niche applications
• Probably wouldn’t have found other ways
Interesting companies & technologies to
look at further
Conclusions
Conclusions (Noise and huge amounts of info):
• Background did not really come in as an issue
• Used English translations to avoid language issues
• Most noise was from search results
• My judgement – about 50% proved somewhat
interesting upon further investigation
• Can this be automated/put into a process?
• 4/5+ family groupings seems to be about the
sweet spot
April 7, 2014
Analyzing Patent Full-Text
A Study
36
What More?
What more?
Further this:
• Life Sciences
• Define processes
Dedicated
machine?
• Detailed full-
text analysis
Study analysis
of parts
• Sellers,
inventors,
manufacturers
etc.
April 7, 2014
Analyzing Patent Full-Text
A Study
37
Easier than expected
More possible & better timescales
Questions
April 7, 2014
Analyzing Patent Full-Text
A Study
38
Analysing Patent Full Text. Study – Additional Examples
PatentOptimizer – Terms & Phrases
April 7, 2014
Analyzing Patent Full-Text
A Study
40
2 of 6 have tattoo in Abstract OR Title
(same if include claims)
Learned – THz radiation can be used for tattoo removal
Diagnosis, surgery - General
PatentOptimizer – Terms & Phrases
April 7, 2014
Analyzing Patent Full-Text
A Study
41
Not found in Abstract & Title
(One claimed -Optical Diagnostics)
Determining microorganism
presence/kind
PatentOptimizer – Claim Elements
April 7, 2014
Analyzing Patent Full-Text
A Study
42
SAME DOCUMENTS
Identifying/determining antimocrobial
resistance of Burkholderia Cepacia
Learned – Smaller more niche areas?
PatentOptimizer – Terms & Phrases
April 7, 2014
Analyzing Patent Full-Text
A Study
43
Not found in Title, Abstract (or claims) – All Some
detectors, some looking for heavy metal contamination
Learned – Some areas to investigate further?
PatentOptimizer – Claim Elements
April 7, 2014
Analyzing Patent Full-Text
A Study
44
Glucose Monitoring – Far-IR (5/7 have in Abstract & Title)
Learned – Not much more than from Title & Abstract
Blood measurement

Contenu connexe

Similaire à II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstract and bibliographic data, and lessons learned (Richard Gynn - LexisNexis, UK)

Planning your assignment or essay
Planning your assignment or essayPlanning your assignment or essay
Planning your assignment or essayBIM Myanmar
 
21.02.22 Conducting your research - Preparation
21.02.22 Conducting your research - Preparation21.02.22 Conducting your research - Preparation
21.02.22 Conducting your research - PreparationLouise Douse
 
Lesson 4 secondary research 2
Lesson 4   secondary research 2Lesson 4   secondary research 2
Lesson 4 secondary research 2Kavita Parwani
 
Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...
Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...
Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...Nader Ale Ebrahim
 
II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother?
II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother? II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother?
II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother? Dr. Haxel Consult
 
8 Elements In A Research Proposal
8 Elements In A Research Proposal8 Elements In A Research Proposal
8 Elements In A Research ProposalAzmi Latiff
 
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.ppt
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.pptINTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.ppt
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.pptfayan1
 
22.02.21 Conducting your research - Preparation
22.02.21 Conducting your research - Preparation22.02.21 Conducting your research - Preparation
22.02.21 Conducting your research - PreparationLouise Douse
 
How to Conduct a Literature Review (ISRAPM 2014)
How to Conduct a Literature Review  (ISRAPM 2014)How to Conduct a Literature Review  (ISRAPM 2014)
How to Conduct a Literature Review (ISRAPM 2014)Saeid Safari
 
ICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of ResearchICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of ResearchDr. Haxel Consult
 
Trial and error - student responses to different approaches of embedding info...
Trial and error - student responses to different approaches of embedding info...Trial and error - student responses to different approaches of embedding info...
Trial and error - student responses to different approaches of embedding info...IL Group (CILIP Information Literacy Group)
 
chapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).ppt
chapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).pptchapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).ppt
chapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).pptChandraBagasAlfian
 
Ms3 lesson 2_research and referencing
Ms3 lesson 2_research and referencingMs3 lesson 2_research and referencing
Ms3 lesson 2_research and referencinghowardeffinghammedia
 
Research and Knowledge Utilization. Symposium Training
Research and Knowledge Utilization. Symposium TrainingResearch and Knowledge Utilization. Symposium Training
Research and Knowledge Utilization. Symposium TrainingInternationalJournal24
 
Communities of interpretation2014
Communities of interpretation2014Communities of interpretation2014
Communities of interpretation2014John Griffiths
 
Taylor & Francis: Open Access Update
Taylor & Francis: Open Access UpdateTaylor & Francis: Open Access Update
Taylor & Francis: Open Access UpdateSIBiUSP
 

Similaire à II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstract and bibliographic data, and lessons learned (Richard Gynn - LexisNexis, UK) (20)

Research proposal
Research proposalResearch proposal
Research proposal
 
Planning your assignment or essay
Planning your assignment or essayPlanning your assignment or essay
Planning your assignment or essay
 
defining research
defining researchdefining research
defining research
 
21.02.22 Conducting your research - Preparation
21.02.22 Conducting your research - Preparation21.02.22 Conducting your research - Preparation
21.02.22 Conducting your research - Preparation
 
Lesson 4 secondary research 2
Lesson 4   secondary research 2Lesson 4   secondary research 2
Lesson 4 secondary research 2
 
Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...
Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...
Conducting a Literature Search & Writing Review Paper, Part 1: Systematic Rev...
 
II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother?
II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother? II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother?
II-SDV 2013 Tweet Mining: Is it Useful and Should we Bother?
 
8 Elements In A Research Proposal
8 Elements In A Research Proposal8 Elements In A Research Proposal
8 Elements In A Research Proposal
 
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.ppt
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.pptINTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.ppt
INTRODUCTION-TO-RESEARCH-METHODOLOGY-2020.ppt
 
22.02.21 Conducting your research - Preparation
22.02.21 Conducting your research - Preparation22.02.21 Conducting your research - Preparation
22.02.21 Conducting your research - Preparation
 
How to Conduct a Literature Review (ISRAPM 2014)
How to Conduct a Literature Review  (ISRAPM 2014)How to Conduct a Literature Review  (ISRAPM 2014)
How to Conduct a Literature Review (ISRAPM 2014)
 
ICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of ResearchICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of Research
 
Trial and error - student responses to different approaches of embedding info...
Trial and error - student responses to different approaches of embedding info...Trial and error - student responses to different approaches of embedding info...
Trial and error - student responses to different approaches of embedding info...
 
Fundamentals of Knowledge Translation and Exchange
Fundamentals of Knowledge Translation and ExchangeFundamentals of Knowledge Translation and Exchange
Fundamentals of Knowledge Translation and Exchange
 
chapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).ppt
chapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).pptchapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).ppt
chapter_02_-_what_are_the_major_types_of_social_research_-_7 (1).ppt
 
Ms3 lesson 2_research and referencing
Ms3 lesson 2_research and referencingMs3 lesson 2_research and referencing
Ms3 lesson 2_research and referencing
 
Research and Knowledge Utilization. Symposium Training
Research and Knowledge Utilization. Symposium TrainingResearch and Knowledge Utilization. Symposium Training
Research and Knowledge Utilization. Symposium Training
 
Communities of interpretation2014
Communities of interpretation2014Communities of interpretation2014
Communities of interpretation2014
 
Research methods l1 3
Research methods l1 3Research methods l1 3
Research methods l1 3
 
Taylor & Francis: Open Access Update
Taylor & Francis: Open Access UpdateTaylor & Francis: Open Access Update
Taylor & Francis: Open Access Update
 

Plus de Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

Plus de Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Dernier

Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 

Dernier (20)

Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 

II-SDV 2014 Analysing Patent Full Text – Comparison against analysis of abstract and bibliographic data, and lessons learned (Richard Gynn - LexisNexis, UK)

  • 1. Analyzing Patent Full-Text A Study 1 April 7, 2014 Analysing Patent Full Text Richard Gynn - LexisNexis
  • 2. Analyzing Patent Full-Text A Study 2 April 7, 2014 Agenda 1) Full Text Availability 2) Analyzing full text - Discussion/considerations - Big picture analysis - Detailed analysis - Study 3) Conclusions Full Text content available from vendors has evolved to a point where most of the top publishing authorities are readily available.
  • 3. Analysing Patent Full Text. Availability
  • 4. Full Text Availability – Top 10 Publishing Authorities (available from most big vendors) April 7, 2014 Analyzing Patent Full-Text A Study 4 China, Korea, Japan are not the big deal they used to be! Text can be available to analyse in English
  • 5. Full Text Availability – Authorities available from at least one vendor April 7, 2014 Analyzing Patent Full-Text A Study 5
  • 6. Full Text Availability by volume- > 100k publications April 7, 2014 Analyzing Patent Full-Text A Study 6 0 5 10 15 20 25 JP US CN DE EP KR GB FR WO CA AU TW SU ES AT SE IT RU CH NL BE FI BR DK IN NO PL IL DD ZA MX HU PT CS AR IE NZ CZ GR Millions
  • 7. Full Text Availability by volume- > 100k publications April 7, 2014 Analyzing Patent Full-Text A Study 7 0 5 10 15 20 25 JP US CN DE EP KR GB FR WO CA AU TW SU ES AT SE IT RU CH NL BE FI BR DK IN NO PL IL DD ZA MX HU PT CS AR IE NZ CZ GR Millions 31 of these 39 are currently available from vendors Account for vast majority of total volume
  • 8. Full Text Availability by volume - < 100k publications April 7, 2014 Analyzing Patent Full-Text A Study 8 0 10,000 20,000 30,000 40,000 50,000 60,000 70,000 80,000 90,000 100,000 HK YU RO SG TR MY LU BG PH UA TH CL EA ID HR SK CO SI VN PE UY OA EG IS EC
  • 9. Full Text Availability by volume - < 100k publications April 7, 2014 Analyzing Patent Full-Text A Study 9 0 10,000 20,000 30,000 40,000 50,000 60,000 70,000 80,000 90,000 100,000 HK YU RO SG TR MY LU BG PH UA TH CL EA ID HR SK CO SI VN PE UY OA EG IS EC Much smaller amounts currently available from vendors ~ 300,000 If all were to become available would add about 1.5% to full text that is currently available, e.g. equivalent to Spain or Taiwan
  • 10. Full Text Availability by volume - < 10k publications April 7, 2014 Analyzing Patent Full-Text A Study 10 0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 10,000 MA AP VE EE LV GT CU LT MD CR PA CY DO MC ZM ZW SV SM JO PY GE DZ KE MT HN MW NI ME TJ GC BO MN BA KZ BY TT
  • 11. Full Text Availability by volume - < 10k publications April 7, 2014 Analyzing Patent Full-Text A Study 11 0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 10,000 MA AP VE EE LV GT CU LT MD CR PA CY DO MC ZM ZW SV SM JO PY GE DZ KE MT HN MW NI ME TJ GC BO MN BA KZ BY TT One currently available from vendors In total these would add about 0.1% to full text that is currently available
  • 12. Analyzing Patent Full-Text A Study 12 April 7, 2014 • Are we nearly there yet? • There’s a lot of full text available to make use • Most vendors have a significant volumes • Rapidly diminishing returns for each authority added Full Text Availability Bringing You The World • We are already in a good place • In terms of % availability at least
  • 13. Analysing Patent Full Text. Discussion/considerations
  • 14. Analyzing Patent Full-Text A Study 14 April 7, 2014 Full Text – What Is It? Full-text – what is it? • Everything of course?! ― …will concentrate on:
  • 15. Considerations April 7, 2014 Analyzing Patent Full-Text A Study 15 There’s clearly a lot out there, so why don’t we see so much analysis of patent full text?
  • 16. Analyzing Patent Full-Text A Study 16 April 7, 2014 Considerations - Language • Can only compare like for like in same language …non-Latin character issues too • Noise – Patent full-text likes to state things like …the complete opposite of what it’s about! Considerations - Language How I might introduce myself …If I was a patent! 나는 사람들이 밥, 앤드류, 데이브 앨런 같은 이름이, 이름이. 나는 밥, 앤드류, 데이브 나 앨런 아니에요. 내 이름은 리처드입니다 I have a name, people have names like Bob, Andrew, Dave and Alan. I’m not Bob, Andrew, Dave or Alan. My name is Richard 私は人々がボブ、アンドリュー、デイブとアラ私は人々がボブ、アンドリュー、デイブとアラ私は人々がボブ、アンドリュー、デイブとアラ私は人々がボブ、アンドリュー、デイブとアラ ンのような名前を持っている、名前を持っていンのような名前を持っている、名前を持っていンのような名前を持っている、名前を持っていンのような名前を持っている、名前を持ってい ます。私はボブ、アンドリュー、デイブかアランます。私はボブ、アンドリュー、デイブかアランます。私はボブ、アンドリュー、デイブかアランます。私はボブ、アンドリュー、デイブかアラン ないよ。私ないよ。私ないよ。私ないよ。私の名の名の名の名前はリチャードです前はリチャードです前はリチャードです前はリチャードです
  • 17. Considerations Other Considerations: • Massive amounts of data – Time? – How deal with ? • Will it contain anything useful? /benefit outweigh effort? April 7, 2014 Analyzing Patent Full-Text A Study 17 • Tools – Big picture? – Details?
  • 18. Big Picture - Landscape Analysis April 7, 2014 Analyzing Patent Full-Text A Study 18 Big picture, topographic mapping (Discussion) Here more full text could provide: • Broader country analysis (often full-text not available) • More consistency across authorities – e.g. more claims ― Compare like for like, e.g. not claims, title & abstract against title • Full text more useful for details • Themes/commonalities easier to find using claims, title, abstract • Whilst useful, vast majority of landscape analysis done elsewhere, …i.e. details rather than big picture
  • 19. Analysing Patent Full Text. Study
  • 20. The Details - Study Detailed analysis – looking for what? • New/emerging, different • Competitive/market comparisons • Strength, weakness, opportunity, threat April 7, 2014 Analyzing Patent Full-Text A Study 20 What can I find using the full text that I couldn’t using title, abstract and bibliography?
  • 21. The Details - The Technology April 7, 2014 Analyzing Patent Full-Text A Study 21 Terahertz analysis, e.g. imaging, spectroscopy? Terahertz radiation - between Infra-red and microwave
  • 22. The Details - The Search April 7, 2014 Analyzing Patent Full-Text A Study 22 • Broad Strategy ― Analysis IPCs + Terahertz Radiation Synonyms ― Keyword Terahertz Imaging & Spectroscopy 5,955 documents/3,365 families
  • 23. Study - PatentOptimizer Analyzing Patent Full-Text A Study 23 April 7, 2014 Analysis Details: • Small/emerging areas of 6-7 families • Look at terms & phrases, parts, claim elements (all numbers represent families) PatentOptimizer™ Analysis of EP, PCT & US results • English Translations
  • 24. PatentOptimizer – Terms & Phrases April 7, 2014 Analyzing Patent Full-Text A Study 24 Diagnosis - General
  • 25. PatentOptimizer – Terms & Phrases April 7, 2014 Analyzing Patent Full-Text A Study 25 Not found in Title, Abstract (or claims) – All From Spectral Image Inc Learned – Something seemingly unique to them SAME DOCUMENTS
  • 26. PatentOptimizer – Terms & Phrases April 7, 2014 Analyzing Patent Full-Text A Study 26 Not found in Title, Abstract (or claims) – All monitoring vitamin K concentration in blood Learned – A more recent (emerging?) use Diagnosis - General
  • 27. PatentOptimizer – Parts April 7, 2014 Analyzing Patent Full-Text A Study 27 Remote monitoring, e.g. of Bluetooth® headset user Learned – Interesting, but not massively relevant result, would like to investigate applications further Diagnosis - general
  • 28. PatentOptimizer – Claim Elements April 7, 2014 Analyzing Patent Full-Text A Study 28 Looking for infiltration or extravasation during intravenous infusion Learned – New possibly interesting area, seemingly dominated by one organisation Diagnosis – general A61M – introducing remedies
  • 29. Study - VantagePoint Analyzing Patent Full-Text A Study 29 April 7, 2014 Analysis Details: • Data Statistics • Terms uniquely appearing in full text • Highly occurring terms used in small numbers of documents • Investigate terms unique to 2013 priority onward Vantage Point Analysis of TotalPatent full text results • English Translations
  • 30. Vantage Point - Statistics Very low percent of terms and words, available for analysis are actually in the title and abstract Title & Abstract • 42,614 words & phrases • 16,251 words Claims • ~132k words and phrases not in Title or Abstract • ~44k words in Title or Abstract Full-text • ~1.3M unique words & phrases • ~650k unique words April 7, 2014 Analyzing Patent Full-Text A Study 30
  • 31. Vantage Point – Terms only appearing in full text 2013 onwards April 7, 2014 Analyzing Patent Full-Text A Study 31
  • 32. Vantage Point – Terms only appearing in full text 2013 onwards April 7, 2014 Analyzing Patent Full-Text A Study 32 Detection of tetracycline drug – concern in resistance to antibiotics Learned – New area (clearer language in full-text) optical investigation
  • 33. Vantage Point – Terms only appearing in full text 2013 onwards April 7, 2014 Analyzing Patent Full-Text A Study 33 Looking for gas hydrates (fracking) Learned – New area (uncovered by more consistent repetition in full text) general investigation, sampling
  • 34. Analysing Patent Full Text. Conclusions
  • 35. Findings April 7, 2014 Analyzing Patent Full-Text A Study 35 • Full text useful • Claims less so (in this case) Most words and phrases in the “full text”, did not appear in Abstract & Title • Text mined wasn’t necessarily applications, but pointed towards • More consistent repetition in full text Helped mainly find new/niche applications • Probably wouldn’t have found other ways Interesting companies & technologies to look at further
  • 36. Conclusions Conclusions (Noise and huge amounts of info): • Background did not really come in as an issue • Used English translations to avoid language issues • Most noise was from search results • My judgement – about 50% proved somewhat interesting upon further investigation • Can this be automated/put into a process? • 4/5+ family groupings seems to be about the sweet spot April 7, 2014 Analyzing Patent Full-Text A Study 36
  • 37. What More? What more? Further this: • Life Sciences • Define processes Dedicated machine? • Detailed full- text analysis Study analysis of parts • Sellers, inventors, manufacturers etc. April 7, 2014 Analyzing Patent Full-Text A Study 37 Easier than expected More possible & better timescales
  • 38. Questions April 7, 2014 Analyzing Patent Full-Text A Study 38
  • 39. Analysing Patent Full Text. Study – Additional Examples
  • 40. PatentOptimizer – Terms & Phrases April 7, 2014 Analyzing Patent Full-Text A Study 40 2 of 6 have tattoo in Abstract OR Title (same if include claims) Learned – THz radiation can be used for tattoo removal Diagnosis, surgery - General
  • 41. PatentOptimizer – Terms & Phrases April 7, 2014 Analyzing Patent Full-Text A Study 41 Not found in Abstract & Title (One claimed -Optical Diagnostics) Determining microorganism presence/kind
  • 42. PatentOptimizer – Claim Elements April 7, 2014 Analyzing Patent Full-Text A Study 42 SAME DOCUMENTS Identifying/determining antimocrobial resistance of Burkholderia Cepacia Learned – Smaller more niche areas?
  • 43. PatentOptimizer – Terms & Phrases April 7, 2014 Analyzing Patent Full-Text A Study 43 Not found in Title, Abstract (or claims) – All Some detectors, some looking for heavy metal contamination Learned – Some areas to investigate further?
  • 44. PatentOptimizer – Claim Elements April 7, 2014 Analyzing Patent Full-Text A Study 44 Glucose Monitoring – Far-IR (5/7 have in Abstract & Title) Learned – Not much more than from Title & Abstract Blood measurement