SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
1
Using Text Mining to Explore 
Concept Complexity in Obesity
through Concept Maps
George Karystianis
School of Computer Science
Supervisors: Goran Nenadic, Iain Buchan
Advisor: Andrea Schalk
2
Motivation
● Complex nature of obesity.
● Wide range of biomedical data sources available.
– implementation of biomedical text/data mining.
● Possible to reveal hidden links between obesity and other
diseases.
● Partial completed knowledge representation models of obesity.
● A systematic approach required for:
– analysis and interpretation of clinical knowledge.
3
Concept Maps
● Knowledge representation models.
● Consisted of:
– nodes (concepts).
– links (relationships between the nodes).
● Aim: gather, understand, explore knowledge.
● Variety of users.
● No explicit detail.
● Implemented primarily in education.
4
Concept Map Example
5
Aim
● To design a framework to build/enhance medical concept maps.
● To improve the understanding of health care concept
complexity.
● Assist medical professionals in the representation, exploration
and validation of their expert knowledge.
● Improvement of the clinical health care.
6
Objectives
● Design and implement methods for health care concept
detection.
● Concept organisation in a concept map form.
● Method generation for concept map updates.
● Build a framework for the design/enhancement/validation of
medical concept maps.
● Methodology evaluation through the health problem of obesity:
– validation of obesity related concepts with current structured obesity
information available.
– identify gaps in clinical knowledge.
7
Research Hypothesis &
Questions
-The analysis required to extract health care concepts.
-The approach to built and enhance a concept map.
-The concept map contribution in the representation/validation of knowledge.
-The text mining results help to understand/explore clinical problems.
Biomedical
Text Mining
Scientific
literature
Concept
map
Improvement of
health care
Framework
8
Obesity
● Worldwide problem.
● Epidemic proportions:
– WHO rates (2005): 1.6 billion overweight, 400 million obese.
● Associations to various diseases.
● Complex risk factors and complications.
● Various aspects.
● Lots of research.
9
10
Biomedical Text Mining
● Extraction of information from unstructured data of biomedical
nature.
● Discovery of new, previously unknown knowledge.
● Performed on documents with complex/specific terminology and
expressions.
● Challenges:
– language ambiguity.
– variation of language expression.
● Various tools and applications (Termine, Whatizit, GATE).
● Adaptation to user's tasks and requirements.
11
What we are looking for?
● Risk Factors
● Causal Factors
● Confounding Factors
● Outcomes
● Complications
● Interventions
● ...
12
Methodology Overview
1. Document retrieval.
2. Term/concept extraction.
3. Feature engineering and Information extraction:
- application of classification/clustering techniques.
4. Concept map design.
13
Evaluation-Obesity Case Study
● Comparison:
– What ?
● biomedical text mining results.
● concept map information.
– How ?
● concepts and relationships.
● New ones.
● Examination/manipulation/validation of new knowledge by experts.
● Enhancement of the concept map.
14
Progress so far (1)
● Corpus collection.
● Application of Automated Term Recognition (ATR).
● C-value method.
● Single word ATR:
– terminological head identification.
– word of a multi-word term that defines the term class.
– example:
● “Childhood diabetes type II”.
● Terminological head: “diabetes”.
15
Progress so far (2)
● Ranking head measures:
– total head frequency,
– single head frequency,
– maximum and average C-value,
– abstract frequency,
– ratio of single head frequency/total head frequency,
– tf*idf (term frequency*inverse document frequency).
16
Results
tf*idf total freq single freq abstract freq word freq max_c aver_c ratio
0
5
10
15
20
25
30
35
40
45
0
10
20
30
40
50
Statistical measure
Numberofkeywords
17
Progress so far (3)
● Pattern extraction from abstracts for:
– risk, confounding and causal factors,
– interventions,
– complications,
– outcomes.
Obesity risk is increased among women with psychiatric disorders
Potential risk factor
18
Example
Potential risk factors Potential interventions Potential complications
19
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 2 (1/2): Concept extraction
20
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 2 (2/2): Concept structuring
21
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 3: Design of the medical concept map
22
Summary
● Framework creation for clinical concept map building and
enhancement.
● Improved understanding of health care concept complexity.
● So far:
– comprehension of literature review.
– methodology design.
– single ATR.
– pattern design.
23
End
Acknowledgements
2. School of Computer Science
University of Manchester
1. Medical Research Council

Contenu connexe

Similaire à First year present

Data Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperData Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperNicholas Tenhue
 
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...healthcareisi
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19openEHR-Japan
 
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Maulik Kamdar
 
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docxalinainglis
 
BDCC-06-00004.pdf
BDCC-06-00004.pdfBDCC-06-00004.pdf
BDCC-06-00004.pdfAsiyaKhan63
 
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingSecinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingNethminiWijesinghe
 
Massey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMassey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMartin McMorrow
 
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Universitat Politècnica de València
 
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...Vlad Manea
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Thien Q. Tran
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informaticsimprovemed
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...IJERA Editor
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...IJERA Editor
 
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm OptimizationCase Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimizationrahulmonikasharma
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Nicola Amoroso
 

Similaire à First year present (20)

Data Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperData Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White Paper
 
MVilla IUI 2012 Lisbon
MVilla IUI 2012 LisbonMVilla IUI 2012 Lisbon
MVilla IUI 2012 Lisbon
 
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
 
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
 
Mapping innovation missions
Mapping innovation missionsMapping innovation missions
Mapping innovation missions
 
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
 
BDCC-06-00004.pdf
BDCC-06-00004.pdfBDCC-06-00004.pdf
BDCC-06-00004.pdf
 
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingSecinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
 
Massey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMassey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL Session
 
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
 
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
 
36411
3641136411
36411
 
Medinfor Gesiti Hospitais
Medinfor Gesiti HospitaisMedinfor Gesiti Hospitais
Medinfor Gesiti Hospitais
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informatics
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...
 
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm OptimizationCase Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
 

Dernier

The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 

Dernier (20)

Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 

First year present

  • 2. 2 Motivation ● Complex nature of obesity. ● Wide range of biomedical data sources available. – implementation of biomedical text/data mining. ● Possible to reveal hidden links between obesity and other diseases. ● Partial completed knowledge representation models of obesity. ● A systematic approach required for: – analysis and interpretation of clinical knowledge.
  • 3. 3 Concept Maps ● Knowledge representation models. ● Consisted of: – nodes (concepts). – links (relationships between the nodes). ● Aim: gather, understand, explore knowledge. ● Variety of users. ● No explicit detail. ● Implemented primarily in education.
  • 5. 5 Aim ● To design a framework to build/enhance medical concept maps. ● To improve the understanding of health care concept complexity. ● Assist medical professionals in the representation, exploration and validation of their expert knowledge. ● Improvement of the clinical health care.
  • 6. 6 Objectives ● Design and implement methods for health care concept detection. ● Concept organisation in a concept map form. ● Method generation for concept map updates. ● Build a framework for the design/enhancement/validation of medical concept maps. ● Methodology evaluation through the health problem of obesity: – validation of obesity related concepts with current structured obesity information available. – identify gaps in clinical knowledge.
  • 7. 7 Research Hypothesis & Questions -The analysis required to extract health care concepts. -The approach to built and enhance a concept map. -The concept map contribution in the representation/validation of knowledge. -The text mining results help to understand/explore clinical problems. Biomedical Text Mining Scientific literature Concept map Improvement of health care Framework
  • 8. 8 Obesity ● Worldwide problem. ● Epidemic proportions: – WHO rates (2005): 1.6 billion overweight, 400 million obese. ● Associations to various diseases. ● Complex risk factors and complications. ● Various aspects. ● Lots of research.
  • 9. 9
  • 10. 10 Biomedical Text Mining ● Extraction of information from unstructured data of biomedical nature. ● Discovery of new, previously unknown knowledge. ● Performed on documents with complex/specific terminology and expressions. ● Challenges: – language ambiguity. – variation of language expression. ● Various tools and applications (Termine, Whatizit, GATE). ● Adaptation to user's tasks and requirements.
  • 11. 11 What we are looking for? ● Risk Factors ● Causal Factors ● Confounding Factors ● Outcomes ● Complications ● Interventions ● ...
  • 12. 12 Methodology Overview 1. Document retrieval. 2. Term/concept extraction. 3. Feature engineering and Information extraction: - application of classification/clustering techniques. 4. Concept map design.
  • 13. 13 Evaluation-Obesity Case Study ● Comparison: – What ? ● biomedical text mining results. ● concept map information. – How ? ● concepts and relationships. ● New ones. ● Examination/manipulation/validation of new knowledge by experts. ● Enhancement of the concept map.
  • 14. 14 Progress so far (1) ● Corpus collection. ● Application of Automated Term Recognition (ATR). ● C-value method. ● Single word ATR: – terminological head identification. – word of a multi-word term that defines the term class. – example: ● “Childhood diabetes type II”. ● Terminological head: “diabetes”.
  • 15. 15 Progress so far (2) ● Ranking head measures: – total head frequency, – single head frequency, – maximum and average C-value, – abstract frequency, – ratio of single head frequency/total head frequency, – tf*idf (term frequency*inverse document frequency).
  • 16. 16 Results tf*idf total freq single freq abstract freq word freq max_c aver_c ratio 0 5 10 15 20 25 30 35 40 45 0 10 20 30 40 50 Statistical measure Numberofkeywords
  • 17. 17 Progress so far (3) ● Pattern extraction from abstracts for: – risk, confounding and causal factors, – interventions, – complications, – outcomes. Obesity risk is increased among women with psychiatric disorders Potential risk factor
  • 18. 18 Example Potential risk factors Potential interventions Potential complications
  • 19. 19 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 2 (1/2): Concept extraction
  • 20. 20 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 2 (2/2): Concept structuring
  • 21. 21 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 3: Design of the medical concept map
  • 22. 22 Summary ● Framework creation for clinical concept map building and enhancement. ● Improved understanding of health care concept complexity. ● So far: – comprehension of literature review. – methodology design. – single ATR. – pattern design.