SlideShare une entreprise Scribd logo
1  sur  15
Extracting static and dynamic
model elements from textual
specifications in humanities
Patricia Martín-Rodilla
César González-Pérez
Institute of Heritage Sciences, Spanish National Research Council
Santiago de Compostela, Spain.
Index
Research Context & Problem
Goal(s)
Related Work
Proposal:
oProposal Overview
oProposal Phases
Case Study in Cultural Heritage Information Systems
Discussion & Open Issues
Research Context
Information Systems are composed
of different information dimensions:
…
Structural (STATIC)
Architectural
Behavioral
Methodological (DYNAMIC)
…
BUT, IS support humans activities
SOFTWARE ANALYST
Software Textual Specifications
Documents about practices
…
Structural (STATIC) MODEL
Architectural MODEL
Behavioral MODEL
Methodological (DYNAMIC) MODEL
…
BUT, in Humanities information…
Narrative-based domains
Importance about the methodological context of information
(Static and dynamic link very pronounced)
Software analysts require hard information dimension effort
Software analysts are far from DH expertise
• To study how other works deal with the different information dimensions
from an holistic point of view, also:
• For humanities IS
• Directly from textual specifications in early stages software conception
• To propose a pipeline method as a tentatively semiautomatic approach for
our needs in humanities domains
Goal(s)
Related work
Works in modelling and automatic
extraction of DIFFERENT information
dimensions
Methods (Domain rules)
Processes (Process Mining)
Notations (BPMN, Topic maps,
Mind Maps, Concept Maps, i*,…)
Practices (Scenarios)
Works in HOLISTIC modelling and
automatic extraction of information
dimensions
Open/METIS
ISO/IEC 24744
Requirements: Cross-cutting concerns
NEED: From early stages textual specifications?
NEED: More than a conceptual bridge…Semi-supervised?
Pipeline approach: based on previous works: TextProcessMiner tool (Epure, Martin-Rodilla et al. 2015)
Initial dynamic information -> Process Mining Algorithms: Activity Logs
Initial static information -> Identification of domain key concepts: Concept map
Proposal
Phase I: TextProcessMiner
• Natural Language Processing approach
• TextProcessMiner extracts activities from
historical and archaeological official reports.
• Previously tested at CSIC, ADS…: in different
languages, validated by report’s authors.
• Locality principle in the activity identification:
tree-based syntactic structure.
TextCleaner
(Lemmatization, Automatic
cleaning, activities
recognition)
ActivityMiner
ActivityRelationshipMiner
Phase II: Preliminary Concept Map
Historical and Archaeological
Methodological Textual
Specifications
Discovered Log
(DYNAMIC INFO
DIMENSION)
Discovered Log
(DYNAMIC INFO
DIMENSION)
• Automatic identification of domain key concepts
• Part of Speech (POS) tagging techniques: decoupling
action verbs (activities candidates)
countable nouns (key concepts candidates)
• Why concepts maps?:
Intermediate formalization degree
Learning potential
Iterative methodology in concept map creation
• Why semi-automatic?
Better results in annotation approaches in humanities
Entities Decoupling (POS tagg.)
Activities Decoupling (POS tagg.)
Cross-links matching
(tree-based syntactic structure)
Preliminary Concept Map
Phase III: Supervised Phase
Preliminary Concept Map
Iterative Phase
Concepts and activity names verification: terminology, synonyms
Order and dependence cross links verification
Domain key concepts learning
Pipeline offers to Software Analysts:
- Most important concepts identification in the domain in a learning environment
- Activities identification and logs
- Static and dynamic preliminary link in domains’ terminology
Pipeline is current used:
- As a preliminary tool for extracting an holistic information view from early stages
textual specifications.
- As a tool for improving the model quality in terms of humanities terminology.
Supervised Concept Map
+
Activity Log sequence
Case Study: Extracting models in Cultural Heritage IS
Phase I: Extracting models in Cultural Heritage IS
“The trench was excavated using a toothed bucket using the
back actor of a small excavating machine. The watching brief
archaeologist inspected the sides of the trench for any past
cultural remains below the overburden. The removed spoil was
inspected in order to recover any past cultural artefacts.
Where archaeological deposits were revealed, each layer, fill
and cut was individually numbered and described in terms of soil
detail, stratigraphic position, dimensions, artefact content,
environmental samples and interpretation. The context system
was cross-referenced to other records. Registers were maintained
for all photographs, levels, plans, section, finds and samples
taken, made or gathered in the field.”
(From ADS Archaeological Report, Gerry Martin Associates Ltd. Glasgow)
- excavate trench -take photograph
- use bucket -take level
- use back_actor_of_machine -take plan
- inspect side_of_trench -take section
- inspect spoil -take find
- recover artefact -make photograph
- reveal deposit -make level
- number layer -make plan
- number fill -make section
- number cut -make find
- describe layer -gather photograph
- describe fill -gather level
- describe cut -gather plan
- cross_referenced context_system -gather section
- maintain register -gather find
Discovered LogTextual Specification
Phase II: Extracting models in Cultural Heritage IS
Trench
Bucket
Back_actor_of
_machine
LayerSide_of_trench
Level
Cut Find
Plan
SectionFill
Deposit
Artefact
Spoil Photograph
Register
Context
system
- excavate trench -take photograph
- use bucket -take level
- use back_actor_of_machine -take plan
- inspect side_of_trench -take section
- inspect spoil -take find
- recover artefact -make photograph
- reveal deposit -make level
- number layer -make plan
- number fill -make section
- number cut -make find
- describe layer -gather photograph
- describe fill -gather level
- describe cut -gather plan
- cross_referenced context_system -gather section
- maintain register -gather find
Discovered Log String Concept Map
+
Activity List
Phase II: Extracting models in Cultural Heritage IS
Preliminary Concept Map
Eastgate,
Hexham
Bucket
Back_actor_
of_machine
Layer
Side_of_trench
Level
Cut
Find PlanSection
Fill
DepositArtefact
Spoil
Photograph
Register
Context
system
USE
INSPECT
REVEAL
RECOVER
NUMBER
DESCRIBE
CROSS-
REFERENCE
GATHER
MANTAIN
EXCAVATE
Trench
Phase III: Extracting models in Cultural Heritage IS
Supervised Concept Map
Eastgate,
Hexham
BucketBack_actor_
of_machine
Layer
Side_of_trench
Level
Cut
Find PlanSection
Fill
DepositArtefact
Spoil
Photograph
Register
Context
system
USES
INSPECTS
ALLOWS
REVEALING
ALLOWS
RECOVERING
NUMBERS
DESCRIBES
CROSS-
REFERENCE
TO GATHER
HAS TO
MANTAIN
EXCAVATES
Trench
Discussion & Open Issues
 Work-in-progress proposal: holistic static and dynamic approach in information modelling
 Software analysts do not need previous domain knowledge to start creating models
 Maintenance of the semantic static and dynamic link in humanities domains’ terminology
 Semi-supervised approach: Software analysts can gradually learn domains’ key concepts and
practices
 Iterative pipeline: incremental improvement of the outputs
 Tested and evaluated by experts at historical and archaeological textual specifications
 Technological dependences: TextProcessMiner (NLP toolkit by Standford) -> TOWARDS A METAMODEL
 Locality principle and synonyms limitations-> WordNet, CILI INTEGRATION
 Humanities sub-domains’ adaptation: CH thesauri's, ontologies
 Need for rigorous validation with a vast CH textual specifications corpus
 From activity list to Process Models (Process Mining tools integration: DISCO, etc.)
Extracting static and dynamic
model elements from textual
specifications in humanities
Thank you for your attention
Patricia Martín-Rodilla
patricia.martin-rodilla@incipit.csic.es
Institute of Heritage Sciences
Spanish National Research Council
Santiago de Compostela, Spain.

Contenu connexe

En vedette

En vedette (20)

Analyzing navigation logs in MOOC: the Coursera case
Analyzing navigation logs in MOOC: the Coursera caseAnalyzing navigation logs in MOOC: the Coursera case
Analyzing navigation logs in MOOC: the Coursera case
 
Perceived Risks in Social Media Use – A Longitudinal Study Among University S...
Perceived Risks in Social Media Use – A Longitudinal Study Among University S...Perceived Risks in Social Media Use – A Longitudinal Study Among University S...
Perceived Risks in Social Media Use – A Longitudinal Study Among University S...
 
Assessing Engagement in an Emotionally-Adaptive Applied Game
Assessing Engagement in an Emotionally-Adaptive Applied GameAssessing Engagement in an Emotionally-Adaptive Applied Game
Assessing Engagement in an Emotionally-Adaptive Applied Game
 
Systematic mapping of the literature: social innovation laboratories for the ...
Systematic mapping of the literature: social innovation laboratories for the ...Systematic mapping of the literature: social innovation laboratories for the ...
Systematic mapping of the literature: social innovation laboratories for the ...
 
A spatio-temporal visual analysis tool for historical dictionaries.
A spatio-temporal visual analysis tool for historical dictionaries. A spatio-temporal visual analysis tool for historical dictionaries.
A spatio-temporal visual analysis tool for historical dictionaries.
 
Designing game-like activities to engage adult learners in higher education
Designing game-like activities to engage adult learners in higher educationDesigning game-like activities to engage adult learners in higher education
Designing game-like activities to engage adult learners in higher education
 
How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...
How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...
How Wiki-based Tasks, and Forums Favor University Students' Writing Skills an...
 
Recognition of an optimal study modality in a continuous education program in...
Recognition of an optimal study modality in a continuous education program in...Recognition of an optimal study modality in a continuous education program in...
Recognition of an optimal study modality in a continuous education program in...
 
Study of the Methodologies used by the Teaching staff of Graphic Expression f...
Study of the Methodologies used by the Teaching staff of Graphic Expression f...Study of the Methodologies used by the Teaching staff of Graphic Expression f...
Study of the Methodologies used by the Teaching staff of Graphic Expression f...
 
Incidence of Hearing Training in Musical Reading at First Sight. An explorato...
Incidence of Hearing Training in Musical Reading at First Sight. An explorato...Incidence of Hearing Training in Musical Reading at First Sight. An explorato...
Incidence of Hearing Training in Musical Reading at First Sight. An explorato...
 
A cross-platform interoperable component for course analytics.
A cross-platform interoperable component for course analytics. A cross-platform interoperable component for course analytics.
A cross-platform interoperable component for course analytics.
 
A strategy to reduce the blank answers on math tests at first engineering cou...
A strategy to reduce the blank answers on math tests at first engineering cou...A strategy to reduce the blank answers on math tests at first engineering cou...
A strategy to reduce the blank answers on math tests at first engineering cou...
 
A3bycomp: a software tool to help social organizations to manage skills
A3bycomp: a software tool to help social organizations to manage skillsA3bycomp: a software tool to help social organizations to manage skills
A3bycomp: a software tool to help social organizations to manage skills
 
Leveraging chatbots to improve self-guided learning through conversational qu...
Leveraging chatbots to improve self-guided learning through conversational qu...Leveraging chatbots to improve self-guided learning through conversational qu...
Leveraging chatbots to improve self-guided learning through conversational qu...
 
Training to capture software requirements by role playing
Training to capture software requirements by role playingTraining to capture software requirements by role playing
Training to capture software requirements by role playing
 
Motivating students of Chemical Engineering through a cooperative work record...
Motivating students of Chemical Engineering through a cooperative work record...Motivating students of Chemical Engineering through a cooperative work record...
Motivating students of Chemical Engineering through a cooperative work record...
 
Enjoyment, Resistance to Change and mLearning Acceptance Among Pre-Service Te...
Enjoyment, Resistance to Change and mLearning Acceptance Among Pre-Service Te...Enjoyment, Resistance to Change and mLearning Acceptance Among Pre-Service Te...
Enjoyment, Resistance to Change and mLearning Acceptance Among Pre-Service Te...
 
Dynamic visualization of the relative position of straight lines on the plane...
Dynamic visualization of the relative position of straight lines on the plane...Dynamic visualization of the relative position of straight lines on the plane...
Dynamic visualization of the relative position of straight lines on the plane...
 
Gamifying Research Activity Support System
Gamifying Research Activity Support SystemGamifying Research Activity Support System
Gamifying Research Activity Support System
 
Ap for b.tech. (mechanical) Assignment Problem
Ap for b.tech. (mechanical) Assignment Problem Ap for b.tech. (mechanical) Assignment Problem
Ap for b.tech. (mechanical) Assignment Problem
 

Similaire à Extracting static and dynamic model elements from textual specifications in humanities

The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...
Keith.May
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.pps
butest
 
Geographical Map Annotation With Social Metadata In a Surveillance Environment
Geographical Map Annotation With Social Metadata In a Surveillance EnvironmentGeographical Map Annotation With Social Metadata In a Surveillance Environment
Geographical Map Annotation With Social Metadata In a Surveillance Environment
Elena Roglia
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
butest
 
Extracting City Traffic Events from Social Streams
 Extracting City Traffic Events from Social Streams Extracting City Traffic Events from Social Streams
Extracting City Traffic Events from Social Streams
Pramod Anantharam
 

Similaire à Extracting static and dynamic model elements from textual specifications in humanities (20)

The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...The Matrix: connecting and re-using digital records of archaeological investi...
The Matrix: connecting and re-using digital records of archaeological investi...
 
Keith_May_S12_CAA2023_Amsterdam.pptx
Keith_May_S12_CAA2023_Amsterdam.pptxKeith_May_S12_CAA2023_Amsterdam.pptx
Keith_May_S12_CAA2023_Amsterdam.pptx
 
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
Introducing Spatial Coverage in a Semantic Repository Model - Phd defence
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.pps
 
On the Management, Analysis and Simulation of our LifeSteps
On the Management, Analysis and Simulation of our LifeStepsOn the Management, Analysis and Simulation of our LifeSteps
On the Management, Analysis and Simulation of our LifeSteps
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
 
Strasbourg2010 bozzano-resa
Strasbourg2010 bozzano-resaStrasbourg2010 bozzano-resa
Strasbourg2010 bozzano-resa
 
AAG_2011
AAG_2011AAG_2011
AAG_2011
 
ArchAIDE kickoff meeting introduction
ArchAIDE kickoff meeting introductionArchAIDE kickoff meeting introduction
ArchAIDE kickoff meeting introduction
 
Developing common European archaeological concepts through extending the CIDO...
Developing common European archaeological concepts through extending the CIDO...Developing common European archaeological concepts through extending the CIDO...
Developing common European archaeological concepts through extending the CIDO...
 
Geographical Map Annotation With Social Metadata In a Surveillance Environment
Geographical Map Annotation With Social Metadata In a Surveillance EnvironmentGeographical Map Annotation With Social Metadata In a Surveillance Environment
Geographical Map Annotation With Social Metadata In a Surveillance Environment
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
 
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
An Open and Shut Case? Shared Standards for Stratigraphic Data and Heritage L...
 
Moving forward data centric sciences weaving AI, Big Data & HPC
Moving forward data centric sciences  weaving AI, Big Data & HPCMoving forward data centric sciences  weaving AI, Big Data & HPC
Moving forward data centric sciences weaving AI, Big Data & HPC
 
20IT501_DWDM_PPT_Unit_V.ppt
20IT501_DWDM_PPT_Unit_V.ppt20IT501_DWDM_PPT_Unit_V.ppt
20IT501_DWDM_PPT_Unit_V.ppt
 
Extracting City Traffic Events from Social Streams
 Extracting City Traffic Events from Social Streams Extracting City Traffic Events from Social Streams
Extracting City Traffic Events from Social Streams
 
Exploration – A Serious Game
Exploration – A Serious GameExploration – A Serious Game
Exploration – A Serious Game
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different Facets
 
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrievalMind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
 
Innovative design methods for data science - beyond brainstorming
Innovative design methods for data science - beyond brainstormingInnovative design methods for data science - beyond brainstorming
Innovative design methods for data science - beyond brainstorming
 

Plus de Technological Ecosystems for Enhancing Multiculturality

Plus de Technological Ecosystems for Enhancing Multiculturality (20)

A Preliminary Study of Proof of Concept Practices and their connection with I...
A Preliminary Study of Proof of Concept Practices and their connection with I...A Preliminary Study of Proof of Concept Practices and their connection with I...
A Preliminary Study of Proof of Concept Practices and their connection with I...
 
Social networks as a promotional space for Spanish radio content. The case st...
Social networks as a promotional space for Spanish radio content. The case st...Social networks as a promotional space for Spanish radio content. The case st...
Social networks as a promotional space for Spanish radio content. The case st...
 
Towards the study of sentiment in the public opinion of science in Spanish
Towards the study of sentiment in the public opinion of science in SpanishTowards the study of sentiment in the public opinion of science in Spanish
Towards the study of sentiment in the public opinion of science in Spanish
 
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
A Three-Step Data-Mining Analysis of Top-Ranked Higher Education Institutions...
 
Specifics of multimedia texts in the context of social networks media aesthetics
Specifics of multimedia texts in the context of social networks media aestheticsSpecifics of multimedia texts in the context of social networks media aesthetics
Specifics of multimedia texts in the context of social networks media aesthetics
 
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
Combined Effects of Similarity and Imagined Contact on First-Person Testimoni...
 
Direct online political communication effects on civil participation in spain...
Direct online political communication effects on civil participation in spain...Direct online political communication effects on civil participation in spain...
Direct online political communication effects on civil participation in spain...
 
University Media in Ecuador: Types, Functions and Self-determination
University Media in Ecuador: Types, Functions and Self-determinationUniversity Media in Ecuador: Types, Functions and Self-determination
University Media in Ecuador: Types, Functions and Self-determination
 
Like it or die: using social networks to improve collaborative learning in hi...
Like it or die: using social networks to improve collaborative learning in hi...Like it or die: using social networks to improve collaborative learning in hi...
Like it or die: using social networks to improve collaborative learning in hi...
 
Framing theory in studies of environmental information in press
Framing theory in studies of environmental information in pressFraming theory in studies of environmental information in press
Framing theory in studies of environmental information in press
 
Domain engineering for generating dashboards to analyze employment and employ...
Domain engineering for generating dashboards to analyze employment and employ...Domain engineering for generating dashboards to analyze employment and employ...
Domain engineering for generating dashboards to analyze employment and employ...
 
Mapping the systematic literature studies about software ecosystems
Mapping the systematic literature studies about software ecosystemsMapping the systematic literature studies about software ecosystems
Mapping the systematic literature studies about software ecosystems
 
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
Tag-Based Browsing of Digital Collections with Inverted Indexes and Browsing ...
 
A Multivocal Literature Review on the use of DevOps for e-learning systems
A Multivocal Literature Review on the use of DevOps for e-learning systemsA Multivocal Literature Review on the use of DevOps for e-learning systems
A Multivocal Literature Review on the use of DevOps for e-learning systems
 
Document Annotation Tools: Annotation Classification Mechanisms
Document Annotation Tools: Annotation Classification MechanismsDocument Annotation Tools: Annotation Classification Mechanisms
Document Annotation Tools: Annotation Classification Mechanisms
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
 
Managing Uncertainty in the Humanities: Digital and Analogue Approaches
Managing Uncertainty in the Humanities: Digital and Analogue ApproachesManaging Uncertainty in the Humanities: Digital and Analogue Approaches
Managing Uncertainty in the Humanities: Digital and Analogue Approaches
 
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
Representing Imprecise and Uncertain Knowledge in Digital Humanities: A Theor...
 
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
Dotmocracy and Planning Poker for Uncertainty Management in Collaborative Res...
 
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
Applying Commercial Computer Vision Tools to Cope with Uncertainties in a Cit...
 

Dernier

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Dernier (20)

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 

Extracting static and dynamic model elements from textual specifications in humanities

  • 1. Extracting static and dynamic model elements from textual specifications in humanities Patricia Martín-Rodilla César González-Pérez Institute of Heritage Sciences, Spanish National Research Council Santiago de Compostela, Spain.
  • 2. Index Research Context & Problem Goal(s) Related Work Proposal: oProposal Overview oProposal Phases Case Study in Cultural Heritage Information Systems Discussion & Open Issues
  • 3. Research Context Information Systems are composed of different information dimensions: … Structural (STATIC) Architectural Behavioral Methodological (DYNAMIC) … BUT, IS support humans activities SOFTWARE ANALYST Software Textual Specifications Documents about practices … Structural (STATIC) MODEL Architectural MODEL Behavioral MODEL Methodological (DYNAMIC) MODEL … BUT, in Humanities information… Narrative-based domains Importance about the methodological context of information (Static and dynamic link very pronounced) Software analysts require hard information dimension effort Software analysts are far from DH expertise
  • 4. • To study how other works deal with the different information dimensions from an holistic point of view, also: • For humanities IS • Directly from textual specifications in early stages software conception • To propose a pipeline method as a tentatively semiautomatic approach for our needs in humanities domains Goal(s)
  • 5. Related work Works in modelling and automatic extraction of DIFFERENT information dimensions Methods (Domain rules) Processes (Process Mining) Notations (BPMN, Topic maps, Mind Maps, Concept Maps, i*,…) Practices (Scenarios) Works in HOLISTIC modelling and automatic extraction of information dimensions Open/METIS ISO/IEC 24744 Requirements: Cross-cutting concerns NEED: From early stages textual specifications? NEED: More than a conceptual bridge…Semi-supervised?
  • 6. Pipeline approach: based on previous works: TextProcessMiner tool (Epure, Martin-Rodilla et al. 2015) Initial dynamic information -> Process Mining Algorithms: Activity Logs Initial static information -> Identification of domain key concepts: Concept map Proposal
  • 7. Phase I: TextProcessMiner • Natural Language Processing approach • TextProcessMiner extracts activities from historical and archaeological official reports. • Previously tested at CSIC, ADS…: in different languages, validated by report’s authors. • Locality principle in the activity identification: tree-based syntactic structure. TextCleaner (Lemmatization, Automatic cleaning, activities recognition) ActivityMiner ActivityRelationshipMiner Phase II: Preliminary Concept Map Historical and Archaeological Methodological Textual Specifications Discovered Log (DYNAMIC INFO DIMENSION) Discovered Log (DYNAMIC INFO DIMENSION) • Automatic identification of domain key concepts • Part of Speech (POS) tagging techniques: decoupling action verbs (activities candidates) countable nouns (key concepts candidates) • Why concepts maps?: Intermediate formalization degree Learning potential Iterative methodology in concept map creation • Why semi-automatic? Better results in annotation approaches in humanities Entities Decoupling (POS tagg.) Activities Decoupling (POS tagg.) Cross-links matching (tree-based syntactic structure) Preliminary Concept Map
  • 8. Phase III: Supervised Phase Preliminary Concept Map Iterative Phase Concepts and activity names verification: terminology, synonyms Order and dependence cross links verification Domain key concepts learning Pipeline offers to Software Analysts: - Most important concepts identification in the domain in a learning environment - Activities identification and logs - Static and dynamic preliminary link in domains’ terminology Pipeline is current used: - As a preliminary tool for extracting an holistic information view from early stages textual specifications. - As a tool for improving the model quality in terms of humanities terminology. Supervised Concept Map + Activity Log sequence
  • 9. Case Study: Extracting models in Cultural Heritage IS
  • 10. Phase I: Extracting models in Cultural Heritage IS “The trench was excavated using a toothed bucket using the back actor of a small excavating machine. The watching brief archaeologist inspected the sides of the trench for any past cultural remains below the overburden. The removed spoil was inspected in order to recover any past cultural artefacts. Where archaeological deposits were revealed, each layer, fill and cut was individually numbered and described in terms of soil detail, stratigraphic position, dimensions, artefact content, environmental samples and interpretation. The context system was cross-referenced to other records. Registers were maintained for all photographs, levels, plans, section, finds and samples taken, made or gathered in the field.” (From ADS Archaeological Report, Gerry Martin Associates Ltd. Glasgow) - excavate trench -take photograph - use bucket -take level - use back_actor_of_machine -take plan - inspect side_of_trench -take section - inspect spoil -take find - recover artefact -make photograph - reveal deposit -make level - number layer -make plan - number fill -make section - number cut -make find - describe layer -gather photograph - describe fill -gather level - describe cut -gather plan - cross_referenced context_system -gather section - maintain register -gather find Discovered LogTextual Specification
  • 11. Phase II: Extracting models in Cultural Heritage IS Trench Bucket Back_actor_of _machine LayerSide_of_trench Level Cut Find Plan SectionFill Deposit Artefact Spoil Photograph Register Context system - excavate trench -take photograph - use bucket -take level - use back_actor_of_machine -take plan - inspect side_of_trench -take section - inspect spoil -take find - recover artefact -make photograph - reveal deposit -make level - number layer -make plan - number fill -make section - number cut -make find - describe layer -gather photograph - describe fill -gather level - describe cut -gather plan - cross_referenced context_system -gather section - maintain register -gather find Discovered Log String Concept Map + Activity List
  • 12. Phase II: Extracting models in Cultural Heritage IS Preliminary Concept Map Eastgate, Hexham Bucket Back_actor_ of_machine Layer Side_of_trench Level Cut Find PlanSection Fill DepositArtefact Spoil Photograph Register Context system USE INSPECT REVEAL RECOVER NUMBER DESCRIBE CROSS- REFERENCE GATHER MANTAIN EXCAVATE Trench
  • 13. Phase III: Extracting models in Cultural Heritage IS Supervised Concept Map Eastgate, Hexham BucketBack_actor_ of_machine Layer Side_of_trench Level Cut Find PlanSection Fill DepositArtefact Spoil Photograph Register Context system USES INSPECTS ALLOWS REVEALING ALLOWS RECOVERING NUMBERS DESCRIBES CROSS- REFERENCE TO GATHER HAS TO MANTAIN EXCAVATES Trench
  • 14. Discussion & Open Issues  Work-in-progress proposal: holistic static and dynamic approach in information modelling  Software analysts do not need previous domain knowledge to start creating models  Maintenance of the semantic static and dynamic link in humanities domains’ terminology  Semi-supervised approach: Software analysts can gradually learn domains’ key concepts and practices  Iterative pipeline: incremental improvement of the outputs  Tested and evaluated by experts at historical and archaeological textual specifications  Technological dependences: TextProcessMiner (NLP toolkit by Standford) -> TOWARDS A METAMODEL  Locality principle and synonyms limitations-> WordNet, CILI INTEGRATION  Humanities sub-domains’ adaptation: CH thesauri's, ontologies  Need for rigorous validation with a vast CH textual specifications corpus  From activity list to Process Models (Process Mining tools integration: DISCO, etc.)
  • 15. Extracting static and dynamic model elements from textual specifications in humanities Thank you for your attention Patricia Martín-Rodilla patricia.martin-rodilla@incipit.csic.es Institute of Heritage Sciences Spanish National Research Council Santiago de Compostela, Spain.