SlideShare une entreprise Scribd logo
1  sur  21
Knowledge Discovery in Environmental Impact Report’s summary texts: an exploratory analysis of four case studies Cláudia Viviane  Viegas 1,2 , Roseli  Búrigo 1,3 , José Leomar  Todesco 1 , Fernando Alvaro Ostuni  Gauthier 1 , Paulo Maurício  Selig 1 1 Federal University of Santa Catarina,  UFSC , Engineering and Knowledge Management Post-graduation Program, Florianópolis (SC), BRAZIL;  2 Feevale  University, Novo Hamburgo (RS), BRAZIL; 3 Santa Catarina Extrem South University,  Unesc , Criciúma (SC), BRAZIL
What? This paper analyses four summary texts from Environmental Impact Reports (EIRs) prepared for hydroelectric facilities, built in Brazil between 1997 and 2005. Documents’ brochures are: Jirau’s (1), Ipueiras’(2), Paulistas’(3), and Barra Grande’s (4) dams. 1 2 3 4
How?   Knowledge Discovery Texts techniques (KDT), namely stopwords and stemming, are employed. EIRs summarise Environmental Impact Assessment (EIAs) outcomes, which are mandatory by law in order to identify and measure effects from entrepreneurship with high levels of environmental change, and to formulate mitigation measures.  A thesaurus is elaborated from the Reference Term (RT) - a document provided by governmental environmental institutions to guide EIAs-EIRs construction.  A contextual approach is employed in order to cover the most number of words and expressions which bear similarities with each other. The thesaurus' words and expressions are classified into 22 groups according to the similarities of their meanings. Such words and expressions are compared to EIRs summaries' words and expressions, after these summaries have undergone data preprocessing.
Why? Brazilian EIRs are often criticised as incomplete and superficial, but this criticism suffers from a lack of objective support.  Major findings Comparison of the results of the thesaurus versus words and expressions acquired from summaries allows us to conclude that the EIRs emphasise: placement issue; impacts; environmental alternatives; and mitigation/compensation procedures. Expressions such as technological resources; financial resources; social and economic context; economic alternatives; impact size; impact relevance; environmental effects; and harm prevention, listed in thesaurus, are not mentioned in the summaries.
EIAs-EIRs guidelines - problem’s approach In Brazil, EIAs-EIRs are required by law, which originates generic Reference Term (RT) as a guideline to this kind of study. Zilberman (1995) highlights five generic steps of an EIA-EIR, and we can consider the first three as more relevant to the thesauru’s building: -  Step I : identification - Information about project site, technological and financial resources to control project environmental effects, socioeconomic context, objectives of land use and occupation policies, legislation, and size and alternatives for these impacts.  -  Step II : environmental diagnostic - Evaluation of each impact identified in the previous step. Physical, biological (or biotic), and socieoeconomic environments are evaluated. -  Step III : impacts' prognosis -  Environmental effects of business are identified and analysed, as well as technological and economic possibilities of prevention and control, mitigation and repair. An alternative is chosen as the basis of the EIA-EIR.
Theoretic framework - IR and KDT To better understand the content of EIRs summaries, Information Retrieval (IR) studies can be worthwhile. IR is "(...) an activity which involves aspects of information description (indexation, pattern building) and it encompasses specification for searching, including any technique, system or machine employed to do or support such tasks” (WIVES, 2002). IR is the process or method where a potential information user can change your information necessity in a real list of stored documents' citations which contain useful information to him (SARACEVIC, 1995). Indexation is the first step of IR. It refers to the selection of relevant words in document, and can be done through controlled vocabulary techniques. It has the aim to build access points to a document. It is possible through the use of key words and identification of expressions (WIVES, 2002).
Relationship between EIRs and KDT The creation of a thesaurus containing key words and expressions from stages of EIA-EIR, following Zilberman's (1995) guidelines, is a first step in establishing a relationship between EIRs and KDT. It is a necessary precursor to the further process of relevant information identification, called matching. It identifies similarities between relevant information to user query and information stored in the system.  EIAs-EIRS major guidelines Semantic treatment thesaurus
KDT techniques - stopwords and stemming   Semantic analysis was employed in order to deal with EIRs summaries, using techniques such as  stopwords  and  stemming .  Stopwords are irrelevant words, and include prepositions, conjunctions, pronouns and others with no meaning in a specific context. It includes "words with no relevant semantic content in their context and irrelevant words in the text analysis” (LOPES, 2004).  Morphologic normalisation, called stemming, takes word's radical as being relevant, without taking in account desinences. “With this technique, user does not need to worry with the orthographic shape of a written word in a text. So, an idea, independent of being written as substantive, adjective or verb, is identified by the same (and single) radical” (WIVES, 2002).
EIRs summaries targeted Jirau Ipueiras Paulistas Barra Grande
Matching and weighting   After the analysis of the texts' summaries, supported by tools such as stopwords and stemming, the matching technique is employed taking in account words and expressions of the texts' summaries and thesaurus' words and expressions. It means considering the relevance of each key word and expression, which is given by the relative frequency of indexed words - by the number of times they appear in comparison with the number of document's words. This is a weighting process. In order to understand the weights' meaning, a clustering technique is employed. Instead of investigation hypothesis, a proactive approach is used to acquire information, designing an exploratory research, which “(...) is useful to detect potential problems and opportunities (Loh et al., 2000)
Thesaurus’ building (I) Following Zilberman's guide to elaborate EIAs-EIRs (1995) as RT, we listed the following steps with respective set of key words and expressions:  - Step I: A - placement, place(s), locational alternative(s), area, area(s) of influence area, influenced area(s) , affected area, region, region of influence, where; B - technologic resources, technology; C - financial resources; financing; D- socioeconomic context, socio economic aspect(s) socioeconomic(s), socioeconomy; E - soil using policy, soil use; F - legislation, law(s), resolution(s), legal aspects.  - Step II: G - environmental diagnostic; H- environmental impact(s), environmental change(s); I- physical media; J- biological media, biotic media; K- physical-biotic media, physical and biotic media; L- socioeconomic media, socioeconomic aspect(s); M- impacts' dimension; N- impacts' relevance.
Thesaurus’ building (II) - Step III: O - impacts prognostic, environmental prognostic(s); P- environmental effects; Q- (environmental) alternative, (environmental) plans, (environmental) projects, (environmental) programs; R- technological alternative; S- economic alternative; T-mitigation, measures, attenuation measures, compensatory measures, corrective measures, compensation, compensate, correction, correct, repair; U- control, monitoring; V- prevention.   thesaurus weighting matching thesaurus
Semantic treatment results (I) Semantic classification results of EIRs’ summaries texts weighted and compared with thesaurus terms
Semantic treatment results (II) Matching and weighting analysis’ aspects according to each facility summary
Findings and discussion (I) More common words and expressions   More common key words and expressions identified belong to the A, H, Q, and T groups. They represent all steps described by Zilberman (1995): A (I), H (II) e Q, and T (III). More important summary items are placement, impact, alternatives, plans, projects or environmental programs, and mitigation measures.  Relevance Considering the total number of key words and expressions for each summary and matching them with the thesaurus' list, we find that the Barra Grande EIR has the best match, as it contains the highest relative proportion of key words and expressions (11,2%) compared with the thesaurus' words and expressions. EIRs summaries of Paulistas (11%), Ipueiras (9,7%) and Jirau (8,4%) facilities perform less well.
Findings and discussion (II) Number of words and expressions selected in each summary compared with thesaurus' sets of words and expressions   In this analysis, we find that the Ipueiras' summary has the best representativeness: 11 words, or 50% of the whole thesaurus. Barra Grande (45,4%), Paulistas (27,2%), and Jirau (18,1%) all match fewer words in the thesaurus. So, we can conclude the Jirau's summary has the poorest overall match with the thesaurus in terms of both number and relevance of words.
Conclusions   The most important items of summaries, compared to the thesaurus, are placement, impact, alternatives, plans, environmental projects or programs, and mitigation measures. Regarding the thesaurus’ words or expressions frequency in each summary, and sets of words and expressions – we listed 22 groups –, EIRs with more summaries’ fitness are Barra Grande   and Ipueiras, and Jirau’s   has the least fitness.  This conclusion, even related to summaries with few words – between 119 and 354 –, indicates on which issues must be focused further studies related to EIRs texts’ semantic analysis. The analysed summaries are not concerned to bring up technological and economic issues, for example, or subjects as dimensioning and environmental impacts' relevance. We recommend the analysis of more documents in order to confirm or refute these results, which we consider as primary.
References (I) Campos, P.M.P. (org.). 1986. Usinas hidrelétricas de Santo Antônio e Jirau -  RIMA . Furnas e Odebrecht, Rio de Janeiro (RJ), 82p. Castro, T.L.C. (org.). 1997.  UHE Barra Grande  -  Relatório de Impacto ao Meio Ambiente . Sumário. Engevix, São Paulo (SP), 59p.  Ferneda, E. 2003.  Recuperação de Informação: Análise sobre a Contribuição da Ciência da Computação para a Ciência da Informação . Escola de Comunicação e Artes da Universidade de São Paulo/ USP (Tese de Doutorado).  Jensen, P.D. (org.). 2005.  Usina Hidrelétrica Ipueiras  -  Relatório de Impacto Ambiental - RIMA . Rede Ipueiras Empresas de Energia Elétrica e Themag Engenharia. São Paulo (SP), 97p.
References (II) Loh, S.; Wives, L.K.; Oliveira, J.P.M. 2000.  Descoberta proativa de conhecimento em coleções textuais: iniciando sem hipóteses .  In: IV Oficina de Inteligência Artificial, Pelotas (RS), p. 143-154. Available in <http://www.inf.ufrgs.br/~palazzo/OAI/00%20OIA.pdf.> Accessed in April 20 th  2006. Lopes, M.C.S. 2004.  Mineração de Dados Textuais Utilizando Técnicas de Clustering para o Idioma Português .  Universidade Federal do Rio de Janeiro/ UFRJ (Tese ), 191 p. Montano, C.F.B.; Pithan, R.O. (org.). 2005.  Relatório de Impacto Ambiental - RIMA   - AHE Paulistas , Rio São Marcos (GO/MG). Biodinâmica Engenharia. Rio de Janeiro (RJ), 54p. Moreira, R. 2002. Para que o EIA-RIMA Quase Vinte Anos Depois? In: Verdum, R. e Medeiros, R. M. (org.).  RIMA - Relatório de Impacto Ambiental . Ed. UFRGS (4ª edição): Porto Alegre, p.11-21.
References (III) Rohde, G. M. 2002. Estudos de Impacto Ambiental: A Situação Brasileira em 2000. In: Verdum, R. e Medeiros, R. M. (org.).  RIMA - Relatório de Impacto Ambiental . Ed. UFRGS (4ª edição): Porto Alegre, p. 41-65. Saracevic, T. 1995. Evaluation of Evaluation in Information Retrieval. In: Conference on Research and Development in Information Retrieval. 18th Annual International SIGIR, Seattle, USA. (Proceedings).  ACM Press ,  p. 137-146. Wives, L.K. 2002.  Tecnologias de Descoberta de Conhecimento em Textos Aplicadas à Inteligência Competitiva . Programa de Pós-graduação em Computação (Exame de Qualificação), 116 p. Porto Alegre (RS), Universidade Federal do Rio Grande do Sul (UFRGS). Zilberman, Isaac. 1995.  Conceitos e Metodologias para Estudos de  Impacto Ambiental . Ed. Ulbra: Canoas (RS).
Author’s contact Cláudia V. Viegas – claudiav@egc.ufsc.br  Roseli Búrigo – rbc@unesc.net J. Leomar Todesco – tite@stela.ufsc.br Fernando O. Gauthier – gauthier@inf.ufsc.br Paulo M. Selig – selig@egc.ufsc.br Acknowledgement We thank to advices coming from Dr.  Alan J. Bond , senior lecturer, Environmental Sciences School,  University of East Anglia  (UEA), Norwich, UK.

Contenu connexe

En vedette

Sérgio Gonçalves
Sérgio Gonçalves Sérgio Gonçalves
Sérgio Gonçalves ProjetoBr
 
Relatório Funcafe 2009
Relatório Funcafe 2009 Relatório Funcafe 2009
Relatório Funcafe 2009 Sergio Pereira
 
Research yearbook 2014 - 2015
Research yearbook 2014 - 2015Research yearbook 2014 - 2015
Research yearbook 2014 - 2015FGV Brazil
 
WCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIAL
WCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIALWCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIAL
WCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIALEXCELLENCE CONSULTING
 
Manejo del correo
Manejo del correoManejo del correo
Manejo del correosenajulian
 
Proposta comercial
Proposta comercialProposta comercial
Proposta comercialDenis Katko
 
Química preparatoria tec m
Química preparatoria tec mQuímica preparatoria tec m
Química preparatoria tec mMaestros Online
 
Construccion de concreto
Construccion de concretoConstruccion de concreto
Construccion de concretowilliams
 
WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...
WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...
WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...EXCELLENCE CONSULTING
 
Filacap on line 075
Filacap on line 075Filacap on line 075
Filacap on line 075mgermina
 
Exercícios power point selma soares
Exercícios power point selma soaresExercícios power point selma soares
Exercícios power point selma soaresselmasoares
 

En vedette (20)

Sergio Lobato CV
Sergio Lobato CVSergio Lobato CV
Sergio Lobato CV
 
Sérgio Gonçalves
Sérgio Gonçalves Sérgio Gonçalves
Sérgio Gonçalves
 
Em Movimento nº14
Em Movimento nº14Em Movimento nº14
Em Movimento nº14
 
Eletronica senai
Eletronica senaiEletronica senai
Eletronica senai
 
Relatório Funcafe 2009
Relatório Funcafe 2009 Relatório Funcafe 2009
Relatório Funcafe 2009
 
Research yearbook 2014 - 2015
Research yearbook 2014 - 2015Research yearbook 2014 - 2015
Research yearbook 2014 - 2015
 
Cap6 smds
Cap6 smdsCap6 smds
Cap6 smds
 
WCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIAL
WCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIALWCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIAL
WCM 2009-TT20-EXCELLENCE - WORLD CLASS MAINTENANCE-MANUTENÇÃO CLASSE MUNDIAL
 
Consenso TEP 2010
Consenso TEP 2010Consenso TEP 2010
Consenso TEP 2010
 
Manejo del correo
Manejo del correoManejo del correo
Manejo del correo
 
Conceito de factoring
Conceito de factoringConceito de factoring
Conceito de factoring
 
Proposta comercial
Proposta comercialProposta comercial
Proposta comercial
 
Química preparatoria tec m
Química preparatoria tec mQuímica preparatoria tec m
Química preparatoria tec m
 
SEMED
SEMEDSEMED
SEMED
 
Construccion de concreto
Construccion de concretoConstruccion de concreto
Construccion de concreto
 
Caderno tarefas dd-1824_geral
Caderno tarefas dd-1824_geralCaderno tarefas dd-1824_geral
Caderno tarefas dd-1824_geral
 
WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...
WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...
WCM-WORLD CLASS MAINTENANCE-BEST PRACTICES-MANUTENÇÃO CLASSE MUNDIAL - MELHOR...
 
Filacap on line 075
Filacap on line 075Filacap on line 075
Filacap on line 075
 
Exercícios power point selma soares
Exercícios power point selma soaresExercícios power point selma soares
Exercícios power point selma soares
 
Sistema financeiro Moçambicano
Sistema financeiro MoçambicanoSistema financeiro Moçambicano
Sistema financeiro Moçambicano
 

Similaire à Knowledge Discovery in Environmental Impact Report’s summary texts: an exploratory analysis of four case studies

Developing reverse logistics programs: a resource based view.
Developing reverse logistics programs: a resource based view.Developing reverse logistics programs: a resource based view.
Developing reverse logistics programs: a resource based view.tenderboyfriend96
 
Heather_Spitzer-_Resume & CHMM & ISO Cert
Heather_Spitzer-_Resume & CHMM & ISO CertHeather_Spitzer-_Resume & CHMM & ISO Cert
Heather_Spitzer-_Resume & CHMM & ISO CertHeather Spitzer, CHMM
 
Comparative Study of the Quality of Life, Quality of Work Life and Organisati...
Comparative Study of the Quality of Life, Quality of Work Life and Organisati...Comparative Study of the Quality of Life, Quality of Work Life and Organisati...
Comparative Study of the Quality of Life, Quality of Work Life and Organisati...inventy
 
A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...
A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...
A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...Heather Strinden
 
A Review Article On Quot Environmental Impact Assessment (Eia)
A Review Article On Quot Environmental Impact Assessment (Eia)A Review Article On Quot Environmental Impact Assessment (Eia)
A Review Article On Quot Environmental Impact Assessment (Eia)Robin Beregovska
 
Strategic thinking Model for SEA (Aplikasi di Indonesia)
Strategic thinking Model for SEA (Aplikasi di Indonesia)Strategic thinking Model for SEA (Aplikasi di Indonesia)
Strategic thinking Model for SEA (Aplikasi di Indonesia)praswaskita2
 
Situational analysis of the subjective well-being of university software deve...
Situational analysis of the subjective well-being of university software deve...Situational analysis of the subjective well-being of university software deve...
Situational analysis of the subjective well-being of university software deve...IJAEMSJORNAL
 
A GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESS
A GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESSA GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESS
A GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESSijseajournal
 
Visual Analytics and the Language of Web Query Logs - A Terminology Perspective
Visual Analytics and the Language of Web Query Logs - A Terminology PerspectiveVisual Analytics and the Language of Web Query Logs - A Terminology Perspective
Visual Analytics and the Language of Web Query Logs - A Terminology PerspectiveFindwise
 
Environmental scanning
Environmental scanningEnvironmental scanning
Environmental scanningchengcampoop
 
Environmental scanning
Environmental scanningEnvironmental scanning
Environmental scanningchengcampoop
 
Environmental scanning
Environmental scanningEnvironmental scanning
Environmental scanningchengcampoop
 
Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...ijcsity
 
Implementing Intervention Research into PublicPolicy—the BI3
Implementing Intervention Research into PublicPolicy—the BI3Implementing Intervention Research into PublicPolicy—the BI3
Implementing Intervention Research into PublicPolicy—the BI3MalikPinckney86
 

Similaire à Knowledge Discovery in Environmental Impact Report’s summary texts: an exploratory analysis of four case studies (20)

Developing reverse logistics programs: a resource based view.
Developing reverse logistics programs: a resource based view.Developing reverse logistics programs: a resource based view.
Developing reverse logistics programs: a resource based view.
 
Heather_Spitzer-_Resume & CHMM & ISO Cert
Heather_Spitzer-_Resume & CHMM & ISO CertHeather_Spitzer-_Resume & CHMM & ISO Cert
Heather_Spitzer-_Resume & CHMM & ISO Cert
 
Guidance notoneia (1)
Guidance notoneia (1)Guidance notoneia (1)
Guidance notoneia (1)
 
Comparative Study of the Quality of Life, Quality of Work Life and Organisati...
Comparative Study of the Quality of Life, Quality of Work Life and Organisati...Comparative Study of the Quality of Life, Quality of Work Life and Organisati...
Comparative Study of the Quality of Life, Quality of Work Life and Organisati...
 
A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...
A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...
A Case Study Exploring Field-Level Risk Assessments As A Leading Safety Indic...
 
SPM_Metrics_WhitePaper_3
SPM_Metrics_WhitePaper_3SPM_Metrics_WhitePaper_3
SPM_Metrics_WhitePaper_3
 
A Review Article On Quot Environmental Impact Assessment (Eia)
A Review Article On Quot Environmental Impact Assessment (Eia)A Review Article On Quot Environmental Impact Assessment (Eia)
A Review Article On Quot Environmental Impact Assessment (Eia)
 
Ny3424442448
Ny3424442448Ny3424442448
Ny3424442448
 
Operations Research
Operations ResearchOperations Research
Operations Research
 
Sustainability
SustainabilitySustainability
Sustainability
 
Strategic thinking Model for SEA (Aplikasi di Indonesia)
Strategic thinking Model for SEA (Aplikasi di Indonesia)Strategic thinking Model for SEA (Aplikasi di Indonesia)
Strategic thinking Model for SEA (Aplikasi di Indonesia)
 
Situational analysis of the subjective well-being of university software deve...
Situational analysis of the subjective well-being of university software deve...Situational analysis of the subjective well-being of university software deve...
Situational analysis of the subjective well-being of university software deve...
 
A GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESS
A GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESSA GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESS
A GROUNDED THEORY OF THE REQUIREMENTS ENGINEERING PROCESS
 
Visual Analytics and the Language of Web Query Logs - A Terminology Perspective
Visual Analytics and the Language of Web Query Logs - A Terminology PerspectiveVisual Analytics and the Language of Web Query Logs - A Terminology Perspective
Visual Analytics and the Language of Web Query Logs - A Terminology Perspective
 
الواجججج
الواججججالواجججج
الواجججج
 
Environmental scanning
Environmental scanningEnvironmental scanning
Environmental scanning
 
Environmental scanning
Environmental scanningEnvironmental scanning
Environmental scanning
 
Environmental scanning
Environmental scanningEnvironmental scanning
Environmental scanning
 
Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...
 
Implementing Intervention Research into PublicPolicy—the BI3
Implementing Intervention Research into PublicPolicy—the BI3Implementing Intervention Research into PublicPolicy—the BI3
Implementing Intervention Research into PublicPolicy—the BI3
 

Plus de inscit2006

Information Searcher-Provider Fit through Information Presentation and Visual...
Information Searcher-Provider Fit through Information Presentation and Visual...Information Searcher-Provider Fit through Information Presentation and Visual...
Information Searcher-Provider Fit through Information Presentation and Visual...inscit2006
 
Difference of application of fuzzy rough sets and probability random on targe...
Difference of application of fuzzy rough sets and probability random on targe...Difference of application of fuzzy rough sets and probability random on targe...
Difference of application of fuzzy rough sets and probability random on targe...inscit2006
 
The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...
The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...
The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...inscit2006
 
Weighted Naïve Bayes Model for Semi-Structured Document Categorization
Weighted Naïve Bayes Model for Semi-Structured Document CategorizationWeighted Naïve Bayes Model for Semi-Structured Document Categorization
Weighted Naïve Bayes Model for Semi-Structured Document Categorizationinscit2006
 
The role of education within the framework of information sciences and techno...
The role of education within the framework of information sciences and techno...The role of education within the framework of information sciences and techno...
The role of education within the framework of information sciences and techno...inscit2006
 
A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...
A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...
A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...inscit2006
 
A Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of DataA Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of Datainscit2006
 
iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...
iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...
iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...inscit2006
 
Designing People’s Interconnections in Mobile Social Networks
Designing People’s Interconnections in Mobile Social NetworksDesigning People’s Interconnections in Mobile Social Networks
Designing People’s Interconnections in Mobile Social Networksinscit2006
 
Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...
Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...
Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...inscit2006
 
Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...
Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...
Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...inscit2006
 
Visualization of Multidimensional Information from Scientific Computations
Visualization of Multidimensional Information from Scientific ComputationsVisualization of Multidimensional Information from Scientific Computations
Visualization of Multidimensional Information from Scientific Computationsinscit2006
 
High dimensional Data Visualization using Star Coordinates on Three Dimensions
High dimensional Data Visualization using Star Coordinates on Three DimensionsHigh dimensional Data Visualization using Star Coordinates on Three Dimensions
High dimensional Data Visualization using Star Coordinates on Three Dimensionsinscit2006
 
Improvement in Quality of Speech associated with Braille codes - A Review
Improvement in Quality of Speech associated with Braille codes - A ReviewImprovement in Quality of Speech associated with Braille codes - A Review
Improvement in Quality of Speech associated with Braille codes - A Reviewinscit2006
 
Identificación de Nombres de Genes en la Literatura Biomédica
Identificación de Nombres de Genes en la Literatura BiomédicaIdentificación de Nombres de Genes en la Literatura Biomédica
Identificación de Nombres de Genes en la Literatura Biomédicainscit2006
 
An Intuitive Natural Language Understanding System
An Intuitive Natural Language Understanding SystemAn Intuitive Natural Language Understanding System
An Intuitive Natural Language Understanding Systeminscit2006
 
Requirement analysis for mobile information exchange in the police using a ti...
Requirement analysis for mobile information exchange in the police using a ti...Requirement analysis for mobile information exchange in the police using a ti...
Requirement analysis for mobile information exchange in the police using a ti...inscit2006
 
Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...
Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...
Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...inscit2006
 
Information updated and conveyed by the neural network systems
Information updated and conveyed by the neural network systemsInformation updated and conveyed by the neural network systems
Information updated and conveyed by the neural network systemsinscit2006
 
Mensajería instantánea: una puerta para una nueva percepción del mundo para n...
Mensajería instantánea: una puerta para una nueva percepción del mundo para n...Mensajería instantánea: una puerta para una nueva percepción del mundo para n...
Mensajería instantánea: una puerta para una nueva percepción del mundo para n...inscit2006
 

Plus de inscit2006 (20)

Information Searcher-Provider Fit through Information Presentation and Visual...
Information Searcher-Provider Fit through Information Presentation and Visual...Information Searcher-Provider Fit through Information Presentation and Visual...
Information Searcher-Provider Fit through Information Presentation and Visual...
 
Difference of application of fuzzy rough sets and probability random on targe...
Difference of application of fuzzy rough sets and probability random on targe...Difference of application of fuzzy rough sets and probability random on targe...
Difference of application of fuzzy rough sets and probability random on targe...
 
The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...
The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...
The Interaction of Navigation Instructions and Visual Attention in Dynamic Au...
 
Weighted Naïve Bayes Model for Semi-Structured Document Categorization
Weighted Naïve Bayes Model for Semi-Structured Document CategorizationWeighted Naïve Bayes Model for Semi-Structured Document Categorization
Weighted Naïve Bayes Model for Semi-Structured Document Categorization
 
The role of education within the framework of information sciences and techno...
The role of education within the framework of information sciences and techno...The role of education within the framework of information sciences and techno...
The role of education within the framework of information sciences and techno...
 
A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...
A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...
A Metadata-Driven Approach to Computing Financial Analytics in a Relational D...
 
A Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of DataA Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of Data
 
iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...
iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...
iCE- Interactive Co-innovation Environment Software, Spatial Mapping Tools fo...
 
Designing People’s Interconnections in Mobile Social Networks
Designing People’s Interconnections in Mobile Social NetworksDesigning People’s Interconnections in Mobile Social Networks
Designing People’s Interconnections in Mobile Social Networks
 
Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...
Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...
Visual Literacy: A Semiotic Analysis of Icons as Visual Information Represent...
 
Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...
Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...
Visualizing Search Term Relevance, Boolean Operators, and Phrases using the C...
 
Visualization of Multidimensional Information from Scientific Computations
Visualization of Multidimensional Information from Scientific ComputationsVisualization of Multidimensional Information from Scientific Computations
Visualization of Multidimensional Information from Scientific Computations
 
High dimensional Data Visualization using Star Coordinates on Three Dimensions
High dimensional Data Visualization using Star Coordinates on Three DimensionsHigh dimensional Data Visualization using Star Coordinates on Three Dimensions
High dimensional Data Visualization using Star Coordinates on Three Dimensions
 
Improvement in Quality of Speech associated with Braille codes - A Review
Improvement in Quality of Speech associated with Braille codes - A ReviewImprovement in Quality of Speech associated with Braille codes - A Review
Improvement in Quality of Speech associated with Braille codes - A Review
 
Identificación de Nombres de Genes en la Literatura Biomédica
Identificación de Nombres de Genes en la Literatura BiomédicaIdentificación de Nombres de Genes en la Literatura Biomédica
Identificación de Nombres de Genes en la Literatura Biomédica
 
An Intuitive Natural Language Understanding System
An Intuitive Natural Language Understanding SystemAn Intuitive Natural Language Understanding System
An Intuitive Natural Language Understanding System
 
Requirement analysis for mobile information exchange in the police using a ti...
Requirement analysis for mobile information exchange in the police using a ti...Requirement analysis for mobile information exchange in the police using a ti...
Requirement analysis for mobile information exchange in the police using a ti...
 
Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...
Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...
Parametric Study to Enhance Genetic Algorithm's Performance using Ranked base...
 
Information updated and conveyed by the neural network systems
Information updated and conveyed by the neural network systemsInformation updated and conveyed by the neural network systems
Information updated and conveyed by the neural network systems
 
Mensajería instantánea: una puerta para una nueva percepción del mundo para n...
Mensajería instantánea: una puerta para una nueva percepción del mundo para n...Mensajería instantánea: una puerta para una nueva percepción del mundo para n...
Mensajería instantánea: una puerta para una nueva percepción del mundo para n...
 

Dernier

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Dernier (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

Knowledge Discovery in Environmental Impact Report’s summary texts: an exploratory analysis of four case studies

  • 1. Knowledge Discovery in Environmental Impact Report’s summary texts: an exploratory analysis of four case studies Cláudia Viviane Viegas 1,2 , Roseli Búrigo 1,3 , José Leomar Todesco 1 , Fernando Alvaro Ostuni Gauthier 1 , Paulo Maurício Selig 1 1 Federal University of Santa Catarina, UFSC , Engineering and Knowledge Management Post-graduation Program, Florianópolis (SC), BRAZIL; 2 Feevale University, Novo Hamburgo (RS), BRAZIL; 3 Santa Catarina Extrem South University, Unesc , Criciúma (SC), BRAZIL
  • 2. What? This paper analyses four summary texts from Environmental Impact Reports (EIRs) prepared for hydroelectric facilities, built in Brazil between 1997 and 2005. Documents’ brochures are: Jirau’s (1), Ipueiras’(2), Paulistas’(3), and Barra Grande’s (4) dams. 1 2 3 4
  • 3. How? Knowledge Discovery Texts techniques (KDT), namely stopwords and stemming, are employed. EIRs summarise Environmental Impact Assessment (EIAs) outcomes, which are mandatory by law in order to identify and measure effects from entrepreneurship with high levels of environmental change, and to formulate mitigation measures. A thesaurus is elaborated from the Reference Term (RT) - a document provided by governmental environmental institutions to guide EIAs-EIRs construction. A contextual approach is employed in order to cover the most number of words and expressions which bear similarities with each other. The thesaurus' words and expressions are classified into 22 groups according to the similarities of their meanings. Such words and expressions are compared to EIRs summaries' words and expressions, after these summaries have undergone data preprocessing.
  • 4. Why? Brazilian EIRs are often criticised as incomplete and superficial, but this criticism suffers from a lack of objective support. Major findings Comparison of the results of the thesaurus versus words and expressions acquired from summaries allows us to conclude that the EIRs emphasise: placement issue; impacts; environmental alternatives; and mitigation/compensation procedures. Expressions such as technological resources; financial resources; social and economic context; economic alternatives; impact size; impact relevance; environmental effects; and harm prevention, listed in thesaurus, are not mentioned in the summaries.
  • 5. EIAs-EIRs guidelines - problem’s approach In Brazil, EIAs-EIRs are required by law, which originates generic Reference Term (RT) as a guideline to this kind of study. Zilberman (1995) highlights five generic steps of an EIA-EIR, and we can consider the first three as more relevant to the thesauru’s building: - Step I : identification - Information about project site, technological and financial resources to control project environmental effects, socioeconomic context, objectives of land use and occupation policies, legislation, and size and alternatives for these impacts. - Step II : environmental diagnostic - Evaluation of each impact identified in the previous step. Physical, biological (or biotic), and socieoeconomic environments are evaluated. - Step III : impacts' prognosis - Environmental effects of business are identified and analysed, as well as technological and economic possibilities of prevention and control, mitigation and repair. An alternative is chosen as the basis of the EIA-EIR.
  • 6. Theoretic framework - IR and KDT To better understand the content of EIRs summaries, Information Retrieval (IR) studies can be worthwhile. IR is &quot;(...) an activity which involves aspects of information description (indexation, pattern building) and it encompasses specification for searching, including any technique, system or machine employed to do or support such tasks” (WIVES, 2002). IR is the process or method where a potential information user can change your information necessity in a real list of stored documents' citations which contain useful information to him (SARACEVIC, 1995). Indexation is the first step of IR. It refers to the selection of relevant words in document, and can be done through controlled vocabulary techniques. It has the aim to build access points to a document. It is possible through the use of key words and identification of expressions (WIVES, 2002).
  • 7. Relationship between EIRs and KDT The creation of a thesaurus containing key words and expressions from stages of EIA-EIR, following Zilberman's (1995) guidelines, is a first step in establishing a relationship between EIRs and KDT. It is a necessary precursor to the further process of relevant information identification, called matching. It identifies similarities between relevant information to user query and information stored in the system. EIAs-EIRS major guidelines Semantic treatment thesaurus
  • 8. KDT techniques - stopwords and stemming Semantic analysis was employed in order to deal with EIRs summaries, using techniques such as stopwords and stemming . Stopwords are irrelevant words, and include prepositions, conjunctions, pronouns and others with no meaning in a specific context. It includes &quot;words with no relevant semantic content in their context and irrelevant words in the text analysis” (LOPES, 2004). Morphologic normalisation, called stemming, takes word's radical as being relevant, without taking in account desinences. “With this technique, user does not need to worry with the orthographic shape of a written word in a text. So, an idea, independent of being written as substantive, adjective or verb, is identified by the same (and single) radical” (WIVES, 2002).
  • 9. EIRs summaries targeted Jirau Ipueiras Paulistas Barra Grande
  • 10. Matching and weighting After the analysis of the texts' summaries, supported by tools such as stopwords and stemming, the matching technique is employed taking in account words and expressions of the texts' summaries and thesaurus' words and expressions. It means considering the relevance of each key word and expression, which is given by the relative frequency of indexed words - by the number of times they appear in comparison with the number of document's words. This is a weighting process. In order to understand the weights' meaning, a clustering technique is employed. Instead of investigation hypothesis, a proactive approach is used to acquire information, designing an exploratory research, which “(...) is useful to detect potential problems and opportunities (Loh et al., 2000)
  • 11. Thesaurus’ building (I) Following Zilberman's guide to elaborate EIAs-EIRs (1995) as RT, we listed the following steps with respective set of key words and expressions: - Step I: A - placement, place(s), locational alternative(s), area, area(s) of influence area, influenced area(s) , affected area, region, region of influence, where; B - technologic resources, technology; C - financial resources; financing; D- socioeconomic context, socio economic aspect(s) socioeconomic(s), socioeconomy; E - soil using policy, soil use; F - legislation, law(s), resolution(s), legal aspects. - Step II: G - environmental diagnostic; H- environmental impact(s), environmental change(s); I- physical media; J- biological media, biotic media; K- physical-biotic media, physical and biotic media; L- socioeconomic media, socioeconomic aspect(s); M- impacts' dimension; N- impacts' relevance.
  • 12. Thesaurus’ building (II) - Step III: O - impacts prognostic, environmental prognostic(s); P- environmental effects; Q- (environmental) alternative, (environmental) plans, (environmental) projects, (environmental) programs; R- technological alternative; S- economic alternative; T-mitigation, measures, attenuation measures, compensatory measures, corrective measures, compensation, compensate, correction, correct, repair; U- control, monitoring; V- prevention. thesaurus weighting matching thesaurus
  • 13. Semantic treatment results (I) Semantic classification results of EIRs’ summaries texts weighted and compared with thesaurus terms
  • 14. Semantic treatment results (II) Matching and weighting analysis’ aspects according to each facility summary
  • 15. Findings and discussion (I) More common words and expressions More common key words and expressions identified belong to the A, H, Q, and T groups. They represent all steps described by Zilberman (1995): A (I), H (II) e Q, and T (III). More important summary items are placement, impact, alternatives, plans, projects or environmental programs, and mitigation measures. Relevance Considering the total number of key words and expressions for each summary and matching them with the thesaurus' list, we find that the Barra Grande EIR has the best match, as it contains the highest relative proportion of key words and expressions (11,2%) compared with the thesaurus' words and expressions. EIRs summaries of Paulistas (11%), Ipueiras (9,7%) and Jirau (8,4%) facilities perform less well.
  • 16. Findings and discussion (II) Number of words and expressions selected in each summary compared with thesaurus' sets of words and expressions In this analysis, we find that the Ipueiras' summary has the best representativeness: 11 words, or 50% of the whole thesaurus. Barra Grande (45,4%), Paulistas (27,2%), and Jirau (18,1%) all match fewer words in the thesaurus. So, we can conclude the Jirau's summary has the poorest overall match with the thesaurus in terms of both number and relevance of words.
  • 17. Conclusions The most important items of summaries, compared to the thesaurus, are placement, impact, alternatives, plans, environmental projects or programs, and mitigation measures. Regarding the thesaurus’ words or expressions frequency in each summary, and sets of words and expressions – we listed 22 groups –, EIRs with more summaries’ fitness are Barra Grande and Ipueiras, and Jirau’s has the least fitness. This conclusion, even related to summaries with few words – between 119 and 354 –, indicates on which issues must be focused further studies related to EIRs texts’ semantic analysis. The analysed summaries are not concerned to bring up technological and economic issues, for example, or subjects as dimensioning and environmental impacts' relevance. We recommend the analysis of more documents in order to confirm or refute these results, which we consider as primary.
  • 18. References (I) Campos, P.M.P. (org.). 1986. Usinas hidrelétricas de Santo Antônio e Jirau - RIMA . Furnas e Odebrecht, Rio de Janeiro (RJ), 82p. Castro, T.L.C. (org.). 1997. UHE Barra Grande - Relatório de Impacto ao Meio Ambiente . Sumário. Engevix, São Paulo (SP), 59p. Ferneda, E. 2003. Recuperação de Informação: Análise sobre a Contribuição da Ciência da Computação para a Ciência da Informação . Escola de Comunicação e Artes da Universidade de São Paulo/ USP (Tese de Doutorado). Jensen, P.D. (org.). 2005. Usina Hidrelétrica Ipueiras - Relatório de Impacto Ambiental - RIMA . Rede Ipueiras Empresas de Energia Elétrica e Themag Engenharia. São Paulo (SP), 97p.
  • 19. References (II) Loh, S.; Wives, L.K.; Oliveira, J.P.M. 2000. Descoberta proativa de conhecimento em coleções textuais: iniciando sem hipóteses . In: IV Oficina de Inteligência Artificial, Pelotas (RS), p. 143-154. Available in <http://www.inf.ufrgs.br/~palazzo/OAI/00%20OIA.pdf.> Accessed in April 20 th 2006. Lopes, M.C.S. 2004. Mineração de Dados Textuais Utilizando Técnicas de Clustering para o Idioma Português . Universidade Federal do Rio de Janeiro/ UFRJ (Tese ), 191 p. Montano, C.F.B.; Pithan, R.O. (org.). 2005. Relatório de Impacto Ambiental - RIMA - AHE Paulistas , Rio São Marcos (GO/MG). Biodinâmica Engenharia. Rio de Janeiro (RJ), 54p. Moreira, R. 2002. Para que o EIA-RIMA Quase Vinte Anos Depois? In: Verdum, R. e Medeiros, R. M. (org.). RIMA - Relatório de Impacto Ambiental . Ed. UFRGS (4ª edição): Porto Alegre, p.11-21.
  • 20. References (III) Rohde, G. M. 2002. Estudos de Impacto Ambiental: A Situação Brasileira em 2000. In: Verdum, R. e Medeiros, R. M. (org.). RIMA - Relatório de Impacto Ambiental . Ed. UFRGS (4ª edição): Porto Alegre, p. 41-65. Saracevic, T. 1995. Evaluation of Evaluation in Information Retrieval. In: Conference on Research and Development in Information Retrieval. 18th Annual International SIGIR, Seattle, USA. (Proceedings). ACM Press , p. 137-146. Wives, L.K. 2002. Tecnologias de Descoberta de Conhecimento em Textos Aplicadas à Inteligência Competitiva . Programa de Pós-graduação em Computação (Exame de Qualificação), 116 p. Porto Alegre (RS), Universidade Federal do Rio Grande do Sul (UFRGS). Zilberman, Isaac. 1995. Conceitos e Metodologias para Estudos de Impacto Ambiental . Ed. Ulbra: Canoas (RS).
  • 21. Author’s contact Cláudia V. Viegas – claudiav@egc.ufsc.br Roseli Búrigo – rbc@unesc.net J. Leomar Todesco – tite@stela.ufsc.br Fernando O. Gauthier – gauthier@inf.ufsc.br Paulo M. Selig – selig@egc.ufsc.br Acknowledgement We thank to advices coming from Dr. Alan J. Bond , senior lecturer, Environmental Sciences School, University of East Anglia (UEA), Norwich, UK.