SlideShare a Scribd company logo
1 of 3
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2122
Semantic Question Matching
Ruchit Mody1, Yesha Sanghavi2, Sejal D’Mello3
1B.E. student, Dept. of Information and Technology, Atharva college of Engineering, Maharashtra, India
2B.E. student, Dept. of Information and Technology, Atharva college of Engineering, Maharashtra, India
3Assistant Professor, Dept. of Information and Technology, Atharva college of Engineering, Maharashtra, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract – Quora is a very popular and known tool used by
developers all over the world. Owingtothispopularity millions
of questions of are asked by users everyday in the hope of
getting expert solution. Hence there is a high possibility that
numerous users pose questions that might having same intent
or purpose. The methods used in this paper will help people
discover questions that have already been discussed on the
forum previously and keep individuals from replying to a
similar question time and time again thereby improving
improving user experience for all. The main aim of thisproject
is to allow a user to see all questions that are semantically
similar to his/her query so that the most optimal solution is
made easily available and thus limiting the amount of
duplicate questions being generated on the platform. This
paper contributes to build up a proficient Semantic Question
Detection system by experimenting with significant kinds of
Natural Language Processing techniques used for intent
identification
Key Words: Natural Language Processing, Semantic
Question Matching, Word Movers Distance, Intent
Classification, Google News Vector.
1. INTRODUCTION
Quora is a very popular social media platform where people
ask questions and can connect to the actual experts who
contribute unique insights and quality answers. But since it
has such an enormous user basesoit’snosurprisethatmany
people ask similarly worded questions. Among the
numerous questions posted in forums, two or more of them
may express the same point and thus are duplicates of one
another. Duplicate questions make site maintenanceharder,
waste resources that could have been used to answer other
questions, and cause developers to unnecessarily wait for
answers that are already available. Question duplication is
the primary trouble encountered by using Q&A boards like
Quora, Stack-overflow, Reddit, and so on. Answers get
fragmented throughout distinct versions of the identical
query due to the redundancy of questions in those forums.
Eventually, this consequences in loss of a practical search,
answer fatigue and segregation of information. This paper
contributes to develop an efficient question duplication
detection system using the power of Machine Learning and
Natural Language Processing. The meaning of a sentence
does not only depend on the words in it, but also intheorder
with which they are combined. The semantic similarity can
have several dimensions, and sentences may be similar in
one aspect but opposite in the other. For example if a person
asks, “Did Linked-in acquire Microsoft?” while in the same
platform if another user asks, “Did MicrosoftacquireLinked-
in?” the words used in both cases are same but the order
completely changes the meaning of the sentence. In this
paper we propose to identify questions with similar intent
and then classify them as duplicate or original and to do so
we first automatically generate tags, using NLP, for all
available questions and after comparing the tags using
various methodologies we calculate similarity between
them.
2. LITERATURE SURVEY
Early work to distinguish the similitude between sentences
utilized physically designed features like word overlap
alongside conventional Artificial intelligence and Machine
learning algorithms like Support Vector Machines. Semantic
analysis of sentences has traditionally concentrated on
logical inference based on the Inference Corpus of Stanford
Natural Language. The paper by Tim Rocktaschel, Edward
Grefenstette, Karl Moritz Hermann, TomasKociskya andPhil
Blunsom used LSTMs to concentrate on word-by-word
recognition methods [1].Neural Network methodologies
have been the in a broader selection of NLP tasks. Siamese
neural network which contains two sub-networks joined at
their outputs was proposed. In spite of the fact that the
Siamese architecture is lightweight and easily trainable,
there is clearly a lesser impact of correlation between the
parameters due to whichinformationlossmayoccur[2].The
Compare-Aggregate model [3] which indeed captures the
interaction between two sentenceswasproposedinorderto
ensure that the limitations of the Siamese framework were
overcomed.
The recent study by Zhang et al shows the effect of word
embedding. Their methodology, PCQADup, relies on
extracted features of question pairs. They announced a
noteworthy improvement in results contrasted with
DupPredictor and Dupe [4],[5].In the paper “Mining
Duplicate Questions in Stack Overflow” by Ahasanuzzaman
Muhammad, Chanchal K. Roy and Kevin A. Schneider , for
detecting duplicate questions, they used a discriminative
model classifier together with BM25scoringfunction. Witha
wide numberof questionsconcerningmultiple programming
languages, they tested their strategy. As per the results of
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2123
their proposed strategy Dupe, it was found that Dupe
outperforms Dup-Predictor, Stack Overflow's only usable
duplicate query tracker [6]. To test the similarities between
two interrogative segments in a set, a Duplicate question
detection approach with a Jaccard coefficientinthepaper by
Y. Wu, Q. Zhang, and X. Huang , “Efficient near- duplicate
detection for Q&A forum,” was utilized. A Duplicatequestion
detection dataset resorting to the Baidu Zhidao, a Chinese
question and answer platform supported by the search
engine Baidu, was developed. The program achieved a f-
score of 60.29 by practicing with 3M pairs and checking on
3k pairs [7],[8].
3. PROPOSED SYSTEM AND ARCHITECTURE
3.1 Architecture Description
The proposed approach whichwewill developistomatch an
input question with all its possibleformsof duplicates.We
first perform dataset preprocessing by removing all the
stopwords, question marks and other unwanted characters.
We need our model to understand that the words "ask" and
"asked" are simply various tenses of a similar action word
and Stemming and LemmatizationareconceptsofNLPthat’s
helps us to do just that. It helps us to convertvarioustypesof
a word to a center root. After preprocessing we generate
tags or keywords to every question. For every question, if
50% tags of that question match with the tags of another we
run it through a Natural LanguageProcessingalgorithmused
for classification.
Figure 3.1: Architecture
3.2 IMPLEMENTATION
Our Research is carried out in the following stages:
A. Data Preprocessing
All the available questions are first cleaned by converting all
to lowercase, removing stopwords, question marks and
other unwanted characters. After which we usestemmingto
convert the words into its equivalent core words.
B. Generation of Tags
For all the available questions we automatically generate
keywords/tags using TFIDF(Term Frequency Inverse-
Document Frequency).This is a method used to evaluate the
importance that a particular word has as comparedtoall the
other words in the document. Words having the highest
TFIDF score are chosen as the tags for that question. We
developed a python code that takes Questionasaninputand
finds out the number of words in that question. Using this it
generates a particular number of tags which is equal to30%
of the length of question. For Example if a question of 11
words is asked such that “What are the different steps in
becoming a freelance datascientist?”The3tagsgenerated by
the model are “steps”,”freelance”,”datascientist”
C. Calculation of Similarity
Since the machine cannot understand human language in
this step we first convert all the words of the question into
vectors. To do so we use Google News Vectors. Google News
Vector includes word vectors for a vocabulary of 3 million
words and phrases that are trained on around 100 billion
words from a Google News dataset [9]. For every question if
50% tags of that question match with the tags of another we
run it through a NLP algorithm used for classification.Inthat
model we use Word Movers Distance to calculate similarity
between the questions. WMD calculates the dissimilarity
between two texts as the amount of distance that the
embedded words of one text needs to travel to reach the
embedded words of another text
D. Developing a Front End
To make our model more user friendly we developed a
webpage using HTML5 and CSS. Using this web page any
layman can work with our Python Model. We have used a
flask framework that helps us to connect our python code to
an HTML web page. Using Flask we can now create a web
app that will trigger our python model on a button click.
HTML5 and CSS will make our web page responsive and
mobile friendly and accessible from devices of all sizes
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2124
4. OUTPUT
Figure 4.1: Sample result for no match
Figure 4.2: Sample result for matching questions
5. CONCLUSION
In this paper we try to identify the problem, of duplicate
questions. We propose an unsupervised approach to
duplication detection by developing a model that is able to
identify the base intent of the question. Our paper is able to
reflect the various advantages pertaining to identifying
duplicate questions on question and answer forums which
will lead to cheaper data storage aslessdata wouldbestored
and the user experience would be improved as they can
expect faster response to the questions. With the help of our
project the clients will be able to find great answers without
investing more time and energy searching for best answer
among its similar questions. Our approachincludescleaning
and tokenizing the questions in the initial phase to prepare
the data set of questions. In our second stage we develop a
model that is able to automatically generate tags/keywords
for all the Questions. The third stage is thestageinwhichour
model is prepared based on Natural Language Processing. It
uses Google News vectors to covert words into vectors after
which we run our similarity calculation model. WMD is a
method that allows us to assess the “distance” between two
documents in a meaningful way, no mattertheyhaveor have
no words in common[10]. The final stage includes the
implementation of the GUI where in aninteractive websiteis
created using Flask, HTML5 and CSS. Hence, this research
work adopts Natural Language Processing to address the
problem of question duplicationin Q&A forumslikeQuora to
classify whether questions are duplicatesornon-duplicates.
ACKNOWLEDGEMENT
We would like to thank our project guide Prof. Sejal D’mello
for her enormous cooperation and guidance. We have no
words to express our gratitude for a person who
wholeheartedly supported the project and gave freelyof her
valuable time while making this project. All the inputs given
by her found a place in the project. She has always been a
source of inspiration for us.
REFERENCES
[1] Tim Rocktaschel, Edward Grefenstette, Karl Moritz
Hermann, Tomas Kocisky, Phil Blunsom.” Reasoning about
entailment with neural attention”, In ICLR 2016
[2] Wang, Zhiguo, Wael Hamza, and Radu Florian. "Bilateral
Multi-Perspective Matching for Natural Language
Sentences.", In arXiv.org ,2017.
[3] Wang, Shuohang, and Jing Jiang. "A Compare-Aggregate
Model for Matching Text Sequences.", In arXiv.org ,2016
[4] Rodrigo F. G. Silva, Klérisson Paixão, Marcelo de Almeida
Maia “ Duplicate Question Detection in Stack Overflow: A
Reproducibility Study “ , 978-1-5386-4969-5/18/$31.00 ,
2018 IEEE SANER 2018, Campobasso, Italy RENE Track.
[5] W. E. Zhang, Q. Z. Sheng, J. H. Lau, and E. Abebe,
“Detecting duplicate posts in programming qa communities
via latent semantics and association rules,” in 26th
International Conference on World Wide Web (WWW),
Geneva, Switzerland, 2017, pp. 1221–1229.
[6] Muhammad Asaduzzaman , Chanchal K. Roy , Kevin A.
Schneider “Mining Duplicate Questions of Stack Overflow”,
IEEE/ACM 13th Working Conference on Mining Software
Repositories , September 2017.
[7] Chakaveh Saedi, Joao Rodrigues, Joao Silva, Antonio
Branco, Vladislav Maraev, “Learning Profiles in Duplicate
Question Detection”, IEEE International Conference on
Information Reuse and Integration (IRI), 2017 .
[8] Y. Wu, Q. Zhang, and X. Huang, “Efficient near- duplicate
detection for Q&A forum,” in Proceedings of the 5th
International Joint Conference on Natural Language
Processing (IJCNLP), 2011, pp. 1001–1009.
[9] Ashwin Dhakal,Arpan Poudel ,Sagar Pandey ” Exploring
Deep Learning in Semantic Question Matching”, IEEE 3rd
International Conferenceon Computing,Communication and
Security (ICCCS), Kathmandu (Nepal) , June 2018
[10] Naveen Saini, Pushpak Bhattacharyya, Sriparna Saha,
Dhiraj Chakraborty,” Extractive single document
summarization using binary differential evolution:
Optimization of different sentencequalitymeasures”,InPLoS
ONENovember2019

More Related Content

What's hot

Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysisZahid Azam
 
Survey of Various Approaches of Emotion Detection Via Multimodal Approach
Survey of Various Approaches of Emotion Detection Via Multimodal ApproachSurvey of Various Approaches of Emotion Detection Via Multimodal Approach
Survey of Various Approaches of Emotion Detection Via Multimodal ApproachIRJET Journal
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET Journal
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Studyvivatechijri
 
NLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEMNLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEMvivatechijri
 
2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit satIAESIJEECS
 
Bba q&a study final white
Bba q&a study final whiteBba q&a study final white
Bba q&a study final whiteGreg Sterling
 
THE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATION
THE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATIONTHE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATION
THE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATIONijscai
 
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET Journal
 
A Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text miningA Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text miningIJSRD
 
Programmer information needs after memory failure
Programmer information needs after memory failureProgrammer information needs after memory failure
Programmer information needs after memory failureBhagyashree Deokar
 
IRJET - Chat-Bot for College Information System using AI
IRJET -  	  Chat-Bot for College Information System using AIIRJET -  	  Chat-Bot for College Information System using AI
IRJET - Chat-Bot for College Information System using AIIRJET Journal
 
Zomato eda report
Zomato eda reportZomato eda report
Zomato eda reportvidit jain
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion dannyijwest
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory acijjournal
 
Analysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion MiningAnalysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion Miningmlaij
 

What's hot (19)

Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
 
Survey of Various Approaches of Emotion Detection Via Multimodal Approach
Survey of Various Approaches of Emotion Detection Via Multimodal ApproachSurvey of Various Approaches of Emotion Detection Via Multimodal Approach
Survey of Various Approaches of Emotion Detection Via Multimodal Approach
 
A044080105
A044080105A044080105
A044080105
 
Supervised Approach to Extract Sentiments from Unstructured Text
Supervised Approach to Extract Sentiments from Unstructured TextSupervised Approach to Extract Sentiments from Unstructured Text
Supervised Approach to Extract Sentiments from Unstructured Text
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & Autocorrection
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 
NLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEMNLP BASED INTERVIEW ASSESSMENT SYSTEM
NLP BASED INTERVIEW ASSESSMENT SYSTEM
 
2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat2. an efficient approach for web query preprocessing edit sat
2. an efficient approach for web query preprocessing edit sat
 
Bba q&a study final white
Bba q&a study final whiteBba q&a study final white
Bba q&a study final white
 
THE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATION
THE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATIONTHE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATION
THE EFFECTS OF THE LDA TOPIC MODEL ON SENTIMENT CLASSIFICATION
 
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
 
A Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text miningA Novel Approach for Keyword extraction in learning objects using text mining
A Novel Approach for Keyword extraction in learning objects using text mining
 
Programmer information needs after memory failure
Programmer information needs after memory failureProgrammer information needs after memory failure
Programmer information needs after memory failure
 
IRJET - Chat-Bot for College Information System using AI
IRJET -  	  Chat-Bot for College Information System using AIIRJET -  	  Chat-Bot for College Information System using AI
IRJET - Chat-Bot for College Information System using AI
 
Zomato eda report
Zomato eda reportZomato eda report
Zomato eda report
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory
 
Analysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion MiningAnalysis of Opinionated Text for Opinion Mining
Analysis of Opinionated Text for Opinion Mining
 

Similar to IRJET- Semantic Question Matching

IRJET- Factoid Question and Answering System
IRJET-  	  Factoid Question and Answering SystemIRJET-  	  Factoid Question and Answering System
IRJET- Factoid Question and Answering SystemIRJET Journal
 
Twitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachTwitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachIRJET Journal
 
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET Journal
 
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGEVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGIJwest
 
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...IRJET Journal
 
SentimentAnalysisofTwitterProductReviewsDocument.pdf
SentimentAnalysisofTwitterProductReviewsDocument.pdfSentimentAnalysisofTwitterProductReviewsDocument.pdf
SentimentAnalysisofTwitterProductReviewsDocument.pdfDevinSohi
 
IRJET - Automated Essay Grading System using Deep Learning
IRJET -  	  Automated Essay Grading System using Deep LearningIRJET -  	  Automated Essay Grading System using Deep Learning
IRJET - Automated Essay Grading System using Deep LearningIRJET Journal
 
Derogatory Comment Classification
Derogatory Comment ClassificationDerogatory Comment Classification
Derogatory Comment ClassificationIRJET Journal
 
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering SystemKnowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering SystemIRJET Journal
 
IRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software TechnologiesIRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software TechnologiesIRJET Journal
 
Text Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to TextText Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to TextIRJET Journal
 
IRJET - Fake News Detection using Machine Learning
IRJET -  	  Fake News Detection using Machine LearningIRJET -  	  Fake News Detection using Machine Learning
IRJET - Fake News Detection using Machine LearningIRJET Journal
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...Kumar Goud
 
Automatic Grading of Handwritten Answers
Automatic Grading of Handwritten AnswersAutomatic Grading of Handwritten Answers
Automatic Grading of Handwritten AnswersIRJET Journal
 
Language and Offensive Word Detection
Language and Offensive Word DetectionLanguage and Offensive Word Detection
Language and Offensive Word DetectionIRJET Journal
 
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETSQUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETSkevig
 
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETSQUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETSijnlc
 
Image Based Tool for Level 1 and Level 2 Autistic People
Image Based Tool for Level 1 and Level 2 Autistic PeopleImage Based Tool for Level 1 and Level 2 Autistic People
Image Based Tool for Level 1 and Level 2 Autistic PeopleIRJET Journal
 
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGAUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGIRJET Journal
 
IDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTER
IDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTERIDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTER
IDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTERIRJET Journal
 

Similar to IRJET- Semantic Question Matching (20)

IRJET- Factoid Question and Answering System
IRJET-  	  Factoid Question and Answering SystemIRJET-  	  Factoid Question and Answering System
IRJET- Factoid Question and Answering System
 
Twitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachTwitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised Approach
 
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...
 
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGEVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
 
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
 
SentimentAnalysisofTwitterProductReviewsDocument.pdf
SentimentAnalysisofTwitterProductReviewsDocument.pdfSentimentAnalysisofTwitterProductReviewsDocument.pdf
SentimentAnalysisofTwitterProductReviewsDocument.pdf
 
IRJET - Automated Essay Grading System using Deep Learning
IRJET -  	  Automated Essay Grading System using Deep LearningIRJET -  	  Automated Essay Grading System using Deep Learning
IRJET - Automated Essay Grading System using Deep Learning
 
Derogatory Comment Classification
Derogatory Comment ClassificationDerogatory Comment Classification
Derogatory Comment Classification
 
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering SystemKnowledge Graph and Similarity Based Retrieval Method for Query Answering System
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
 
IRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software TechnologiesIRJET- A Novel Approch Automatically Categorizing Software Technologies
IRJET- A Novel Approch Automatically Categorizing Software Technologies
 
Text Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to TextText Summarization and Conversion of Speech to Text
Text Summarization and Conversion of Speech to Text
 
IRJET - Fake News Detection using Machine Learning
IRJET -  	  Fake News Detection using Machine LearningIRJET -  	  Fake News Detection using Machine Learning
IRJET - Fake News Detection using Machine Learning
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
 
Automatic Grading of Handwritten Answers
Automatic Grading of Handwritten AnswersAutomatic Grading of Handwritten Answers
Automatic Grading of Handwritten Answers
 
Language and Offensive Word Detection
Language and Offensive Word DetectionLanguage and Offensive Word Detection
Language and Offensive Word Detection
 
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETSQUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
 
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETSQUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
QUESTION ANSWERING MODULE LEVERAGING HETEROGENEOUS DATASETS
 
Image Based Tool for Level 1 and Level 2 Autistic People
Image Based Tool for Level 1 and Level 2 Autistic PeopleImage Based Tool for Level 1 and Level 2 Autistic People
Image Based Tool for Level 1 and Level 2 Autistic People
 
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGAUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
 
IDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTER
IDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTERIDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTER
IDENTIFYING THE DAMAGE ASSESSMENT TWEETS DURING DISASTER
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 

Recently uploaded (20)

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 

IRJET- Semantic Question Matching

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2122 Semantic Question Matching Ruchit Mody1, Yesha Sanghavi2, Sejal D’Mello3 1B.E. student, Dept. of Information and Technology, Atharva college of Engineering, Maharashtra, India 2B.E. student, Dept. of Information and Technology, Atharva college of Engineering, Maharashtra, India 3Assistant Professor, Dept. of Information and Technology, Atharva college of Engineering, Maharashtra, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract – Quora is a very popular and known tool used by developers all over the world. Owingtothispopularity millions of questions of are asked by users everyday in the hope of getting expert solution. Hence there is a high possibility that numerous users pose questions that might having same intent or purpose. The methods used in this paper will help people discover questions that have already been discussed on the forum previously and keep individuals from replying to a similar question time and time again thereby improving improving user experience for all. The main aim of thisproject is to allow a user to see all questions that are semantically similar to his/her query so that the most optimal solution is made easily available and thus limiting the amount of duplicate questions being generated on the platform. This paper contributes to build up a proficient Semantic Question Detection system by experimenting with significant kinds of Natural Language Processing techniques used for intent identification Key Words: Natural Language Processing, Semantic Question Matching, Word Movers Distance, Intent Classification, Google News Vector. 1. INTRODUCTION Quora is a very popular social media platform where people ask questions and can connect to the actual experts who contribute unique insights and quality answers. But since it has such an enormous user basesoit’snosurprisethatmany people ask similarly worded questions. Among the numerous questions posted in forums, two or more of them may express the same point and thus are duplicates of one another. Duplicate questions make site maintenanceharder, waste resources that could have been used to answer other questions, and cause developers to unnecessarily wait for answers that are already available. Question duplication is the primary trouble encountered by using Q&A boards like Quora, Stack-overflow, Reddit, and so on. Answers get fragmented throughout distinct versions of the identical query due to the redundancy of questions in those forums. Eventually, this consequences in loss of a practical search, answer fatigue and segregation of information. This paper contributes to develop an efficient question duplication detection system using the power of Machine Learning and Natural Language Processing. The meaning of a sentence does not only depend on the words in it, but also intheorder with which they are combined. The semantic similarity can have several dimensions, and sentences may be similar in one aspect but opposite in the other. For example if a person asks, “Did Linked-in acquire Microsoft?” while in the same platform if another user asks, “Did MicrosoftacquireLinked- in?” the words used in both cases are same but the order completely changes the meaning of the sentence. In this paper we propose to identify questions with similar intent and then classify them as duplicate or original and to do so we first automatically generate tags, using NLP, for all available questions and after comparing the tags using various methodologies we calculate similarity between them. 2. LITERATURE SURVEY Early work to distinguish the similitude between sentences utilized physically designed features like word overlap alongside conventional Artificial intelligence and Machine learning algorithms like Support Vector Machines. Semantic analysis of sentences has traditionally concentrated on logical inference based on the Inference Corpus of Stanford Natural Language. The paper by Tim Rocktaschel, Edward Grefenstette, Karl Moritz Hermann, TomasKociskya andPhil Blunsom used LSTMs to concentrate on word-by-word recognition methods [1].Neural Network methodologies have been the in a broader selection of NLP tasks. Siamese neural network which contains two sub-networks joined at their outputs was proposed. In spite of the fact that the Siamese architecture is lightweight and easily trainable, there is clearly a lesser impact of correlation between the parameters due to whichinformationlossmayoccur[2].The Compare-Aggregate model [3] which indeed captures the interaction between two sentenceswasproposedinorderto ensure that the limitations of the Siamese framework were overcomed. The recent study by Zhang et al shows the effect of word embedding. Their methodology, PCQADup, relies on extracted features of question pairs. They announced a noteworthy improvement in results contrasted with DupPredictor and Dupe [4],[5].In the paper “Mining Duplicate Questions in Stack Overflow” by Ahasanuzzaman Muhammad, Chanchal K. Roy and Kevin A. Schneider , for detecting duplicate questions, they used a discriminative model classifier together with BM25scoringfunction. Witha wide numberof questionsconcerningmultiple programming languages, they tested their strategy. As per the results of
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2123 their proposed strategy Dupe, it was found that Dupe outperforms Dup-Predictor, Stack Overflow's only usable duplicate query tracker [6]. To test the similarities between two interrogative segments in a set, a Duplicate question detection approach with a Jaccard coefficientinthepaper by Y. Wu, Q. Zhang, and X. Huang , “Efficient near- duplicate detection for Q&A forum,” was utilized. A Duplicatequestion detection dataset resorting to the Baidu Zhidao, a Chinese question and answer platform supported by the search engine Baidu, was developed. The program achieved a f- score of 60.29 by practicing with 3M pairs and checking on 3k pairs [7],[8]. 3. PROPOSED SYSTEM AND ARCHITECTURE 3.1 Architecture Description The proposed approach whichwewill developistomatch an input question with all its possibleformsof duplicates.We first perform dataset preprocessing by removing all the stopwords, question marks and other unwanted characters. We need our model to understand that the words "ask" and "asked" are simply various tenses of a similar action word and Stemming and LemmatizationareconceptsofNLPthat’s helps us to do just that. It helps us to convertvarioustypesof a word to a center root. After preprocessing we generate tags or keywords to every question. For every question, if 50% tags of that question match with the tags of another we run it through a Natural LanguageProcessingalgorithmused for classification. Figure 3.1: Architecture 3.2 IMPLEMENTATION Our Research is carried out in the following stages: A. Data Preprocessing All the available questions are first cleaned by converting all to lowercase, removing stopwords, question marks and other unwanted characters. After which we usestemmingto convert the words into its equivalent core words. B. Generation of Tags For all the available questions we automatically generate keywords/tags using TFIDF(Term Frequency Inverse- Document Frequency).This is a method used to evaluate the importance that a particular word has as comparedtoall the other words in the document. Words having the highest TFIDF score are chosen as the tags for that question. We developed a python code that takes Questionasaninputand finds out the number of words in that question. Using this it generates a particular number of tags which is equal to30% of the length of question. For Example if a question of 11 words is asked such that “What are the different steps in becoming a freelance datascientist?”The3tagsgenerated by the model are “steps”,”freelance”,”datascientist” C. Calculation of Similarity Since the machine cannot understand human language in this step we first convert all the words of the question into vectors. To do so we use Google News Vectors. Google News Vector includes word vectors for a vocabulary of 3 million words and phrases that are trained on around 100 billion words from a Google News dataset [9]. For every question if 50% tags of that question match with the tags of another we run it through a NLP algorithm used for classification.Inthat model we use Word Movers Distance to calculate similarity between the questions. WMD calculates the dissimilarity between two texts as the amount of distance that the embedded words of one text needs to travel to reach the embedded words of another text D. Developing a Front End To make our model more user friendly we developed a webpage using HTML5 and CSS. Using this web page any layman can work with our Python Model. We have used a flask framework that helps us to connect our python code to an HTML web page. Using Flask we can now create a web app that will trigger our python model on a button click. HTML5 and CSS will make our web page responsive and mobile friendly and accessible from devices of all sizes
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 01 | Jan 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 2124 4. OUTPUT Figure 4.1: Sample result for no match Figure 4.2: Sample result for matching questions 5. CONCLUSION In this paper we try to identify the problem, of duplicate questions. We propose an unsupervised approach to duplication detection by developing a model that is able to identify the base intent of the question. Our paper is able to reflect the various advantages pertaining to identifying duplicate questions on question and answer forums which will lead to cheaper data storage aslessdata wouldbestored and the user experience would be improved as they can expect faster response to the questions. With the help of our project the clients will be able to find great answers without investing more time and energy searching for best answer among its similar questions. Our approachincludescleaning and tokenizing the questions in the initial phase to prepare the data set of questions. In our second stage we develop a model that is able to automatically generate tags/keywords for all the Questions. The third stage is thestageinwhichour model is prepared based on Natural Language Processing. It uses Google News vectors to covert words into vectors after which we run our similarity calculation model. WMD is a method that allows us to assess the “distance” between two documents in a meaningful way, no mattertheyhaveor have no words in common[10]. The final stage includes the implementation of the GUI where in aninteractive websiteis created using Flask, HTML5 and CSS. Hence, this research work adopts Natural Language Processing to address the problem of question duplicationin Q&A forumslikeQuora to classify whether questions are duplicatesornon-duplicates. ACKNOWLEDGEMENT We would like to thank our project guide Prof. Sejal D’mello for her enormous cooperation and guidance. We have no words to express our gratitude for a person who wholeheartedly supported the project and gave freelyof her valuable time while making this project. All the inputs given by her found a place in the project. She has always been a source of inspiration for us. REFERENCES [1] Tim Rocktaschel, Edward Grefenstette, Karl Moritz Hermann, Tomas Kocisky, Phil Blunsom.” Reasoning about entailment with neural attention”, In ICLR 2016 [2] Wang, Zhiguo, Wael Hamza, and Radu Florian. "Bilateral Multi-Perspective Matching for Natural Language Sentences.", In arXiv.org ,2017. [3] Wang, Shuohang, and Jing Jiang. "A Compare-Aggregate Model for Matching Text Sequences.", In arXiv.org ,2016 [4] Rodrigo F. G. Silva, Klérisson Paixão, Marcelo de Almeida Maia “ Duplicate Question Detection in Stack Overflow: A Reproducibility Study “ , 978-1-5386-4969-5/18/$31.00 , 2018 IEEE SANER 2018, Campobasso, Italy RENE Track. [5] W. E. Zhang, Q. Z. Sheng, J. H. Lau, and E. Abebe, “Detecting duplicate posts in programming qa communities via latent semantics and association rules,” in 26th International Conference on World Wide Web (WWW), Geneva, Switzerland, 2017, pp. 1221–1229. [6] Muhammad Asaduzzaman , Chanchal K. Roy , Kevin A. Schneider “Mining Duplicate Questions of Stack Overflow”, IEEE/ACM 13th Working Conference on Mining Software Repositories , September 2017. [7] Chakaveh Saedi, Joao Rodrigues, Joao Silva, Antonio Branco, Vladislav Maraev, “Learning Profiles in Duplicate Question Detection”, IEEE International Conference on Information Reuse and Integration (IRI), 2017 . [8] Y. Wu, Q. Zhang, and X. Huang, “Efficient near- duplicate detection for Q&A forum,” in Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP), 2011, pp. 1001–1009. [9] Ashwin Dhakal,Arpan Poudel ,Sagar Pandey ” Exploring Deep Learning in Semantic Question Matching”, IEEE 3rd International Conferenceon Computing,Communication and Security (ICCCS), Kathmandu (Nepal) , June 2018 [10] Naveen Saini, Pushpak Bhattacharyya, Sriparna Saha, Dhiraj Chakraborty,” Extractive single document summarization using binary differential evolution: Optimization of different sentencequalitymeasures”,InPLoS ONENovember2019