SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1859
FINANCIAL ANALYSIS USING DATA MINING
Akshaya Rajagopalan1, Vidhya. S2, P.R Bhavya3
1Akshaya Rajagopalan, Dept. of Computer Application, M.O.P Vaishnav College for women, TamilNadu, India
2Vidhya.S, Dept. of Computer Application, M.O.P Vaishnav College for women, TamilNadu, India
3P.R Bhavya, Dept. of Computer Application, M.O.P Vaishnav College for women, TamilNadu, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Data mining refers to extracting or mining knowledge from large amounts of data. Data mining isused inpredictionof
various diseases, stock market trends and weather conditions. The paper uses data mining algorithms to predict whether the
NASDAQ share market will increase every month. The prediction is done using KNN classification, Rule based classification and
Deep Learning technique. The three algorithms are compared with each other for determining the one with higher accuracy.
Key Words: (Size 10 & Bold) Key word1, Key word2, Key word3, etc (Minimum 5 to 8 key words)…
1. INTRODUCTION
Data mining refers to extracting information from huge sets of data. In general, we can say that data mining is the
process of mining knowledge from data. The information or knowledge extracted can be used for variousapplicationssuchas
market analysis, fraud detection, customer retention and many more.
Stock market trends are very unstable. This leads to high risk in stock returns. The stock returns for an investor is
important as they invest their money on stocks to gain profit. In order to gain high stock returns,knowledgeonstock trends is
necessary. There are market experts who predict when an investor can buy, sell or hold shares. The prediction however is
difficult due to complex and unstable nature of stock market. On the other hand, there ishugeamountofdata generatedbythe
stock market which can be used for prediction. There are various ways to predict stock market trends.
There are various algorithms for classification and clustering data. The paper uses the data set of NASDAQ share
market of 10 years for every month from Jan-2009 to Jan-2019 from it official website. The dataset was given label as
increasing which has either yes or no as its value based on previous month and current month price of stock.
1.1 Literature Review
[1]Examining the stability and persistence of the long-short performance of fundamental signals over time. This analysis is
important because previous studies (e.g., Sullivan, Timmermann, and White 2001) argue that the analysis of sub period
stability is a remedy against data mining.
[2]Tokenization: Each news article or financial report document is split into meaningful words called tokens. They are: Data
standardization, Stop-word-removal, Stemming, Abbreviation processing, Filter tokens.
[3]To develop the new models, two methods were used: DA and decision trees.
Using these methods, we identified the most significant variables from the basic sets in terms of prediction bankruptcy/non
bankruptcy of companies within the Slovak business environment.
[4]There are often inconsistencies in how entity names and addresses are entered, in addition to outright errors and typos.
There is implicit semantic knowledge included in a name, e.g., a name may contain National association or State Bank of in its
name. This complicates matching based on a similarity score that is obtained using some edit distance metric.
[5]Mentioning that the final purpose behind text classification processes is to facilitate text analyzing in order to recognize
hidden textual patterns with reduced manual efforts. To do this, we focus on utilization of classified financial footnotes in the
last phase of the research, which consists of representation of classified output in each category.
[6]If for denser structure trees all effective features in first prediction are selected by the proposed hybrid model, results in
better accuracy as ‘‘BF Tree’’, ‘‘LAD Tree’’, and ‘‘FT Tree’’. Otherwise, it is possible that accuracy drops, like ‘‘CART and Rep’’
TREEs. The selected features have different effect on the accuracy of forecasting. Some trees with large structure, such as J48
Graph and J48 Tree.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1860
[7]The LDA results from the previous section show that Comcast has serious customer service issues. We want to further
explore these issues with sentiment analysis, which is a supervised learning method and can also be used to visualize and
summarize big data. Sentiment analysis can be carried out in various models, such as maximum entropy.
[8]As cryptocurrencies continue to develop they have merited the attention of policy-makers and regulators for a host of
differing reasons. However, there have been three specific situations that have been quite unique to cryptocurrency markets.
[9] Always aiming to increase credit volume while reducing default risks. Therefore, credit scoring analyses are crucial to aid
faster decision making, reduce the costs of loan analysis, monitor existing accounts more closely, predict default risks, and
ensure that institutions can detect possible risks while developing their competitiveness
2. ANALYSIS
The analysis is done using Data mining tool named RapidMiner Studio. Itisabouttheusageofthreemainalgorithms:KNearest
Neighbour, Rule Based Classification and Deep Learning for predictingwhethertheNASDAQmarketwill increase everymonth
or not and with higher accuracy. The label for prediction is increasing with values yes or no. This data was then imported to
RapidMiner and many algorithms were applied. In the end only three algorithms had significant accuracy. These three are
compared and the one with higher accuracy is best suited for this prediction. The working, results and performance of each
algorithm is given below.
K NEAREST NEIGHBOURS
K Nearest Neighbours is a simple algorithm that stores all availablecasesandclassifiesnewcasesbasedona similarity
measure (e.g. distance functions). It belongs to the supervised learning domain and finds intense application in pattern
recognition, data mining and intrusion detection. In KNN-classification,theoutputisa classmembership.Anobjectisclassified
by a plurality vote of its neighbours, with the object being assigned to the class most common among its k- nearestneighbours
(k is a positive integer, typically small). If k = 1, then the object is simply assigned to the class of that single nearest neighbour.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1861
The table above shows the how accurate the result is. The first parameteriswith respecttopredictionnowhichistrue
no: 86 and true yes: 31. The second parameter is prediction yeswhichistrueno:4.Therefore,thetotal accuracyis71.28% with
error of +/- 13.89%.
RULE BASED CLASSIFICATION
The term rule-based classification can be used to refer to any classification scheme that makes use of IF-THEN rules
for class prediction. The IF part of the rule is called rule antecedent or precondition. The THEN part of the rule is called rule
consequent. The antecedent part the condition consists of one or more attribute tests andthesetestsarelogicallyANDED. The
consequent part consists of class prediction.
The table above shows the how accurate the result is. The first parameteriswith respecttopredictionnowhichistrue
no: 90and true yes: 31. The second parameter is prediction yes which is true no: 0 and trueyes:0.Therefore,thetotal accuracy
is 74.49% with error of +/- 11.64%.
DEEP LEARNING
Deep Learning is a set of Machine Learning algorithmswhichhaveoneormorehiddenlayersinNeural Networks.Now
Deep Learning systems are used for almost any tasks other Machine Learning algorithms are used for. These can be
Classification, Dimensionality Reduction, Object Recognition, Clustering etc.
Model Metrics Type: Binomial
Description: Metrics reported on full training frame
model id: rm-h2o-model-deep_learning-847701
frame id: rm-h2o-frame-deep_learning-797658
MSE: 0.01230316
R^2: 0.9354371
AUC: 1.0
logloss: 0.089967564
CM: Confusion Matrix (vertical: actual; across: predicted):
no yes Error Rate
no 90 0 0.0000 = 0 / 90
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1862
yes 0 31 0.0000 = 0 / 31
Totals 90 31 0.0000 = 0 / 121
Gains/Lift Table (Avg response rate: 25.62 %):
Group Cumulative Data Fraction Lower Threshold Lift Cumulative Lift ResponseRate CumulativeResponseRate Capture
Rate Cumulative Capture Rate Gain Cumulative Gain
1 0.01652893 0.949065 3.903226 3.903226 1.000000 1.000000 0.064516 0.064516 290.322581 290.322581
2 0.02479339 0.938766 3.903226 3.903226 1.000000 1.000000 0.032258 0.096774 290.322581 290.322581
3 0.03305785 0.936683 3.903226 3.903226 1.000000 1.000000 0.032258 0.129032 290.322581 290.322581
4 0.04132231 0.934789 3.903226 3.903226 1.000000 1.000000 0.032258 0.161290 290.322581 290.322581
5 0.05785124 0.903776 3.903226 3.903226 1.000000 1.000000 0.064516 0.225806 290.322581 290.322581
6 0.10743802 0.876076 3.903226 3.903226 1.000000 1.000000 0.193548 0.419355 290.322581 290.322581
7 0.15702479 0.839690 3.903226 3.903226 1.000000 1.000000 0.193548 0.612903 290.322581 290.322581
8 0.20661157 0.801775 3.903226 3.903226 1.000000 1.000000 0.193548 0.806452 290.322581 290.322581
9 0.30578512 0.151490 1.951613 3.270270 0.500000 0.837838 0.193548 1.000000 95.161290 227.027027
10 0.40495868 0.087053 0.000000 2.469388 0.000000 0.632653 0.000000 1.000000 -100.000000 146.938776
11 0.50413223 0.052242 0.000000 1.983607 0.000000 0.508197 0.000000 1.000000 -100.000000 98.360656
12 0.60330579 0.044419 0.000000 1.657534 0.000000 0.424658 0.000000 1.000000 -100.000000 65.753425
13 0.70247934 0.035038 0.000000 1.423529 0.000000 0.364706 0.000000 1.000000 -100.000000 42.352941
14 0.80165289 0.029978 0.000000 1.247423 0.000000 0.319588 0.000000 1.000000 -100.000000 24.742268
15 0.90082645 0.021018 0.000000 1.110092 0.000000 0.284404 0.000000 1.000000 -100.000000 11.009174
16 1.00000000 0.005519 0.000000 1.000000 0.000000 0.256198 0.000000 1.000000 -100.000000 0.000000
Status of Neuron Layers (predicting increasing, 2-class classification, bernoulli distribution, CrossEntropy loss, 44,752
weights/biases, 529.2 KB, 1,210 training samples, mini-batch size 1):
Layer Units Type Dropout L1 L2 Mean Rate Rate RMS Momentum Mean Weight Weight RMS Mean Bias Bias RMS
1 841 Input 0.00 %
2 50 Rectifier 0.00 % 0.000010 0.000000 0.026497 0.097324 0.000000 0.000063 0.048879 0.485324 0.071063
3 50 Rectifier 0.00 % 0.000010 0.000000 0.013653 0.026169 0.000000 -0.000503 0.143624 0.997198 0.013364
4 2 Softmax 0.000010 0.000000 0.002090 0.001478 0.000000 -0.079601 0.434169 -0.000000 0.007881
Scoring History:
Timestamp Duration Training Speed Epochs Iterations Samples Training MSE Training R^2 Training LogLoss
Training AUC Training Lift Training Classification Error
2019-09-22 15:33:36 0.000 sec 0.00000 0 0.000000 NaN NaN NaN NaN NaN NaN
2019-09-22 15:33:36 0.453 sec 703 rows/sec 1.00000 1 121.000000 0.18056 0.05247 0.53647 0.76416 1.95161 0.19835
2019-09-22 15:33:37 1.078 sec 1549 rows/sec 10.00000 10 1210.000000 0.01230 0.93544 0.08997 1.00000 3.90323 0.00000
H2O version: 3.8.2.6-rm9.0.0
The above table shows the how accurate the result is. The first parameteriswith respecttopredictionnowhichistrue
no: 90 and true yes: 29. The second parameter is prediction yes which is true yes: 2.Therefore, the total accuracy is 76.03%
with error of +/- 10.72%.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1863
3. CONCLUSION
In this paper, dataset of NASDAQ share market has been analysed to predict whetherthepricewill increaseor notinupcoming
month. The three algorithms- KNN, Rule model and Deep Learning are applied and their performances are compared. On
comparing these three , Deep Learning has highest accuracy of 76.03% +/-10.72%. Rule based classifier comes next with
74.49% +/-11.64%. The least among the three is KNN with 71.28 +/-13.89%. Therefore wecanconcludethatDeep Learningis
best suited for predicting whether the NASDAQ share market will increase each month or not.
REFERENCES
[1.]Fundamental Analysis and the Cross-Section of Stock: A Data-Mining Approach,Lingling Zheng,2019
[2].Predicting Stock Market Behavior using Data Mining Technique, Ayman Elsayed Khedr ,2017
[3]. Analysis of Impact of Using the Trend Variables on Bankruptcy Prediction Models Performance, Beáta GAVUROVÁ,2017
[4]. Research Challenges in Financial Data Modeling and Analysis,Lewis Alexander ,Sanjiv R. Das ,2017
[5].Financial Footnote Analysis: Developing a Text Mining Approach,Maryam Heidari, Carsten Felden ,2018
[6].Developing an approach to evaluate stocks by forecasting effective features with data mining ,Sasan Barak,2018.
[7.] Analyzing the impact of user-generated content on B2B Firms' stockperformance: Big dataanalysiswithmachinelearning
methods,Xia Liu,2019
[8.]Cryptocurrencies as a Financial Asset: A systematic analysis,Shaen Corbet,2019
[9]. -Comparison of Data Mining Classification Algorithms Determining the Default Risk, Begüm Çığşar and Deniz Ünal,2017
BIOGRAPHIES
Akshaya Rajagopalan a student of
M.O.P Vaishnav College currently
in her final semesterpursuingBCA.
S.Vidhya a student of M.O.P
Vaishnav College currently in her
final semester pursuing BCA.
Bhavya.P.R a student of M.O.P
Vaishnav College currently in her
final semester pursuing BCA.

Contenu connexe

Tendances

PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
Hari KC
 
Poonam ahluwalia
Poonam ahluwaliaPoonam ahluwalia
Poonam ahluwalia
PMI2011
 

Tendances (20)

PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
PERFORMANCE ANALYSIS and PREDICTION of NEPAL STOCK MARKET (NEPSE) for INVESTM...
 
IRJET- Stock Market Prediction using ANN
IRJET- Stock Market Prediction using ANNIRJET- Stock Market Prediction using ANN
IRJET- Stock Market Prediction using ANN
 
IRJET - Stock Market Prediction using Machine Learning Algorithm
IRJET - Stock Market Prediction using Machine Learning AlgorithmIRJET - Stock Market Prediction using Machine Learning Algorithm
IRJET - Stock Market Prediction using Machine Learning Algorithm
 
Dmml report final
Dmml report finalDmml report final
Dmml report final
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
IRJET- Stock Price Prediction using Long Short Term Memory
IRJET-  	  Stock Price Prediction using Long Short Term MemoryIRJET-  	  Stock Price Prediction using Long Short Term Memory
IRJET- Stock Price Prediction using Long Short Term Memory
 
IRJET- Stock Price Prediction using combination of LSTM Neural Networks, ARIM...
IRJET- Stock Price Prediction using combination of LSTM Neural Networks, ARIM...IRJET- Stock Price Prediction using combination of LSTM Neural Networks, ARIM...
IRJET- Stock Price Prediction using combination of LSTM Neural Networks, ARIM...
 
International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER) International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER)
 
Q04602106117
Q04602106117Q04602106117
Q04602106117
 
Poonam ahluwalia
Poonam ahluwaliaPoonam ahluwalia
Poonam ahluwalia
 
IRJET- A Comprehensive way of finding Top-K Competitors using C-Miner Algorithm
IRJET- A Comprehensive way of finding Top-K Competitors using C-Miner AlgorithmIRJET- A Comprehensive way of finding Top-K Competitors using C-Miner Algorithm
IRJET- A Comprehensive way of finding Top-K Competitors using C-Miner Algorithm
 
Wp0030
Wp0030Wp0030
Wp0030
 
Stock price prediction using Neural Net
Stock price prediction using Neural NetStock price prediction using Neural Net
Stock price prediction using Neural Net
 
Stock market analysis
Stock market analysisStock market analysis
Stock market analysis
 
Presentation1
Presentation1Presentation1
Presentation1
 
Statistics And Probability Tutorial | Statistics And Probability for Data Sci...
Statistics And Probability Tutorial | Statistics And Probability for Data Sci...Statistics And Probability Tutorial | Statistics And Probability for Data Sci...
Statistics And Probability Tutorial | Statistics And Probability for Data Sci...
 
L3 1b
L3 1bL3 1b
L3 1b
 
Opinion mining framework using proposed RB-bayes model for text classication
Opinion mining framework using proposed RB-bayes model for text classicationOpinion mining framework using proposed RB-bayes model for text classication
Opinion mining framework using proposed RB-bayes model for text classication
 
13F_working_paper
13F_working_paper13F_working_paper
13F_working_paper
 
stock market prediction
stock market predictionstock market prediction
stock market prediction
 

Similaire à IRJET- Financial Analysis using Data Mining

Similaire à IRJET- Financial Analysis using Data Mining (20)

Bank Customer Segmentation & Insurance Claim Prediction
Bank Customer Segmentation & Insurance Claim PredictionBank Customer Segmentation & Insurance Claim Prediction
Bank Customer Segmentation & Insurance Claim Prediction
 
IRJET - An Overview of Machine Learning Algorithms for Data Science
IRJET - An Overview of Machine Learning Algorithms for Data ScienceIRJET - An Overview of Machine Learning Algorithms for Data Science
IRJET - An Overview of Machine Learning Algorithms for Data Science
 
IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using...
IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using...IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using...
IRJET- Improving Prediction of Potential Clients for Bank Term Deposits using...
 
IRJET- Future Stock Price Prediction using LSTM Machine Learning Algorithm
IRJET-  	  Future Stock Price Prediction using LSTM Machine Learning AlgorithmIRJET-  	  Future Stock Price Prediction using LSTM Machine Learning Algorithm
IRJET- Future Stock Price Prediction using LSTM Machine Learning Algorithm
 
STOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUES
STOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUESSTOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUES
STOCK MARKET ANALYZING AND PREDICTION USING MACHINE LEARNING TECHNIQUES
 
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONSMUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
MUTUAL FUND RECOMMENDATION SYSTEM WITH PERSONALIZED EXPLANATIONS
 
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISKMACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
 
Investment Portfolio Risk Manager using Machine Learning and Deep-Learning.
Investment Portfolio Risk Manager using Machine Learning and Deep-Learning.Investment Portfolio Risk Manager using Machine Learning and Deep-Learning.
Investment Portfolio Risk Manager using Machine Learning and Deep-Learning.
 
Business Analysis using Machine Learning
Business Analysis using Machine LearningBusiness Analysis using Machine Learning
Business Analysis using Machine Learning
 
IRJET - Forecasting Stock Market Movement Direction using Sentiment Analysis ...
IRJET - Forecasting Stock Market Movement Direction using Sentiment Analysis ...IRJET - Forecasting Stock Market Movement Direction using Sentiment Analysis ...
IRJET - Forecasting Stock Market Movement Direction using Sentiment Analysis ...
 
IRJET- Comparative Study of Classification Algorithms for Sentiment Analy...
IRJET-  	  Comparative Study of Classification Algorithms for Sentiment Analy...IRJET-  	  Comparative Study of Classification Algorithms for Sentiment Analy...
IRJET- Comparative Study of Classification Algorithms for Sentiment Analy...
 
IRJET- Information Reterival of Text-based Deep Stock Prediction
IRJET-  	  Information Reterival of Text-based Deep Stock PredictionIRJET-  	  Information Reterival of Text-based Deep Stock Prediction
IRJET- Information Reterival of Text-based Deep Stock Prediction
 
IRJET-Financial Distress Prediction of a Company using Data Mining
IRJET-Financial Distress Prediction of a Company using Data MiningIRJET-Financial Distress Prediction of a Company using Data Mining
IRJET-Financial Distress Prediction of a Company using Data Mining
 
Quant Foundry Labs - Low Probability Defaults
Quant Foundry Labs - Low Probability DefaultsQuant Foundry Labs - Low Probability Defaults
Quant Foundry Labs - Low Probability Defaults
 
STOCK PRICE PREDICTION USING TIME SERIES
STOCK PRICE PREDICTION USING TIME SERIESSTOCK PRICE PREDICTION USING TIME SERIES
STOCK PRICE PREDICTION USING TIME SERIES
 
STOCK PRICE PREDICTION USING TIME SERIES
STOCK PRICE PREDICTION USING TIME SERIESSTOCK PRICE PREDICTION USING TIME SERIES
STOCK PRICE PREDICTION USING TIME SERIES
 
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
 
Loan Analysis Predicting Defaulters
Loan Analysis Predicting DefaultersLoan Analysis Predicting Defaulters
Loan Analysis Predicting Defaulters
 
ANNUAL REPORT ANALYSIS WITH ADVANCED LANGUAGE MODELS: A STOCK INVESTMENT STRA...
ANNUAL REPORT ANALYSIS WITH ADVANCED LANGUAGE MODELS: A STOCK INVESTMENT STRA...ANNUAL REPORT ANALYSIS WITH ADVANCED LANGUAGE MODELS: A STOCK INVESTMENT STRA...
ANNUAL REPORT ANALYSIS WITH ADVANCED LANGUAGE MODELS: A STOCK INVESTMENT STRA...
 
A Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningA Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine Learning
 

Plus de IRJET Journal

Plus de IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Dernier

VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
dharasingh5698
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 

Dernier (20)

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 

IRJET- Financial Analysis using Data Mining

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1859 FINANCIAL ANALYSIS USING DATA MINING Akshaya Rajagopalan1, Vidhya. S2, P.R Bhavya3 1Akshaya Rajagopalan, Dept. of Computer Application, M.O.P Vaishnav College for women, TamilNadu, India 2Vidhya.S, Dept. of Computer Application, M.O.P Vaishnav College for women, TamilNadu, India 3P.R Bhavya, Dept. of Computer Application, M.O.P Vaishnav College for women, TamilNadu, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - Data mining refers to extracting or mining knowledge from large amounts of data. Data mining isused inpredictionof various diseases, stock market trends and weather conditions. The paper uses data mining algorithms to predict whether the NASDAQ share market will increase every month. The prediction is done using KNN classification, Rule based classification and Deep Learning technique. The three algorithms are compared with each other for determining the one with higher accuracy. Key Words: (Size 10 & Bold) Key word1, Key word2, Key word3, etc (Minimum 5 to 8 key words)… 1. INTRODUCTION Data mining refers to extracting information from huge sets of data. In general, we can say that data mining is the process of mining knowledge from data. The information or knowledge extracted can be used for variousapplicationssuchas market analysis, fraud detection, customer retention and many more. Stock market trends are very unstable. This leads to high risk in stock returns. The stock returns for an investor is important as they invest their money on stocks to gain profit. In order to gain high stock returns,knowledgeonstock trends is necessary. There are market experts who predict when an investor can buy, sell or hold shares. The prediction however is difficult due to complex and unstable nature of stock market. On the other hand, there ishugeamountofdata generatedbythe stock market which can be used for prediction. There are various ways to predict stock market trends. There are various algorithms for classification and clustering data. The paper uses the data set of NASDAQ share market of 10 years for every month from Jan-2009 to Jan-2019 from it official website. The dataset was given label as increasing which has either yes or no as its value based on previous month and current month price of stock. 1.1 Literature Review [1]Examining the stability and persistence of the long-short performance of fundamental signals over time. This analysis is important because previous studies (e.g., Sullivan, Timmermann, and White 2001) argue that the analysis of sub period stability is a remedy against data mining. [2]Tokenization: Each news article or financial report document is split into meaningful words called tokens. They are: Data standardization, Stop-word-removal, Stemming, Abbreviation processing, Filter tokens. [3]To develop the new models, two methods were used: DA and decision trees. Using these methods, we identified the most significant variables from the basic sets in terms of prediction bankruptcy/non bankruptcy of companies within the Slovak business environment. [4]There are often inconsistencies in how entity names and addresses are entered, in addition to outright errors and typos. There is implicit semantic knowledge included in a name, e.g., a name may contain National association or State Bank of in its name. This complicates matching based on a similarity score that is obtained using some edit distance metric. [5]Mentioning that the final purpose behind text classification processes is to facilitate text analyzing in order to recognize hidden textual patterns with reduced manual efforts. To do this, we focus on utilization of classified financial footnotes in the last phase of the research, which consists of representation of classified output in each category. [6]If for denser structure trees all effective features in first prediction are selected by the proposed hybrid model, results in better accuracy as ‘‘BF Tree’’, ‘‘LAD Tree’’, and ‘‘FT Tree’’. Otherwise, it is possible that accuracy drops, like ‘‘CART and Rep’’ TREEs. The selected features have different effect on the accuracy of forecasting. Some trees with large structure, such as J48 Graph and J48 Tree.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1860 [7]The LDA results from the previous section show that Comcast has serious customer service issues. We want to further explore these issues with sentiment analysis, which is a supervised learning method and can also be used to visualize and summarize big data. Sentiment analysis can be carried out in various models, such as maximum entropy. [8]As cryptocurrencies continue to develop they have merited the attention of policy-makers and regulators for a host of differing reasons. However, there have been three specific situations that have been quite unique to cryptocurrency markets. [9] Always aiming to increase credit volume while reducing default risks. Therefore, credit scoring analyses are crucial to aid faster decision making, reduce the costs of loan analysis, monitor existing accounts more closely, predict default risks, and ensure that institutions can detect possible risks while developing their competitiveness 2. ANALYSIS The analysis is done using Data mining tool named RapidMiner Studio. Itisabouttheusageofthreemainalgorithms:KNearest Neighbour, Rule Based Classification and Deep Learning for predictingwhethertheNASDAQmarketwill increase everymonth or not and with higher accuracy. The label for prediction is increasing with values yes or no. This data was then imported to RapidMiner and many algorithms were applied. In the end only three algorithms had significant accuracy. These three are compared and the one with higher accuracy is best suited for this prediction. The working, results and performance of each algorithm is given below. K NEAREST NEIGHBOURS K Nearest Neighbours is a simple algorithm that stores all availablecasesandclassifiesnewcasesbasedona similarity measure (e.g. distance functions). It belongs to the supervised learning domain and finds intense application in pattern recognition, data mining and intrusion detection. In KNN-classification,theoutputisa classmembership.Anobjectisclassified by a plurality vote of its neighbours, with the object being assigned to the class most common among its k- nearestneighbours (k is a positive integer, typically small). If k = 1, then the object is simply assigned to the class of that single nearest neighbour.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1861 The table above shows the how accurate the result is. The first parameteriswith respecttopredictionnowhichistrue no: 86 and true yes: 31. The second parameter is prediction yeswhichistrueno:4.Therefore,thetotal accuracyis71.28% with error of +/- 13.89%. RULE BASED CLASSIFICATION The term rule-based classification can be used to refer to any classification scheme that makes use of IF-THEN rules for class prediction. The IF part of the rule is called rule antecedent or precondition. The THEN part of the rule is called rule consequent. The antecedent part the condition consists of one or more attribute tests andthesetestsarelogicallyANDED. The consequent part consists of class prediction. The table above shows the how accurate the result is. The first parameteriswith respecttopredictionnowhichistrue no: 90and true yes: 31. The second parameter is prediction yes which is true no: 0 and trueyes:0.Therefore,thetotal accuracy is 74.49% with error of +/- 11.64%. DEEP LEARNING Deep Learning is a set of Machine Learning algorithmswhichhaveoneormorehiddenlayersinNeural Networks.Now Deep Learning systems are used for almost any tasks other Machine Learning algorithms are used for. These can be Classification, Dimensionality Reduction, Object Recognition, Clustering etc. Model Metrics Type: Binomial Description: Metrics reported on full training frame model id: rm-h2o-model-deep_learning-847701 frame id: rm-h2o-frame-deep_learning-797658 MSE: 0.01230316 R^2: 0.9354371 AUC: 1.0 logloss: 0.089967564 CM: Confusion Matrix (vertical: actual; across: predicted): no yes Error Rate no 90 0 0.0000 = 0 / 90
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1862 yes 0 31 0.0000 = 0 / 31 Totals 90 31 0.0000 = 0 / 121 Gains/Lift Table (Avg response rate: 25.62 %): Group Cumulative Data Fraction Lower Threshold Lift Cumulative Lift ResponseRate CumulativeResponseRate Capture Rate Cumulative Capture Rate Gain Cumulative Gain 1 0.01652893 0.949065 3.903226 3.903226 1.000000 1.000000 0.064516 0.064516 290.322581 290.322581 2 0.02479339 0.938766 3.903226 3.903226 1.000000 1.000000 0.032258 0.096774 290.322581 290.322581 3 0.03305785 0.936683 3.903226 3.903226 1.000000 1.000000 0.032258 0.129032 290.322581 290.322581 4 0.04132231 0.934789 3.903226 3.903226 1.000000 1.000000 0.032258 0.161290 290.322581 290.322581 5 0.05785124 0.903776 3.903226 3.903226 1.000000 1.000000 0.064516 0.225806 290.322581 290.322581 6 0.10743802 0.876076 3.903226 3.903226 1.000000 1.000000 0.193548 0.419355 290.322581 290.322581 7 0.15702479 0.839690 3.903226 3.903226 1.000000 1.000000 0.193548 0.612903 290.322581 290.322581 8 0.20661157 0.801775 3.903226 3.903226 1.000000 1.000000 0.193548 0.806452 290.322581 290.322581 9 0.30578512 0.151490 1.951613 3.270270 0.500000 0.837838 0.193548 1.000000 95.161290 227.027027 10 0.40495868 0.087053 0.000000 2.469388 0.000000 0.632653 0.000000 1.000000 -100.000000 146.938776 11 0.50413223 0.052242 0.000000 1.983607 0.000000 0.508197 0.000000 1.000000 -100.000000 98.360656 12 0.60330579 0.044419 0.000000 1.657534 0.000000 0.424658 0.000000 1.000000 -100.000000 65.753425 13 0.70247934 0.035038 0.000000 1.423529 0.000000 0.364706 0.000000 1.000000 -100.000000 42.352941 14 0.80165289 0.029978 0.000000 1.247423 0.000000 0.319588 0.000000 1.000000 -100.000000 24.742268 15 0.90082645 0.021018 0.000000 1.110092 0.000000 0.284404 0.000000 1.000000 -100.000000 11.009174 16 1.00000000 0.005519 0.000000 1.000000 0.000000 0.256198 0.000000 1.000000 -100.000000 0.000000 Status of Neuron Layers (predicting increasing, 2-class classification, bernoulli distribution, CrossEntropy loss, 44,752 weights/biases, 529.2 KB, 1,210 training samples, mini-batch size 1): Layer Units Type Dropout L1 L2 Mean Rate Rate RMS Momentum Mean Weight Weight RMS Mean Bias Bias RMS 1 841 Input 0.00 % 2 50 Rectifier 0.00 % 0.000010 0.000000 0.026497 0.097324 0.000000 0.000063 0.048879 0.485324 0.071063 3 50 Rectifier 0.00 % 0.000010 0.000000 0.013653 0.026169 0.000000 -0.000503 0.143624 0.997198 0.013364 4 2 Softmax 0.000010 0.000000 0.002090 0.001478 0.000000 -0.079601 0.434169 -0.000000 0.007881 Scoring History: Timestamp Duration Training Speed Epochs Iterations Samples Training MSE Training R^2 Training LogLoss Training AUC Training Lift Training Classification Error 2019-09-22 15:33:36 0.000 sec 0.00000 0 0.000000 NaN NaN NaN NaN NaN NaN 2019-09-22 15:33:36 0.453 sec 703 rows/sec 1.00000 1 121.000000 0.18056 0.05247 0.53647 0.76416 1.95161 0.19835 2019-09-22 15:33:37 1.078 sec 1549 rows/sec 10.00000 10 1210.000000 0.01230 0.93544 0.08997 1.00000 3.90323 0.00000 H2O version: 3.8.2.6-rm9.0.0 The above table shows the how accurate the result is. The first parameteriswith respecttopredictionnowhichistrue no: 90 and true yes: 29. The second parameter is prediction yes which is true yes: 2.Therefore, the total accuracy is 76.03% with error of +/- 10.72%.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1863 3. CONCLUSION In this paper, dataset of NASDAQ share market has been analysed to predict whetherthepricewill increaseor notinupcoming month. The three algorithms- KNN, Rule model and Deep Learning are applied and their performances are compared. On comparing these three , Deep Learning has highest accuracy of 76.03% +/-10.72%. Rule based classifier comes next with 74.49% +/-11.64%. The least among the three is KNN with 71.28 +/-13.89%. Therefore wecanconcludethatDeep Learningis best suited for predicting whether the NASDAQ share market will increase each month or not. REFERENCES [1.]Fundamental Analysis and the Cross-Section of Stock: A Data-Mining Approach,Lingling Zheng,2019 [2].Predicting Stock Market Behavior using Data Mining Technique, Ayman Elsayed Khedr ,2017 [3]. Analysis of Impact of Using the Trend Variables on Bankruptcy Prediction Models Performance, Beáta GAVUROVÁ,2017 [4]. Research Challenges in Financial Data Modeling and Analysis,Lewis Alexander ,Sanjiv R. Das ,2017 [5].Financial Footnote Analysis: Developing a Text Mining Approach,Maryam Heidari, Carsten Felden ,2018 [6].Developing an approach to evaluate stocks by forecasting effective features with data mining ,Sasan Barak,2018. [7.] Analyzing the impact of user-generated content on B2B Firms' stockperformance: Big dataanalysiswithmachinelearning methods,Xia Liu,2019 [8.]Cryptocurrencies as a Financial Asset: A systematic analysis,Shaen Corbet,2019 [9]. -Comparison of Data Mining Classification Algorithms Determining the Default Risk, Begüm Çığşar and Deniz Ünal,2017 BIOGRAPHIES Akshaya Rajagopalan a student of M.O.P Vaishnav College currently in her final semesterpursuingBCA. S.Vidhya a student of M.O.P Vaishnav College currently in her final semester pursuing BCA. Bhavya.P.R a student of M.O.P Vaishnav College currently in her final semester pursuing BCA.