SlideShare une entreprise Scribd logo
1  sur  49
Télécharger pour lire hors ligne
Big Data & Data Science | Bogotá | Colombia | Octubre 27, 2016
Whose Balance Sheet is this?
Neural Networks for Banks’ Pattern Recognition
Carlos León
Banco de la República (Colombia)
& Tilburg University
cleonrin@banrep.gov.co
Jose Fernando Moreno
Barcelona Grad. School of Economics
jose.moreno@barcelonagse.eu
Jorge Cely
Banco de la República (Colombia)
jcelyfe@banrep.gov.co
Cats No cats
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
A bank’s balance sheet as a 27x1 pixels image
Assets (9)
Liabilities (9)
Equity (9)
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
5
10
15
20
25
Disclaimer
The opinions and statements in this article are the sole responsibility of the authors
and do not represent neither those of Banco de la República nor of its Board of
Directors. Comments and suggestions from Hernando Vargas, Clara Machado,
Freddy Cepeda, Fabio Ortega, and other members of the technical staff of Banco
de la República are appreciated. Any remaining errors are the authors’ own.
http://www.banrep.gov.co/sites/default/files/publicaciones/archivos/be_959.pdf
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Introduction
 Balance sheets’ overall informational content …
o … information about the past performance of a firm, and a starting point for
forecasts of future performance (Chisholm, 2002)
o … assess the overall composition of resources, the constriction of external
obligations, and the firm’s flexibility and ability to change to meet new
requirements (Kaliski, 2001)
 In the banking industry …
o … among the minimum periodic reports that banks should provide to
supervisors to conduct effective supervision and to evaluate the condition of
the local banking market (BCBS, 1997 & 1998)
o … traditional supervisory examination has focused on the assessment of
bank’s balance sheets (see Mishkin, 2004)
o … and they have been related to bank lending, investment spending, and
economic activity, and the advent of financial crisis (see Mishkin, 2004)
Introduction
 Therefore, the balance sheet may be considered …
o A unique and characteristic combination of financial accounts (i.e. the
elements of financial statements) that not only allows for assessing a bank’s
financial stance, but that also differentiates it from its peers
o A snapshot of a bank
 Question: can we train a model to deal with balance sheets as
snapshots to recognize their owners with fair accuracy?
 Why? Because it is the first step towards training a model to
o Detect important changes in banks’ financial accounts
o Classify banks (fragility, riskiness, … )
o State-of-the-art early-warning systems (e.g. Fioramanti (2008), Sarlin (2014),
and Holopainen & Sarlin (2016))
Introduction
 How? Artificial Neural Networks (ANN)
o Effective classifiers, better than classical statistical methods (Wu (1997),
Zhang et al. (1999), McNelis (2005), and Han & Kamber (2006))
o No assumptions about the statistical porperties of the data (Zhang et al.
(1999), McNelis (2005), Demyanyk & Hasan (2009), and Nazari & Alidadi
(2013))
o Able to deal with non-linear relationships between factors in the data
(Bishop (1995), Han & Kamber (2006), Demyanyk & Hasan (2009), Eletter
et al. (2010), and Hagan et al. (2014))
 But… ANN have been criticized because results are opaque and they
lack interpretability –black box criticism (Han & Kamber (2006),
Angelini et al. (2008), and Witten et al. (2011)) … do we care?
Introduction
 Black box criticism comes from a desire to tie down empirical estimation
with an underlying economic theory (McNelis, 05)
 We do not care about the black box criticism because we have no
underlying economic theory to test
 This is predictive modeling –not explanatory modeling (see Shmueli, 2010)
Explanatory Modeling
• The aim is to test a causal theory (traditional
econometrics)
• Requires building an underlying causal
structure (a theoretical prior)
• Need to work on expected role of variables
Predictive Modeling
• The aim is to predict or classify successfully
• No need to build an underlying causal
structure (a theoretical prior)
• No need to delve into the expected role of
the variables
Machine LearningEconometrics
Introduction
Explanatory Modeling
• The aim is to test a causal theory (traditional
econometrics)
• Requires building an underlying causal
structure (a theoretical prior)
• Need to work on expected role of variables
Predictive Modeling
• The aim is to predict or classify successfully
• No need to build an underlying causal
structure (a theoretical prior)
• No need to delve into the expected role of
the variables
Machine LearningEconometrics
Varian, H. (2014):
• […] econometrics is concerned with detecting and summarizing relationships in data,
with regression analysis as its prevalent tool.
• […] machine learning methods –such as artificial neural networks- are concerned with
developing high-performance computer systems that can provide useful predictions,
namely out-of-sample predictions.
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Related literature
• Pattern recognition • ANN on financial data
Related literature
• Pattern recognition (classification)
– Aims at classifying inputs into a set of target categories (Hagan et al., 2014)
– Mainly a supervised machine learning problem: for training, each example
pertains to a known category
– Wide spectrum: facial recognition, image classification, voice recognition, text
translation, fraud detection, classification of handwritten characters, and
medical diagnosis
– Contemporary success due to:
• Big data is now available for successful training
• Great computational power is now available for ANN
• Deep learning for particularly complex ANN (Schmidhuber (2015))
Related literature
• ANN on financial data (financial ratios)
– Bankruptcy/failure prediction based on classification of firms
• Non- financial (Rudorfer (1995), Zhang et al. (1999), Atiya (2001), Brédart (2014))
• Financial (Tam & Kiang (1990), Tam (1991), Olmeda & Fernández (1997))
– Loan decisions in retail and corporate banking (Angelini et al. (2008),
Eletter et al. (2010), Nazari & Alidadi (2013), Bekhet & Eletter (2014))
– Local/foreign bank classification (Turkan et al. (2011))
– Islamic/conventional bank classification (Khediri et al. (2015))
– Auditing/no auditing firms for tax evasion (Wu (1997))
– State-of-the-art early warning systems
• Sovereign debt crises prediction (Fioramanti (2008))
• Country-specific fin. crises (Sarlin, (2014), Holopainen & Sarlin (2016))
Related literature
• ANNs’ increasingly important role in financial applications for
such tasks as pattern recognition, classification, and time series
forecasting (Naziri & Alidadi (2013) and Eletter & Yaseen
(2010))
• In our case…
– Instead of selecting the “appropriate” set of financial ratios…
– We work on raw balance sheets (the input for financial ratios)
– Beware: when working on selected financial ratios we discard potentially
useful information due to our cognitive bias (or plain ignorance).
– To the best of our knowledge, this is the first time raw balance sheets are
encoded as inputs for a pattern recognition problem
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
ANNs and pattern recognition
• ANNs are networks of interconnected artificial neurons, with the
weights of those connections resulting from a learning process that
attempts to minimize the prediction/classification error of the input-
output function
• The central idea of ANNs is to extract linear combinations of the
inputs as derived features, and then model the output (i.e. the target)
as a nonlinear function of these features. (Hastie et al., 2013)
• The simplest case is the feed-forward ANN (our choice for what
follows).
• Other ANNs cases are more complex, but may open new ways to
solve more complex problems (e.g. recurrent ANNs, convolutional
ANNs, reinforcement ANNs). We do not describe them.
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
ANN models
Activation function:
• Prediction
• Classification
Log-sigmoid function
ANN models
Activation function:
• Prediction
• Classification
Log-sigmoid function
ANN models
Activation function:
• Classification
Log-sigmoid function
Softmax function*
(*) According to G. Hinton, it is convenient as 1) it may be interpreted as a probability, and 2) it provides additional knowledge to the training process.
ANN models
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Training the ANN
• Training: Adjusting parameters in W and b in order to attain an
input-output relationship target under the chosen transfer
functions for a set of observations (i.e. examples)
• Backpropagation:
– Backpropagation learns by iteratively processing a dataset of training
examples (i.e. observations), comparing network’s prediction (i.e.
output) for each example with the actual target value
– Parameters in W and b are modified in backwards direction, from the
output layer, through each hidden layer down to the first hidden layer –
hence its name (Han & Kamber, 2006)
Training the ANN
• Backpropagation (cont.):
– Backpropagation usually employs some type of gradient descent method
to minimize the error between the prediction and the actual target value
Sum (or mean) of squared errors, for prediction or classification Cross-entropy, for classification
Training the ANN
• Unlike typical applications of regression models in econometrics, the
goal of training an artificial neural network is not limited to
minimizing in-sample the errors.
• The overfitting problem may be described as the model’s ability to
succeed at fitting in-sample but to fail at fitting out-of-sample (see
Shmueli (2010), Varian (2014))
• The goal is not to memorize the training data, but to model the
underlying generator of the data (Bishop, 1995)
• Early stopping:
– Halt the minimization process before the complexity of the solution inhibits
its generalization capability
– If training is stopped before the minimum in-sample is reached, then the
network will effectively be using fewer parameters and will be less likely to
overfit (Hagan et al., 2014)
Training the ANN
• Early stopping with cross-validation (Hagan et al., 2014):
Training dataset
(70%)
Validation dataset
(15%)
Test dataset
(15%)
The training set is used to minimize the error between the
prediction and the actual target value
The (large) dataset
The validation dataset is used simultaneously (as the neural
network is trained) to check how the estimated parameters fit out-
of-sample data. When validation error starts to increase (i.e.
overfitting starts), the training stops.
The error obtained on the test dataset is used to check the future
performance of the artificial neural network on out-of-sample
data, i.e. its generalization capability.
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Post-training analysis
• To test how good is the in-sample and out-of-sample training:
– For prediction: r2, scatter plots
– For classification:
• Confusion matrix: squared table that relates the actual
target class (in x-axis) with the predicted class (in y-axis)
• Receiver operating characteristic (ROC) curve: shows the
trade-off between the true positive rate (in y-axis) and the
false-positive rate (in x-axis) for a given model (Han &
Kamber, 2006)
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Data and methodology
• Balance sheets
– 25 financial accounts (i.e. features)
– Monthly, from January 2000 to December 2014*
– 21 banks available (out of 41 that report)
• The ANN
– We implement a standard two-layer network, with one
hidden layer and one output layer; often a single hidden
layer is all that is necessary (see Zhang et al., (1999), Witten
et al. (2011))
– A base case scenario with a 15-neuron hidden layer
– Other scenarios for robustness (5, 10, 20, 25)
(*) From January 2015 balance sheets are reported based on International Financial Reporting Standards (IFRS-NIIF),
instead of COLGAAP. They are not consistent.
3,237 examples
Figure 12. Evolution of Colombian banks (2000-2014). Only banks active as of
December 2014 are presented. The name and type of credit institution (e.g. bank,
financial corporation, financial cooperative) of some institutions may have changed
during the sample period; the most recent name and type (i.e. bank) is preserved.
Some names were shortened.
P =
𝑝1,1 𝑝1,2 ⋯ 𝑝1,3237
𝑝2,1
⋱
⋮ ⋮
𝑝25,1 𝑝25,2 ⋯ 𝑝25,3237
Examples (3237 balance sheets)
Classes(21banks)
T =
𝑡1,1 = 1 𝑡1,2 = 0 ⋯ 𝑡1,3237 = 0
𝑡2,1 = 0
⋱
⋮ ⋮
𝑡21,1 = 0 𝑡21,2 = 0 ⋯ 𝑡21,3237 = 1
Examples (3237 balance sheets)
Features(25accounts)
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Main results
• After training with early-stopping (1 hidden layer, 15 neurons)
Set
Samples
(balance sheets)
Performance
(cross-entropy)
Misclassification
(%)
Training 2,265 0.0012 0.35%
Validation 486 0.0044 1.65%
Test 486 0.0019 1.03%
Table 1. Overall results of the artificial neural network after training with cross-validation early-stopping.
In-sample
Main results
• After training with early-stopping (1 hidden layer, 15 neurons)
•In-sample
•Out-of-sample #1
•Out-of-sample #2
Figure 7. Training dataset confusion matrix.
•In-sample M&A
Sample
???
Figure 10. Test dataset confusion matrix
•Out-of-sample #2 M&A
Sample
???
Main results
None of the classes (i.e. banks) displays a ratio of true positives to false positives close to the
diagonal. All classes show a high ratio of true positives to false positives.
In-sample Out-of-sample #2
Set
Misclassification (Average and standard deviation, %)
5 neurons 10 neurons 15 neurons 20 neurons 25 neurons
Training
19.75%
[15.37%]
3.41%
[9.84%]
0.61%
[0.43%]
0.15%
[0.29%]
0.10%
[0.23%]
Validation
20.99%
[15.23%]
4.86%
[9.87%]
1.64%
[0.81%]
1.00%
[0.70%]
0.91%
[0.72%]
Test
21.53%
[15.44%]
5.19%
[9.86%]
1.72%
[0.80%]
1.23%
[0.66%]
0.94%
[0.63%]
Table 2. Overall average results of the artificial neural network after training with cross-validation early-
stopping. The average and standard deviation (in brackets) is estimated on 100 independent training
processes.
Main results
In-sample
Contents
1. Introduction
2. Related literature
3. Artificial neural networks and pattern recognition
3.1. Artificial neural network models
3.2. Training the artificial neural network
3.3. Post-training analysis
4. Data and methodology
5. Main results
6. Final remarks
Final remarks
• We attained a successful implementation of ANN for pattern
classification of banks’ balance sheets
– Balance sheets are unique and representative snapshots of banks’ financial
position
– ANN is a suitable method for classifying balance sheets
• To the best of our knowledge, this is the first attempt to use balance
sheet data as a comprehensive portrait of financial position of a firm
• Using raw balance sheets instead of arbitrarily chosen financial ratios
may alleviate selection bias problems (i.e. discarding potentially
useful information due to ignorance or reliance on prior research)
• There is a particularly straightforward application..
Final remarks
• Early-warning systems, as in Fioramanti (2008), Sarlin
(2014), and Holopainen & Sarlin (2016), but…
• With raw data (or mixtures of data)
Final remarks
• Early-warning systems, as in Fioramanti (2008), Sarlin
(2014), and Holopainen & Sarlin (2016), but…
• With raw data (or mixtures of data)
Final remarks
• Early-warning systems, as in Fioramanti (2008), Sarlin
(2014), and Holopainen & Sarlin (2016), but…
• With raw data (or mixtures of raw data + indicators)
Big Data & Data Science | Bogotá | Colombia | Octubre 27, 2016
Whose Balance Sheet is this?
Neural Networks for Banks’ Pattern Recognition

Contenu connexe

Similaire à Whose Balance Sheet is this? Neural Networks for Banks’ Pattern Recognition

612020 Originality Reporthttpsucumberlands.blackboard.docx
612020 Originality Reporthttpsucumberlands.blackboard.docx612020 Originality Reporthttpsucumberlands.blackboard.docx
612020 Originality Reporthttpsucumberlands.blackboard.docxBHANU281672
 
An Innovative Approach to Predict Bankruptcy
An Innovative Approach to Predict BankruptcyAn Innovative Approach to Predict Bankruptcy
An Innovative Approach to Predict Bankruptcyvivatechijri
 
Financial revolution: a systemic analysis of artificial intelligence and mach...
Financial revolution: a systemic analysis of artificial intelligence and mach...Financial revolution: a systemic analysis of artificial intelligence and mach...
Financial revolution: a systemic analysis of artificial intelligence and mach...IJECEIAES
 
The Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdf
The Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdfThe Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdf
The Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdfamitjewels87
 
B510519.pdf
B510519.pdfB510519.pdf
B510519.pdfaijbm
 
Time Series Analysis
Time Series AnalysisTime Series Analysis
Time Series AnalysisAmanda Reed
 
Determinants of bank's interest margin in the aftermath of the crisis: the ef...
Determinants of bank's interest margin in the aftermath of the crisis: the ef...Determinants of bank's interest margin in the aftermath of the crisis: the ef...
Determinants of bank's interest margin in the aftermath of the crisis: the ef...Ivie
 
IRJET - Bankruptcy Score Indexing
IRJET - Bankruptcy Score IndexingIRJET - Bankruptcy Score Indexing
IRJET - Bankruptcy Score IndexingIRJET Journal
 
Case Study Measurement of Variables – Operational DefinitionsCh.docx
Case Study Measurement of Variables – Operational DefinitionsCh.docxCase Study Measurement of Variables – Operational DefinitionsCh.docx
Case Study Measurement of Variables – Operational DefinitionsCh.docxwendolynhalbert
 
Impact_of_Information_Technology_on_Orga (2).pdf
Impact_of_Information_Technology_on_Orga (2).pdfImpact_of_Information_Technology_on_Orga (2).pdf
Impact_of_Information_Technology_on_Orga (2).pdfstephjohn3
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlationVrushaliSolanke
 
Rafael Love - Resume
Rafael Love - ResumeRafael Love - Resume
Rafael Love - ResumeRafael Love
 
Donabedian Model
Donabedian ModelDonabedian Model
Donabedian ModelDani Cox
 
Rafael Love - Resume
Rafael Love - ResumeRafael Love - Resume
Rafael Love - ResumeRafael Love
 

Similaire à Whose Balance Sheet is this? Neural Networks for Banks’ Pattern Recognition (20)

612020 Originality Reporthttpsucumberlands.blackboard.docx
612020 Originality Reporthttpsucumberlands.blackboard.docx612020 Originality Reporthttpsucumberlands.blackboard.docx
612020 Originality Reporthttpsucumberlands.blackboard.docx
 
An Innovative Approach to Predict Bankruptcy
An Innovative Approach to Predict BankruptcyAn Innovative Approach to Predict Bankruptcy
An Innovative Approach to Predict Bankruptcy
 
Ackim - Viva
Ackim - VivaAckim - Viva
Ackim - Viva
 
Financial revolution: a systemic analysis of artificial intelligence and mach...
Financial revolution: a systemic analysis of artificial intelligence and mach...Financial revolution: a systemic analysis of artificial intelligence and mach...
Financial revolution: a systemic analysis of artificial intelligence and mach...
 
The Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdf
The Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdfThe Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdf
The Standard Asian Merchant Bank is a Malaysian merchant bank headqu.pdf
 
Risk mgmt-analysis-wp-326822
Risk mgmt-analysis-wp-326822Risk mgmt-analysis-wp-326822
Risk mgmt-analysis-wp-326822
 
B510519.pdf
B510519.pdfB510519.pdf
B510519.pdf
 
Time Series Analysis
Time Series AnalysisTime Series Analysis
Time Series Analysis
 
Introduction (1) ME ppt
Introduction (1) ME pptIntroduction (1) ME ppt
Introduction (1) ME ppt
 
Determinants of bank's interest margin in the aftermath of the crisis: the ef...
Determinants of bank's interest margin in the aftermath of the crisis: the ef...Determinants of bank's interest margin in the aftermath of the crisis: the ef...
Determinants of bank's interest margin in the aftermath of the crisis: the ef...
 
IRJET - Bankruptcy Score Indexing
IRJET - Bankruptcy Score IndexingIRJET - Bankruptcy Score Indexing
IRJET - Bankruptcy Score Indexing
 
Case Study Measurement of Variables – Operational DefinitionsCh.docx
Case Study Measurement of Variables – Operational DefinitionsCh.docxCase Study Measurement of Variables – Operational DefinitionsCh.docx
Case Study Measurement of Variables – Operational DefinitionsCh.docx
 
Impact_of_Information_Technology_on_Orga (2).pdf
Impact_of_Information_Technology_on_Orga (2).pdfImpact_of_Information_Technology_on_Orga (2).pdf
Impact_of_Information_Technology_on_Orga (2).pdf
 
Pareto Analysis
Pareto AnalysisPareto Analysis
Pareto Analysis
 
Sector analysis
Sector analysisSector analysis
Sector analysis
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
 
Rafael Love - Resume
Rafael Love - ResumeRafael Love - Resume
Rafael Love - Resume
 
Intro.pptx
Intro.pptxIntro.pptx
Intro.pptx
 
Donabedian Model
Donabedian ModelDonabedian Model
Donabedian Model
 
Rafael Love - Resume
Rafael Love - ResumeRafael Love - Resume
Rafael Love - Resume
 

Plus de Big Data Colombia

An introduction to deep reinforcement learning
An introduction to deep reinforcement learningAn introduction to deep reinforcement learning
An introduction to deep reinforcement learningBig Data Colombia
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in healthBig Data Colombia
 
Analysis of your own Facebook friends’ data structure through graphs
Analysis of your own Facebook friends’ data structure through graphsAnalysis of your own Facebook friends’ data structure through graphs
Analysis of your own Facebook friends’ data structure through graphsBig Data Colombia
 
Lo datos cuentan su historia
Lo datos cuentan su historiaLo datos cuentan su historia
Lo datos cuentan su historiaBig Data Colombia
 
Entornos Naturalmente Inteligentes
Entornos Naturalmente InteligentesEntornos Naturalmente Inteligentes
Entornos Naturalmente InteligentesBig Data Colombia
 
Modelamiento predictivo y medicina
Modelamiento predictivo y medicinaModelamiento predictivo y medicina
Modelamiento predictivo y medicinaBig Data Colombia
 
Ayudando a los Viajeros usando 500 millones de Reseñas Hoteleras al Mes
Ayudando a los Viajeros usando 500 millones de Reseñas Hoteleras al MesAyudando a los Viajeros usando 500 millones de Reseñas Hoteleras al Mes
Ayudando a los Viajeros usando 500 millones de Reseñas Hoteleras al MesBig Data Colombia
 
Deep learning: el renacimiento de las redes neuronales
Deep learning: el renacimiento de las redes neuronalesDeep learning: el renacimiento de las redes neuronales
Deep learning: el renacimiento de las redes neuronalesBig Data Colombia
 
Cloud computing: Trends and Challenges
Cloud computing: Trends and ChallengesCloud computing: Trends and Challenges
Cloud computing: Trends and ChallengesBig Data Colombia
 
Kaggle: Coupon Purchase Prediction
Kaggle: Coupon Purchase PredictionKaggle: Coupon Purchase Prediction
Kaggle: Coupon Purchase PredictionBig Data Colombia
 
Introducción al Datawarehousing
Introducción al DatawarehousingIntroducción al Datawarehousing
Introducción al DatawarehousingBig Data Colombia
 
Análisis Explotatorio de Datos: Dejad que la data hable.
Análisis Explotatorio de Datos: Dejad que la data hable.Análisis Explotatorio de Datos: Dejad que la data hable.
Análisis Explotatorio de Datos: Dejad que la data hable.Big Data Colombia
 
Salud, dinero, amor y big data
Salud, dinero, amor y big dataSalud, dinero, amor y big data
Salud, dinero, amor y big dataBig Data Colombia
 
Business Analytics: ¡La culpa es del BIG data!
Business Analytics: ¡La culpa es del BIG data!Business Analytics: ¡La culpa es del BIG data!
Business Analytics: ¡La culpa es del BIG data!Big Data Colombia
 

Plus de Big Data Colombia (19)

An introduction to deep reinforcement learning
An introduction to deep reinforcement learningAn introduction to deep reinforcement learning
An introduction to deep reinforcement learning
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in health
 
Analysis of your own Facebook friends’ data structure through graphs
Analysis of your own Facebook friends’ data structure through graphsAnalysis of your own Facebook friends’ data structure through graphs
Analysis of your own Facebook friends’ data structure through graphs
 
Lo datos cuentan su historia
Lo datos cuentan su historiaLo datos cuentan su historia
Lo datos cuentan su historia
 
Entornos Naturalmente Inteligentes
Entornos Naturalmente InteligentesEntornos Naturalmente Inteligentes
Entornos Naturalmente Inteligentes
 
Modelamiento predictivo y medicina
Modelamiento predictivo y medicinaModelamiento predictivo y medicina
Modelamiento predictivo y medicina
 
Ayudando a los Viajeros usando 500 millones de Reseñas Hoteleras al Mes
Ayudando a los Viajeros usando 500 millones de Reseñas Hoteleras al MesAyudando a los Viajeros usando 500 millones de Reseñas Hoteleras al Mes
Ayudando a los Viajeros usando 500 millones de Reseñas Hoteleras al Mes
 
Deep learning: el renacimiento de las redes neuronales
Deep learning: el renacimiento de las redes neuronalesDeep learning: el renacimiento de las redes neuronales
Deep learning: el renacimiento de las redes neuronales
 
IPython & Jupyter
IPython & JupyterIPython & Jupyter
IPython & Jupyter
 
Cloud computing: Trends and Challenges
Cloud computing: Trends and ChallengesCloud computing: Trends and Challenges
Cloud computing: Trends and Challenges
 
Kaggle: Coupon Purchase Prediction
Kaggle: Coupon Purchase PredictionKaggle: Coupon Purchase Prediction
Kaggle: Coupon Purchase Prediction
 
Machine learning y Kaggle
Machine learning y KaggleMachine learning y Kaggle
Machine learning y Kaggle
 
Fraud Analytics
Fraud AnalyticsFraud Analytics
Fraud Analytics
 
Data crunching con Spark
Data crunching con SparkData crunching con Spark
Data crunching con Spark
 
Introducción al Datawarehousing
Introducción al DatawarehousingIntroducción al Datawarehousing
Introducción al Datawarehousing
 
Análisis Explotatorio de Datos: Dejad que la data hable.
Análisis Explotatorio de Datos: Dejad que la data hable.Análisis Explotatorio de Datos: Dejad que la data hable.
Análisis Explotatorio de Datos: Dejad que la data hable.
 
Big Data para mortales
Big Data para mortalesBig Data para mortales
Big Data para mortales
 
Salud, dinero, amor y big data
Salud, dinero, amor y big dataSalud, dinero, amor y big data
Salud, dinero, amor y big data
 
Business Analytics: ¡La culpa es del BIG data!
Business Analytics: ¡La culpa es del BIG data!Business Analytics: ¡La culpa es del BIG data!
Business Analytics: ¡La culpa es del BIG data!
 

Dernier

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfRagavanV2
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 

Dernier (20)

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 

Whose Balance Sheet is this? Neural Networks for Banks’ Pattern Recognition

  • 1. Big Data & Data Science | Bogotá | Colombia | Octubre 27, 2016 Whose Balance Sheet is this? Neural Networks for Banks’ Pattern Recognition Carlos León Banco de la República (Colombia) & Tilburg University cleonrin@banrep.gov.co Jose Fernando Moreno Barcelona Grad. School of Economics jose.moreno@barcelonagse.eu Jorge Cely Banco de la República (Colombia) jcelyfe@banrep.gov.co
  • 2.
  • 4. 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 A bank’s balance sheet as a 27x1 pixels image Assets (9) Liabilities (9) Equity (9)
  • 5. 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25
  • 6. 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 5 10 15 20 25
  • 7. Disclaimer The opinions and statements in this article are the sole responsibility of the authors and do not represent neither those of Banco de la República nor of its Board of Directors. Comments and suggestions from Hernando Vargas, Clara Machado, Freddy Cepeda, Fabio Ortega, and other members of the technical staff of Banco de la República are appreciated. Any remaining errors are the authors’ own. http://www.banrep.gov.co/sites/default/files/publicaciones/archivos/be_959.pdf
  • 8. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 9. Introduction  Balance sheets’ overall informational content … o … information about the past performance of a firm, and a starting point for forecasts of future performance (Chisholm, 2002) o … assess the overall composition of resources, the constriction of external obligations, and the firm’s flexibility and ability to change to meet new requirements (Kaliski, 2001)  In the banking industry … o … among the minimum periodic reports that banks should provide to supervisors to conduct effective supervision and to evaluate the condition of the local banking market (BCBS, 1997 & 1998) o … traditional supervisory examination has focused on the assessment of bank’s balance sheets (see Mishkin, 2004) o … and they have been related to bank lending, investment spending, and economic activity, and the advent of financial crisis (see Mishkin, 2004)
  • 10. Introduction  Therefore, the balance sheet may be considered … o A unique and characteristic combination of financial accounts (i.e. the elements of financial statements) that not only allows for assessing a bank’s financial stance, but that also differentiates it from its peers o A snapshot of a bank  Question: can we train a model to deal with balance sheets as snapshots to recognize their owners with fair accuracy?  Why? Because it is the first step towards training a model to o Detect important changes in banks’ financial accounts o Classify banks (fragility, riskiness, … ) o State-of-the-art early-warning systems (e.g. Fioramanti (2008), Sarlin (2014), and Holopainen & Sarlin (2016))
  • 11. Introduction  How? Artificial Neural Networks (ANN) o Effective classifiers, better than classical statistical methods (Wu (1997), Zhang et al. (1999), McNelis (2005), and Han & Kamber (2006)) o No assumptions about the statistical porperties of the data (Zhang et al. (1999), McNelis (2005), Demyanyk & Hasan (2009), and Nazari & Alidadi (2013)) o Able to deal with non-linear relationships between factors in the data (Bishop (1995), Han & Kamber (2006), Demyanyk & Hasan (2009), Eletter et al. (2010), and Hagan et al. (2014))  But… ANN have been criticized because results are opaque and they lack interpretability –black box criticism (Han & Kamber (2006), Angelini et al. (2008), and Witten et al. (2011)) … do we care?
  • 12. Introduction  Black box criticism comes from a desire to tie down empirical estimation with an underlying economic theory (McNelis, 05)  We do not care about the black box criticism because we have no underlying economic theory to test  This is predictive modeling –not explanatory modeling (see Shmueli, 2010) Explanatory Modeling • The aim is to test a causal theory (traditional econometrics) • Requires building an underlying causal structure (a theoretical prior) • Need to work on expected role of variables Predictive Modeling • The aim is to predict or classify successfully • No need to build an underlying causal structure (a theoretical prior) • No need to delve into the expected role of the variables Machine LearningEconometrics
  • 13. Introduction Explanatory Modeling • The aim is to test a causal theory (traditional econometrics) • Requires building an underlying causal structure (a theoretical prior) • Need to work on expected role of variables Predictive Modeling • The aim is to predict or classify successfully • No need to build an underlying causal structure (a theoretical prior) • No need to delve into the expected role of the variables Machine LearningEconometrics Varian, H. (2014): • […] econometrics is concerned with detecting and summarizing relationships in data, with regression analysis as its prevalent tool. • […] machine learning methods –such as artificial neural networks- are concerned with developing high-performance computer systems that can provide useful predictions, namely out-of-sample predictions.
  • 14. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 15. Related literature • Pattern recognition • ANN on financial data
  • 16. Related literature • Pattern recognition (classification) – Aims at classifying inputs into a set of target categories (Hagan et al., 2014) – Mainly a supervised machine learning problem: for training, each example pertains to a known category – Wide spectrum: facial recognition, image classification, voice recognition, text translation, fraud detection, classification of handwritten characters, and medical diagnosis – Contemporary success due to: • Big data is now available for successful training • Great computational power is now available for ANN • Deep learning for particularly complex ANN (Schmidhuber (2015))
  • 17. Related literature • ANN on financial data (financial ratios) – Bankruptcy/failure prediction based on classification of firms • Non- financial (Rudorfer (1995), Zhang et al. (1999), Atiya (2001), Brédart (2014)) • Financial (Tam & Kiang (1990), Tam (1991), Olmeda & Fernández (1997)) – Loan decisions in retail and corporate banking (Angelini et al. (2008), Eletter et al. (2010), Nazari & Alidadi (2013), Bekhet & Eletter (2014)) – Local/foreign bank classification (Turkan et al. (2011)) – Islamic/conventional bank classification (Khediri et al. (2015)) – Auditing/no auditing firms for tax evasion (Wu (1997)) – State-of-the-art early warning systems • Sovereign debt crises prediction (Fioramanti (2008)) • Country-specific fin. crises (Sarlin, (2014), Holopainen & Sarlin (2016))
  • 18. Related literature • ANNs’ increasingly important role in financial applications for such tasks as pattern recognition, classification, and time series forecasting (Naziri & Alidadi (2013) and Eletter & Yaseen (2010)) • In our case… – Instead of selecting the “appropriate” set of financial ratios… – We work on raw balance sheets (the input for financial ratios) – Beware: when working on selected financial ratios we discard potentially useful information due to our cognitive bias (or plain ignorance). – To the best of our knowledge, this is the first time raw balance sheets are encoded as inputs for a pattern recognition problem
  • 19. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 20. ANNs and pattern recognition • ANNs are networks of interconnected artificial neurons, with the weights of those connections resulting from a learning process that attempts to minimize the prediction/classification error of the input- output function • The central idea of ANNs is to extract linear combinations of the inputs as derived features, and then model the output (i.e. the target) as a nonlinear function of these features. (Hastie et al., 2013) • The simplest case is the feed-forward ANN (our choice for what follows). • Other ANNs cases are more complex, but may open new ways to solve more complex problems (e.g. recurrent ANNs, convolutional ANNs, reinforcement ANNs). We do not describe them.
  • 21. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 22. ANN models Activation function: • Prediction • Classification Log-sigmoid function
  • 23. ANN models Activation function: • Prediction • Classification Log-sigmoid function
  • 24. ANN models Activation function: • Classification Log-sigmoid function Softmax function* (*) According to G. Hinton, it is convenient as 1) it may be interpreted as a probability, and 2) it provides additional knowledge to the training process.
  • 26. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 27. Training the ANN • Training: Adjusting parameters in W and b in order to attain an input-output relationship target under the chosen transfer functions for a set of observations (i.e. examples) • Backpropagation: – Backpropagation learns by iteratively processing a dataset of training examples (i.e. observations), comparing network’s prediction (i.e. output) for each example with the actual target value – Parameters in W and b are modified in backwards direction, from the output layer, through each hidden layer down to the first hidden layer – hence its name (Han & Kamber, 2006)
  • 28. Training the ANN • Backpropagation (cont.): – Backpropagation usually employs some type of gradient descent method to minimize the error between the prediction and the actual target value Sum (or mean) of squared errors, for prediction or classification Cross-entropy, for classification
  • 29. Training the ANN • Unlike typical applications of regression models in econometrics, the goal of training an artificial neural network is not limited to minimizing in-sample the errors. • The overfitting problem may be described as the model’s ability to succeed at fitting in-sample but to fail at fitting out-of-sample (see Shmueli (2010), Varian (2014)) • The goal is not to memorize the training data, but to model the underlying generator of the data (Bishop, 1995) • Early stopping: – Halt the minimization process before the complexity of the solution inhibits its generalization capability – If training is stopped before the minimum in-sample is reached, then the network will effectively be using fewer parameters and will be less likely to overfit (Hagan et al., 2014)
  • 30. Training the ANN • Early stopping with cross-validation (Hagan et al., 2014): Training dataset (70%) Validation dataset (15%) Test dataset (15%) The training set is used to minimize the error between the prediction and the actual target value The (large) dataset The validation dataset is used simultaneously (as the neural network is trained) to check how the estimated parameters fit out- of-sample data. When validation error starts to increase (i.e. overfitting starts), the training stops. The error obtained on the test dataset is used to check the future performance of the artificial neural network on out-of-sample data, i.e. its generalization capability.
  • 31. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 32. Post-training analysis • To test how good is the in-sample and out-of-sample training: – For prediction: r2, scatter plots – For classification: • Confusion matrix: squared table that relates the actual target class (in x-axis) with the predicted class (in y-axis) • Receiver operating characteristic (ROC) curve: shows the trade-off between the true positive rate (in y-axis) and the false-positive rate (in x-axis) for a given model (Han & Kamber, 2006)
  • 33. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 34. Data and methodology • Balance sheets – 25 financial accounts (i.e. features) – Monthly, from January 2000 to December 2014* – 21 banks available (out of 41 that report) • The ANN – We implement a standard two-layer network, with one hidden layer and one output layer; often a single hidden layer is all that is necessary (see Zhang et al., (1999), Witten et al. (2011)) – A base case scenario with a 15-neuron hidden layer – Other scenarios for robustness (5, 10, 20, 25) (*) From January 2015 balance sheets are reported based on International Financial Reporting Standards (IFRS-NIIF), instead of COLGAAP. They are not consistent. 3,237 examples
  • 35. Figure 12. Evolution of Colombian banks (2000-2014). Only banks active as of December 2014 are presented. The name and type of credit institution (e.g. bank, financial corporation, financial cooperative) of some institutions may have changed during the sample period; the most recent name and type (i.e. bank) is preserved. Some names were shortened.
  • 36. P = 𝑝1,1 𝑝1,2 ⋯ 𝑝1,3237 𝑝2,1 ⋱ ⋮ ⋮ 𝑝25,1 𝑝25,2 ⋯ 𝑝25,3237 Examples (3237 balance sheets) Classes(21banks) T = 𝑡1,1 = 1 𝑡1,2 = 0 ⋯ 𝑡1,3237 = 0 𝑡2,1 = 0 ⋱ ⋮ ⋮ 𝑡21,1 = 0 𝑡21,2 = 0 ⋯ 𝑡21,3237 = 1 Examples (3237 balance sheets) Features(25accounts)
  • 37. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 38. Main results • After training with early-stopping (1 hidden layer, 15 neurons) Set Samples (balance sheets) Performance (cross-entropy) Misclassification (%) Training 2,265 0.0012 0.35% Validation 486 0.0044 1.65% Test 486 0.0019 1.03% Table 1. Overall results of the artificial neural network after training with cross-validation early-stopping. In-sample
  • 39. Main results • After training with early-stopping (1 hidden layer, 15 neurons) •In-sample •Out-of-sample #1 •Out-of-sample #2
  • 40. Figure 7. Training dataset confusion matrix. •In-sample M&A Sample ???
  • 41. Figure 10. Test dataset confusion matrix •Out-of-sample #2 M&A Sample ???
  • 42. Main results None of the classes (i.e. banks) displays a ratio of true positives to false positives close to the diagonal. All classes show a high ratio of true positives to false positives. In-sample Out-of-sample #2
  • 43. Set Misclassification (Average and standard deviation, %) 5 neurons 10 neurons 15 neurons 20 neurons 25 neurons Training 19.75% [15.37%] 3.41% [9.84%] 0.61% [0.43%] 0.15% [0.29%] 0.10% [0.23%] Validation 20.99% [15.23%] 4.86% [9.87%] 1.64% [0.81%] 1.00% [0.70%] 0.91% [0.72%] Test 21.53% [15.44%] 5.19% [9.86%] 1.72% [0.80%] 1.23% [0.66%] 0.94% [0.63%] Table 2. Overall average results of the artificial neural network after training with cross-validation early- stopping. The average and standard deviation (in brackets) is estimated on 100 independent training processes. Main results In-sample
  • 44. Contents 1. Introduction 2. Related literature 3. Artificial neural networks and pattern recognition 3.1. Artificial neural network models 3.2. Training the artificial neural network 3.3. Post-training analysis 4. Data and methodology 5. Main results 6. Final remarks
  • 45. Final remarks • We attained a successful implementation of ANN for pattern classification of banks’ balance sheets – Balance sheets are unique and representative snapshots of banks’ financial position – ANN is a suitable method for classifying balance sheets • To the best of our knowledge, this is the first attempt to use balance sheet data as a comprehensive portrait of financial position of a firm • Using raw balance sheets instead of arbitrarily chosen financial ratios may alleviate selection bias problems (i.e. discarding potentially useful information due to ignorance or reliance on prior research) • There is a particularly straightforward application..
  • 46. Final remarks • Early-warning systems, as in Fioramanti (2008), Sarlin (2014), and Holopainen & Sarlin (2016), but… • With raw data (or mixtures of data)
  • 47. Final remarks • Early-warning systems, as in Fioramanti (2008), Sarlin (2014), and Holopainen & Sarlin (2016), but… • With raw data (or mixtures of data)
  • 48. Final remarks • Early-warning systems, as in Fioramanti (2008), Sarlin (2014), and Holopainen & Sarlin (2016), but… • With raw data (or mixtures of raw data + indicators)
  • 49. Big Data & Data Science | Bogotá | Colombia | Octubre 27, 2016 Whose Balance Sheet is this? Neural Networks for Banks’ Pattern Recognition