SlideShare a Scribd company logo
1 of 15
1
Techno India University, WB
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
Simplified Knowledge Prediction:
Application of Machine Learning in Real Life
____________________________________________________
____________________________________________________
 Mr. Sayan Adhikary, B.Sc (H) Data Science – 2nd
Year
 Miss Ankita Jash, B.Sc (H) Data Science – 2nd
Year
 Mr. Avishek Das, B.Sc (H) Data Science – 2nd
Year
2
Acknowledgement:
 Techno India University, West Bengal for giving us the
opportunity to do this project.
 Mr Shantanu. P. Chakraborty, Assistant Professor, Techno
India University
 Mrs. Peea Bal, Assistant Professor, Techno India University
3
Contents:
Serial
No.
Topic Page
No.
1 Abstract 4
2 Introduction 5
3 Scrutinized Analysis of Dataset 8
4 Procedure 9
5 Scope 12
6 Challenges and Opportunities 13
7 Conclusion 14
8 References and Links 15
4
Abstract:
The world today is highly dependent on data. In this data-
driven era with fast growing technologies, a huge amount of
data is generated, captured and also maintained for a variety of
purposes. Machine learning models utilize existing data to
derive meaningful insights over how various factors have an
impact on development of different nations and industries and
predict the outcomes accordingly. In the arena of prediction,
machine learning often works together with the data
visualization techniques to make it easier for the user to
understand the inferences.
This paper aims to analyze and predict the outcome of a real
life case-study using various tools of visual analytics and
machine learning. The dataset consists of cell samples of
patients suspected of breast cancer. We will apply machine
learning models to improve the accuracy of cancer
susceptibility by predicting whether the breast cancer is benign
or malignant. The machine learning models used in this paper
includes decision tree algorithm and logistic regression. The
integration of multidimensionalheterogeneous data, combined
with the application of different machine learning techniques
will show a new path in the domain of cancer detection. The
paper also explores the challenges and limitations so that it
provides future research scopes.
Keywords : Machine Learning, Algorithm, Cancer detection, Visual analytics
5
Introduction:
Incidentally, machine learning is the scientific study of
algorithms and statistical models that is used by the machines
to perform a specific task depending on patterns and inference
rather than explicit instructions .The algorithms of machine
learning divides the data into two parts known as the test data
and validation data. Mathematical model based algorithms are
implemented on the test data, in order to make predictions or
decisions without being explicitly programmed to perform the
task.
Machine learning can be of two types that is supervised
machine learning (contains desired input and output) and
unsupervised machine learning (contains only input but not the
desired output labels). The concept of machine learning is very
important for the predictions done in this paper.
This research and analysis aims to observe how precisely a
machine can predict that a patient suspected of breast cancer is
having malignant or benign cancer. Cancer has been
characterized as a heterogeneous disease consisting of various
subtypes. Early detection and prognosis of a cancer type has
become a necessity in cancer research for facilitating
subsequent clinical management of patients. In this paper the
classification of cancer type and prediction of risk levels is done
by variousmodel of machine learning and is pictoriallydepicted
by various tools of visual analytics.
6
The secondary dataset we have collected was created by Dr.
William H. Wolberg, a physician at the University of Wisconsin
Hospital at Madison, Wisconsin, USA. For creating this dataset
Dr. Wolberg have used fluid samples taken from patients with
solid breast masses and an easy to use graphical computer
program called Xcyt, which is capable of performing the
analysis of cytological features based on a digital scan. The
program initially uses a curve-fitting algorithm for computing
ten features from each one of the cells in the sample and then
calculates the mean value, extreme value and the standard
error of each of the ten features for the image, returning a 30
real valued vector.
A person who sufferedfromBreast Cancer
7
We have performed classification and prediction by decision
tree and logistic regression model of machine learning on the
dataset with the help of python and its various library packages
like pandas, matplotlib, seaborn and scikit. The data
visualization has been done through pairplots.
The inferences and predictions from this paper will be helpful
to cancer research and improve the accuracy of cancer
susceptibility, recurrence and survival prediction.
8
Scrutinized Analysis of the dataset :
In this paper, we have used machine learning repository for
breast cancer dataset. The key attributes of the dataset consists
of the ID number and diagnosis ((M = malignant, B = benign)
3–32). And for each cell nucleus ten real valued features are
computed. The real valued features must include radius,
texture, perimeter, area, smoothness, compactness, concavity,
concave points, fractal dimension and symmetry. The radius is
computed by mean of distances from centre to points on the
perimeter, the texture is calculated by standard deviation of
grey scale values, smoothness is computed by local variation in
radius lengths, compactness is obtained by”perimeter² / area
— 1.0”, concavity is referred as severity of concave portions of
the contour, concave points is the number of concave portions
of the contour and fractal dimension is computed by coastline
approximation— 1.
9
Procedure:
 Phase 1: Data Exploration
We have python in anaconda prompt shell to work on this
dataset. In data exploration phase we firstly import the
necessary libraries and our dataset to anaconda prompt shell.
The necessary libraries include scikit, matplotlib, pyplot ,
seaborn and pandas.Then we have provided index numbers for
“x” and “y” values.
Fig 1: Dataset and X set after importing the dataset
After importing the dataset and the required libraries we have
checked if there is any missing or null data points of the data
set using the some pandas function.
10
After checking and removing the nullvalues, the seaborn library
was used for visualization, to find the data distribution of the
features. The visualization below shows a pairplot with all ten
features of the dataset.
Fig 2: Visualization of dataset.
 Phase 2: Categorical Data
In this phase we have used Label Encoder library to label the
categorical data. Then we had split the dataset into training
data and test data with the help of SciKit-Learn library in
Python using train_test_split method.
11
 Phase 3: Feature Scaling
In this phase of analysis we have scaled the data using the
StandardScaler method from SciKit-Learn library of python.
 Phase 4: Model Selection
This is the one of major phase of analysis where we have
applied Machine Learning algorithm to the Dataset. This phase
is also known as Algorithm selection for predicting the best
results as in this phase we have selected the algorithm which
would predict the best result.
In this phase we have used sklearn library to import all the
methods of classification algorithms.
Then we have predicted the test set results and checked the
accuracy with our model. For checking the accuracy we have
imported a confusion_matrix method of metrics class. We have
used Classification Accuracy method to find the accuracy of our
models (where accuracy =number of correct predictions / total
number of predictions).
After checking we have found that our model of logistic
Regression and Decision Tree Algorithm both has 95.8%
accuracy. Hence from this phase we can conclude that both of
the two machine learning models can give us the best result for
our data set.
12
Scope:
 This paper will be helpful for the hospitals having modern
cancer treatment facilities.
 Oncologists can use our procedure and get accurate results
within seconds.
13
Challenges and Opportunities:
 Challenges :
 Any error made while generating the algorithm may result
in inaccurate results.
 Making people aware of the modern machine learning
techniques and earning their faith is one of the biggest
challenges.
 Opportunities :
 If this model works well then we can use machine learning
techniques for analyzing and predicting other fatal
diseases.
 There is also scope for further research in this field.
14
Conclusion:
 Results and Findings: From a dataset of patients suspected
for cancer we can predict whether the cancer is benign or
malignant. If it is malignant then the risk factor is high. If it
is benign then the risk factor is comparatively low.
 Machine Learning models will be the future of cancer
prediction.
 We have identified a number of trends with respect to the
types of machine learning model being used, the types of
training data being integrated, the kinds of endpoint
predictions being made, the types of cancers being studied
and the overall performance of the models in predicting
cancer susceptibility or outcomes.
 Depending upon the analysis of the results, it is evident
that the integration of multidimensional heterogeneous
data, combined with the application of different machine
learning techniques for feature selection, classification and
prediction can provide promising tools for inference in the
cancer domain.
15
References and Link:
 www.towardsdatascience.com
 www.people.com
 www.kaggle.com
 www.wikipedia.com
 www.geeksforgeeks.com

More Related Content

What's hot

Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...ahmad abdelhafeez
 
When deep learners change their mind learning dynamics for active learning
When deep learners change their mind  learning dynamics for active learningWhen deep learners change their mind  learning dynamics for active learning
When deep learners change their mind learning dynamics for active learningDevansh16
 
Iganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer ThreatsIganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer Threatsijsrd.com
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...cscpconf
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...ijsc
 
Accounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksAccounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksDevansh16
 
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...IJERA Editor
 
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAA BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAIJSCAI Journal
 
Chronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine LearningChronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine LearningIJCSIS Research Publications
 
An approach for breast cancer diagnosis classification using neural network
An approach for breast cancer diagnosis classification using neural networkAn approach for breast cancer diagnosis classification using neural network
An approach for breast cancer diagnosis classification using neural networkacijjournal
 
Twin support vector machine using kernel function for colorectal cancer detec...
Twin support vector machine using kernel function for colorectal cancer detec...Twin support vector machine using kernel function for colorectal cancer detec...
Twin support vector machine using kernel function for colorectal cancer detec...journalBEEI
 
2013: Prototype-based learning and adaptive distances for classification
2013: Prototype-based learning and adaptive distances for classification2013: Prototype-based learning and adaptive distances for classification
2013: Prototype-based learning and adaptive distances for classificationUniversity of Groningen
 
MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...
MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...
MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...cscpconf
 
Srge most important publications 2020
Srge most important  publications 2020Srge most important  publications 2020
Srge most important publications 2020Aboul Ella Hassanien
 
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...cscpconf
 
A chi-square-SVM based pedagogical rule extraction method for microarray data...
A chi-square-SVM based pedagogical rule extraction method for microarray data...A chi-square-SVM based pedagogical rule extraction method for microarray data...
A chi-square-SVM based pedagogical rule extraction method for microarray data...IJAAS Team
 

What's hot (18)

Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
 
When deep learners change their mind learning dynamics for active learning
When deep learners change their mind  learning dynamics for active learningWhen deep learners change their mind  learning dynamics for active learning
When deep learners change their mind learning dynamics for active learning
 
Iganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer ThreatsIganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer Threats
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
 
Accounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarksAccounting for variance in machine learning benchmarks
Accounting for variance in machine learning benchmarks
 
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
 
IJET-V2I6P32
IJET-V2I6P32IJET-V2I6P32
IJET-V2I6P32
 
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAA BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
 
Chronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine LearningChronic Kidney Disease Prediction Using Machine Learning
Chronic Kidney Disease Prediction Using Machine Learning
 
An approach for breast cancer diagnosis classification using neural network
An approach for breast cancer diagnosis classification using neural networkAn approach for breast cancer diagnosis classification using neural network
An approach for breast cancer diagnosis classification using neural network
 
Twin support vector machine using kernel function for colorectal cancer detec...
Twin support vector machine using kernel function for colorectal cancer detec...Twin support vector machine using kernel function for colorectal cancer detec...
Twin support vector machine using kernel function for colorectal cancer detec...
 
2013: Prototype-based learning and adaptive distances for classification
2013: Prototype-based learning and adaptive distances for classification2013: Prototype-based learning and adaptive distances for classification
2013: Prototype-based learning and adaptive distances for classification
 
MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...
MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...
MLTDD : USE OF MACHINE LEARNING TECHNIQUES FOR DIAGNOSIS OF THYROID GLAND DIS...
 
Srge most important publications 2020
Srge most important  publications 2020Srge most important  publications 2020
Srge most important publications 2020
 
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
SVM-PSO based Feature Selection for Improving Medical Diagnosis Reliability u...
 
[IJCT-V3I2P26] Authors: Sunny Sharma
[IJCT-V3I2P26] Authors: Sunny Sharma[IJCT-V3I2P26] Authors: Sunny Sharma
[IJCT-V3I2P26] Authors: Sunny Sharma
 
A chi-square-SVM based pedagogical rule extraction method for microarray data...
A chi-square-SVM based pedagogical rule extraction method for microarray data...A chi-square-SVM based pedagogical rule extraction method for microarray data...
A chi-square-SVM based pedagogical rule extraction method for microarray data...
 

Similar to Simplified Knowledge Prediction: Application of Machine Learning in Real Life

IRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
IRJET- Breast Cancer Disease Prediction : Using Machine Learning ApproachIRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
IRJET- Breast Cancer Disease Prediction : Using Machine Learning ApproachIRJET Journal
 
A Comprehensive Survey On Predictive Analysis Of Breast Cancer
A Comprehensive Survey On Predictive Analysis Of Breast CancerA Comprehensive Survey On Predictive Analysis Of Breast Cancer
A Comprehensive Survey On Predictive Analysis Of Breast CancerAngela Shin
 
Health Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningHealth Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningIRJET Journal
 
Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataIIRindia
 
Breast Cancer Detection Using Machine Learning
Breast Cancer Detection Using Machine LearningBreast Cancer Detection Using Machine Learning
Breast Cancer Detection Using Machine LearningIRJET Journal
 
IRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
IRJET- Breast Cancer Prediction using Supervised Machine Learning AlgorithmsIRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
IRJET- Breast Cancer Prediction using Supervised Machine Learning AlgorithmsIRJET Journal
 
Comparative Analysis of Early Stage Cancer Detection Methods in Machine Learning
Comparative Analysis of Early Stage Cancer Detection Methods in Machine LearningComparative Analysis of Early Stage Cancer Detection Methods in Machine Learning
Comparative Analysis of Early Stage Cancer Detection Methods in Machine LearningIRJET Journal
 
APPLICATION OF CNN MODEL ON MEDICAL IMAGE
APPLICATION OF CNN MODEL ON MEDICAL IMAGEAPPLICATION OF CNN MODEL ON MEDICAL IMAGE
APPLICATION OF CNN MODEL ON MEDICAL IMAGEIRJET Journal
 
Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Damian R. Mingle, MBA
 
IRJET- Disease Prediction using Machine Learning
IRJET-  Disease Prediction using Machine LearningIRJET-  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine LearningIRJET Journal
 
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...IRJET Journal
 
IRJET- Disease Prediction using Machine Learning
IRJET-  	  Disease Prediction using Machine LearningIRJET-  	  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine LearningIRJET Journal
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
 
IRJET - Classification of Cancer Images using Deep Learning
IRJET -  	  Classification of Cancer Images using Deep LearningIRJET -  	  Classification of Cancer Images using Deep Learning
IRJET - Classification of Cancer Images using Deep LearningIRJET Journal
 
Analysis of Machine Learning Techniques for Breast Cancer Prediction
Analysis of Machine Learning Techniques for Breast Cancer PredictionAnalysis of Machine Learning Techniques for Breast Cancer Prediction
Analysis of Machine Learning Techniques for Breast Cancer PredictionDr. Amarjeet Singh
 
Breast Cancer Prediction
Breast Cancer PredictionBreast Cancer Prediction
Breast Cancer PredictionIRJET Journal
 
Computer Aided System for Detection and Classification of Breast Cancer
Computer Aided System for Detection and Classification of Breast CancerComputer Aided System for Detection and Classification of Breast Cancer
Computer Aided System for Detection and Classification of Breast CancerIJITCA Journal
 
IRJET - Machine Learning Applications on Cancer Prognosis and Prediction
IRJET - Machine Learning Applications on Cancer Prognosis and PredictionIRJET - Machine Learning Applications on Cancer Prognosis and Prediction
IRJET - Machine Learning Applications on Cancer Prognosis and PredictionIRJET Journal
 

Similar to Simplified Knowledge Prediction: Application of Machine Learning in Real Life (20)

IRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
IRJET- Breast Cancer Disease Prediction : Using Machine Learning ApproachIRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
IRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
 
A Comprehensive Survey On Predictive Analysis Of Breast Cancer
A Comprehensive Survey On Predictive Analysis Of Breast CancerA Comprehensive Survey On Predictive Analysis Of Breast Cancer
A Comprehensive Survey On Predictive Analysis Of Breast Cancer
 
Health Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningHealth Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep Learning
 
Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer Data
 
Breast Cancer Detection Using Machine Learning
Breast Cancer Detection Using Machine LearningBreast Cancer Detection Using Machine Learning
Breast Cancer Detection Using Machine Learning
 
IRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
IRJET- Breast Cancer Prediction using Supervised Machine Learning AlgorithmsIRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
IRJET- Breast Cancer Prediction using Supervised Machine Learning Algorithms
 
Updated proposal powerpoint.pptx
Updated proposal powerpoint.pptxUpdated proposal powerpoint.pptx
Updated proposal powerpoint.pptx
 
Comparative Analysis of Early Stage Cancer Detection Methods in Machine Learning
Comparative Analysis of Early Stage Cancer Detection Methods in Machine LearningComparative Analysis of Early Stage Cancer Detection Methods in Machine Learning
Comparative Analysis of Early Stage Cancer Detection Methods in Machine Learning
 
APPLICATION OF CNN MODEL ON MEDICAL IMAGE
APPLICATION OF CNN MODEL ON MEDICAL IMAGEAPPLICATION OF CNN MODEL ON MEDICAL IMAGE
APPLICATION OF CNN MODEL ON MEDICAL IMAGE
 
Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...Controlling informative features for improved accuracy and faster predictions...
Controlling informative features for improved accuracy and faster predictions...
 
IRJET- Disease Prediction using Machine Learning
IRJET-  Disease Prediction using Machine LearningIRJET-  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine Learning
 
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
 
IRJET- Disease Prediction using Machine Learning
IRJET-  	  Disease Prediction using Machine LearningIRJET-  	  Disease Prediction using Machine Learning
IRJET- Disease Prediction using Machine Learning
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
IRJET - Classification of Cancer Images using Deep Learning
IRJET -  	  Classification of Cancer Images using Deep LearningIRJET -  	  Classification of Cancer Images using Deep Learning
IRJET - Classification of Cancer Images using Deep Learning
 
Analysis of Machine Learning Techniques for Breast Cancer Prediction
Analysis of Machine Learning Techniques for Breast Cancer PredictionAnalysis of Machine Learning Techniques for Breast Cancer Prediction
Analysis of Machine Learning Techniques for Breast Cancer Prediction
 
Breast Cancer Prediction
Breast Cancer PredictionBreast Cancer Prediction
Breast Cancer Prediction
 
Computer Aided System for Detection and Classification of Breast Cancer
Computer Aided System for Detection and Classification of Breast CancerComputer Aided System for Detection and Classification of Breast Cancer
Computer Aided System for Detection and Classification of Breast Cancer
 
Madhavi tippani
Madhavi tippaniMadhavi tippani
Madhavi tippani
 
IRJET - Machine Learning Applications on Cancer Prognosis and Prediction
IRJET - Machine Learning Applications on Cancer Prognosis and PredictionIRJET - Machine Learning Applications on Cancer Prognosis and Prediction
IRJET - Machine Learning Applications on Cancer Prognosis and Prediction
 

More from Peea Bal Chakraborty

Write your email professionally - Writing apology Email
Write your email professionally - Writing apology EmailWrite your email professionally - Writing apology Email
Write your email professionally - Writing apology EmailPeea Bal Chakraborty
 
Write your email professionally- Writing apology Emails
Write your email professionally- Writing apology EmailsWrite your email professionally- Writing apology Emails
Write your email professionally- Writing apology EmailsPeea Bal Chakraborty
 
Write your email professionally - Writing Request E mails
Write your email professionally - Writing Request E mailsWrite your email professionally - Writing Request E mails
Write your email professionally - Writing Request E mailsPeea Bal Chakraborty
 
Supply Chain Management - Its challenges & requirements
Supply Chain Management - Its challenges & requirementsSupply Chain Management - Its challenges & requirements
Supply Chain Management - Its challenges & requirementsPeea Bal Chakraborty
 
Introduction to Supply Chain Management - a brief overview
Introduction to Supply Chain Management - a brief overview Introduction to Supply Chain Management - a brief overview
Introduction to Supply Chain Management - a brief overview Peea Bal Chakraborty
 

More from Peea Bal Chakraborty (10)

ABC analysis
ABC analysisABC analysis
ABC analysis
 
Channels of distribution
Channels of distributionChannels of distribution
Channels of distribution
 
Scm inventory controls
Scm  inventory controlsScm  inventory controls
Scm inventory controls
 
Write your email professionally - Writing apology Email
Write your email professionally - Writing apology EmailWrite your email professionally - Writing apology Email
Write your email professionally - Writing apology Email
 
Write your email professionally- Writing apology Emails
Write your email professionally- Writing apology EmailsWrite your email professionally- Writing apology Emails
Write your email professionally- Writing apology Emails
 
Write your email professionally - Writing Request E mails
Write your email professionally - Writing Request E mailsWrite your email professionally - Writing Request E mails
Write your email professionally - Writing Request E mails
 
Plc
PlcPlc
Plc
 
Plc
PlcPlc
Plc
 
Supply Chain Management - Its challenges & requirements
Supply Chain Management - Its challenges & requirementsSupply Chain Management - Its challenges & requirements
Supply Chain Management - Its challenges & requirements
 
Introduction to Supply Chain Management - a brief overview
Introduction to Supply Chain Management - a brief overview Introduction to Supply Chain Management - a brief overview
Introduction to Supply Chain Management - a brief overview
 

Recently uploaded

RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 

Simplified Knowledge Prediction: Application of Machine Learning in Real Life

  • 1. 1 Techno India University, WB −−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−− Simplified Knowledge Prediction: Application of Machine Learning in Real Life ____________________________________________________ ____________________________________________________  Mr. Sayan Adhikary, B.Sc (H) Data Science – 2nd Year  Miss Ankita Jash, B.Sc (H) Data Science – 2nd Year  Mr. Avishek Das, B.Sc (H) Data Science – 2nd Year
  • 2. 2 Acknowledgement:  Techno India University, West Bengal for giving us the opportunity to do this project.  Mr Shantanu. P. Chakraborty, Assistant Professor, Techno India University  Mrs. Peea Bal, Assistant Professor, Techno India University
  • 3. 3 Contents: Serial No. Topic Page No. 1 Abstract 4 2 Introduction 5 3 Scrutinized Analysis of Dataset 8 4 Procedure 9 5 Scope 12 6 Challenges and Opportunities 13 7 Conclusion 14 8 References and Links 15
  • 4. 4 Abstract: The world today is highly dependent on data. In this data- driven era with fast growing technologies, a huge amount of data is generated, captured and also maintained for a variety of purposes. Machine learning models utilize existing data to derive meaningful insights over how various factors have an impact on development of different nations and industries and predict the outcomes accordingly. In the arena of prediction, machine learning often works together with the data visualization techniques to make it easier for the user to understand the inferences. This paper aims to analyze and predict the outcome of a real life case-study using various tools of visual analytics and machine learning. The dataset consists of cell samples of patients suspected of breast cancer. We will apply machine learning models to improve the accuracy of cancer susceptibility by predicting whether the breast cancer is benign or malignant. The machine learning models used in this paper includes decision tree algorithm and logistic regression. The integration of multidimensionalheterogeneous data, combined with the application of different machine learning techniques will show a new path in the domain of cancer detection. The paper also explores the challenges and limitations so that it provides future research scopes. Keywords : Machine Learning, Algorithm, Cancer detection, Visual analytics
  • 5. 5 Introduction: Incidentally, machine learning is the scientific study of algorithms and statistical models that is used by the machines to perform a specific task depending on patterns and inference rather than explicit instructions .The algorithms of machine learning divides the data into two parts known as the test data and validation data. Mathematical model based algorithms are implemented on the test data, in order to make predictions or decisions without being explicitly programmed to perform the task. Machine learning can be of two types that is supervised machine learning (contains desired input and output) and unsupervised machine learning (contains only input but not the desired output labels). The concept of machine learning is very important for the predictions done in this paper. This research and analysis aims to observe how precisely a machine can predict that a patient suspected of breast cancer is having malignant or benign cancer. Cancer has been characterized as a heterogeneous disease consisting of various subtypes. Early detection and prognosis of a cancer type has become a necessity in cancer research for facilitating subsequent clinical management of patients. In this paper the classification of cancer type and prediction of risk levels is done by variousmodel of machine learning and is pictoriallydepicted by various tools of visual analytics.
  • 6. 6 The secondary dataset we have collected was created by Dr. William H. Wolberg, a physician at the University of Wisconsin Hospital at Madison, Wisconsin, USA. For creating this dataset Dr. Wolberg have used fluid samples taken from patients with solid breast masses and an easy to use graphical computer program called Xcyt, which is capable of performing the analysis of cytological features based on a digital scan. The program initially uses a curve-fitting algorithm for computing ten features from each one of the cells in the sample and then calculates the mean value, extreme value and the standard error of each of the ten features for the image, returning a 30 real valued vector. A person who sufferedfromBreast Cancer
  • 7. 7 We have performed classification and prediction by decision tree and logistic regression model of machine learning on the dataset with the help of python and its various library packages like pandas, matplotlib, seaborn and scikit. The data visualization has been done through pairplots. The inferences and predictions from this paper will be helpful to cancer research and improve the accuracy of cancer susceptibility, recurrence and survival prediction.
  • 8. 8 Scrutinized Analysis of the dataset : In this paper, we have used machine learning repository for breast cancer dataset. The key attributes of the dataset consists of the ID number and diagnosis ((M = malignant, B = benign) 3–32). And for each cell nucleus ten real valued features are computed. The real valued features must include radius, texture, perimeter, area, smoothness, compactness, concavity, concave points, fractal dimension and symmetry. The radius is computed by mean of distances from centre to points on the perimeter, the texture is calculated by standard deviation of grey scale values, smoothness is computed by local variation in radius lengths, compactness is obtained by”perimeter² / area — 1.0”, concavity is referred as severity of concave portions of the contour, concave points is the number of concave portions of the contour and fractal dimension is computed by coastline approximation— 1.
  • 9. 9 Procedure:  Phase 1: Data Exploration We have python in anaconda prompt shell to work on this dataset. In data exploration phase we firstly import the necessary libraries and our dataset to anaconda prompt shell. The necessary libraries include scikit, matplotlib, pyplot , seaborn and pandas.Then we have provided index numbers for “x” and “y” values. Fig 1: Dataset and X set after importing the dataset After importing the dataset and the required libraries we have checked if there is any missing or null data points of the data set using the some pandas function.
  • 10. 10 After checking and removing the nullvalues, the seaborn library was used for visualization, to find the data distribution of the features. The visualization below shows a pairplot with all ten features of the dataset. Fig 2: Visualization of dataset.  Phase 2: Categorical Data In this phase we have used Label Encoder library to label the categorical data. Then we had split the dataset into training data and test data with the help of SciKit-Learn library in Python using train_test_split method.
  • 11. 11  Phase 3: Feature Scaling In this phase of analysis we have scaled the data using the StandardScaler method from SciKit-Learn library of python.  Phase 4: Model Selection This is the one of major phase of analysis where we have applied Machine Learning algorithm to the Dataset. This phase is also known as Algorithm selection for predicting the best results as in this phase we have selected the algorithm which would predict the best result. In this phase we have used sklearn library to import all the methods of classification algorithms. Then we have predicted the test set results and checked the accuracy with our model. For checking the accuracy we have imported a confusion_matrix method of metrics class. We have used Classification Accuracy method to find the accuracy of our models (where accuracy =number of correct predictions / total number of predictions). After checking we have found that our model of logistic Regression and Decision Tree Algorithm both has 95.8% accuracy. Hence from this phase we can conclude that both of the two machine learning models can give us the best result for our data set.
  • 12. 12 Scope:  This paper will be helpful for the hospitals having modern cancer treatment facilities.  Oncologists can use our procedure and get accurate results within seconds.
  • 13. 13 Challenges and Opportunities:  Challenges :  Any error made while generating the algorithm may result in inaccurate results.  Making people aware of the modern machine learning techniques and earning their faith is one of the biggest challenges.  Opportunities :  If this model works well then we can use machine learning techniques for analyzing and predicting other fatal diseases.  There is also scope for further research in this field.
  • 14. 14 Conclusion:  Results and Findings: From a dataset of patients suspected for cancer we can predict whether the cancer is benign or malignant. If it is malignant then the risk factor is high. If it is benign then the risk factor is comparatively low.  Machine Learning models will be the future of cancer prediction.  We have identified a number of trends with respect to the types of machine learning model being used, the types of training data being integrated, the kinds of endpoint predictions being made, the types of cancers being studied and the overall performance of the models in predicting cancer susceptibility or outcomes.  Depending upon the analysis of the results, it is evident that the integration of multidimensional heterogeneous data, combined with the application of different machine learning techniques for feature selection, classification and prediction can provide promising tools for inference in the cancer domain.
  • 15. 15 References and Link:  www.towardsdatascience.com  www.people.com  www.kaggle.com  www.wikipedia.com  www.geeksforgeeks.com