SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Rohit Garg
Email: pgp12091.rohit@iimraipur.ac.in
Mobile: (+91) 97524 46191
Rohit Garg (12pgp091.rohit@iimraipur.ac.in)
METHODOLOGY – Blockbuster Movies
Disclaimer: OSCAR has been compiled based on the data provided for more than 2000
movies. These are the people were part of the movies that made it to IMDB rating of
more than 7. These people may or may not have received the Oscar in real life.
Data Preparation:
1. The data has Movie title, imdbID, Writer, Director, Actors, Genre, Runtime,
and imdbRating. (The data has 2,576 rows).
2. A Movie can have more than one Writer/s, Director/s, Actor/s, and Genre/s.
In Excel ‘text to columns’ option was used to separate the Writer/s,
Director/s, Actor/s, and Genre/s. (Delimiter is ‘,’).
3. Runtime was in hours and minutes. The Runtime was converted to minutes.
Analysis:
1. Analysis is divided into 4 parts: TRENDS (Data Confessions), OSCARS (The
Writer/s, Director/s, and Actor/s who were part of movies that made to
IMDB Rating of more than 7), SORCERY (Prediction), and SIR ORACLE
(Recommendations).
2. Two datasets were created:
a. First dataset is for TRENDS.
b. Second dataset is for OSCARS, SORCERY, and SIR ORACLE.
Methodology (OSCARS, SORCERY, and SIR ORACLE):
(No methodology is employed for TRENDS as it is data observation).
1. This section highlights the methodologies that we employed for data
prediction and recommendations.
2. We employed a rather straight forward method to come up with the
solution. “What should the producer of a movie do to get an IMDB rating of
more than 7?” (IMDB rating of more than 7 is considered successful movie).
3. We prepared a dataset of movies that had IMDB Rating of more than 7 from
2,576 movies given. (Finally we have 396 rows).
4. Then we made a comprehensive list of all the Writers, Directors, and Actors
present in the all the 396 movies.
5. We made a pivot table of Genre vs all the Writers, Directors, and Actors.
Then for each Genre we checked the number and percentage of movies they
were a part. (Of course if they were not a part of the genre the count is 0).
6. The top Writers, Directors, and Actors were selected. The top Writers,
Directors, and Actors are those who had the highest percentage in a
particular Genre. We called them OSCAR winners.
TRENDS (Please refer to info-graphics for output):
1. In this section we have tried to figure out whether IMDB Rating has affect on
Box Office collections, whether month of release has affect on IMDB Rating
and Box Office collections, whether runtime has affect on IMDB Rating and
Box Office collections.
2. Picked only those rows where BoxOffice amount is present and amount is in
Millions of USD. (Dropped those rows where amount is not present).
3. Picked only those rows where IMDB rating and Tomato rating is present.
4. Picked only those rows where run time is given. (Finally we have 840 rows).
OSCARS, SORCERY, and SIR ORACLE (Please refer to info-graphics for output):
1. OSCAR: Based on the Methodology employed we got a comprehensive list of
the best Writers, Directors, and Actors for different Genre.
2. SORCERY: Based on the OSCAR, the list of 50 upcoming movies was checked.
Flag variable was made for each Writer/s, Director/s, or Actor/s. Flag variable
= “Strong” means found in OSCAR list and Flag variable = “Weak” means not
found in OSCAR list. (Of course if a movie has more than 1 Genre we checked
for all the Genres).
a. SORCERY (top movies): At least 1 Flag Variable = “Strong”.
b. SORCERY (blockbusters): More than 1 Flag Variable = “Strong”.
3. SIR ORACLE: Based on the TRENDS and based on the OSCAR we have made
recommendations. Based on TRENDS we can recommend producer on movie
release month, runtime and rating that will help him to earn a jackpot. Based
on the OSCAR we can recommend producer on choosing the Writer/s,
Director/s, and Actor/s for a movie in a particular Genre. (Of course if the
movie falls in more than 1 Genre the options increase).
Failed Methodologies (OSCARS, SORCERY, and SIR ORACLE):
1. This section highlights the methodologies that we employed for data
prediction and recommendations. But the results were not satisfactory.
2. Dataset was divided into smaller datasets based on Genre.
3. For each Genre we did regression analysis. IMDB Rating was taken as
dependent variable and Writer/s, Director/s, and Actor/s as independent
variable. But the R-square was low.
PleaseprintonA4sheet

Contenu connexe

Tendances

Movie distribution
Movie distributionMovie distribution
Movie distribution
raktim_hatai
 
The Dark Knight Presentation
The Dark Knight PresentationThe Dark Knight Presentation
The Dark Knight Presentation
Abel Stafa
 
The Dark Knight Case Study
The Dark Knight Case StudyThe Dark Knight Case Study
The Dark Knight Case Study
Naamah Hill
 

Tendances (20)

Netflix marketing plan presentation
Netflix marketing plan presentationNetflix marketing plan presentation
Netflix marketing plan presentation
 
Marvel Case Presentation
Marvel Case PresentationMarvel Case Presentation
Marvel Case Presentation
 
Marvel Studios Presentation
Marvel Studios PresentationMarvel Studios Presentation
Marvel Studios Presentation
 
Movie distribution
Movie distributionMovie distribution
Movie distribution
 
Company Analysis Marvels
Company Analysis  MarvelsCompany Analysis  Marvels
Company Analysis Marvels
 
Steven spielberg
Steven spielbergSteven spielberg
Steven spielberg
 
Over the-top media service in india
Over the-top media service in indiaOver the-top media service in india
Over the-top media service in india
 
Movie positioning advt project (autosaved)
Movie positioning advt project (autosaved)Movie positioning advt project (autosaved)
Movie positioning advt project (autosaved)
 
Film industry
Film industry Film industry
Film industry
 
Netflix Business Model - Nine Elements
Netflix Business Model - Nine ElementsNetflix Business Model - Nine Elements
Netflix Business Model - Nine Elements
 
The Dark Knight Presentation
The Dark Knight PresentationThe Dark Knight Presentation
The Dark Knight Presentation
 
Netflix - Strategy management
Netflix - Strategy managementNetflix - Strategy management
Netflix - Strategy management
 
Netflix
NetflixNetflix
Netflix
 
Value Chain Analysis of PVR Ltd.
Value Chain Analysis of PVR Ltd.Value Chain Analysis of PVR Ltd.
Value Chain Analysis of PVR Ltd.
 
The Dark Knight Case Study
The Dark Knight Case StudyThe Dark Knight Case Study
The Dark Knight Case Study
 
A history of action movies
A history of action moviesA history of action movies
A history of action movies
 
Nintendo's Disruptive Strategy
Nintendo's Disruptive StrategyNintendo's Disruptive Strategy
Nintendo's Disruptive Strategy
 
Marvel Cinematic Universe Presentation
Marvel Cinematic Universe PresentationMarvel Cinematic Universe Presentation
Marvel Cinematic Universe Presentation
 
Film Budgets Presentation
Film Budgets PresentationFilm Budgets Presentation
Film Budgets Presentation
 
FMCG HR practices
FMCG HR practicesFMCG HR practices
FMCG HR practices
 

Similaire à Rohit garg iim_raipur_method (8)

Data Acquisition Project
Data Acquisition ProjectData Acquisition Project
Data Acquisition Project
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
 
Understanding the requirements of pre-production.
Understanding the requirements of pre-production.Understanding the requirements of pre-production.
Understanding the requirements of pre-production.
 
Welcome to film studies gcse homework 1
Welcome to film studies gcse homework 1Welcome to film studies gcse homework 1
Welcome to film studies gcse homework 1
 
ADBMS Project Pearl
ADBMS Project PearlADBMS Project Pearl
ADBMS Project Pearl
 
Aspect identification
Aspect identificationAspect identification
Aspect identification
 
movieRecommendation_FinalReport
movieRecommendation_FinalReportmovieRecommendation_FinalReport
movieRecommendation_FinalReport
 
Modeling Techniques in Predictive Analytics
Modeling Techniques in Predictive AnalyticsModeling Techniques in Predictive Analytics
Modeling Techniques in Predictive Analytics
 

Plus de Rohit Garg

Plus de Rohit Garg (15)

natural language processing
natural language processingnatural language processing
natural language processing
 
Certificate DataBricks Training
Certificate DataBricks TrainingCertificate DataBricks Training
Certificate DataBricks Training
 
Infographics - Welcome to India (Visa)
Infographics - Welcome to India (Visa)Infographics - Welcome to India (Visa)
Infographics - Welcome to India (Visa)
 
Infographics - Bollywood
Infographics - BollywoodInfographics - Bollywood
Infographics - Bollywood
 
Certificate in value at risk 1198
Certificate in value at risk 1198Certificate in value at risk 1198
Certificate in value at risk 1198
 
Certificate in risk management 1195
Certificate in risk management 1195Certificate in risk management 1195
Certificate in risk management 1195
 
Certificate in market risk 1196
Certificate in market risk 1196Certificate in market risk 1196
Certificate in market risk 1196
 
Certificate in credit risk 1197
Certificate in credit risk 1197Certificate in credit risk 1197
Certificate in credit risk 1197
 
Certificate in Macro-economics and Statistics
Certificate in Macro-economics and Statistics Certificate in Macro-economics and Statistics
Certificate in Macro-economics and Statistics
 
Certificate in Business Analysis
Certificate in Business Analysis Certificate in Business Analysis
Certificate in Business Analysis
 
B Plan
B PlanB Plan
B Plan
 
Mis 1
Mis 1Mis 1
Mis 1
 
Mis 2
Mis 2Mis 2
Mis 2
 
VBA
VBAVBA
VBA
 
Excel
ExcelExcel
Excel
 

Dernier

Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 

Dernier (20)

Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 

Rohit garg iim_raipur_method

  • 1. Rohit Garg Email: pgp12091.rohit@iimraipur.ac.in Mobile: (+91) 97524 46191 Rohit Garg (12pgp091.rohit@iimraipur.ac.in) METHODOLOGY – Blockbuster Movies Disclaimer: OSCAR has been compiled based on the data provided for more than 2000 movies. These are the people were part of the movies that made it to IMDB rating of more than 7. These people may or may not have received the Oscar in real life. Data Preparation: 1. The data has Movie title, imdbID, Writer, Director, Actors, Genre, Runtime, and imdbRating. (The data has 2,576 rows). 2. A Movie can have more than one Writer/s, Director/s, Actor/s, and Genre/s. In Excel ‘text to columns’ option was used to separate the Writer/s, Director/s, Actor/s, and Genre/s. (Delimiter is ‘,’). 3. Runtime was in hours and minutes. The Runtime was converted to minutes. Analysis: 1. Analysis is divided into 4 parts: TRENDS (Data Confessions), OSCARS (The Writer/s, Director/s, and Actor/s who were part of movies that made to IMDB Rating of more than 7), SORCERY (Prediction), and SIR ORACLE (Recommendations). 2. Two datasets were created: a. First dataset is for TRENDS. b. Second dataset is for OSCARS, SORCERY, and SIR ORACLE. Methodology (OSCARS, SORCERY, and SIR ORACLE): (No methodology is employed for TRENDS as it is data observation). 1. This section highlights the methodologies that we employed for data prediction and recommendations. 2. We employed a rather straight forward method to come up with the solution. “What should the producer of a movie do to get an IMDB rating of more than 7?” (IMDB rating of more than 7 is considered successful movie). 3. We prepared a dataset of movies that had IMDB Rating of more than 7 from 2,576 movies given. (Finally we have 396 rows). 4. Then we made a comprehensive list of all the Writers, Directors, and Actors present in the all the 396 movies. 5. We made a pivot table of Genre vs all the Writers, Directors, and Actors. Then for each Genre we checked the number and percentage of movies they were a part. (Of course if they were not a part of the genre the count is 0). 6. The top Writers, Directors, and Actors were selected. The top Writers, Directors, and Actors are those who had the highest percentage in a particular Genre. We called them OSCAR winners. TRENDS (Please refer to info-graphics for output): 1. In this section we have tried to figure out whether IMDB Rating has affect on Box Office collections, whether month of release has affect on IMDB Rating and Box Office collections, whether runtime has affect on IMDB Rating and Box Office collections. 2. Picked only those rows where BoxOffice amount is present and amount is in Millions of USD. (Dropped those rows where amount is not present). 3. Picked only those rows where IMDB rating and Tomato rating is present. 4. Picked only those rows where run time is given. (Finally we have 840 rows). OSCARS, SORCERY, and SIR ORACLE (Please refer to info-graphics for output): 1. OSCAR: Based on the Methodology employed we got a comprehensive list of the best Writers, Directors, and Actors for different Genre. 2. SORCERY: Based on the OSCAR, the list of 50 upcoming movies was checked. Flag variable was made for each Writer/s, Director/s, or Actor/s. Flag variable = “Strong” means found in OSCAR list and Flag variable = “Weak” means not found in OSCAR list. (Of course if a movie has more than 1 Genre we checked for all the Genres). a. SORCERY (top movies): At least 1 Flag Variable = “Strong”. b. SORCERY (blockbusters): More than 1 Flag Variable = “Strong”. 3. SIR ORACLE: Based on the TRENDS and based on the OSCAR we have made recommendations. Based on TRENDS we can recommend producer on movie release month, runtime and rating that will help him to earn a jackpot. Based on the OSCAR we can recommend producer on choosing the Writer/s, Director/s, and Actor/s for a movie in a particular Genre. (Of course if the movie falls in more than 1 Genre the options increase). Failed Methodologies (OSCARS, SORCERY, and SIR ORACLE): 1. This section highlights the methodologies that we employed for data prediction and recommendations. But the results were not satisfactory. 2. Dataset was divided into smaller datasets based on Genre. 3. For each Genre we did regression analysis. IMDB Rating was taken as dependent variable and Writer/s, Director/s, and Actor/s as independent variable. But the R-square was low. PleaseprintonA4sheet