SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
R Language for Business Analytics
Kamakshaiah Musunuru
Dhruva College of Management, Hyderabad

September 1, 2013
Objectives
Pre-history
About R
how to obtain R
Introduction
Speciality
Muenchen’s Survey
O’Conner’s Comparison
Help
Objectives

To know about R
To asertain Characteristics of R
To compare with other proprietary alternatives
To evaluate with the help of an example
The predecessor for R is S.
S was developed by John Chanmbers (earlier versions) along
with Rick Becker and Allan Wilks of Bell Laboratories

the project was started on May, 1976.
in 1979, S was ported to UNIX
S-Plus and R happened to be by-products of S. 1
S was available for academic and commercial purposes from
ATΓT Laboratories.
1

Ironically, R stood at top 26 best software languages, where as S and
S-Plus are observed in 100.
R began as a research project by Ross Ihaka and
Robert Gentleman at University of Aukland in 1990s.
R is programming language, meant for statistical computing.
R is open source software, supported by volunteers all around
the world. But the central control in the hands of a group
called R-core
The base system provides:
interactive language for numerical computing
data management
graphics
a variety of related calculations
Please visit:
http://r-project.org
R is available for Windows, Linux and MacOS.
Some important websites for R:
CRAN
BioConductor
omegahat
RForge
Introduction to R
R is an integrated suite of software facilities for data manipulation,
calculation and graphical display. Among other things it has
an effective data handling and storage facility,
a suite of operators for calculations on arrays, in particular
matrices,
a large, coherent, integrated collection of intermediate tools
for data analysis,
graphical facilities for data analysis and display either directly
at the computer or on hardcopy, and
a well developed, simple and effective programming language
(called ’S’) which includes conditionals, loops, user defined
recursive functions and input and output facilities.
(Indeedmost of the system supplied functions are themselves
written in the S language.)
Introduction - Continued
R is a GNU project which is similar to the S language and
environment which was developed at Bell Laboratories
(formerly ATT, now Lucent Technologies) by John Chambers
and colleagues.(Please visit http://www.r-project.org/)
R provides a wide variety of statistical and and graphical
techniques, and is highly extensible. some of them are:
Linear Modelling
Non-Linear Modelling
Classical Statistical Tests
time-series Analysis
Classification, clustering
Neural Networks
Social Network Analysis
Linear Programming, integer-programming and etc
and many more.............
Introduction - More...

R is Ligua Franca of statistical research
Over all SAS is 11 years behind R (William Ravelle)
Most importantly R is not only free but also open sourcewhich mean much more
R is available under GNU Copy-left
The recent R version 2.15.3 (Security Blanket) has been
released on 2013-03-01
Speciality of R

By Tal Galili (from http://www.kdnuggets.com/), he asserts
that:
R has largest number of email discussions
The number of R packages published on CRAN continue to
grow (than STATA and SAS)
R has more blogs (appox. 170) the second to R is SAS (only
31 blogs)
Even in terms of job opportunities it might not be worse
41 percent SAS
15 percent SPSS
14 percent R
Meunchen Survey - CRAN growth

2

2

Fig. 1: CRAN-growth
Introduction - Speciality of R
By R A Muenchen (from
http://r4stats.com/articles/popularity/), he observes that
R counts for more number of downloads (but it might be
difficult to count)
TIOBE (http://www.tiobe.com, community programming
language index) ranked R ranked as 24th best programming
language (SPSS was out from the list)
Transparent Language Popularity Index (TLPI) ranked R as 12
most wounderful languages on the globe; the SAS as 26th
R observed as most wanted on online discussions
Mean
Mean
1000
Mean
Mean

monthly email disscussions for R are more than 3000
monthly email disscussions for STATA are more than
monthly email disscussions for SAS are less than 1000
monthly email disscussions for SPSS are less than 500

The assumption is being that what you want is that what you
talk
Introduction - downloads

Illustration.1: most important package downloads -Bioconductor
Introduction - Downloads

Fig.2: Downloads of Bioconductor package
Introduction - What Muenchen Said?
His book, ”R for SAS and SPSS users” is a great work for miners and
analyst.
He studied popularity of data analysis software with respect certain
factors(https://sites.google.com/site/r4statistics/popularity):
sales downloads
Language popularity measures
Internet discussions
Competition
Usage
Literature books
Impact on scholarly activity
Website popularity
Growth in pupularity
IT Research firms
Job markets
Meunchen Survey - Number of Users

3

3

Fig. 3: Number of Users and Analytics
Meunchen Survey - Email Discussions

4

4

Fig. 4: Traffic on Email Discussons
Meunchen Survey - Scholers Hits

5

5

Fig. 5: Scholers hits on software
Meunchen Survey - Job Market

6

6

Fig. 6: Jobs for analytics software on Indeed.com
Introduction - Comparison
According to Brendan O’Conner (expert of artificial intelligence
and social science researcher):
there are two big divisions of solutions; they are:
programming oriented solutions like R, Matlab, Python
analytic solutions like Excel, Stata, and SPSS

Python is “immature”
Matlab is certainly “weak”, but might be better for
mathematical algorithms
SPSS and Stata are equal in capabilities; perhaps Stata might
be much cheaper than SPSS
These two are for those who crave for easy ways and
short-cuts.....
SAS is favoured by older crowd....
SAS people complain that that the graphical outputs are poor
Matlab visualization too is in little controversy compared to R
So, why not we try R!
Introduction - O’Conner’s Comparison
Name
R

Advantages
Library support
Visualization

Matlab

Elegant visualization
matrix support
Python

SciPy/
NumPy/
Matplotlib
Excel
SAS

Stata
SPSS

Easy; visual
flexible
Large datasets

Easy statistical
analysis
Like stata but
more expensive and wost
7

7

Disadvantages
Steep
learning
curve
Expensive
Immature

Large
datasets
Expensive
outdated
programming language
Introduction - O’Conner’s Comparison - Continued

Name
R
Matlab
SciPy/
NumPy/
Matplotlib
Excel
SAS
Stata
SPSS

Open Source
Yes
No
Yes

Typical Users
Finance and Statistics
Engineering
Engineering

No
No
No
No

Business
Business;Government
Science
Business; Academics

8

8

Illustration-3: comparison 2
Last but not least......
Ista Zahn 9 says that ....
A
”I am the only person in my department who uses LTEX and
R. Because Sweave simply provides a way to integrate these
two programs, it follows that I am the only Sweave user as
well. Why have I taken the time and eort to learn these
programs instead of following the crowd and sticking with
Word and SPSS? Quite simply, I made the switch because
A
using LTEX and R is actually easier. It took me some time to
become familiar with these programs, but after using them for
a couple of months I am firmly convinced that I am more
productive with these programs than I ever was with Word
and SPSS.

9

Zahn, I. (2008). Learning to Sweave in APA Style. The PracTEX Journal,
No-1
R Help

Manuals
Help Files
Help.Search (?help.search)
The wikis
The mailing lists
Journals
Thanks for Your Patient Listening
Any Questions?

Contenu connexe

Tendances

Introducing The R Software
Introducing The R Software  Introducing The R Software
Introducing The R Software Kamarul Imran
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning rNetaji Gandi
 
R Programming Overview
R Programming Overview R Programming Overview
R Programming Overview dlamb3244
 
Introduction to R
Introduction to RIntroduction to R
Introduction to RAjay Ohri
 
Webinar : Introduction to R Programming and Machine Learning
Webinar : Introduction to R Programming and Machine LearningWebinar : Introduction to R Programming and Machine Learning
Webinar : Introduction to R Programming and Machine LearningEdureka!
 
Resume_10-30-16-general
Resume_10-30-16-generalResume_10-30-16-general
Resume_10-30-16-generalZachary Holmes
 
R Introduction
R IntroductionR Introduction
R Introductionschamber
 
Linked Data Visualization Model - KEG VŠE
Linked Data Visualization Model - KEG VŠELinked Data Visualization Model - KEG VŠE
Linked Data Visualization Model - KEG VŠEJiří Helmich
 
R meet up slides.pptx
R meet up slides.pptxR meet up slides.pptx
R meet up slides.pptxCorey Sparks
 
Compiler_Project_Srikanth_Vanama
Compiler_Project_Srikanth_VanamaCompiler_Project_Srikanth_Vanama
Compiler_Project_Srikanth_VanamaSrikanth Vanama
 

Tendances (17)

Introducing The R Software
Introducing The R Software  Introducing The R Software
Introducing The R Software
 
R programming
R programmingR programming
R programming
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning r
 
R programming
R programmingR programming
R programming
 
Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
R Programming Overview
R Programming Overview R Programming Overview
R Programming Overview
 
LSESU a Taste of R Language Workshop
LSESU a Taste of R Language WorkshopLSESU a Taste of R Language Workshop
LSESU a Taste of R Language Workshop
 
Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
R Course Online
R Course OnlineR Course Online
R Course Online
 
Webinar : Introduction to R Programming and Machine Learning
Webinar : Introduction to R Programming and Machine LearningWebinar : Introduction to R Programming and Machine Learning
Webinar : Introduction to R Programming and Machine Learning
 
Resume_10-30-16-general
Resume_10-30-16-generalResume_10-30-16-general
Resume_10-30-16-general
 
R Introduction
R IntroductionR Introduction
R Introduction
 
Linked Data Visualization Model - KEG VŠE
Linked Data Visualization Model - KEG VŠELinked Data Visualization Model - KEG VŠE
Linked Data Visualization Model - KEG VŠE
 
Resume
ResumeResume
Resume
 
R meet up slides.pptx
R meet up slides.pptxR meet up slides.pptx
R meet up slides.pptx
 
Unit 3
Unit 3Unit 3
Unit 3
 
Compiler_Project_Srikanth_Vanama
Compiler_Project_Srikanth_VanamaCompiler_Project_Srikanth_Vanama
Compiler_Project_Srikanth_Vanama
 

En vedette

Applied spatial data introducing
Applied spatial data introducingApplied spatial data introducing
Applied spatial data introducingHa Hoang
 
Comprehensive R Arichive Network (CRAN)
Comprehensive R Arichive Network (CRAN)Comprehensive R Arichive Network (CRAN)
Comprehensive R Arichive Network (CRAN)Kamakshaiah M
 
Data analysis with R
Data analysis with RData analysis with R
Data analysis with RShareThis
 
R programming Basic & Advanced
R programming Basic & AdvancedR programming Basic & Advanced
R programming Basic & AdvancedSohom Ghosh
 
Introduction to R Programming
Introduction to R ProgrammingIntroduction to R Programming
Introduction to R Programmingizahn
 
R language tutorial
R language tutorialR language tutorial
R language tutorialDavid Chiu
 
10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle CompetitionsDataRobot
 
An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)Dataspora
 
Iris data analysis example in R
Iris data analysis example in RIris data analysis example in R
Iris data analysis example in RDuyen Do
 

En vedette (10)

Applied spatial data introducing
Applied spatial data introducingApplied spatial data introducing
Applied spatial data introducing
 
Comprehensive R Arichive Network (CRAN)
Comprehensive R Arichive Network (CRAN)Comprehensive R Arichive Network (CRAN)
Comprehensive R Arichive Network (CRAN)
 
Data analysis with R
Data analysis with RData analysis with R
Data analysis with R
 
R programming Basic & Advanced
R programming Basic & AdvancedR programming Basic & Advanced
R programming Basic & Advanced
 
Introduction to R Programming
Introduction to R ProgrammingIntroduction to R Programming
Introduction to R Programming
 
R language tutorial
R language tutorialR language tutorial
R language tutorial
 
10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions
 
An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)
 
R learning by examples
R learning by examplesR learning by examples
R learning by examples
 
Iris data analysis example in R
Iris data analysis example in RIris data analysis example in R
Iris data analysis example in R
 

Similaire à Introtor

R vs SPSS: Which One is The Best Statistical Language
R vs SPSS: Which One is The Best Statistical LanguageR vs SPSS: Which One is The Best Statistical Language
R vs SPSS: Which One is The Best Statistical LanguageStat Analytica
 
R and Rcmdr Statistical Software
R and Rcmdr Statistical SoftwareR and Rcmdr Statistical Software
R and Rcmdr Statistical Softwarearttan2001
 
R Journal 2009 1 Chambers
R Journal 2009 1  ChambersR Journal 2009 1  Chambers
R Journal 2009 1 ChambersAjay Ohri
 
GNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineGNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineAdrian Olszewski
 
R as supporting tool for analytics and simulation
R as supporting tool for analytics and simulationR as supporting tool for analytics and simulation
R as supporting tool for analytics and simulationAlvaro Gil
 
Python vs. r for data science
Python vs. r for data sciencePython vs. r for data science
Python vs. r for data scienceHugo Shi
 
Switching from lispstat to r
Switching from lispstat to rSwitching from lispstat to r
Switching from lispstat to rAjay Ohri
 
Introduction to R Programming
Introduction to R ProgrammingIntroduction to R Programming
Introduction to R Programminghemasri56
 
The Statistical Significance of "R"
The Statistical Significance of "R"The Statistical Significance of "R"
The Statistical Significance of "R"ppvora
 
Introduction to R and R Studio
Introduction to R and R StudioIntroduction to R and R Studio
Introduction to R and R StudioRupak Roy
 
Top 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insightsTop 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insightsStat Analytica
 
R programming for psychometrics
R programming for psychometricsR programming for psychometrics
R programming for psychometricsDiane Talley
 
R vs python. Which one is best for data science
R vs python. Which one is best for data scienceR vs python. Which one is best for data science
R vs python. Which one is best for data scienceStat Analytica
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document usefulssuser3c3f88
 

Similaire à Introtor (20)

R vs SPSS: Which One is The Best Statistical Language
R vs SPSS: Which One is The Best Statistical LanguageR vs SPSS: Which One is The Best Statistical Language
R vs SPSS: Which One is The Best Statistical Language
 
UNIT-1 Start Learning R.pdf
UNIT-1 Start Learning R.pdfUNIT-1 Start Learning R.pdf
UNIT-1 Start Learning R.pdf
 
R and Rcmdr Statistical Software
R and Rcmdr Statistical SoftwareR and Rcmdr Statistical Software
R and Rcmdr Statistical Software
 
R_L1-Aug-2022.pptx
R_L1-Aug-2022.pptxR_L1-Aug-2022.pptx
R_L1-Aug-2022.pptx
 
R Journal 2009 1 Chambers
R Journal 2009 1  ChambersR Journal 2009 1  Chambers
R Journal 2009 1 Chambers
 
GNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based MedicineGNU R in Clinical Research and Evidence-Based Medicine
GNU R in Clinical Research and Evidence-Based Medicine
 
Reason To learn & use r
Reason To learn & use rReason To learn & use r
Reason To learn & use r
 
R as supporting tool for analytics and simulation
R as supporting tool for analytics and simulationR as supporting tool for analytics and simulation
R as supporting tool for analytics and simulation
 
Python vs. r for data science
Python vs. r for data sciencePython vs. r for data science
Python vs. r for data science
 
Switching from lispstat to r
Switching from lispstat to rSwitching from lispstat to r
Switching from lispstat to r
 
Introduction to R Programming
Introduction to R ProgrammingIntroduction to R Programming
Introduction to R Programming
 
The Statistical Significance of "R"
The Statistical Significance of "R"The Statistical Significance of "R"
The Statistical Significance of "R"
 
R for data analytics
R for data analyticsR for data analytics
R for data analytics
 
R Data Analysis Software
R Data Analysis SoftwareR Data Analysis Software
R Data Analysis Software
 
Introduction to R and R Studio
Introduction to R and R StudioIntroduction to R and R Studio
Introduction to R and R Studio
 
Top 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insightsTop 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insights
 
r merged.pdf
r merged.pdfr merged.pdf
r merged.pdf
 
R programming for psychometrics
R programming for psychometricsR programming for psychometrics
R programming for psychometrics
 
R vs python. Which one is best for data science
R vs python. Which one is best for data scienceR vs python. Which one is best for data science
R vs python. Which one is best for data science
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document useful
 

Dernier

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 

Dernier (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 

Introtor

  • 1. R Language for Business Analytics Kamakshaiah Musunuru Dhruva College of Management, Hyderabad September 1, 2013
  • 2. Objectives Pre-history About R how to obtain R Introduction Speciality Muenchen’s Survey O’Conner’s Comparison Help
  • 3. Objectives To know about R To asertain Characteristics of R To compare with other proprietary alternatives To evaluate with the help of an example
  • 4. The predecessor for R is S. S was developed by John Chanmbers (earlier versions) along with Rick Becker and Allan Wilks of Bell Laboratories the project was started on May, 1976. in 1979, S was ported to UNIX S-Plus and R happened to be by-products of S. 1 S was available for academic and commercial purposes from ATΓT Laboratories. 1 Ironically, R stood at top 26 best software languages, where as S and S-Plus are observed in 100.
  • 5. R began as a research project by Ross Ihaka and Robert Gentleman at University of Aukland in 1990s. R is programming language, meant for statistical computing. R is open source software, supported by volunteers all around the world. But the central control in the hands of a group called R-core The base system provides: interactive language for numerical computing data management graphics a variety of related calculations
  • 6. Please visit: http://r-project.org R is available for Windows, Linux and MacOS.
  • 7. Some important websites for R: CRAN BioConductor omegahat RForge
  • 8. Introduction to R R is an integrated suite of software facilities for data manipulation, calculation and graphical display. Among other things it has an effective data handling and storage facility, a suite of operators for calculations on arrays, in particular matrices, a large, coherent, integrated collection of intermediate tools for data analysis, graphical facilities for data analysis and display either directly at the computer or on hardcopy, and a well developed, simple and effective programming language (called ’S’) which includes conditionals, loops, user defined recursive functions and input and output facilities. (Indeedmost of the system supplied functions are themselves written in the S language.)
  • 9. Introduction - Continued R is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly ATT, now Lucent Technologies) by John Chambers and colleagues.(Please visit http://www.r-project.org/) R provides a wide variety of statistical and and graphical techniques, and is highly extensible. some of them are: Linear Modelling Non-Linear Modelling Classical Statistical Tests time-series Analysis Classification, clustering Neural Networks Social Network Analysis Linear Programming, integer-programming and etc and many more.............
  • 10. Introduction - More... R is Ligua Franca of statistical research Over all SAS is 11 years behind R (William Ravelle) Most importantly R is not only free but also open sourcewhich mean much more R is available under GNU Copy-left The recent R version 2.15.3 (Security Blanket) has been released on 2013-03-01
  • 11. Speciality of R By Tal Galili (from http://www.kdnuggets.com/), he asserts that: R has largest number of email discussions The number of R packages published on CRAN continue to grow (than STATA and SAS) R has more blogs (appox. 170) the second to R is SAS (only 31 blogs) Even in terms of job opportunities it might not be worse 41 percent SAS 15 percent SPSS 14 percent R
  • 12. Meunchen Survey - CRAN growth 2 2 Fig. 1: CRAN-growth
  • 13. Introduction - Speciality of R By R A Muenchen (from http://r4stats.com/articles/popularity/), he observes that R counts for more number of downloads (but it might be difficult to count) TIOBE (http://www.tiobe.com, community programming language index) ranked R ranked as 24th best programming language (SPSS was out from the list) Transparent Language Popularity Index (TLPI) ranked R as 12 most wounderful languages on the globe; the SAS as 26th R observed as most wanted on online discussions Mean Mean 1000 Mean Mean monthly email disscussions for R are more than 3000 monthly email disscussions for STATA are more than monthly email disscussions for SAS are less than 1000 monthly email disscussions for SPSS are less than 500 The assumption is being that what you want is that what you talk
  • 14. Introduction - downloads Illustration.1: most important package downloads -Bioconductor
  • 15. Introduction - Downloads Fig.2: Downloads of Bioconductor package
  • 16. Introduction - What Muenchen Said? His book, ”R for SAS and SPSS users” is a great work for miners and analyst. He studied popularity of data analysis software with respect certain factors(https://sites.google.com/site/r4statistics/popularity): sales downloads Language popularity measures Internet discussions Competition Usage Literature books Impact on scholarly activity Website popularity Growth in pupularity IT Research firms Job markets
  • 17. Meunchen Survey - Number of Users 3 3 Fig. 3: Number of Users and Analytics
  • 18. Meunchen Survey - Email Discussions 4 4 Fig. 4: Traffic on Email Discussons
  • 19. Meunchen Survey - Scholers Hits 5 5 Fig. 5: Scholers hits on software
  • 20. Meunchen Survey - Job Market 6 6 Fig. 6: Jobs for analytics software on Indeed.com
  • 21. Introduction - Comparison According to Brendan O’Conner (expert of artificial intelligence and social science researcher): there are two big divisions of solutions; they are: programming oriented solutions like R, Matlab, Python analytic solutions like Excel, Stata, and SPSS Python is “immature” Matlab is certainly “weak”, but might be better for mathematical algorithms SPSS and Stata are equal in capabilities; perhaps Stata might be much cheaper than SPSS These two are for those who crave for easy ways and short-cuts..... SAS is favoured by older crowd.... SAS people complain that that the graphical outputs are poor Matlab visualization too is in little controversy compared to R So, why not we try R!
  • 22. Introduction - O’Conner’s Comparison Name R Advantages Library support Visualization Matlab Elegant visualization matrix support Python SciPy/ NumPy/ Matplotlib Excel SAS Stata SPSS Easy; visual flexible Large datasets Easy statistical analysis Like stata but more expensive and wost 7 7 Disadvantages Steep learning curve Expensive Immature Large datasets Expensive outdated programming language
  • 23. Introduction - O’Conner’s Comparison - Continued Name R Matlab SciPy/ NumPy/ Matplotlib Excel SAS Stata SPSS Open Source Yes No Yes Typical Users Finance and Statistics Engineering Engineering No No No No Business Business;Government Science Business; Academics 8 8 Illustration-3: comparison 2
  • 24. Last but not least...... Ista Zahn 9 says that .... A ”I am the only person in my department who uses LTEX and R. Because Sweave simply provides a way to integrate these two programs, it follows that I am the only Sweave user as well. Why have I taken the time and eort to learn these programs instead of following the crowd and sticking with Word and SPSS? Quite simply, I made the switch because A using LTEX and R is actually easier. It took me some time to become familiar with these programs, but after using them for a couple of months I am firmly convinced that I am more productive with these programs than I ever was with Word and SPSS. 9 Zahn, I. (2008). Learning to Sweave in APA Style. The PracTEX Journal, No-1
  • 25. R Help Manuals Help Files Help.Search (?help.search) The wikis The mailing lists Journals
  • 26. Thanks for Your Patient Listening Any Questions?