SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Key Roles In Data-Driven
Organisation
Presented By:
Durgesh Gupta,
Mayura Zadane
Agenda:
● Introduction
● Key Roles in Data-Driven Organisation.
○ Data Analyst
○ Data Engineer
○ Applied ML Engineering
■ Data Scientist
■ Statistician
■ Applied ML Engineer
■ Ethicist
■ Social Scientist
■ Researcher
○ Tech Lead
■ Analytics Manager
■ Decision Maker
Introduction
● Data Science jobs are one of the hottest jobs
of 21st century and its demand is increasing
by the day
● In industry, there are different data science
roles we come across
● It’s tough to get a general understanding of
how they differ in terms of skill sets and what
they work on
● Getting brief insights of key job roles and
responsibilities of each title along with
skills/qualifications can help in understanding
roles in data science field.
Introduction
Key Roles in Data-Driven
Organisation
Data Engineer
● For many organizations, Data Engineers are first hires on a data team.
● Data Engineers develops, constructs, tests and maintains architectures of databases and systems.
● They gather data from other websites through web scraping, API’s or IoT devices and ingests the data
into the data warehouse.
● Data Engineers create ETL (Extract, Transform and Load) processes to make sure that the data gets into
the data warehouse.
● Responsible for building efficient data pipelines.
Skill sets:
● Big data tools: Hadoop, Spark, Kafka, etc.
● SQL and NoSQL databases like PostgreSQL, Cassandra, MongoDB etc.
● R, Python, C/C++ Programming Languages.
● Cloud Services
Data Analyst
● A Data Analyst collects, processes, performs statistical analysis and creates visualizations on data.
● Analysts implement feature engineering, feature selection, clean the data using programming
languages, spreadsheets, and business intelligence tools to describe and categorize the data.
● The master data collected is managed by an analyst including creation, updation, deletion and
processing confidential data.
● Analyst creates report and analysis. Provides expertise on data storage structure, data mining and data
cleaning.
Skills sets:
● Structured Query Language(SQL) or any databases
● Data Mining, cleaning
● Data Analysis, Visualizations
● R or Python Programming Language
● Presentation skills
Applied ML Engineering
Statistician
● Statisticians are professionals who apply statistical methods and models to real-world problems.
● They gather, analyze, and interpret data to aid in many business decision-making processes.
● Statisticians are valuable employees in a range of industries, and often seek roles in areas such as business,
health and medicine, government, physical sciences, and environmental sciences.
● Daily tasks are likely to include:
○ Collecting, analyzing, and interpreting data
○ Identifying trends and relationships in data
○ Designing processes for data collection
○ Communicating findings to stakeholders
○ Advising organizational and business strategy
○ Assisting in decision making
Skill sets:
● Statistical theory and methods. Data Mining & Machine Learning
● Distributed Computing (Hadoop)
● Databases (SQL and NoSQL)
● R, Python, Spark programming Language
Applied ML Engineer
● The work of a Machine Learning Engineer is to bridge the gap between Data Scientist’s work and
production environment.
● Machine Learning Engineer is more concerned with deploying production-ready models.
● Removes errors from data sets and find correct data representation methods.
● Deploys the machine learning model to be integrated into the application/ website.
● Scaling and optimizing the model for production.
● Monitoring and maintenance of deployed models
Skill sets:
● Probability & Statistics
● Data Modeling and Evaluation.
● MLOps.
● Applying Machine Learning algorithms and libraries(Tensorflow, Pytorch)
● Software Engineering and system design(AWS, Azure, GCP)
Data Scientist
● A Data Scientist work based on the visualization provided by the data analytics team to build and
optimize classifiers using machine learning techniques
● Thoroughly clean data to discard irrelevant information and prepare the data for preprocessing and
modeling
● Performs exploratory data analysis (EDA) to determine how to handle missing data.
● Discovers new algorithms to solve problems & build programs to improve current strategies.
● Perform feature engineering, feature selection to implement analytical methods, machine learning and
statistical methods to prepare data for use in predictive and prescriptive modeling
Skill sets:
● Programming: Python, Java
● Applying Machine Learning algorithms and libraries(Scikit Learn, Tensorflow, PyTorch)
● Predictive Modeling
● Maths and Stats
● Effective Communication
Ethicist
● Data ethics is a cross-cutting discipline that assesses the wider societal impact of technology, producing
recommendations for technologists and data professionals. It involves thinking about fairness,
accountability, the law, moral dilemmas, and the risks involved in creating technology and data products
and policies.
● Data Ethicist in teams will enable Data Engineers and Data Scientists to innovate responsibly and respond to
the ongoing demand for implementing data ethics best practice.
● This critical role has been extremely successful in recent years in the private sector, and has been
instrumental in the development of high-risk data and artificial intelligence (AI) products.
● Skill Sets:
○ communication skills (data)
○ applied knowledge of social sciences
○ stakeholder relationship management
○ analysis and synthesis (data ethics)
○ bridging the gap between the technical and non-technical (data ethics)
○ product development (data ethics)
○ empathy and inclusivity
○ ethics and privacy
○ Problem-solving
○ facilitating decisions and risks
Social ScientistA social scientist
● AI has the potential to bring along diverse benefits for our health, safety and general well-being.
● A Social Scientist performs research on link between AI and societal impact of it.
● They can detect potential use of AI by considering societal implications of these technologies.
● Such individuals may be especially equipped to spot the problems in AI that aggravate long-ingrained
prejudices.
● They have proper domain knowledge on problem statement for which AI is used.
Social Scientist
Researcher
● AI researchers conceptualize and explore new ways of leveraging data by developing new AI algorithms,
i.e., they create and ask new questions that can be answered using AI.
● AI researchers focus on finding ways to analyze data in innovative ways for automated decision-making
and action.
● AI researchers, research novel forms of AI technology to create new applications that use data to drive
independent actions.
● Skill Set:
○ AI programming skills: This one goes without saying, but coding skills is a given for any professional in
the AI and data science domain. The best programming languages for AI development currently are
Python, Lisp, Prolog, R, C/C++ and Java. Out of these languages, Python is most preferred by both tech
companies and AI researchers themselves, possibly because of its ease of use.
○ Analytical thinking: Since artificial intelligence is closely intertwined with data analysis, analytical skills
are necessary for potential AI researchers. Having good analytical skills translates into the ability to
■ make sense of data
■ verify the validity of the data gathered
■ identify connections between different variables, and
■ form logical conclusions based on the available data.
Tech Lead Roles
Analytics Manager
● The complete cycle revolves around the enterprise goal.
● Identify the key business variables that the analysis needs to predict.
● Define the project goals by asking and refining "sharp" questions that are relevant, specific, and
unambiguous.
● Find the relevant data that helps you answer the questions that define the objectives of the
project.
● An Analytics Manager manages a team of analysts and data scientists
Skills sets:
● R, Python , SQL, SAS, Java Programming
● Leadership & project management
● Data Mining & Predictive modeling
● Interpersonal Communication
Decision Maker
● Real-world data sets are often noisy, are missing values, or have a host of other discrepancies.
● Aim is to produce a clean, high-quality data set whose relationship to the target variables is
understood.
● Develop a solution architecture of the data pipeline that refreshes and scores the data regularly
Key Roles In Data-Driven Organisation
Key Roles In Data-Driven Organisation

Contenu connexe

Similaire à Key Roles In Data-Driven Organisation

How can a data scientist expert solve real world problems?
How can a data scientist expert solve real world problems? How can a data scientist expert solve real world problems?
How can a data scientist expert solve real world problems? priyanka rajput
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargShiv Shakti Ghosh
 
How to become a data scientist
How to become a data scientist How to become a data scientist
How to become a data scientist Manjunath Sindagi
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
Data Analytics Course In Bangalore-November
Data Analytics Course In Bangalore-NovemberData Analytics Course In Bangalore-November
Data Analytics Course In Bangalore-NovemberDataMites
 
Data Analytics Course In Bangalore
Data Analytics Course In BangaloreData Analytics Course In Bangalore
Data Analytics Course In BangaloreDataMites
 
Data Analytics Course In Pune-October
Data Analytics Course In Pune-OctoberData Analytics Course In Pune-October
Data Analytics Course In Pune-OctoberDataMites
 
Data Analytics Course In Pune
Data Analytics Course In PuneData Analytics Course In Pune
Data Analytics Course In PuneDataMites
 
Data Analytics Course In Chennai-August
Data Analytics Course In Chennai-AugustData Analytics Course In Chennai-August
Data Analytics Course In Chennai-AugustDataMites
 
Data Analytics Course In Delhi-November
Data Analytics Course In Delhi-NovemberData Analytics Course In Delhi-November
Data Analytics Course In Delhi-NovemberDataMites
 
Data Analytics Course In Chennai
Data Analytics Course In ChennaiData Analytics Course In Chennai
Data Analytics Course In ChennaiDataMites
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Simplilearn
 
Data Analytics Course In Mumbai
Data Analytics Course In MumbaiData Analytics Course In Mumbai
Data Analytics Course In MumbaiDataMites
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science Sagar Hedau
 
Data Analytics Course In Mumbai-November
Data Analytics Course In Mumbai-NovemberData Analytics Course In Mumbai-November
Data Analytics Course In Mumbai-NovemberDataMites
 
Data Analytics Course In Chennai-November
Data Analytics Course In Chennai-NovemberData Analytics Course In Chennai-November
Data Analytics Course In Chennai-NovemberDataMites
 
Data Analytics Course In Bangalore-August
Data Analytics Course In Bangalore-AugustData Analytics Course In Bangalore-August
Data Analytics Course In Bangalore-AugustDataMites
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdfUniversity of Sindh
 

Similaire à Key Roles In Data-Driven Organisation (20)

How can a data scientist expert solve real world problems?
How can a data scientist expert solve real world problems? How can a data scientist expert solve real world problems?
How can a data scientist expert solve real world problems?
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
How to become a data scientist
How to become a data scientist How to become a data scientist
How to become a data scientist
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
Data Analytics Course In Bangalore-November
Data Analytics Course In Bangalore-NovemberData Analytics Course In Bangalore-November
Data Analytics Course In Bangalore-November
 
Data Analytics Course In Bangalore
Data Analytics Course In BangaloreData Analytics Course In Bangalore
Data Analytics Course In Bangalore
 
Data Analytics Course In Pune-October
Data Analytics Course In Pune-OctoberData Analytics Course In Pune-October
Data Analytics Course In Pune-October
 
Data Analytics Course In Pune
Data Analytics Course In PuneData Analytics Course In Pune
Data Analytics Course In Pune
 
Data Analytics Course In Chennai-August
Data Analytics Course In Chennai-AugustData Analytics Course In Chennai-August
Data Analytics Course In Chennai-August
 
Data Analytics Course In Delhi-November
Data Analytics Course In Delhi-NovemberData Analytics Course In Delhi-November
Data Analytics Course In Delhi-November
 
Data Analytics Course in Noida. pptx
Data Analytics  Course in Noida.     pptxData Analytics  Course in Noida.     pptx
Data Analytics Course in Noida. pptx
 
Data Analytics Course In Chennai
Data Analytics Course In ChennaiData Analytics Course In Chennai
Data Analytics Course In Chennai
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
Data Analytics Course In Mumbai
Data Analytics Course In MumbaiData Analytics Course In Mumbai
Data Analytics Course In Mumbai
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science
 
Data Analytics Course In Mumbai-November
Data Analytics Course In Mumbai-NovemberData Analytics Course In Mumbai-November
Data Analytics Course In Mumbai-November
 
Data Analytics Course In Chennai-November
Data Analytics Course In Chennai-NovemberData Analytics Course In Chennai-November
Data Analytics Course In Chennai-November
 
semana1.pptx
semana1.pptxsemana1.pptx
semana1.pptx
 
Data Analytics Course In Bangalore-August
Data Analytics Course In Bangalore-AugustData Analytics Course In Bangalore-August
Data Analytics Course In Bangalore-August
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
 

Plus de Knoldus Inc.

Robusta -Tool Presentation (DevOps).pptx
Robusta -Tool Presentation (DevOps).pptxRobusta -Tool Presentation (DevOps).pptx
Robusta -Tool Presentation (DevOps).pptxKnoldus Inc.
 
Optimizing Kubernetes using GOLDILOCKS.pptx
Optimizing Kubernetes using GOLDILOCKS.pptxOptimizing Kubernetes using GOLDILOCKS.pptx
Optimizing Kubernetes using GOLDILOCKS.pptxKnoldus Inc.
 
Azure Function App Exception Handling.pptx
Azure Function App Exception Handling.pptxAzure Function App Exception Handling.pptx
Azure Function App Exception Handling.pptxKnoldus Inc.
 
CQRS Design Pattern Presentation (Java).pptx
CQRS Design Pattern Presentation (Java).pptxCQRS Design Pattern Presentation (Java).pptx
CQRS Design Pattern Presentation (Java).pptxKnoldus Inc.
 
ETL Observability: Azure to Snowflake Presentation
ETL Observability: Azure to Snowflake PresentationETL Observability: Azure to Snowflake Presentation
ETL Observability: Azure to Snowflake PresentationKnoldus Inc.
 
Scripting with K6 - Beyond the Basics Presentation
Scripting with K6 - Beyond the Basics PresentationScripting with K6 - Beyond the Basics Presentation
Scripting with K6 - Beyond the Basics PresentationKnoldus Inc.
 
Getting started with dotnet core Web APIs
Getting started with dotnet core Web APIsGetting started with dotnet core Web APIs
Getting started with dotnet core Web APIsKnoldus Inc.
 
Introduction To Rust part II Presentation
Introduction To Rust part II PresentationIntroduction To Rust part II Presentation
Introduction To Rust part II PresentationKnoldus Inc.
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Configuring Workflows & Validators in JIRA
Configuring Workflows & Validators in JIRAConfiguring Workflows & Validators in JIRA
Configuring Workflows & Validators in JIRAKnoldus Inc.
 
Advanced Python (with dependency injection and hydra configuration packages)
Advanced Python (with dependency injection and hydra configuration packages)Advanced Python (with dependency injection and hydra configuration packages)
Advanced Python (with dependency injection and hydra configuration packages)Knoldus Inc.
 
Azure Databricks (For Data Analytics).pptx
Azure Databricks (For Data Analytics).pptxAzure Databricks (For Data Analytics).pptx
Azure Databricks (For Data Analytics).pptxKnoldus Inc.
 
The Power of Dependency Injection with Dagger 2 and Kotlin
The Power of Dependency Injection with Dagger 2 and KotlinThe Power of Dependency Injection with Dagger 2 and Kotlin
The Power of Dependency Injection with Dagger 2 and KotlinKnoldus Inc.
 
Data Engineering with Databricks Presentation
Data Engineering with Databricks PresentationData Engineering with Databricks Presentation
Data Engineering with Databricks PresentationKnoldus Inc.
 
Databricks for MLOps Presentation (AI/ML)
Databricks for MLOps Presentation (AI/ML)Databricks for MLOps Presentation (AI/ML)
Databricks for MLOps Presentation (AI/ML)Knoldus Inc.
 
NoOps - (Automate Ops) Presentation.pptx
NoOps - (Automate Ops) Presentation.pptxNoOps - (Automate Ops) Presentation.pptx
NoOps - (Automate Ops) Presentation.pptxKnoldus Inc.
 
Mastering Distributed Performance Testing
Mastering Distributed Performance TestingMastering Distributed Performance Testing
Mastering Distributed Performance TestingKnoldus Inc.
 
MLops on Vertex AI Presentation (AI/ML).pptx
MLops on Vertex AI Presentation (AI/ML).pptxMLops on Vertex AI Presentation (AI/ML).pptx
MLops on Vertex AI Presentation (AI/ML).pptxKnoldus Inc.
 
Introduction to Ansible Tower Presentation
Introduction to Ansible Tower PresentationIntroduction to Ansible Tower Presentation
Introduction to Ansible Tower PresentationKnoldus Inc.
 
CQRS with dot net services presentation.
CQRS with dot net services presentation.CQRS with dot net services presentation.
CQRS with dot net services presentation.Knoldus Inc.
 

Plus de Knoldus Inc. (20)

Robusta -Tool Presentation (DevOps).pptx
Robusta -Tool Presentation (DevOps).pptxRobusta -Tool Presentation (DevOps).pptx
Robusta -Tool Presentation (DevOps).pptx
 
Optimizing Kubernetes using GOLDILOCKS.pptx
Optimizing Kubernetes using GOLDILOCKS.pptxOptimizing Kubernetes using GOLDILOCKS.pptx
Optimizing Kubernetes using GOLDILOCKS.pptx
 
Azure Function App Exception Handling.pptx
Azure Function App Exception Handling.pptxAzure Function App Exception Handling.pptx
Azure Function App Exception Handling.pptx
 
CQRS Design Pattern Presentation (Java).pptx
CQRS Design Pattern Presentation (Java).pptxCQRS Design Pattern Presentation (Java).pptx
CQRS Design Pattern Presentation (Java).pptx
 
ETL Observability: Azure to Snowflake Presentation
ETL Observability: Azure to Snowflake PresentationETL Observability: Azure to Snowflake Presentation
ETL Observability: Azure to Snowflake Presentation
 
Scripting with K6 - Beyond the Basics Presentation
Scripting with K6 - Beyond the Basics PresentationScripting with K6 - Beyond the Basics Presentation
Scripting with K6 - Beyond the Basics Presentation
 
Getting started with dotnet core Web APIs
Getting started with dotnet core Web APIsGetting started with dotnet core Web APIs
Getting started with dotnet core Web APIs
 
Introduction To Rust part II Presentation
Introduction To Rust part II PresentationIntroduction To Rust part II Presentation
Introduction To Rust part II Presentation
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Configuring Workflows & Validators in JIRA
Configuring Workflows & Validators in JIRAConfiguring Workflows & Validators in JIRA
Configuring Workflows & Validators in JIRA
 
Advanced Python (with dependency injection and hydra configuration packages)
Advanced Python (with dependency injection and hydra configuration packages)Advanced Python (with dependency injection and hydra configuration packages)
Advanced Python (with dependency injection and hydra configuration packages)
 
Azure Databricks (For Data Analytics).pptx
Azure Databricks (For Data Analytics).pptxAzure Databricks (For Data Analytics).pptx
Azure Databricks (For Data Analytics).pptx
 
The Power of Dependency Injection with Dagger 2 and Kotlin
The Power of Dependency Injection with Dagger 2 and KotlinThe Power of Dependency Injection with Dagger 2 and Kotlin
The Power of Dependency Injection with Dagger 2 and Kotlin
 
Data Engineering with Databricks Presentation
Data Engineering with Databricks PresentationData Engineering with Databricks Presentation
Data Engineering with Databricks Presentation
 
Databricks for MLOps Presentation (AI/ML)
Databricks for MLOps Presentation (AI/ML)Databricks for MLOps Presentation (AI/ML)
Databricks for MLOps Presentation (AI/ML)
 
NoOps - (Automate Ops) Presentation.pptx
NoOps - (Automate Ops) Presentation.pptxNoOps - (Automate Ops) Presentation.pptx
NoOps - (Automate Ops) Presentation.pptx
 
Mastering Distributed Performance Testing
Mastering Distributed Performance TestingMastering Distributed Performance Testing
Mastering Distributed Performance Testing
 
MLops on Vertex AI Presentation (AI/ML).pptx
MLops on Vertex AI Presentation (AI/ML).pptxMLops on Vertex AI Presentation (AI/ML).pptx
MLops on Vertex AI Presentation (AI/ML).pptx
 
Introduction to Ansible Tower Presentation
Introduction to Ansible Tower PresentationIntroduction to Ansible Tower Presentation
Introduction to Ansible Tower Presentation
 
CQRS with dot net services presentation.
CQRS with dot net services presentation.CQRS with dot net services presentation.
CQRS with dot net services presentation.
 

Dernier

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 

Dernier (20)

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 

Key Roles In Data-Driven Organisation

  • 1. Key Roles In Data-Driven Organisation Presented By: Durgesh Gupta, Mayura Zadane
  • 2. Agenda: ● Introduction ● Key Roles in Data-Driven Organisation. ○ Data Analyst ○ Data Engineer ○ Applied ML Engineering ■ Data Scientist ■ Statistician ■ Applied ML Engineer ■ Ethicist ■ Social Scientist ■ Researcher ○ Tech Lead ■ Analytics Manager ■ Decision Maker
  • 3. Introduction ● Data Science jobs are one of the hottest jobs of 21st century and its demand is increasing by the day ● In industry, there are different data science roles we come across ● It’s tough to get a general understanding of how they differ in terms of skill sets and what they work on ● Getting brief insights of key job roles and responsibilities of each title along with skills/qualifications can help in understanding roles in data science field.
  • 5. Key Roles in Data-Driven Organisation
  • 6. Data Engineer ● For many organizations, Data Engineers are first hires on a data team. ● Data Engineers develops, constructs, tests and maintains architectures of databases and systems. ● They gather data from other websites through web scraping, API’s or IoT devices and ingests the data into the data warehouse. ● Data Engineers create ETL (Extract, Transform and Load) processes to make sure that the data gets into the data warehouse. ● Responsible for building efficient data pipelines. Skill sets: ● Big data tools: Hadoop, Spark, Kafka, etc. ● SQL and NoSQL databases like PostgreSQL, Cassandra, MongoDB etc. ● R, Python, C/C++ Programming Languages. ● Cloud Services
  • 7. Data Analyst ● A Data Analyst collects, processes, performs statistical analysis and creates visualizations on data. ● Analysts implement feature engineering, feature selection, clean the data using programming languages, spreadsheets, and business intelligence tools to describe and categorize the data. ● The master data collected is managed by an analyst including creation, updation, deletion and processing confidential data. ● Analyst creates report and analysis. Provides expertise on data storage structure, data mining and data cleaning. Skills sets: ● Structured Query Language(SQL) or any databases ● Data Mining, cleaning ● Data Analysis, Visualizations ● R or Python Programming Language ● Presentation skills
  • 9. Statistician ● Statisticians are professionals who apply statistical methods and models to real-world problems. ● They gather, analyze, and interpret data to aid in many business decision-making processes. ● Statisticians are valuable employees in a range of industries, and often seek roles in areas such as business, health and medicine, government, physical sciences, and environmental sciences. ● Daily tasks are likely to include: ○ Collecting, analyzing, and interpreting data ○ Identifying trends and relationships in data ○ Designing processes for data collection ○ Communicating findings to stakeholders ○ Advising organizational and business strategy ○ Assisting in decision making Skill sets: ● Statistical theory and methods. Data Mining & Machine Learning ● Distributed Computing (Hadoop) ● Databases (SQL and NoSQL) ● R, Python, Spark programming Language
  • 10. Applied ML Engineer ● The work of a Machine Learning Engineer is to bridge the gap between Data Scientist’s work and production environment. ● Machine Learning Engineer is more concerned with deploying production-ready models. ● Removes errors from data sets and find correct data representation methods. ● Deploys the machine learning model to be integrated into the application/ website. ● Scaling and optimizing the model for production. ● Monitoring and maintenance of deployed models Skill sets: ● Probability & Statistics ● Data Modeling and Evaluation. ● MLOps. ● Applying Machine Learning algorithms and libraries(Tensorflow, Pytorch) ● Software Engineering and system design(AWS, Azure, GCP)
  • 11. Data Scientist ● A Data Scientist work based on the visualization provided by the data analytics team to build and optimize classifiers using machine learning techniques ● Thoroughly clean data to discard irrelevant information and prepare the data for preprocessing and modeling ● Performs exploratory data analysis (EDA) to determine how to handle missing data. ● Discovers new algorithms to solve problems & build programs to improve current strategies. ● Perform feature engineering, feature selection to implement analytical methods, machine learning and statistical methods to prepare data for use in predictive and prescriptive modeling Skill sets: ● Programming: Python, Java ● Applying Machine Learning algorithms and libraries(Scikit Learn, Tensorflow, PyTorch) ● Predictive Modeling ● Maths and Stats ● Effective Communication
  • 12. Ethicist ● Data ethics is a cross-cutting discipline that assesses the wider societal impact of technology, producing recommendations for technologists and data professionals. It involves thinking about fairness, accountability, the law, moral dilemmas, and the risks involved in creating technology and data products and policies. ● Data Ethicist in teams will enable Data Engineers and Data Scientists to innovate responsibly and respond to the ongoing demand for implementing data ethics best practice. ● This critical role has been extremely successful in recent years in the private sector, and has been instrumental in the development of high-risk data and artificial intelligence (AI) products. ● Skill Sets: ○ communication skills (data) ○ applied knowledge of social sciences ○ stakeholder relationship management ○ analysis and synthesis (data ethics) ○ bridging the gap between the technical and non-technical (data ethics) ○ product development (data ethics) ○ empathy and inclusivity ○ ethics and privacy ○ Problem-solving ○ facilitating decisions and risks
  • 13. Social ScientistA social scientist ● AI has the potential to bring along diverse benefits for our health, safety and general well-being. ● A Social Scientist performs research on link between AI and societal impact of it. ● They can detect potential use of AI by considering societal implications of these technologies. ● Such individuals may be especially equipped to spot the problems in AI that aggravate long-ingrained prejudices. ● They have proper domain knowledge on problem statement for which AI is used. Social Scientist
  • 14. Researcher ● AI researchers conceptualize and explore new ways of leveraging data by developing new AI algorithms, i.e., they create and ask new questions that can be answered using AI. ● AI researchers focus on finding ways to analyze data in innovative ways for automated decision-making and action. ● AI researchers, research novel forms of AI technology to create new applications that use data to drive independent actions. ● Skill Set: ○ AI programming skills: This one goes without saying, but coding skills is a given for any professional in the AI and data science domain. The best programming languages for AI development currently are Python, Lisp, Prolog, R, C/C++ and Java. Out of these languages, Python is most preferred by both tech companies and AI researchers themselves, possibly because of its ease of use. ○ Analytical thinking: Since artificial intelligence is closely intertwined with data analysis, analytical skills are necessary for potential AI researchers. Having good analytical skills translates into the ability to ■ make sense of data ■ verify the validity of the data gathered ■ identify connections between different variables, and ■ form logical conclusions based on the available data.
  • 16. Analytics Manager ● The complete cycle revolves around the enterprise goal. ● Identify the key business variables that the analysis needs to predict. ● Define the project goals by asking and refining "sharp" questions that are relevant, specific, and unambiguous. ● Find the relevant data that helps you answer the questions that define the objectives of the project. ● An Analytics Manager manages a team of analysts and data scientists Skills sets: ● R, Python , SQL, SAS, Java Programming ● Leadership & project management ● Data Mining & Predictive modeling ● Interpersonal Communication
  • 17. Decision Maker ● Real-world data sets are often noisy, are missing values, or have a host of other discrepancies. ● Aim is to produce a clean, high-quality data set whose relationship to the target variables is understood. ● Develop a solution architecture of the data pipeline that refreshes and scores the data regularly