SlideShare une entreprise Scribd logo
1  sur  25
made with from innovation lab
AI Hackathon.
Team: Vera Ekimenko
made with from innovation lab
Technology Stack.
Spark for data wrangling
Spark ML Random Forests for the model
HTMLUnit for web scrapping
made with from innovation lab
Approach - Phases
Phases:
1. 0 days before Departure (all given data) <= used for evaluation
2. 2 days before Departure
3. 5 days before Departure
4. 14 days before Departure
5. 100 days before Departure
6. 0 days after Booking
7. 7 days after Booking
Departure
Booking 7 days
Evaluation Date
made with from innovation lab
Approach - Feature Engineering
Booking Departure
First Last
Event types:
- 1. TLT action
- 2. Passenger details
- 3. Payments
- 4. Service requests
- 5. Tickets issue
Dates
• Every
event has
a date
Duration
since/prior
• Every
range has
2 dates
Number of
days
• Every
action has
4 time
features
Counts
• Every
action is
an event
Occurrence
since/prior
• Calculate
how many
times
occurred
Sum of
occurrence
• Every
action has
2 count
features
• The number of days between the booking and the
first/last addition of individual passenger details
• The number of days between the first/last payments and
the departure
• The number of additions of TLT record
made with from innovation lab
Travel types based on segments analysis
MELDXB-DXBMAA-CCUDXB-DXBMEL
SINDXB-DXBATH-ATHDXB-DXBSIN
WAWDXB-DXBIKA-DXBWAW
DACDXB-DXBBAH
55%
37%
4%
5%
Two
destinations
One way
Disperse
One
destination
made with from innovation lab
External data
Passport
Index
TCdata 360Holidays
Airports
USA travel
advisories
Segments
Destination 1 Destination 2Boarding Point
Departure Date
made with from innovation lab
External data – airports
made with from innovation lab
External data – TC data 360
Travel and Tourism Competitiveness Report 2017
made with from innovation lab
External data – Passport Index
Visa requirements by country
made with from innovation lab
External data – Holidays
made with from innovation lab
External data – USA travel advisories
made with from innovation lab
Performance
made with from innovation lab
Accuracy
0%
20%
40%
60%
80%
100%
120%
With external data
Without external data
made with from innovation lab
External data boost
0.00%
1.00%
2.00%
3.00%
4.00%
5.00%
6.00%
7.00%
Accuracy PR AUC ROC AUC
made with from innovation lab
Feature Importance for different phases - 1
made with from innovation lab
Feature Importance for different phases - 2
made with from innovation lab
Scalability
airport
•Country data
•Politics
organizator
•Travel agency
•Regular group
travellers
season
•Christmas
•Events
Technical scalability Features scalability
made with from innovation lab
Self-learning
• Selected
features
• Train data
Initial model
• Monitor model
deterioration
• Re-generate the
adjusted model
Test accuracy
nightly • New features
• New data
Adjustments
made with from innovation lab
Reasons why it’s the best solution
• Native for HELIX -> easy to deploy and maintain
• Low maintenance -> easy to update the model
• Fast and scalable -> evaluate group bookings nightly with no fuzz
• Good foundation for other models -> Recommender for new stations
• Pluggable -> can be used to enrich the existing models
• Transparency -> Easy to communicate non-tech how the model works with
Feature Importance
• Robustness -> the model works even if data quality is not perfect
made with from innovation lab
Annex (1) – Full list of features used in the model
PAX - Total number of passenger in the group
Pcc2City - 1 = PCC equals to City
IsGWS - 1 = GWS_ID is not empty
DepDateDays - the number of days between the booking and the departure
NumberSegments - the number of segments booked
BookingToRequestDays - the number of days between the request and the booking
RequestCreatedPriorDays - the number of days between the request and the departure
IndPaxAdditionFromDays - the number of days between the booking and the first addition of individual passenger
details
IndPaxAdditionFromPriorDays - the number of days between the first addition of individual passenger details and
the departure
IndPaxAdditionToDays - the number of days between the last addition of individual passenger details
IndPaxAdditionToPriorDays - the number of days between the last addition of individual passenger details and the
departure
IndividualPaxAdditionSum - the number of addition of individual passenger details
IndividualPaxRemovalSum - the number of removal of individual passenger details
PaymentsFromDays - the number of days between the booking and the first payment
PaymentsFromPriorDays - the number of days between the first of payment and the departure
PaymentsToDays - the number of days between the last of payment
PaymentsToPriorDays - the number of days between the last of payment and the departure
IncludedCSR - The payment is included in the sales report
made with from innovation lab
Annex (2) – Full list of features used in the model
TLTActionDateAdditionsFromDays - the number of days between the booking and the first addition of TLT record
TLTActionDateAdditionsFromPriorDays - the number of days between the first addition of TLT record and the
departure
TLTActionDateAdditionsToDays - the number of days between the last addition of TLT record
TLTActionDateAdditionsToPriorDays - the number of days between the last addition of TLT record and the
departure
TLTActionDateRemovalFromDays - the number of days between the booking and the first removal of TLT record
TLTActionDateRemovalFromPriorDays - the number of days between the first removal of TLT record and the
departure
TLTActionDateRemovalToDays - the number of days between the last removal of TLT record
TLTActionDateRemovalToPriorDays - the number of days between the last removal of TLT record and the departure
TLTAdditionsSum - the number of addition of TLT record
TLTRemovalSum - the number of removal of TLT record
ServicesCount - The number of the service requests added
isonedest - The journey has one destination
ismultidest - The journey has two destinations
ismultirtn - The journey has multiple returning points
isoneleg - The journey is one way
isgathering - The journey has multiple boarding points
made with from innovation lab
Annex (3) – Full list of features used in the model
ServicesDepartureDateMinDays - the number of days between the booking and the earliest departure date for the added
services
ServicesDepartureDateMinPriorDays - the number of days between the earliest departure date for the added services and the
departure
FirstDepDateMonth - the months of the departure
FirstDepDateDay - the day of the departure
IsIATA - 1=The agent is a member of IATA
ACCEPTED_GROUP_SIZE - The accepted group size
ActuallyGivenAndAccepted - The difference between the accepted group size and total number of passenger
KidPerAdult - The number of infants and children passengers per adult passenger
NotAcceptedAdult - The difference between the requested number of number of adult passengers and accepted number of adult
passengers
NotAcceptedChild - The difference between the requested number of number of child passengers and accepted number of child
passengers
NotAcceptedInfant - The difference between the requested number of number of infant passengers and accepted number of
infant passengers
ACC_ADULT - The accepted number of adult passengers
ACC_CHILD - The accepted number of child passengers
ACC_INF - The accepted number of infant passengers
made with from innovation lab
Annex (4) – Full list of features used in the model
airport_infrastructure - The difference in levels of airport infrastructure in the first destination country
business_environment - The difference in levels of business environment in the first destination country
culresources_bustravel - The difference in levels of cultural resources and business travel in the first destination country
enabling_environment - The difference in levels of enabling environment in the first destination country
environmental_sustainability - The difference in levels of environmental sustainability in the first destination country
tourism_priority - The difference in levels of tourism priority in the first destination country
ground_port_infrastructure - The difference in levels of ground port infrastructure in the first destination country
health_hygiene - The difference in levels of health hygiene in the first destination country
labor_market - The difference in levels of labour market in the first destination country
infrastructure_subindex - The difference in levels of infrastructure sub-index in the first destination country
international_openness - The difference in levels of international openness in the first destination country
natural_cultural_resources - The difference in levels of natural and cultural resources in the first destination country
natural_resources - The difference in levels of natural resources in the first destination country
price_competitiveness - The difference in levels of price competitiveness in the first destination country
safety_security - The difference in levels of safety and security in the first destination country
tourist_infrastructure - The difference in levels of tourist infrastructure in the first destination country
travel_ict_readiness - The difference in levels of travel and tourism ICT readiness in the first destination country
travel_policy - The difference in levels of travel policy in the first destination country
travel_competitiveness - The difference in levels of Travel and Tourism policy and enabling conditions in the first destination
country
made with from innovation lab
Annex (5) – Full list of features used in the model
dest1_passport_requirements - The level of difficulty to get a visa to the first destination country
dest2_passport_requirements - The level of difficulty to get a visa to the second destination country if any
dest1_distance - The geographical distance between the boarding point and the first destination
dest1_timediff - The time lag between the boarding point and the first destination
adv_dest1_levelNN - USA travel advisory level for the first destination country
adv_dest2_levelNN - USA travel advisory level for the second destination country if any
AroundHoliday - The departure date is a public holiday (+/- 3 days) in the original country
AroundWeekend2 - The departure date is a weekend (+ / - 1 day) in the original country
countriesOHE - The original country
made with from innovation lab
Thank You

Contenu connexe

Tendances

Internet Traffic Forecasting using Time Series Methods
Internet Traffic Forecasting using Time Series MethodsInternet Traffic Forecasting using Time Series Methods
Internet Traffic Forecasting using Time Series Methods
Ajay Ohri
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks
Chiranjeevi Adi
 
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPTAI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
Cprime
 

Tendances (20)

Introduction to XGBoost
Introduction to XGBoostIntroduction to XGBoost
Introduction to XGBoost
 
INTEGRATE 2022 - Data Mapping in the Microsoft Cloud
INTEGRATE 2022 - Data Mapping in the Microsoft CloudINTEGRATE 2022 - Data Mapping in the Microsoft Cloud
INTEGRATE 2022 - Data Mapping in the Microsoft Cloud
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
 
Chatbot ppt
Chatbot pptChatbot ppt
Chatbot ppt
 
Hackathon winning pitch
Hackathon winning pitchHackathon winning pitch
Hackathon winning pitch
 
Knowledge Extraction
Knowledge ExtractionKnowledge Extraction
Knowledge Extraction
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphs
 
Vector database
Vector databaseVector database
Vector database
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Generative AI and Student Writing.pptx
Generative AI and Student Writing.pptxGenerative AI and Student Writing.pptx
Generative AI and Student Writing.pptx
 
Api presentation
Api presentationApi presentation
Api presentation
 
Internet Traffic Forecasting using Time Series Methods
Internet Traffic Forecasting using Time Series MethodsInternet Traffic Forecasting using Time Series Methods
Internet Traffic Forecasting using Time Series Methods
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks
 
Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge ...
Knowledge Graph Generation  from Wikipedia in the Age of ChatGPT:  Knowledge ...Knowledge Graph Generation  from Wikipedia in the Age of ChatGPT:  Knowledge ...
Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge ...
 
Machine Learning project presentation
Machine Learning project presentationMachine Learning project presentation
Machine Learning project presentation
 
Semantic AI
Semantic AISemantic AI
Semantic AI
 
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPTAI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
 
Publish & Subscribe to events using an Event Aggregator
Publish & Subscribe to events using an Event AggregatorPublish & Subscribe to events using an Event Aggregator
Publish & Subscribe to events using an Event Aggregator
 
Housing price prediction
Housing price predictionHousing price prediction
Housing price prediction
 
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
Deep Learning Tutorial | Deep Learning Tutorial for Beginners | Neural Networ...
 

Similaire à Artificial Intelligence Hackathon

Participatory Project
Participatory ProjectParticipatory Project
Participatory Project
#Xiao Zhe#
 
orashid_2016_New
orashid_2016_Neworashid_2016_New
orashid_2016_New
Omar Rashid
 
orashid_2016_New
orashid_2016_Neworashid_2016_New
orashid_2016_New
Omar Rashid
 

Similaire à Artificial Intelligence Hackathon (20)

Unit 2 hci in software process
Unit 2   hci in software processUnit 2   hci in software process
Unit 2 hci in software process
 
Participatory Project
Participatory ProjectParticipatory Project
Participatory Project
 
Furuyama - analysis of factors that affect productivity
Furuyama - analysis of factors that affect productivityFuruyama - analysis of factors that affect productivity
Furuyama - analysis of factors that affect productivity
 
Project synopsis.
Project synopsis.Project synopsis.
Project synopsis.
 
IRJET- Intelligent Traffic Management System
IRJET- Intelligent Traffic Management SystemIRJET- Intelligent Traffic Management System
IRJET- Intelligent Traffic Management System
 
Harnessing business intelligence and big data. Is collaboration the key to su...
Harnessing business intelligence and big data. Is collaboration the key to su...Harnessing business intelligence and big data. Is collaboration the key to su...
Harnessing business intelligence and big data. Is collaboration the key to su...
 
m-government & android application development
m-government & android application developmentm-government & android application development
m-government & android application development
 
orashid_2016_New
orashid_2016_Neworashid_2016_New
orashid_2016_New
 
orashid_2016_New
orashid_2016_Neworashid_2016_New
orashid_2016_New
 
Airline Flight Schedule Notification Application (AFSNA)
Airline Flight Schedule Notification Application (AFSNA)Airline Flight Schedule Notification Application (AFSNA)
Airline Flight Schedule Notification Application (AFSNA)
 
TRAFFIC FORECAST FOR INTELLECTUAL TRANSPORTATION SYSTEM USING MACHINE LEARNING
TRAFFIC FORECAST FOR INTELLECTUAL TRANSPORTATION SYSTEM USING MACHINE LEARNINGTRAFFIC FORECAST FOR INTELLECTUAL TRANSPORTATION SYSTEM USING MACHINE LEARNING
TRAFFIC FORECAST FOR INTELLECTUAL TRANSPORTATION SYSTEM USING MACHINE LEARNING
 
Commercial Drone Best Practices: How to Incorporate Data and Job Specs
Commercial Drone Best Practices: How to Incorporate Data and Job SpecsCommercial Drone Best Practices: How to Incorporate Data and Job Specs
Commercial Drone Best Practices: How to Incorporate Data and Job Specs
 
Top Ten Skills 2020 (Logistics & Distribution)
Top Ten Skills 2020 (Logistics & Distribution)Top Ten Skills 2020 (Logistics & Distribution)
Top Ten Skills 2020 (Logistics & Distribution)
 
E-TICKETING ON RAILWAY TICKET RESERVATION
E-TICKETING ON RAILWAY TICKET RESERVATIONE-TICKETING ON RAILWAY TICKET RESERVATION
E-TICKETING ON RAILWAY TICKET RESERVATION
 
Big Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical IntelligenceBig Data Analytics and Artifical Intelligence
Big Data Analytics and Artifical Intelligence
 
Determination and visualization of density210409
Determination and visualization of density210409 Determination and visualization of density210409
Determination and visualization of density210409
 
Driving Efficiency with Splunk Cloud at Gatwick Airport
Driving Efficiency with Splunk Cloud at Gatwick AirportDriving Efficiency with Splunk Cloud at Gatwick Airport
Driving Efficiency with Splunk Cloud at Gatwick Airport
 
Strategic Location Analysis for setting up of Manufacturing Facility
Strategic Location Analysis for setting up of Manufacturing FacilityStrategic Location Analysis for setting up of Manufacturing Facility
Strategic Location Analysis for setting up of Manufacturing Facility
 
Strategic Selection for setting up a new manufacturing facility for a global ...
Strategic Selection for setting up a new manufacturing facility for a global ...Strategic Selection for setting up a new manufacturing facility for a global ...
Strategic Selection for setting up a new manufacturing facility for a global ...
 
Strategic Selection of FTZ for setting up a new manufacturing facility for a ...
Strategic Selection of FTZ for setting up a new manufacturing facility for a ...Strategic Selection of FTZ for setting up a new manufacturing facility for a ...
Strategic Selection of FTZ for setting up a new manufacturing facility for a ...
 

Plus de Vera Ekimenko

KeyAchivementsMimecast
KeyAchivementsMimecastKeyAchivementsMimecast
KeyAchivementsMimecast
Vera Ekimenko
 
KeyAchivementsJustisPublishing
KeyAchivementsJustisPublishingKeyAchivementsJustisPublishing
KeyAchivementsJustisPublishing
Vera Ekimenko
 

Plus de Vera Ekimenko (13)

Data Quality with AI
Data Quality with AIData Quality with AI
Data Quality with AI
 
AML Knowledge Graph
AML Knowledge GraphAML Knowledge Graph
AML Knowledge Graph
 
Deep Reinforcement Learning for Portfolio Optimization
Deep Reinforcement Learning for Portfolio OptimizationDeep Reinforcement Learning for Portfolio Optimization
Deep Reinforcement Learning for Portfolio Optimization
 
Artificial Intelligence for Data Quality
Artificial Intelligence for Data QualityArtificial Intelligence for Data Quality
Artificial Intelligence for Data Quality
 
Unsupervised AI for Data Quality
Unsupervised AI for Data QualityUnsupervised AI for Data Quality
Unsupervised AI for Data Quality
 
Deep Learning Hackathon
Deep Learning HackathonDeep Learning Hackathon
Deep Learning Hackathon
 
Cloudera migration oozie_hadoop_ci_cd_pipeline
Cloudera migration oozie_hadoop_ci_cd_pipelineCloudera migration oozie_hadoop_ci_cd_pipeline
Cloudera migration oozie_hadoop_ci_cd_pipeline
 
CSharp
CSharpCSharp
CSharp
 
DWHRestructure
DWHRestructureDWHRestructure
DWHRestructure
 
KeyAchivementsMimecast
KeyAchivementsMimecastKeyAchivementsMimecast
KeyAchivementsMimecast
 
KeyAchivementsJustisPublishing
KeyAchivementsJustisPublishingKeyAchivementsJustisPublishing
KeyAchivementsJustisPublishing
 
buy_in
buy_inbuy_in
buy_in
 
HCM Access Insight Dashboard
HCM Access Insight DashboardHCM Access Insight Dashboard
HCM Access Insight Dashboard
 

Dernier

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Dernier (20)

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 

Artificial Intelligence Hackathon

  • 1. made with from innovation lab AI Hackathon. Team: Vera Ekimenko
  • 2. made with from innovation lab Technology Stack. Spark for data wrangling Spark ML Random Forests for the model HTMLUnit for web scrapping
  • 3. made with from innovation lab Approach - Phases Phases: 1. 0 days before Departure (all given data) <= used for evaluation 2. 2 days before Departure 3. 5 days before Departure 4. 14 days before Departure 5. 100 days before Departure 6. 0 days after Booking 7. 7 days after Booking Departure Booking 7 days Evaluation Date
  • 4. made with from innovation lab Approach - Feature Engineering Booking Departure First Last Event types: - 1. TLT action - 2. Passenger details - 3. Payments - 4. Service requests - 5. Tickets issue Dates • Every event has a date Duration since/prior • Every range has 2 dates Number of days • Every action has 4 time features Counts • Every action is an event Occurrence since/prior • Calculate how many times occurred Sum of occurrence • Every action has 2 count features • The number of days between the booking and the first/last addition of individual passenger details • The number of days between the first/last payments and the departure • The number of additions of TLT record
  • 5. made with from innovation lab Travel types based on segments analysis MELDXB-DXBMAA-CCUDXB-DXBMEL SINDXB-DXBATH-ATHDXB-DXBSIN WAWDXB-DXBIKA-DXBWAW DACDXB-DXBBAH 55% 37% 4% 5% Two destinations One way Disperse One destination
  • 6. made with from innovation lab External data Passport Index TCdata 360Holidays Airports USA travel advisories Segments Destination 1 Destination 2Boarding Point Departure Date
  • 7. made with from innovation lab External data – airports
  • 8. made with from innovation lab External data – TC data 360 Travel and Tourism Competitiveness Report 2017
  • 9. made with from innovation lab External data – Passport Index Visa requirements by country
  • 10. made with from innovation lab External data – Holidays
  • 11. made with from innovation lab External data – USA travel advisories
  • 12. made with from innovation lab Performance
  • 13. made with from innovation lab Accuracy 0% 20% 40% 60% 80% 100% 120% With external data Without external data
  • 14. made with from innovation lab External data boost 0.00% 1.00% 2.00% 3.00% 4.00% 5.00% 6.00% 7.00% Accuracy PR AUC ROC AUC
  • 15. made with from innovation lab Feature Importance for different phases - 1
  • 16. made with from innovation lab Feature Importance for different phases - 2
  • 17. made with from innovation lab Scalability airport •Country data •Politics organizator •Travel agency •Regular group travellers season •Christmas •Events Technical scalability Features scalability
  • 18. made with from innovation lab Self-learning • Selected features • Train data Initial model • Monitor model deterioration • Re-generate the adjusted model Test accuracy nightly • New features • New data Adjustments
  • 19. made with from innovation lab Reasons why it’s the best solution • Native for HELIX -> easy to deploy and maintain • Low maintenance -> easy to update the model • Fast and scalable -> evaluate group bookings nightly with no fuzz • Good foundation for other models -> Recommender for new stations • Pluggable -> can be used to enrich the existing models • Transparency -> Easy to communicate non-tech how the model works with Feature Importance • Robustness -> the model works even if data quality is not perfect
  • 20. made with from innovation lab Annex (1) – Full list of features used in the model PAX - Total number of passenger in the group Pcc2City - 1 = PCC equals to City IsGWS - 1 = GWS_ID is not empty DepDateDays - the number of days between the booking and the departure NumberSegments - the number of segments booked BookingToRequestDays - the number of days between the request and the booking RequestCreatedPriorDays - the number of days between the request and the departure IndPaxAdditionFromDays - the number of days between the booking and the first addition of individual passenger details IndPaxAdditionFromPriorDays - the number of days between the first addition of individual passenger details and the departure IndPaxAdditionToDays - the number of days between the last addition of individual passenger details IndPaxAdditionToPriorDays - the number of days between the last addition of individual passenger details and the departure IndividualPaxAdditionSum - the number of addition of individual passenger details IndividualPaxRemovalSum - the number of removal of individual passenger details PaymentsFromDays - the number of days between the booking and the first payment PaymentsFromPriorDays - the number of days between the first of payment and the departure PaymentsToDays - the number of days between the last of payment PaymentsToPriorDays - the number of days between the last of payment and the departure IncludedCSR - The payment is included in the sales report
  • 21. made with from innovation lab Annex (2) – Full list of features used in the model TLTActionDateAdditionsFromDays - the number of days between the booking and the first addition of TLT record TLTActionDateAdditionsFromPriorDays - the number of days between the first addition of TLT record and the departure TLTActionDateAdditionsToDays - the number of days between the last addition of TLT record TLTActionDateAdditionsToPriorDays - the number of days between the last addition of TLT record and the departure TLTActionDateRemovalFromDays - the number of days between the booking and the first removal of TLT record TLTActionDateRemovalFromPriorDays - the number of days between the first removal of TLT record and the departure TLTActionDateRemovalToDays - the number of days between the last removal of TLT record TLTActionDateRemovalToPriorDays - the number of days between the last removal of TLT record and the departure TLTAdditionsSum - the number of addition of TLT record TLTRemovalSum - the number of removal of TLT record ServicesCount - The number of the service requests added isonedest - The journey has one destination ismultidest - The journey has two destinations ismultirtn - The journey has multiple returning points isoneleg - The journey is one way isgathering - The journey has multiple boarding points
  • 22. made with from innovation lab Annex (3) – Full list of features used in the model ServicesDepartureDateMinDays - the number of days between the booking and the earliest departure date for the added services ServicesDepartureDateMinPriorDays - the number of days between the earliest departure date for the added services and the departure FirstDepDateMonth - the months of the departure FirstDepDateDay - the day of the departure IsIATA - 1=The agent is a member of IATA ACCEPTED_GROUP_SIZE - The accepted group size ActuallyGivenAndAccepted - The difference between the accepted group size and total number of passenger KidPerAdult - The number of infants and children passengers per adult passenger NotAcceptedAdult - The difference between the requested number of number of adult passengers and accepted number of adult passengers NotAcceptedChild - The difference between the requested number of number of child passengers and accepted number of child passengers NotAcceptedInfant - The difference between the requested number of number of infant passengers and accepted number of infant passengers ACC_ADULT - The accepted number of adult passengers ACC_CHILD - The accepted number of child passengers ACC_INF - The accepted number of infant passengers
  • 23. made with from innovation lab Annex (4) – Full list of features used in the model airport_infrastructure - The difference in levels of airport infrastructure in the first destination country business_environment - The difference in levels of business environment in the first destination country culresources_bustravel - The difference in levels of cultural resources and business travel in the first destination country enabling_environment - The difference in levels of enabling environment in the first destination country environmental_sustainability - The difference in levels of environmental sustainability in the first destination country tourism_priority - The difference in levels of tourism priority in the first destination country ground_port_infrastructure - The difference in levels of ground port infrastructure in the first destination country health_hygiene - The difference in levels of health hygiene in the first destination country labor_market - The difference in levels of labour market in the first destination country infrastructure_subindex - The difference in levels of infrastructure sub-index in the first destination country international_openness - The difference in levels of international openness in the first destination country natural_cultural_resources - The difference in levels of natural and cultural resources in the first destination country natural_resources - The difference in levels of natural resources in the first destination country price_competitiveness - The difference in levels of price competitiveness in the first destination country safety_security - The difference in levels of safety and security in the first destination country tourist_infrastructure - The difference in levels of tourist infrastructure in the first destination country travel_ict_readiness - The difference in levels of travel and tourism ICT readiness in the first destination country travel_policy - The difference in levels of travel policy in the first destination country travel_competitiveness - The difference in levels of Travel and Tourism policy and enabling conditions in the first destination country
  • 24. made with from innovation lab Annex (5) – Full list of features used in the model dest1_passport_requirements - The level of difficulty to get a visa to the first destination country dest2_passport_requirements - The level of difficulty to get a visa to the second destination country if any dest1_distance - The geographical distance between the boarding point and the first destination dest1_timediff - The time lag between the boarding point and the first destination adv_dest1_levelNN - USA travel advisory level for the first destination country adv_dest2_levelNN - USA travel advisory level for the second destination country if any AroundHoliday - The departure date is a public holiday (+/- 3 days) in the original country AroundWeekend2 - The departure date is a weekend (+ / - 1 day) in the original country countriesOHE - The original country
  • 25. made with from innovation lab Thank You