SlideShare une entreprise Scribd logo
1  sur  15
Introduction to Lantea
.NET Open Source Big Data Solution
What is Lantea
• Open source big data platform
• Rich ETL (Extract-Transform-Load) features
• A platform that can help Data Scientist to collect and deal with data easily
• Import data from different source is extremely easy
Highlighted features of Lantea
• A lot of different data sources on different media
• Query aggregation data via SQL
• Very easy to collect data from websites, local file systems, emails and
databases
• Export data via a lot of formats and APIs
Target User of Lantea
• Data Scientists
• Marketing Analyzer
• Managers who needs BI
• Researchers
• Big data/BI Developers
• Deep Machine Learning Developers
Non-
Commercial
Commercial
Researchers
Data
Scientists
Big data/BI
Developers
Marketing
Analyzer
Open source
developers
Managers
who needs BI
Essential Elements of Big Data Platform
• Data/File Extraction
• Data Cleaning and Filtering
• Different ways of Analyzing data
• Real-time Processing
• Data Collection from Different Source
• Connect to Different Database Types
• Analysis Result Rendering
• Advanced Parameter Adjustment
Big Data
Extraction
Cleaning
Analysis
Data
Processing
Data
Collection
Parameter
Adjustment
Introduction to Lantea
Architecture Design and Use Case
Third-party Projects Included
• Toxy – Data Extraction framework
• Spidey – Web Spider framework
• EQueue – Queue Implementation
• CacheAdapter – Cache Provider
• Irony – Compiler Implementation
• ServiceStack.Redis– Redis Client
• ScrapySharp – Html Parser and Selector
• Autofac – IOC Container
• Log4net – Configurable Logging System
• Datatables.js – Web Spreadsheet
• Thinkecture Identity Server
- Social account integration
• Nepy
– Parsers for Natural Language Processing
License Candidate
• LGPL
• Apache 2.0
• MIT
• Custom Open Source license
Architecture Design v1
Key Features
• Web Crawling Service
• Data Extraction Service
• Queue Service
• CQLR
(Common Query Language Runtime)
• Rich Formats Outputs and APIs
• Restful and ODATA support
Schedule for Lantea
Use Case 1 – Regional Manager Report Collection
Use Case 1 – Lantea Solution
Use Case 2 – Data Aggregation from Websites
Use Case 2 – Lantea Solution
– the Studio behind Lantea
Our Mission
• Re-create .NET Ecosystem
• Provide .NET-based solutions for clients
• Create something non-exist for .NET Community
• Contribute to Global Open Source Community
• Change the way human lives

Contenu connexe

Tendances

KD-2013-Optimizing-Document-Search-using-Lucene
KD-2013-Optimizing-Document-Search-using-LuceneKD-2013-Optimizing-Document-Search-using-Lucene
KD-2013-Optimizing-Document-Search-using-LuceneHarshakumar Ummerpillai
 
Informatica Online Training
Informatica Online TrainingInformatica Online Training
Informatica Online TrainingRao Rao
 
Ciel, mes données ne sont plus relationnelles
Ciel, mes données ne sont plus relationnellesCiel, mes données ne sont plus relationnelles
Ciel, mes données ne sont plus relationnellesXavier Gorse
 
Introduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing Centre
Introduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing CentreIntroduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing Centre
Introduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing Centrelocloud
 
The LoCloud MORE aggregator, Gavrilis Dimitris Afiontzi Eleni, Makri Dimit...
The LoCloud MORE aggregator, Gavrilis Dimitris   Afiontzi Eleni,  Makri Dimit...The LoCloud MORE aggregator, Gavrilis Dimitris   Afiontzi Eleni,  Makri Dimit...
The LoCloud MORE aggregator, Gavrilis Dimitris Afiontzi Eleni, Makri Dimit...locloud
 
Beyond sparql linked data, software, services and applications. Keynote at D...
Beyond sparql  linked data, software, services and applications. Keynote at D...Beyond sparql  linked data, software, services and applications. Keynote at D...
Beyond sparql linked data, software, services and applications. Keynote at D...John Domingue
 
Introduction to Total Library Solution- TLS
Introduction to Total Library Solution- TLSIntroduction to Total Library Solution- TLS
Introduction to Total Library Solution- TLSAta Rehman
 
Continuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData AnalysisContinuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData AnalysisKai Sasaki
 
the tooling of a modern and agile oracle dba
the tooling of a modern and agile oracle dbathe tooling of a modern and agile oracle dba
the tooling of a modern and agile oracle dbaBertrandDrouvot
 
A Walkthrough of InfluxCloud 2.0 by Tim Hall
A Walkthrough of InfluxCloud 2.0 by Tim HallA Walkthrough of InfluxCloud 2.0 by Tim Hall
A Walkthrough of InfluxCloud 2.0 by Tim HallInfluxData
 
(ATS6-APP06) Accelrys LIMS and Accelrys ELN integration
(ATS6-APP06) Accelrys LIMS and Accelrys ELN integration    (ATS6-APP06) Accelrys LIMS and Accelrys ELN integration
(ATS6-APP06) Accelrys LIMS and Accelrys ELN integration BIOVIA
 
IRUS R5: open and flexible access to standardised repository usage data
IRUS R5: open and flexible access to standardised repository usage dataIRUS R5: open and flexible access to standardised repository usage data
IRUS R5: open and flexible access to standardised repository usage dataJisc
 
MarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie DunnMarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie DunnWiLS
 
On-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the CloudOn-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the CloudMarin Dimitrov
 
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAAdobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAJaemi Bremner
 

Tendances (20)

KD-2013-Optimizing-Document-Search-using-Lucene
KD-2013-Optimizing-Document-Search-using-LuceneKD-2013-Optimizing-Document-Search-using-Lucene
KD-2013-Optimizing-Document-Search-using-Lucene
 
Informatica Online Training
Informatica Online TrainingInformatica Online Training
Informatica Online Training
 
Ciel, mes données ne sont plus relationnelles
Ciel, mes données ne sont plus relationnellesCiel, mes données ne sont plus relationnelles
Ciel, mes données ne sont plus relationnelles
 
Introduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing Centre
Introduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing CentreIntroduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing Centre
Introduction to LoCloud Collections, Marcin Werla, Poznan Supercomputing Centre
 
SortaSQL
SortaSQLSortaSQL
SortaSQL
 
The LoCloud MORE aggregator, Gavrilis Dimitris Afiontzi Eleni, Makri Dimit...
The LoCloud MORE aggregator, Gavrilis Dimitris   Afiontzi Eleni,  Makri Dimit...The LoCloud MORE aggregator, Gavrilis Dimitris   Afiontzi Eleni,  Makri Dimit...
The LoCloud MORE aggregator, Gavrilis Dimitris Afiontzi Eleni, Makri Dimit...
 
Beyond sparql linked data, software, services and applications. Keynote at D...
Beyond sparql  linked data, software, services and applications. Keynote at D...Beyond sparql  linked data, software, services and applications. Keynote at D...
Beyond sparql linked data, software, services and applications. Keynote at D...
 
Introduction to Total Library Solution- TLS
Introduction to Total Library Solution- TLSIntroduction to Total Library Solution- TLS
Introduction to Total Library Solution- TLS
 
Continuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData AnalysisContinuous Optimization for Distributed BigData Analysis
Continuous Optimization for Distributed BigData Analysis
 
the tooling of a modern and agile oracle dba
the tooling of a modern and agile oracle dbathe tooling of a modern and agile oracle dba
the tooling of a modern and agile oracle dba
 
A Walkthrough of InfluxCloud 2.0 by Tim Hall
A Walkthrough of InfluxCloud 2.0 by Tim HallA Walkthrough of InfluxCloud 2.0 by Tim Hall
A Walkthrough of InfluxCloud 2.0 by Tim Hall
 
(ATS6-APP06) Accelrys LIMS and Accelrys ELN integration
(ATS6-APP06) Accelrys LIMS and Accelrys ELN integration    (ATS6-APP06) Accelrys LIMS and Accelrys ELN integration
(ATS6-APP06) Accelrys LIMS and Accelrys ELN integration
 
IRUS R5: open and flexible access to standardised repository usage data
IRUS R5: open and flexible access to standardised repository usage dataIRUS R5: open and flexible access to standardised repository usage data
IRUS R5: open and flexible access to standardised repository usage data
 
MarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie DunnMarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie Dunn
 
Reporting
ReportingReporting
Reporting
 
Integrations
IntegrationsIntegrations
Integrations
 
Elisa curve fitting-analysis with ReaderFit.com
Elisa curve fitting-analysis with ReaderFit.comElisa curve fitting-analysis with ReaderFit.com
Elisa curve fitting-analysis with ReaderFit.com
 
On-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the CloudOn-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the Cloud
 
Gilreath diving into the details apwa_presentation_final
Gilreath diving into the details apwa_presentation_finalGilreath diving into the details apwa_presentation_final
Gilreath diving into the details apwa_presentation_final
 
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAAdobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
 

En vedette

Introduction to Toxy
Introduction to ToxyIntroduction to Toxy
Introduction to ToxyNeuzilla
 
Data Science Conference Belgrade
Data Science Conference BelgradeData Science Conference Belgrade
Data Science Conference BelgradeDarko Marjanovic
 
Hadoop and IoT Sinergija 2014
Hadoop and IoT Sinergija 2014Hadoop and IoT Sinergija 2014
Hadoop and IoT Sinergija 2014Darko Marjanovic
 
Hadoop i sveprisutno racunarstvo
Hadoop i sveprisutno racunarstvoHadoop i sveprisutno racunarstvo
Hadoop i sveprisutno racunarstvoDarko Marjanovic
 
Final الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئة
Final الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئةFinal الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئة
Final الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئةDURAID ALTAY
 
Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?
Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?
Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?Darko Marjanovic
 
Hadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisa
Hadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisaHadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisa
Hadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisaDarko Marjanovic
 
Big Data tools in practice
Big Data tools in practiceBig Data tools in practice
Big Data tools in practiceDarko Marjanovic
 
Hadoop infrastructure for education
Hadoop infrastructure for educationHadoop infrastructure for education
Hadoop infrastructure for educationDarko Marjanovic
 

En vedette (11)

Introduction to Toxy
Introduction to ToxyIntroduction to Toxy
Introduction to Toxy
 
Data Science Conference Belgrade
Data Science Conference BelgradeData Science Conference Belgrade
Data Science Conference Belgrade
 
Hadoop and IoT Sinergija 2014
Hadoop and IoT Sinergija 2014Hadoop and IoT Sinergija 2014
Hadoop and IoT Sinergija 2014
 
Hadoop i sveprisutno racunarstvo
Hadoop i sveprisutno racunarstvoHadoop i sveprisutno racunarstvo
Hadoop i sveprisutno racunarstvo
 
Baza podataka
Baza podatakaBaza podataka
Baza podataka
 
Big Data - pojam i značaj
Big Data - pojam i značajBig Data - pojam i značaj
Big Data - pojam i značaj
 
Final الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئة
Final الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئةFinal الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئة
Final الأسمدة-الكيميائية-وخطرها-على-صحةالانسان-و-التلوث-البيئة
 
Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?
Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?
Big Data: Apache Spark -novo pojačanje tradicionalnom BI ili ne?
 
Hadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisa
Hadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisaHadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisa
Hadoop ekosistem u praksi - socijalne mreže, unapređenje prodaje i servisa
 
Big Data tools in practice
Big Data tools in practiceBig Data tools in practice
Big Data tools in practice
 
Hadoop infrastructure for education
Hadoop infrastructure for educationHadoop infrastructure for education
Hadoop infrastructure for education
 

Similaire à Lantea platform

SharePoint Development
SharePoint DevelopmentSharePoint Development
SharePoint DevelopmentMalin De Silva
 
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...ssuserd3a367
 
Pimping the ForgeRock Identity Platform for a Billion Users
Pimping the ForgeRock Identity Platform for a Billion UsersPimping the ForgeRock Identity Platform for a Billion Users
Pimping the ForgeRock Identity Platform for a Billion UsersForgeRock
 
Restful风格ž„web服务架构
Restful风格ž„web服务架构Restful风格ž„web服务架构
Restful风格ž„web服务架构Benjamin Tan
 
SharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceSharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceBrian Culver
 
Scalable Preservation Workflows
Scalable Preservation WorkflowsScalable Preservation Workflows
Scalable Preservation WorkflowsSCAPE Project
 
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014Arun Gupta
 
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...Victor Holman
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...
Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...
Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...Chirag Patel
 
Informatica power center online training
Informatica power center online trainingInformatica power center online training
Informatica power center online trainingSmartittrainings
 
Introduction to SharePoint for SQLserver DBAs
Introduction to SharePoint for SQLserver DBAsIntroduction to SharePoint for SQLserver DBAs
Introduction to SharePoint for SQLserver DBAsSteve Knutson
 
SharePoint Saturday The Conference 2011 - SP2010 Performance
SharePoint Saturday The Conference 2011 - SP2010 PerformanceSharePoint Saturday The Conference 2011 - SP2010 Performance
SharePoint Saturday The Conference 2011 - SP2010 PerformanceBrian Culver
 
Apache Geode Meetup, London
Apache Geode Meetup, LondonApache Geode Meetup, London
Apache Geode Meetup, LondonApache Geode
 
Share Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content MigrationShare Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content MigrationNadir Kamdar
 

Similaire à Lantea platform (20)

SharePoint Development
SharePoint DevelopmentSharePoint Development
SharePoint Development
 
Apache drill
Apache drillApache drill
Apache drill
 
Oracle bi apps training
Oracle bi apps trainingOracle bi apps training
Oracle bi apps training
 
RDAP @ .at
RDAP @ .at RDAP @ .at
RDAP @ .at
 
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
 
Pimping the ForgeRock Identity Platform for a Billion Users
Pimping the ForgeRock Identity Platform for a Billion UsersPimping the ForgeRock Identity Platform for a Billion Users
Pimping the ForgeRock Identity Platform for a Billion Users
 
Restful风格ž„web服务架构
Restful风格ž„web服务架构Restful风格ž„web服务架构
Restful风格ž„web服务架构
 
SharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceSharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 Performance
 
Where to save my data, for devs!
Where to save my data, for devs!Where to save my data, for devs!
Where to save my data, for devs!
 
Scalable Preservation Workflows
Scalable Preservation WorkflowsScalable Preservation Workflows
Scalable Preservation Workflows
 
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
 
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Asp.net
Asp.netAsp.net
Asp.net
 
Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...
Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...
Replacing your fileshare with SharePoint 2013 Farm - SharePoint User Group UK...
 
Informatica power center online training
Informatica power center online trainingInformatica power center online training
Informatica power center online training
 
Introduction to SharePoint for SQLserver DBAs
Introduction to SharePoint for SQLserver DBAsIntroduction to SharePoint for SQLserver DBAs
Introduction to SharePoint for SQLserver DBAs
 
SharePoint Saturday The Conference 2011 - SP2010 Performance
SharePoint Saturday The Conference 2011 - SP2010 PerformanceSharePoint Saturday The Conference 2011 - SP2010 Performance
SharePoint Saturday The Conference 2011 - SP2010 Performance
 
Apache Geode Meetup, London
Apache Geode Meetup, LondonApache Geode Meetup, London
Apache Geode Meetup, London
 
Share Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content MigrationShare Point Sat Share Point 2010 And Content Migration
Share Point Sat Share Point 2010 And Content Migration
 

Dernier

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 

Dernier (20)

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 

Lantea platform

  • 1. Introduction to Lantea .NET Open Source Big Data Solution
  • 2. What is Lantea • Open source big data platform • Rich ETL (Extract-Transform-Load) features • A platform that can help Data Scientist to collect and deal with data easily • Import data from different source is extremely easy
  • 3. Highlighted features of Lantea • A lot of different data sources on different media • Query aggregation data via SQL • Very easy to collect data from websites, local file systems, emails and databases • Export data via a lot of formats and APIs
  • 4. Target User of Lantea • Data Scientists • Marketing Analyzer • Managers who needs BI • Researchers • Big data/BI Developers • Deep Machine Learning Developers Non- Commercial Commercial Researchers Data Scientists Big data/BI Developers Marketing Analyzer Open source developers Managers who needs BI
  • 5. Essential Elements of Big Data Platform • Data/File Extraction • Data Cleaning and Filtering • Different ways of Analyzing data • Real-time Processing • Data Collection from Different Source • Connect to Different Database Types • Analysis Result Rendering • Advanced Parameter Adjustment Big Data Extraction Cleaning Analysis Data Processing Data Collection Parameter Adjustment
  • 7. Third-party Projects Included • Toxy – Data Extraction framework • Spidey – Web Spider framework • EQueue – Queue Implementation • CacheAdapter – Cache Provider • Irony – Compiler Implementation • ServiceStack.Redis– Redis Client • ScrapySharp – Html Parser and Selector • Autofac – IOC Container • Log4net – Configurable Logging System • Datatables.js – Web Spreadsheet • Thinkecture Identity Server - Social account integration • Nepy – Parsers for Natural Language Processing
  • 8. License Candidate • LGPL • Apache 2.0 • MIT • Custom Open Source license
  • 9. Architecture Design v1 Key Features • Web Crawling Service • Data Extraction Service • Queue Service • CQLR (Common Query Language Runtime) • Rich Formats Outputs and APIs • Restful and ODATA support
  • 11. Use Case 1 – Regional Manager Report Collection
  • 12. Use Case 1 – Lantea Solution
  • 13. Use Case 2 – Data Aggregation from Websites
  • 14. Use Case 2 – Lantea Solution
  • 15. – the Studio behind Lantea Our Mission • Re-create .NET Ecosystem • Provide .NET-based solutions for clients • Create something non-exist for .NET Community • Contribute to Global Open Source Community • Change the way human lives