2. What is Lantea
• Open source big data platform
• Rich ETL (Extract-Transform-Load) features
• A platform that can help Data Scientist to collect and deal with data easily
• Import data from different source is extremely easy
3. Highlighted features of Lantea
• A lot of different data sources on different media
• Query aggregation data via SQL
• Very easy to collect data from websites, local file systems, emails and
databases
• Export data via a lot of formats and APIs
4. Target User of Lantea
• Data Scientists
• Marketing Analyzer
• Managers who needs BI
• Researchers
• Big data/BI Developers
• Deep Machine Learning Developers
Non-
Commercial
Commercial
Researchers
Data
Scientists
Big data/BI
Developers
Marketing
Analyzer
Open source
developers
Managers
who needs BI
5. Essential Elements of Big Data Platform
• Data/File Extraction
• Data Cleaning and Filtering
• Different ways of Analyzing data
• Real-time Processing
• Data Collection from Different Source
• Connect to Different Database Types
• Analysis Result Rendering
• Advanced Parameter Adjustment
Big Data
Extraction
Cleaning
Analysis
Data
Processing
Data
Collection
Parameter
Adjustment
9. Architecture Design v1
Key Features
• Web Crawling Service
• Data Extraction Service
• Queue Service
• CQLR
(Common Query Language Runtime)
• Rich Formats Outputs and APIs
• Restful and ODATA support
15. – the Studio behind Lantea
Our Mission
• Re-create .NET Ecosystem
• Provide .NET-based solutions for clients
• Create something non-exist for .NET Community
• Contribute to Global Open Source Community
• Change the way human lives