In-memory technology is shaking up the industry faster than lightning, because it's all about queries being faster than lightning. It is not, however, about replacing your current investments to achieve this, but about enabling them with in-memory technology!
This Web Briefing: "Accelerate Reporting and Advanced Analytics with Kognitio and Nexus" covers how an in-memory analytical platform co-exists and enhances existing Enterprise Data Warehouse investments (like Teradata) to enable a 10-100x query performance gain. Keynote speaker Tom Coffing, CEO of Coffing Data Warehousing headlines the discussion, which also featured a demo and conversation aboute advanced analytics with in-memory processing.
27. Teradata has Secondary Indexes
AMP AMP AMP
USI SubtableUSI SubtableUSI Subtable
Stover
Davis
Gomez
Rivers
Khan
Kertzel
Kinski
Swartz
1,1 2,1 3,1
4,1 5,1 6,1
7,1 8,1 9,1
Emp_Intl Emp_Intl Emp_Intl
NUSI SubtableNUSI SubtableNUSI Subtable
The Base Table has a Primary Index of Last_Name. The USI was created on EmpNo and
the NUSI on First_Name. The USI rows are hashed to different AMPs, but the NUSI
rows are AMP local. Both subtables contain the same Base Table Row‐IDs.
25,1 Maria 2,1
16,1 1002 2,1
30,1 Rafael 1,1
22,1 1001 1,1
18,1 1004 4,1
40,1 Kyle 4,1
50,1 Sushma 7,1
21,1 1007 7,1
35,1 Rob 5,1
19,1 1005 5,1
41,1 Mo 8,1
15,1 1008 8,1
28,1 Charl 3,1
14,1 1003 3,1
65,1 Inna 6,1
17,1 1006 6,1
70,1 Mo 9,1
20,1 1009 9,1
Minal Rafael 100
1100
4
Kyle
1007Sushma
Maria 1002
1005Rob
1008 Mo
Charl 1003
1006Inna
1009Mo
USI’s
are
Hashed
NUSI’s
are
AMP
Local
28. Teradata Join Quiz
Do you know which statement above is False?
Which Statement is NOT true!
1. Each Table in Teradata has a Primary Index, unless it is a NoPI table.
2. The Primary Index is the mechanism that allows Teradata to physically distribute the rows
of a table across the AMPs using a Hash Formula and the Hash Map.
3. Each AMP Sorts its rows by the Row‐ID, unless it is a Partitioned table, and then it sorts first
by the Partition and then by Row‐ID which is actually the Row Key.
4. For two rows to be Joined together Teradata insists that both rows are physically on the
same AMP.
5. Teradata will either Redistribute one or both of the tables or Duplicate the smaller table
across all AMPs to ensure that the matching rows are on the same AMP in FSG Cache.
Once the matching rows are on the same AMP the join can take place.
40. Kognitio: Analytical Accelerator for Teradata
Comprehensive
• Real-time, full data volume, new sources, cross-correlation
Engage Big Data and enable Hadoop
without changing your environment
Flexible
• Accelerate queries, enable departmental self-service for
every departmental need
Universal
• Standardize connections without custom coding
41. Analytical Platform: “The Golden Layer”
Analytical
Platform
Layer
Near-line
Storage
(optional)
Application &
Client Layer
All BI Tools All OLAP Clients Excel
Persistence
Layer Hadoop
Clusters
Enterprise Data
Warehouses
Legacy
Systems
Kognitio
Storage
Reporting
Cloud
Storage
42. Performance acceleration with Kognitio
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1.8
Teradata Kognitio
Report speed relativeto median Kognitio speed
Max
Median
Min
0
0.5
1
1.5
2
2.5
3
3.5
4
SQL Server Kognitio
Query Speed relativeto median Kognitio speed
Bigger is better!
43. Big Data: Bring the Analytics TO the Data
Kognitio Hadoop Integration
• Kognitio Map/Reduce Agent uploads itself to
Hadoop nodes
• Query passes selections, relevant predicates
• Data filtering & projection locally on each node
• Data filtered as it is read from file(s)
• Only data of interest is transferred and loaded
into memory via parallel load streams