Apache Atlas provides data governance capabilities for Hadoop including data classification, metadata management, and data lineage/provenance. It models metadata using a flexible type system and stores metadata in a property graph database for relationships and lineage queries. Key features include cross-component lineage mapping, reusable tagging policies for access control, and a business catalog to organize assets by common business terms.
Is the product was well understood?
Is the product something they would use?
Where is the value?
9
How fast ? 7 months !
Show – clearly identify customer metadata. Change
Add customer classification example – Aetna – make the use case story have continuity. Use DX procedures to diagnosis
** bring meta from external systems into hadoop – keep it together
Colibra — business process workflow + mapping to regulation in various countries and standards
Alation - Socializing of analytics - sql / traditional edw based
Meta Integration (Miti)
Paxata - wrangling
Syncsort - ETL - specializing traditional system and Mainframe
Talend - ETL, metadata management
Attivio — ingestion / discovery
Show – clearly identify customer metadata. Change
Add customer classification example – Aetna – make the use case story have continuity. Use DX procedures to diagnosis
** bring meta from external systems into hadoop – keep it together
Is the product was well understood?
Is the product something they would use?
Where is the value?
Make sure Audits are demod for policy denials and acceptances
Show – clearly identify customer metadata. Change
Add customer classification example – Aetna – make the use case story have continuity. Use DX procedures to diagnosis
** bring meta from external systems into hadoop – keep it together