This document discusses deduplication and data fusion software. It introduces the benefits of identifying duplicated records across databases and merging data from different sources. The process section outlines how the software allows configuring input formats, similarity computations, filters, and validation steps. Successful stories provide examples of how a health service and beer manufacturer used the software to clean databases and identify incorrect deliveries. A demo is available to see the software in action.
22. Assign types to columns to help using the most adequate automatic filtersCSV Configurations Execution Validation Exportation Excel PDF XML CSV
23.
24. Percentage of the importance of each column for the similarity computationCSV Configurations Execution Validation Exportation 30% 35% 35% 100% = Excel PDF XML CSV
27. Available automatic and specific filters for values such as name, dates, address, etc…CSV Configurations Execution Validation Exportation Excel PDF XML CSV
49. Successful storiesBeer Manufacturer Who? Beer manufacturer Objective Detect dealers that deliver to not previously assigned centers Solution Identify duplicates in each dealer’s delivery database and delete them Deduplication with DAURUM Detect deliveries to centers shared between different dealers Fusion with DAURUM Result Master database clean of repetitions and detection of dealers with wrong deliveries
54. Thanks for your attention Any questions? Pere Baleta Ferrer CEO pbaleta@sparsity-technologies.com Josep Lluís Larriba Pey Founder larri@sparsity-technologies.com SPARSITY-TECHNOLOGIES Jordi Girona, 1-3, Edifici K2M 08034 Barcelona info@sparsity-technologies.com http://www.sparsity-technologies.com