SlideShare une entreprise Scribd logo
1  sur  16
Amnesia
Data anonymization made easy
https://amnesia.openaire.eu
Manolis Terrovitis
mter@imis.athena-innovation.gr
http://web.imsi.athenarc.gr/~mter/
Research Center Athena, IMSI
Amnesia – Webinar 24/4/2018
Data anonymization?
• Data anonymization facilitates the publication of micro data(vs.
aggregated macrodata) , e.g., data used in scientific research
• Micro data often reveal important private information, e.g., the
medical condition of a person
o Individuals are afraid to provide their data
o Companies are afraid to share data with experts
o GDPR makes a strict protection scheme obligatory
• The aim of anonymization methods is to allow sharing such data,
without compromising the privacy of the users.
Amnesia - Webinar 24/4/2018
Data anonymization and
Amnesia
• Data anonymization
• Removal of direct identifiers, e.g., Names, SSN etc
• Removal of infrequent combinations of quasi-identifiers, e.g., unique combinations of
birth dates and zipcodes
• Infrequent combinations are removed through generalization, e.g., birth date
14/01/1977 becomes **/**/1977
• Amnesia is a scalable anonymization tool
• It offers several versions of k-anonymity
• It allows the user to select and customize possible solutions
• It offers graphical tools that allow the user to analyze the anonymized dataset
• It is scalable and uses all available CPU cores in the anonymization process
Amnesia - Webinar 24/4/2018
Link attacks
Amnesia - Webinar 24/4/2018
k-anonymity
• Each entry becomes indistinguishable from
other k-1 entries
o k-anonymity is achieved through suppression and
generalization
id Zipcode Age National. Disease
1 13053 28 Russian Heart Disease
2 13068 29 American Heart Disease
3 13068 21 Japanese Viral Infection
4 13053 23 American Viral Infection
5 14853 50 Indian Cancer
6 14853 55 Russian Heart Disease
7 14850 47 American Viral Infection
8 14850 49 American Viral Infection
9 13053 31 American Cancer
10 13053 37 Indian Cancer
11 13068 36 Japanese Cancer
12 13068 35 American Cancer
id Zipcode Age National. Disease
1 130** <30 ∗ Heart Disease
2 130** <30 ∗ Heart Disease
3 130** <30 ∗ Viral Infection
4 130** <30 ∗ Viral Infection
5 1485* ≥40 ∗ Cancer
6 1485* ≥40 ∗ Heart Disease
7 1485* ≥40 ∗ Viral Infection
8 1485* ≥40 ∗ Viral Infection
9 130** 3∗ ∗ Cancer
10 130** 3∗ ∗ Cancer
11 130** 3∗ ∗ Cancer
12 130** 3∗ ∗ Cancer
Amnesia - Webinar 24/4/2018
Generalization Hierarchy
Amnesia - Webinar 24/4/2018
7 9 16 18
0-10 10-20
*
Structural information
• We need to anonymize all relevant information about a
person, not just a tuple
• Information tends to gather over time
• Information is linked through semantic properties, it’s schema
is irrelevant
• Personal data tend to accumulate over time
• Research focuses on simple data and complicated
guaranties but real world has complex data and requires
simple guaranties
Amnesia - Webinar 24/4/2018
Limitsofk-anonymity
• 2-anonymous
Fruits Meat Vegetables Fish
Vassilis Χ Χ
Manolis Χ Χ Χ
Eleni Χ
Maria Χ Χ
Kostas Χ Χ
Food
Vassilis Χ
Manolis Χ
Eleni Χ
Maria Χ
Kostas Χ
Amnesia - Webinar 24/4/2018
km-anonymity
• 22-anonymous
• Any
combination of
m items will not
appear less
than k times
Fruits Meat Vegetables Fish
Vassilis Χ Χ
Manolis Χ Χ Χ
Eleni Χ
Maria Χ Χ
Kostas Χ Χ
Fruits Meat Other food
Vassilis Χ Χ
Manolis X Χ X
Eleni X
Maria Χ X
Kostas Χ X
Amnesia - Webinar 24/4/2018
Strengths and Weaknesses
• Strengths
o Simple to understand
• Can be the basis for consent
o Close to previous and existing legal definitions
o Low information loss
o Customizable by non-experts
• Weaknesses
o Not very strict
o Does not take into account sensitive values
Amnesia - Webinar 24/4/2018
Anonymization challenges
• Anonymization techniques have not been tested in practice
extensively
o Mapping the social notion of privacy to technical notions is not easy
• Data utility has not been studied extensively in research
o Few artificial information loss measures
• Data utility is difficult to estimate in practice
o Different applications have different needs
o No easy to quantify the loss of information
Amnesia - Webinar 24/4/2018
Amensia
• Amnesia is a data anonymization tool developed by Research
Center Athena
• Amnesia is build with Java and Javascript
• k-anonymity and km-anonymity
• Tuples and set-values
• Visual tools
o Estimating data utility
o Building hierarchies
o Customizing anonymization solutions
Amnesia - Webinar 24/4/2018
Amnesia status
• Amnesia is available as a public beta version at
o https://amnesia.openaire.eu
• On-line version is for demonstration and testing purposes mostly
• Sensitive data can be anonymized locally by downloading the
application
o Security
o Scalability
• We are in process of adjusting it to health data
Amnesia - Webinar 24/4/2018
Amensia Challenges
Is it easy to use by data owners? Are anoymized data useful?
Amnesia - Webinar 24/4/2018
• Give us feedback!!
o amnesia-helpdesk@imis.athena-
innovation.gr
• Can it anonymize your data?
o Let us know about your use case
o Ask us for help
• We need feedback for data
analysis
o Let us know if you have shared
anonymized results
• Please contact us with your
needs
Next steps
Work on the feedback More features
Amnesia - Webinar 24/4/2018
• Improve user experience
• Add support for specific
domain data
• Fix bugs!
• New algorithms
o Additional privacy guaranties
o More data types
• Better scaling capabilities
o Disk based solutions
o More efficient memory usage
HTTPS://AMNESIA.OPENAIRE.EU/
Thank you!
Amnesia - Webinar 24/4/2018

Contenu connexe

Plus de OpenAIRE

Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)OpenAIRE
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community CallOpenAIRE
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?OpenAIRE
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)OpenAIRE
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in GreeceOpenAIRE
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community CallOpenAIRE
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community CallOpenAIRE
 
3rd Content Providers Community Call
3rd Content Providers Community Call3rd Content Providers Community Call
3rd Content Providers Community CallOpenAIRE
 
2nd Content Providers Community Call
2nd Content Providers Community Call2nd Content Providers Community Call
2nd Content Providers Community CallOpenAIRE
 
1st Content Providers Community Call
1st Content Providers Community Call1st Content Providers Community Call
1st Content Providers Community CallOpenAIRE
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraphOpenAIRE
 
IPR and Exploitation
IPR and Exploitation IPR and Exploitation
IPR and Exploitation OpenAIRE
 
Eosc_OpenAIRE_onboarding_v2
Eosc_OpenAIRE_onboarding_v2Eosc_OpenAIRE_onboarding_v2
Eosc_OpenAIRE_onboarding_v2OpenAIRE
 
Open Science infrastructure in the EU
Open Science infrastructure in the EUOpen Science infrastructure in the EU
Open Science infrastructure in the EUOpenAIRE
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE
 

Plus de OpenAIRE (20)

Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 
3rd Content Providers Community Call
3rd Content Providers Community Call3rd Content Providers Community Call
3rd Content Providers Community Call
 
2nd Content Providers Community Call
2nd Content Providers Community Call2nd Content Providers Community Call
2nd Content Providers Community Call
 
1st Content Providers Community Call
1st Content Providers Community Call1st Content Providers Community Call
1st Content Providers Community Call
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph
 
IPR and Exploitation
IPR and Exploitation IPR and Exploitation
IPR and Exploitation
 
Eosc_OpenAIRE_onboarding_v2
Eosc_OpenAIRE_onboarding_v2Eosc_OpenAIRE_onboarding_v2
Eosc_OpenAIRE_onboarding_v2
 
Open Science infrastructure in the EU
Open Science infrastructure in the EUOpen Science infrastructure in the EU
Open Science infrastructure in the EU
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation Repositories
 

Dernier

COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 

Dernier (20)

COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 

Amnesia: Data anonymization made easy

  • 1. Amnesia Data anonymization made easy https://amnesia.openaire.eu Manolis Terrovitis mter@imis.athena-innovation.gr http://web.imsi.athenarc.gr/~mter/ Research Center Athena, IMSI Amnesia – Webinar 24/4/2018
  • 2. Data anonymization? • Data anonymization facilitates the publication of micro data(vs. aggregated macrodata) , e.g., data used in scientific research • Micro data often reveal important private information, e.g., the medical condition of a person o Individuals are afraid to provide their data o Companies are afraid to share data with experts o GDPR makes a strict protection scheme obligatory • The aim of anonymization methods is to allow sharing such data, without compromising the privacy of the users. Amnesia - Webinar 24/4/2018
  • 3. Data anonymization and Amnesia • Data anonymization • Removal of direct identifiers, e.g., Names, SSN etc • Removal of infrequent combinations of quasi-identifiers, e.g., unique combinations of birth dates and zipcodes • Infrequent combinations are removed through generalization, e.g., birth date 14/01/1977 becomes **/**/1977 • Amnesia is a scalable anonymization tool • It offers several versions of k-anonymity • It allows the user to select and customize possible solutions • It offers graphical tools that allow the user to analyze the anonymized dataset • It is scalable and uses all available CPU cores in the anonymization process Amnesia - Webinar 24/4/2018
  • 4. Link attacks Amnesia - Webinar 24/4/2018
  • 5. k-anonymity • Each entry becomes indistinguishable from other k-1 entries o k-anonymity is achieved through suppression and generalization id Zipcode Age National. Disease 1 13053 28 Russian Heart Disease 2 13068 29 American Heart Disease 3 13068 21 Japanese Viral Infection 4 13053 23 American Viral Infection 5 14853 50 Indian Cancer 6 14853 55 Russian Heart Disease 7 14850 47 American Viral Infection 8 14850 49 American Viral Infection 9 13053 31 American Cancer 10 13053 37 Indian Cancer 11 13068 36 Japanese Cancer 12 13068 35 American Cancer id Zipcode Age National. Disease 1 130** <30 ∗ Heart Disease 2 130** <30 ∗ Heart Disease 3 130** <30 ∗ Viral Infection 4 130** <30 ∗ Viral Infection 5 1485* ≥40 ∗ Cancer 6 1485* ≥40 ∗ Heart Disease 7 1485* ≥40 ∗ Viral Infection 8 1485* ≥40 ∗ Viral Infection 9 130** 3∗ ∗ Cancer 10 130** 3∗ ∗ Cancer 11 130** 3∗ ∗ Cancer 12 130** 3∗ ∗ Cancer Amnesia - Webinar 24/4/2018
  • 6. Generalization Hierarchy Amnesia - Webinar 24/4/2018 7 9 16 18 0-10 10-20 *
  • 7. Structural information • We need to anonymize all relevant information about a person, not just a tuple • Information tends to gather over time • Information is linked through semantic properties, it’s schema is irrelevant • Personal data tend to accumulate over time • Research focuses on simple data and complicated guaranties but real world has complex data and requires simple guaranties Amnesia - Webinar 24/4/2018
  • 8. Limitsofk-anonymity • 2-anonymous Fruits Meat Vegetables Fish Vassilis Χ Χ Manolis Χ Χ Χ Eleni Χ Maria Χ Χ Kostas Χ Χ Food Vassilis Χ Manolis Χ Eleni Χ Maria Χ Kostas Χ Amnesia - Webinar 24/4/2018
  • 9. km-anonymity • 22-anonymous • Any combination of m items will not appear less than k times Fruits Meat Vegetables Fish Vassilis Χ Χ Manolis Χ Χ Χ Eleni Χ Maria Χ Χ Kostas Χ Χ Fruits Meat Other food Vassilis Χ Χ Manolis X Χ X Eleni X Maria Χ X Kostas Χ X Amnesia - Webinar 24/4/2018
  • 10. Strengths and Weaknesses • Strengths o Simple to understand • Can be the basis for consent o Close to previous and existing legal definitions o Low information loss o Customizable by non-experts • Weaknesses o Not very strict o Does not take into account sensitive values Amnesia - Webinar 24/4/2018
  • 11. Anonymization challenges • Anonymization techniques have not been tested in practice extensively o Mapping the social notion of privacy to technical notions is not easy • Data utility has not been studied extensively in research o Few artificial information loss measures • Data utility is difficult to estimate in practice o Different applications have different needs o No easy to quantify the loss of information Amnesia - Webinar 24/4/2018
  • 12. Amensia • Amnesia is a data anonymization tool developed by Research Center Athena • Amnesia is build with Java and Javascript • k-anonymity and km-anonymity • Tuples and set-values • Visual tools o Estimating data utility o Building hierarchies o Customizing anonymization solutions Amnesia - Webinar 24/4/2018
  • 13. Amnesia status • Amnesia is available as a public beta version at o https://amnesia.openaire.eu • On-line version is for demonstration and testing purposes mostly • Sensitive data can be anonymized locally by downloading the application o Security o Scalability • We are in process of adjusting it to health data Amnesia - Webinar 24/4/2018
  • 14. Amensia Challenges Is it easy to use by data owners? Are anoymized data useful? Amnesia - Webinar 24/4/2018 • Give us feedback!! o amnesia-helpdesk@imis.athena- innovation.gr • Can it anonymize your data? o Let us know about your use case o Ask us for help • We need feedback for data analysis o Let us know if you have shared anonymized results • Please contact us with your needs
  • 15. Next steps Work on the feedback More features Amnesia - Webinar 24/4/2018 • Improve user experience • Add support for specific domain data • Fix bugs! • New algorithms o Additional privacy guaranties o More data types • Better scaling capabilities o Disk based solutions o More efficient memory usage