SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
More generally: A discussion of ethics for data, research,
and publishing
Mason A. Porter (@masonporter)
Department of Mathematics
UCLA
§ It’s important.
§ People need to be able to replicate our work.
§ Making sure their own code is correct
§ Natural self-correction in science (and ability to understand precisely every choice we
make in our work)
§ Not traditionally part of mathematical training, but increasingly we are using social
data — including potentially personal data — in our research
§ We use a lot more real data nowadays, and in particular this includes a lot of human
(and animal) data.
§ Much less a part of the research (and thus training) tradition in mathematics than in other
disciplines
§ Other disciplines have thought a lot more about ethics than mathematics
§ In many cases, unfortunately, because they’ve messed up the ethics historically,
sometimes substantially, and we need to learn from the best practices they’ve developed
"Look, lady. Just because my grandfather didn't rape the
environment and exploit the workers doesn't make me a
peasant. And it's not that he didn't want to rape the
environment and exploit the workers; I'm sure he did. It's
just that as a barber, he didn't have that much opportunity."
– Roger Cobb [Steve Martin], All of Me (1984)
Thanks to Peter Mucha for
the quote suggestion (and
an excuse to allude to this
movie)
§ Be honest and fair (obviously)
§ Design ethically thoughtful research
§ Explain your decisions to others
§ [Points 2 and 3 taken from slides by Matt Salganik]
FOUR PRINCIPLES
§ Respect for persons
§ (Note: Animal research also has thorny ethical issues!)
§ Beneficence
§ Justice
§ Respect for Law and Public Interest
How do you balance these four principles?
§ Always be honest about your work
§ If you are working with personal data, you need to check with your Institutional
Review Board (IRB) to ensure that you are doing the work in an ethical way.
§ They may tell you that you don’t need to submit a formal application, or they may tell you
that you do. Let them know briefly what data you have access to (or plan to acquire, and
how) and what you plan to do with it.
§ Different IRBs of course can rule differently.
§ Rules differ in different countries
§ Human data versus animal data
§ In these slides, I have human data in mind, but animal data and its acquisition of course also has
major ethical considerations.
§ Look through UCLA’s website for the Office of the Human Research Protection
Program (OHRPP): http://ora.research.ucla.edu/ohrpp/Pages/OHRPPHome.aspx
§ “IRB is a floor, not a ceiling” (from Matt Salganik’s slides)
§ A well-known, heavily-used set of courses:
https://www.citiprogram.org/index.cfm?pageID=86
§ I found this from a link from UCLA’s OHRPP website.
§ Several years ago, I did some IRB training. (When preparing these slides, I couldn’t
find the specific online course I took.) In addition to helping to think about issues, if
something does go wrong, you do (from a practical point of view) want to be able to
say that you have appropriate ethics training.
§ Note:The training required/expected/available differs substantially across
countries.
§ Example: From my experience, my impression is that the UK appears to be less stringent
about human data than the US, but it appears to be more stringent about non-human
animals.
§ The more your research has the potential to violate personal privacy, the more
helpful for humanity the outcome needs to have the potential to be.
§ Informed Consent
§ Understanding and managing informational risk
§ Privacy
§ Making decisions in the face of uncertainty
§ Other notes
§ Put yourself in everyone else’s shoes
§ Think of research ethics as continuous, not discrete (sliding scale)
Bullet points from Matt Salganik’s slides
§ You must provide sufficient (and precise) detail for people to be able to replicate
your work!
§ Try to include it in your papers, but people are human, so if somebody e-mails you
to ask for a clarification, copy of code (even if poorly commented), or something
else, you should respond and send it to them, provide it’s something that you have
the right to send them.
§ To the extent possible, you should publish your data and usable (and well-commented) code
along with your work.
§ There can be tension between these ideals and issues of personal privacy, nondisclosure agreements, and
so on.
§ If using synthetic data, publish code to generate the data and the generated examples that you
used in your paper.
§ Supplementary material for the paper on the journal website, Github, Figshare, and other venues
§ Likely relevant for literally all of you
§ E.g., if you are doing any numerical computations at all, this is desirable
§ E.g., adjacency matrices for graphs in a definition–theorem–proof paper is also useful for readers (though
level of necessity depends on how large the graphs are)
§ Admission: I have been trying to get better about this over the years. I am very good about
responding to e-mail queries, and the goal (though there exist practical considerations) is to be
precise about all of my steps and to put as much online as feasible.
§ For empirical data, if you have permission to post something (e.g., does the data
“belong” to somebody else?) and it doesn’t invade privacy, you should post it
because that promotes good science.
§ Alternative name:“replication crisis”
§ https://en.wikipedia.org/wiki/Replication_crisis
Take a look, e.g., at the work of Victoria Snodden:
http://web.stanford.edu/~vcs/
§ Be explicit about anything you did, so that others can know what choices you made
and evaluate whether they think it is the best procedure for your analysis
§ E.g., sampling biases change properties of data
§ There are many reasons that one makes choices, so it’s not that you shouldn’t make
them, but it’s part of your scientific procedure, so tell people exactly what you did
so they know exactly what these choices were. (They may want to make different
choices.)
§ “Manipulating” is a loaded word; here I mean it in a neutral way (i.e.,“changes”),
rather than in a negative one.
§ When are things actually “anonymous”
§ Is “full” anonymization even possible?
Slide from Matt Salganik
Slide from Matt Salganik
§ https://en.wikipedia.org/wiki/Netflix_Prize#Cancelled_sequel
§ Acknowledge all sources of data
§ Include precise means of how you got data and how somebody else can get the
data (e.g., who do they contact?), especially if there is a reason that you are unable
to post the data itself
§ Be generous when acknowledging people in papers: useful discussions, ideas, etc.
§ Be fair and appropriate when discussing work by authors in past papers
§ You are standing on the shoulders of giants. :) Given credit where it is due.
§ Difference between somebody “showing” something in a past paper versus “reporting” it.
The former is a statement of verifying validity; the latter is a historical fact (assuming what
you write is accurate).
§ There can be complications in posting data to the public, no matter how well-
intentioned.
§ This is a great data set to advance several avenues of research in network science, and my goal
is for people to be able to do that.
§ Learning the hard way
§ Urgently arranging a phone meeting with the head of Facebook’s Data Science team
§ An important learning experience for me
§ A small chapter in the long story of data privacy
§ A blog entry that is very critical of me (though this differs from my side of the story):
http://www.michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-2005-
released/
§ Led to my learning much more about these issues (though under very stressful circumstances),
a page about research using human data in Oxford’s Mathematical Institute, etc.
§ https://www.maths.ox.ac.uk/members/policies/data-protection/research-using-data-involving-humans
§ Research in collaboration with companies or government:What is it ok to include
in a publication or post online?
§ Tension between open data and personal privacy
§ Terms-of-service agreements and nondisclosure agreements
§ In what sense can you replicate work if you can’t post everything?
§ “Softer” replication: do you observe similar phenomena in circumstances that have some
similarities but are not the same?
§ E.g., human behavior in different social networks
§ See, e.g., the discussion around this paper:
http://science.sciencemag.org/content/early/2015/05/06/science.aaa1160.full
§ Eytan Bakshy, Solomon Messing, & Lada Adamic, Exposure to ideologically diverse news
and opinion on Facebook, Science, 2015
§ They can’t tell us Facebook’s sampling algorithm, so how are we as scientists going to go
about “replicating” their work?
§ Note: Do their insights apply to other online social networks? One should be able to do a weaker
form of replication such that the most interesting qualitative results are not merely a property of
specifics on Facebook
§ Also:What about this work being public versus being entirely within Facebook and
us never seeing any of it?
§ A. D. I. Kramer, J. E. Guillory, and J.T. Hancock. Experimental evidence of massive-
scale emotional contagion through social networks. Proceedings of the National
Academy of Sciences of the United States of America, 111(24):8788–8790, 2014
§ Look up articles on this one
§ Experiments on Facebook with changes in people’s feeds
§ Also:What about this work being public versus being entirely within Facebook and
us never seeing any of it?
Note: Academic researchers have IRBs that need to approve a study before it starts, whereas
Facebook has a publication review board to approve publication of a study after it's already been
done.Thus, we know that this study occurred because FB concluded that it could be published.
We don’t know about what stuff is done with our data from FB and other companies when it
doesn’t get published.
Should academic researchers and
companies follow the same rules?
§ You can apply this comment generally to “data science” if you like, though the
property of connectivity in networks provides substantial additional issues beyond
just data science (and “Big Data”, etc.).
§ Short essay by Johan Ugander (Management Science & Engineering, Stanford):
https://medium.com/@jugander/truth-lies-and-an-ethics-of-personalization-
e4ccfa7f2b84#.rzap3hm70
§ As an example, he discusses “Cambridge Analytica, identified by the NY Times as
the hired guns behind Trump’s online targeting.”
§ Alexander Nix (CEO of CA) gave the following example in a video. Quoting
Ugander’s essay:“if you own a private beach, he notes, you’d have more success
keeping people off your beach by putting up a “Warning: sharks beyond this point”
sign vs. a “private property” sign.The problem is: he recommends this strategy —
 and personalized versions of it — without any consideration to whether there
actually are any sharks, advocating “behavioral communication” that is completely
detached from any truth about reality. In fewer words: crafting lies, and then
targeting them.”
§ http://callingbullshit.org
§ Full title:“Calling Bullshit in the Age of Big Data”
§ A course designed by Carl Bergstrom and Jevin West (University of Washington)
§ Excellent syllabus and reading materials
§ Various parts of it relate to ethics, and they also have a unit directly about ethics:
http://callingbullshit.org/syllabus.html#Ethics
§ Targeted advertising (different trailers for people of different races) for the movie
"Straight outta Compton": http://www.businessinsider.com/why-straight-outta-
compton-had-different-trailers-for-people-of-different-races
§ Different levels of prior familiarity with gangsta rap pioneers N.W. A. (Ice Cube, Dr. Dre, etc.)
§ Papers by Arvind Narayanan and collaborators, including:
§ http://senglehardt.com/papers/ccs16_online_tracking.pdf
§ https://5harad.com/papers/twivacy.pdf
§ J. Su et al.,“De-anonymizing Web Browsing Data with Social Networks”, 2016
§ C. Kanich et al.,“Spamalytics: An Empirical Analysis of Spam Marketing Conversion”
(2008):
§ http://www.umiacs.umd.edu/~tdumitra/courses/ENEE757/Fall14/papers/Kanich08.pdf
§ B. Markines et al.,“Social spam detection” (2009):
§ http://dl.acm.org/citation.cfm?doid=1531914.1531924
§ “Tastes,Ties, and Time” Facebook data set
§ One discussion about the controversy associated with this data set:
http://www.chronicle.com/article/Harvards-Privacy-Meltdown/128166/
§ Research by Sinan Aral and collaborators on manipulation of voting on social
media sites
§ One discussion: https://techcrunch.com/2013/08/11/reddit-science-herd/
§ Mathematicians are relatively new to using human data, but we don’t yet have the
ethics training to help us deal with the thorny issues
§ Learn from the best practices (and past mistakes) from other disciplines
§ As in those other disciplines, mathematicians should be getting ethics training
§ Read about — and think about and discuss — various controversies and other
studies.We all may set our bars in a different place, but we need to do it
conscientiously.
§ It’s a sliding bar: the more potential for invasion of personal privacy, the more valuable
the potential outcome has to be for humanity
§ IRB approval is only a lower bound
§ While I have more training and experience with these issues than most
mathematicians, I am very much an amateur on data ethics compared to people
from the social and human sciences, for whom this is a standard part of the training
from the beginning of their education.
§ With this in mind, please contact me with any suggestions on these slides. Did I
miss any salient points? Do you disagree with any of the discussed points? Are
there any other studies that are especially crucial to bring up?
§ Eventually, I hope to develop these slides further into an article for a venue in the
mathematical sciences. Let me know if you are interested in being involved in
writing this article.
§ Several suggestions for resources from Johan Ugander
§ Several comments on my slides and suggestions for resources from Peter Mucha
§ Website from Matt Salganik’s class on Computational Social Science (Fall 2016):
http://www.princeton.edu/~mjs3/soc596_f2016/
§ I drew some material and ideas from his slides on ethics
§ It would be pretty ironic if I plagiarized these slides, wouldn’t it?

Contenu connexe

Tendances

05 Communities in Network
05 Communities in Network05 Communities in Network
05 Communities in Networkdnac
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data AnalysisMason Porter
 
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)dnac
 
Social network analysis intro part I
Social network analysis intro part ISocial network analysis intro part I
Social network analysis intro part ITHomas Plotkowiak
 
Social Network Theory and Google
Social Network Theory and GoogleSocial Network Theory and Google
Social Network Theory and GoogleEdward Alonzo
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis WorkshopData Works MD
 
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Lauri Eloranta
 
Intro to social network analysis | What is Network Analysis? | History of (So...
Intro to social network analysis | What is Network Analysis? | History of (So...Intro to social network analysis | What is Network Analysis? | History of (So...
Intro to social network analysis | What is Network Analysis? | History of (So...Gaditek
 
A Perspective on Graph Theory and Network Science
A Perspective on Graph Theory and Network ScienceA Perspective on Graph Theory and Network Science
A Perspective on Graph Theory and Network ScienceMarko Rodriguez
 
00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and Overview00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and OverviewDuke Network Analysis Center
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Lauri Eloranta
 

Tendances (20)

04 Network Data Collection
04 Network Data Collection04 Network Data Collection
04 Network Data Collection
 
03 Communities in Networks (2017)
03 Communities in Networks (2017)03 Communities in Networks (2017)
03 Communities in Networks (2017)
 
05 Communities in Network
05 Communities in Network05 Communities in Network
05 Communities in Network
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
02 Descriptive Statistics (2017)
02 Descriptive Statistics (2017)02 Descriptive Statistics (2017)
02 Descriptive Statistics (2017)
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data Analysis
 
01 Network Data Collection (2017)
01 Network Data Collection (2017)01 Network Data Collection (2017)
01 Network Data Collection (2017)
 
15 Network Visualization and Communities
15 Network Visualization and Communities15 Network Visualization and Communities
15 Network Visualization and Communities
 
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
 
13 Community Detection
13 Community Detection13 Community Detection
13 Community Detection
 
07 Network Visualization
07 Network Visualization07 Network Visualization
07 Network Visualization
 
Social network analysis intro part I
Social network analysis intro part ISocial network analysis intro part I
Social network analysis intro part I
 
Social Network Theory and Google
Social Network Theory and GoogleSocial Network Theory and Google
Social Network Theory and Google
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis Workshop
 
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
 
Intro to social network analysis | What is Network Analysis? | History of (So...
Intro to social network analysis | What is Network Analysis? | History of (So...Intro to social network analysis | What is Network Analysis? | History of (So...
Intro to social network analysis | What is Network Analysis? | History of (So...
 
A Perspective on Graph Theory and Network Science
A Perspective on Graph Theory and Network ScienceA Perspective on Graph Theory and Network Science
A Perspective on Graph Theory and Network Science
 
00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and Overview00 Introduction to SN&H: Key Concepts and Overview
00 Introduction to SN&H: Key Concepts and Overview
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
 
10 Network Experiments
10 Network Experiments10 Network Experiments
10 Network Experiments
 

En vedette

Slide show test
Slide show testSlide show test
Slide show testsdhamilton
 
Vinmonopolet - Norges beste omdømme
Vinmonopolet - Norges beste omdømmeVinmonopolet - Norges beste omdømme
Vinmonopolet - Norges beste omdømmeApeland
 
[Android] Multimedia Programming
[Android] Multimedia Programming[Android] Multimedia Programming
[Android] Multimedia ProgrammingNikmesoft Ltd
 
Digital news report 2016 Reuters
Digital news report 2016 ReutersDigital news report 2016 Reuters
Digital news report 2016 ReutersPatricia Bertolotti
 
Question 2
Question 2Question 2
Question 2Lara243
 
Me&U - Atkins Official Newsletter
Me&U - Atkins Official NewsletterMe&U - Atkins Official Newsletter
Me&U - Atkins Official Newsletternkumarmba
 
Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...
Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...
Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...GLOBAL HEAVYLIFT HOLDINGS
 
Ajar v2-14-the-full-monty
Ajar v2-14-the-full-montyAjar v2-14-the-full-monty
Ajar v2-14-the-full-montyguuled
 

En vedette (20)

SLAS2015 JALA & JBS Author Workshop: How to Get Your Work Published
SLAS2015 JALA & JBS Author Workshop: How to Get Your Work PublishedSLAS2015 JALA & JBS Author Workshop: How to Get Your Work Published
SLAS2015 JALA & JBS Author Workshop: How to Get Your Work Published
 
Slide show test
Slide show testSlide show test
Slide show test
 
SLAS Tips for Scientific Publishing
SLAS Tips for Scientific PublishingSLAS Tips for Scientific Publishing
SLAS Tips for Scientific Publishing
 
SLAS Labware Leachables Special Interest Group SLAS2017 Presentation
SLAS Labware Leachables Special Interest Group SLAS2017 PresentationSLAS Labware Leachables Special Interest Group SLAS2017 Presentation
SLAS Labware Leachables Special Interest Group SLAS2017 Presentation
 
Praveen Balimane Addresses SLAS ADMET Special Interest Group at SLAS2015
Praveen Balimane Addresses SLAS ADMET Special Interest Group at SLAS2015Praveen Balimane Addresses SLAS ADMET Special Interest Group at SLAS2015
Praveen Balimane Addresses SLAS ADMET Special Interest Group at SLAS2015
 
Robert T. Dunn, II, Ph.D., DABT, SLAS ADMET Special Interest Group Meeting p...
 Robert T. Dunn, II, Ph.D., DABT, SLAS ADMET Special Interest Group Meeting p... Robert T. Dunn, II, Ph.D., DABT, SLAS ADMET Special Interest Group Meeting p...
Robert T. Dunn, II, Ph.D., DABT, SLAS ADMET Special Interest Group Meeting p...
 
Vinmonopolet - Norges beste omdømme
Vinmonopolet - Norges beste omdømmeVinmonopolet - Norges beste omdømme
Vinmonopolet - Norges beste omdømme
 
FINAL_Report
FINAL_ReportFINAL_Report
FINAL_Report
 
Libro de calculo 3
Libro de calculo 3Libro de calculo 3
Libro de calculo 3
 
[Android] Multimedia Programming
[Android] Multimedia Programming[Android] Multimedia Programming
[Android] Multimedia Programming
 
Digital news report 2016 Reuters
Digital news report 2016 ReutersDigital news report 2016 Reuters
Digital news report 2016 Reuters
 
The M-Jirga
The M-JirgaThe M-Jirga
The M-Jirga
 
Cambodia
Cambodia Cambodia
Cambodia
 
Geo study
Geo study Geo study
Geo study
 
Question 2
Question 2Question 2
Question 2
 
Me&U - Atkins Official Newsletter
Me&U - Atkins Official NewsletterMe&U - Atkins Official Newsletter
Me&U - Atkins Official Newsletter
 
Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...
Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...
Richard Holbrooke's World: Memories of a Newsweek Special Correspondent - A Q...
 
As Biologists
As BiologistsAs Biologists
As Biologists
 
Ajar v2-14-the-full-monty
Ajar v2-14-the-full-montyAjar v2-14-the-full-monty
Ajar v2-14-the-full-monty
 
Mercator Ocean newsletter 30
Mercator Ocean newsletter 30Mercator Ocean newsletter 30
Mercator Ocean newsletter 30
 

Similaire à Data Ethics for Mathematicians

The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
Inspiration Architecture: The Future of Libraries
Inspiration Architecture: The Future of LibrariesInspiration Architecture: The Future of Libraries
Inspiration Architecture: The Future of LibrariesPeter Morville
 
AoA Presentation.v.6Feb2024.pptx
AoA Presentation.v.6Feb2024.pptxAoA Presentation.v.6Feb2024.pptx
AoA Presentation.v.6Feb2024.pptxklaus110316
 
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.Josh Cowls
 
Due Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docx
Due Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docxDue Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docx
Due Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docxjacksnathalie
 
Social Side of Research CARL 2014
Social Side of Research CARL 2014Social Side of Research CARL 2014
Social Side of Research CARL 2014Michele Van Hoeck
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of UnderstandingPeter Morville
 
WK 2 DQ 1Read the journal article The Ethics of Internet Resear.docx
WK 2 DQ 1Read the journal article The Ethics of Internet Resear.docxWK 2 DQ 1Read the journal article The Ethics of Internet Resear.docx
WK 2 DQ 1Read the journal article The Ethics of Internet Resear.docxambersalomon88660
 
The data science handbook pre release (1)
The data science handbook   pre release (1)The data science handbook   pre release (1)
The data science handbook pre release (1)Lakshmi Prasanna
 
Citation 101: Teaching Students to Avoid Plagiarism
Citation 101: Teaching Students to Avoid PlagiarismCitation 101: Teaching Students to Avoid Plagiarism
Citation 101: Teaching Students to Avoid Plagiarismemdowney
 
Research questions and research design
Research questions and research design Research questions and research design
Research questions and research design Normative
 
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & PracticeEnhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & PracticeSt. Petersburg College
 
IbelongInCSsample.pdf
IbelongInCSsample.pdfIbelongInCSsample.pdf
IbelongInCSsample.pdfAngela DeHart
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...The Higher Education Academy
 
Using Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docx
Using Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docxUsing Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docx
Using Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docxdickonsondorris
 
Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...
Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...
Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...colin gray
 

Similaire à Data Ethics for Mathematicians (20)

The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Inspiration Architecture: The Future of Libraries
Inspiration Architecture: The Future of LibrariesInspiration Architecture: The Future of Libraries
Inspiration Architecture: The Future of Libraries
 
AoA Presentation.v.6Feb2024.pptx
AoA Presentation.v.6Feb2024.pptxAoA Presentation.v.6Feb2024.pptx
AoA Presentation.v.6Feb2024.pptx
 
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
 
Due Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docx
Due Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docxDue Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docx
Due Jan 6Week 8Using Figure 1.2 in Ch. 1 of Exploring Resear.docx
 
Social Side of Research CARL 2014
Social Side of Research CARL 2014Social Side of Research CARL 2014
Social Side of Research CARL 2014
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
WK 2 DQ 1Read the journal article The Ethics of Internet Resear.docx
WK 2 DQ 1Read the journal article The Ethics of Internet Resear.docxWK 2 DQ 1Read the journal article The Ethics of Internet Resear.docx
WK 2 DQ 1Read the journal article The Ethics of Internet Resear.docx
 
The data science handbook pre release (1)
The data science handbook   pre release (1)The data science handbook   pre release (1)
The data science handbook pre release (1)
 
Citation 101: Teaching Students to Avoid Plagiarism
Citation 101: Teaching Students to Avoid PlagiarismCitation 101: Teaching Students to Avoid Plagiarism
Citation 101: Teaching Students to Avoid Plagiarism
 
Research questions and research design
Research questions and research design Research questions and research design
Research questions and research design
 
3Rd Person Essay
3Rd Person Essay3Rd Person Essay
3Rd Person Essay
 
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & PracticeEnhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
 
IbelongInCSsample.pdf
IbelongInCSsample.pdfIbelongInCSsample.pdf
IbelongInCSsample.pdf
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
Using Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docx
Using Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docxUsing Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docx
Using Figure 1.2 in Ch. 1 of Exploring Research, create a flowchar.docx
 
Analytic Essay
Analytic EssayAnalytic Essay
Analytic Essay
 
Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...
Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...
Developing a Socially-Aware Engineering Identity Through Transdisciplinary Le...
 

Plus de Mason Porter

Opinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized NetworksOpinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized NetworksMason Porter
 
Social Dynamics on Networks
Social Dynamics on NetworksSocial Dynamics on Networks
Social Dynamics on NetworksMason Porter
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsMason Porter
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsMason Porter
 
The Science of "Chaos"
The Science of "Chaos"The Science of "Chaos"
The Science of "Chaos"Mason Porter
 
Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Mason Porter
 
Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)
Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)
Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)Mason Porter
 
Snowbird comp-top-may2017
Snowbird comp-top-may2017Snowbird comp-top-may2017
Snowbird comp-top-may2017Mason Porter
 
Networks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and BeyondNetworks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and BeyondMason Porter
 
Cascades and Social Influence on Networks, UCSB, 3 Oct 2014
Cascades and Social Influence on Networks, UCSB, 3 Oct 2014Cascades and Social Influence on Networks, UCSB, 3 Oct 2014
Cascades and Social Influence on Networks, UCSB, 3 Oct 2014Mason Porter
 
Multilayer tutorial-netsci2014-slightlyupdated
Multilayer tutorial-netsci2014-slightlyupdatedMultilayer tutorial-netsci2014-slightlyupdated
Multilayer tutorial-netsci2014-slightlyupdatedMason Porter
 

Plus de Mason Porter (13)

Opinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized NetworksOpinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized Networks
 
Social Dynamics on Networks
Social Dynamics on NetworksSocial Dynamics on Networks
Social Dynamics on Networks
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial Systems
 
Topological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial SystemsTopological Data Analysis of Complex Spatial Systems
Topological Data Analysis of Complex Spatial Systems
 
The Science of "Chaos"
The Science of "Chaos"The Science of "Chaos"
The Science of "Chaos"
 
Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)
 
Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)
Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)
Tutorial on Paper-Writing in Applied Mathematics (Preliminary Draft of Slides)
 
Snowbird comp-top-may2017
Snowbird comp-top-may2017Snowbird comp-top-may2017
Snowbird comp-top-may2017
 
Networks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and BeyondNetworks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and Beyond
 
Ds15 minitute-v2
Ds15 minitute-v2Ds15 minitute-v2
Ds15 minitute-v2
 
Matchmaker110714
Matchmaker110714Matchmaker110714
Matchmaker110714
 
Cascades and Social Influence on Networks, UCSB, 3 Oct 2014
Cascades and Social Influence on Networks, UCSB, 3 Oct 2014Cascades and Social Influence on Networks, UCSB, 3 Oct 2014
Cascades and Social Influence on Networks, UCSB, 3 Oct 2014
 
Multilayer tutorial-netsci2014-slightlyupdated
Multilayer tutorial-netsci2014-slightlyupdatedMultilayer tutorial-netsci2014-slightlyupdated
Multilayer tutorial-netsci2014-slightlyupdated
 

Dernier

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 

Dernier (20)

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Data Ethics for Mathematicians

  • 1. More generally: A discussion of ethics for data, research, and publishing Mason A. Porter (@masonporter) Department of Mathematics UCLA
  • 2. § It’s important. § People need to be able to replicate our work. § Making sure their own code is correct § Natural self-correction in science (and ability to understand precisely every choice we make in our work) § Not traditionally part of mathematical training, but increasingly we are using social data — including potentially personal data — in our research
  • 3. § We use a lot more real data nowadays, and in particular this includes a lot of human (and animal) data. § Much less a part of the research (and thus training) tradition in mathematics than in other disciplines § Other disciplines have thought a lot more about ethics than mathematics § In many cases, unfortunately, because they’ve messed up the ethics historically, sometimes substantially, and we need to learn from the best practices they’ve developed "Look, lady. Just because my grandfather didn't rape the environment and exploit the workers doesn't make me a peasant. And it's not that he didn't want to rape the environment and exploit the workers; I'm sure he did. It's just that as a barber, he didn't have that much opportunity." – Roger Cobb [Steve Martin], All of Me (1984) Thanks to Peter Mucha for the quote suggestion (and an excuse to allude to this movie)
  • 4. § Be honest and fair (obviously) § Design ethically thoughtful research § Explain your decisions to others § [Points 2 and 3 taken from slides by Matt Salganik]
  • 5. FOUR PRINCIPLES § Respect for persons § (Note: Animal research also has thorny ethical issues!) § Beneficence § Justice § Respect for Law and Public Interest How do you balance these four principles?
  • 6. § Always be honest about your work
  • 7. § If you are working with personal data, you need to check with your Institutional Review Board (IRB) to ensure that you are doing the work in an ethical way. § They may tell you that you don’t need to submit a formal application, or they may tell you that you do. Let them know briefly what data you have access to (or plan to acquire, and how) and what you plan to do with it. § Different IRBs of course can rule differently. § Rules differ in different countries § Human data versus animal data § In these slides, I have human data in mind, but animal data and its acquisition of course also has major ethical considerations. § Look through UCLA’s website for the Office of the Human Research Protection Program (OHRPP): http://ora.research.ucla.edu/ohrpp/Pages/OHRPPHome.aspx § “IRB is a floor, not a ceiling” (from Matt Salganik’s slides)
  • 8. § A well-known, heavily-used set of courses: https://www.citiprogram.org/index.cfm?pageID=86 § I found this from a link from UCLA’s OHRPP website. § Several years ago, I did some IRB training. (When preparing these slides, I couldn’t find the specific online course I took.) In addition to helping to think about issues, if something does go wrong, you do (from a practical point of view) want to be able to say that you have appropriate ethics training. § Note:The training required/expected/available differs substantially across countries. § Example: From my experience, my impression is that the UK appears to be less stringent about human data than the US, but it appears to be more stringent about non-human animals.
  • 9.
  • 10.
  • 11.
  • 12. § The more your research has the potential to violate personal privacy, the more helpful for humanity the outcome needs to have the potential to be.
  • 13. § Informed Consent § Understanding and managing informational risk § Privacy § Making decisions in the face of uncertainty § Other notes § Put yourself in everyone else’s shoes § Think of research ethics as continuous, not discrete (sliding scale) Bullet points from Matt Salganik’s slides
  • 14. § You must provide sufficient (and precise) detail for people to be able to replicate your work! § Try to include it in your papers, but people are human, so if somebody e-mails you to ask for a clarification, copy of code (even if poorly commented), or something else, you should respond and send it to them, provide it’s something that you have the right to send them.
  • 15. § To the extent possible, you should publish your data and usable (and well-commented) code along with your work. § There can be tension between these ideals and issues of personal privacy, nondisclosure agreements, and so on. § If using synthetic data, publish code to generate the data and the generated examples that you used in your paper. § Supplementary material for the paper on the journal website, Github, Figshare, and other venues § Likely relevant for literally all of you § E.g., if you are doing any numerical computations at all, this is desirable § E.g., adjacency matrices for graphs in a definition–theorem–proof paper is also useful for readers (though level of necessity depends on how large the graphs are) § Admission: I have been trying to get better about this over the years. I am very good about responding to e-mail queries, and the goal (though there exist practical considerations) is to be precise about all of my steps and to put as much online as feasible.
  • 16. § For empirical data, if you have permission to post something (e.g., does the data “belong” to somebody else?) and it doesn’t invade privacy, you should post it because that promotes good science.
  • 17. § Alternative name:“replication crisis” § https://en.wikipedia.org/wiki/Replication_crisis Take a look, e.g., at the work of Victoria Snodden: http://web.stanford.edu/~vcs/
  • 18. § Be explicit about anything you did, so that others can know what choices you made and evaluate whether they think it is the best procedure for your analysis § E.g., sampling biases change properties of data § There are many reasons that one makes choices, so it’s not that you shouldn’t make them, but it’s part of your scientific procedure, so tell people exactly what you did so they know exactly what these choices were. (They may want to make different choices.) § “Manipulating” is a loaded word; here I mean it in a neutral way (i.e.,“changes”), rather than in a negative one.
  • 19. § When are things actually “anonymous” § Is “full” anonymization even possible?
  • 20. Slide from Matt Salganik
  • 21. Slide from Matt Salganik
  • 23. § Acknowledge all sources of data § Include precise means of how you got data and how somebody else can get the data (e.g., who do they contact?), especially if there is a reason that you are unable to post the data itself § Be generous when acknowledging people in papers: useful discussions, ideas, etc. § Be fair and appropriate when discussing work by authors in past papers § You are standing on the shoulders of giants. :) Given credit where it is due. § Difference between somebody “showing” something in a past paper versus “reporting” it. The former is a statement of verifying validity; the latter is a historical fact (assuming what you write is accurate).
  • 24. § There can be complications in posting data to the public, no matter how well- intentioned. § This is a great data set to advance several avenues of research in network science, and my goal is for people to be able to do that. § Learning the hard way § Urgently arranging a phone meeting with the head of Facebook’s Data Science team § An important learning experience for me § A small chapter in the long story of data privacy § A blog entry that is very critical of me (though this differs from my side of the story): http://www.michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-2005- released/ § Led to my learning much more about these issues (though under very stressful circumstances), a page about research using human data in Oxford’s Mathematical Institute, etc. § https://www.maths.ox.ac.uk/members/policies/data-protection/research-using-data-involving-humans
  • 25. § Research in collaboration with companies or government:What is it ok to include in a publication or post online? § Tension between open data and personal privacy § Terms-of-service agreements and nondisclosure agreements § In what sense can you replicate work if you can’t post everything? § “Softer” replication: do you observe similar phenomena in circumstances that have some similarities but are not the same? § E.g., human behavior in different social networks
  • 26. § See, e.g., the discussion around this paper: http://science.sciencemag.org/content/early/2015/05/06/science.aaa1160.full § Eytan Bakshy, Solomon Messing, & Lada Adamic, Exposure to ideologically diverse news and opinion on Facebook, Science, 2015 § They can’t tell us Facebook’s sampling algorithm, so how are we as scientists going to go about “replicating” their work? § Note: Do their insights apply to other online social networks? One should be able to do a weaker form of replication such that the most interesting qualitative results are not merely a property of specifics on Facebook § Also:What about this work being public versus being entirely within Facebook and us never seeing any of it?
  • 27. § A. D. I. Kramer, J. E. Guillory, and J.T. Hancock. Experimental evidence of massive- scale emotional contagion through social networks. Proceedings of the National Academy of Sciences of the United States of America, 111(24):8788–8790, 2014 § Look up articles on this one § Experiments on Facebook with changes in people’s feeds § Also:What about this work being public versus being entirely within Facebook and us never seeing any of it? Note: Academic researchers have IRBs that need to approve a study before it starts, whereas Facebook has a publication review board to approve publication of a study after it's already been done.Thus, we know that this study occurred because FB concluded that it could be published. We don’t know about what stuff is done with our data from FB and other companies when it doesn’t get published.
  • 28. Should academic researchers and companies follow the same rules?
  • 29. § You can apply this comment generally to “data science” if you like, though the property of connectivity in networks provides substantial additional issues beyond just data science (and “Big Data”, etc.).
  • 30. § Short essay by Johan Ugander (Management Science & Engineering, Stanford): https://medium.com/@jugander/truth-lies-and-an-ethics-of-personalization- e4ccfa7f2b84#.rzap3hm70 § As an example, he discusses “Cambridge Analytica, identified by the NY Times as the hired guns behind Trump’s online targeting.” § Alexander Nix (CEO of CA) gave the following example in a video. Quoting Ugander’s essay:“if you own a private beach, he notes, you’d have more success keeping people off your beach by putting up a “Warning: sharks beyond this point” sign vs. a “private property” sign.The problem is: he recommends this strategy —  and personalized versions of it — without any consideration to whether there actually are any sharks, advocating “behavioral communication” that is completely detached from any truth about reality. In fewer words: crafting lies, and then targeting them.”
  • 31. § http://callingbullshit.org § Full title:“Calling Bullshit in the Age of Big Data” § A course designed by Carl Bergstrom and Jevin West (University of Washington) § Excellent syllabus and reading materials § Various parts of it relate to ethics, and they also have a unit directly about ethics: http://callingbullshit.org/syllabus.html#Ethics
  • 32. § Targeted advertising (different trailers for people of different races) for the movie "Straight outta Compton": http://www.businessinsider.com/why-straight-outta- compton-had-different-trailers-for-people-of-different-races § Different levels of prior familiarity with gangsta rap pioneers N.W. A. (Ice Cube, Dr. Dre, etc.) § Papers by Arvind Narayanan and collaborators, including: § http://senglehardt.com/papers/ccs16_online_tracking.pdf § https://5harad.com/papers/twivacy.pdf § J. Su et al.,“De-anonymizing Web Browsing Data with Social Networks”, 2016 § C. Kanich et al.,“Spamalytics: An Empirical Analysis of Spam Marketing Conversion” (2008): § http://www.umiacs.umd.edu/~tdumitra/courses/ENEE757/Fall14/papers/Kanich08.pdf § B. Markines et al.,“Social spam detection” (2009): § http://dl.acm.org/citation.cfm?doid=1531914.1531924
  • 33. § “Tastes,Ties, and Time” Facebook data set § One discussion about the controversy associated with this data set: http://www.chronicle.com/article/Harvards-Privacy-Meltdown/128166/ § Research by Sinan Aral and collaborators on manipulation of voting on social media sites § One discussion: https://techcrunch.com/2013/08/11/reddit-science-herd/
  • 34. § Mathematicians are relatively new to using human data, but we don’t yet have the ethics training to help us deal with the thorny issues § Learn from the best practices (and past mistakes) from other disciplines § As in those other disciplines, mathematicians should be getting ethics training § Read about — and think about and discuss — various controversies and other studies.We all may set our bars in a different place, but we need to do it conscientiously. § It’s a sliding bar: the more potential for invasion of personal privacy, the more valuable the potential outcome has to be for humanity § IRB approval is only a lower bound
  • 35. § While I have more training and experience with these issues than most mathematicians, I am very much an amateur on data ethics compared to people from the social and human sciences, for whom this is a standard part of the training from the beginning of their education. § With this in mind, please contact me with any suggestions on these slides. Did I miss any salient points? Do you disagree with any of the discussed points? Are there any other studies that are especially crucial to bring up? § Eventually, I hope to develop these slides further into an article for a venue in the mathematical sciences. Let me know if you are interested in being involved in writing this article.
  • 36. § Several suggestions for resources from Johan Ugander § Several comments on my slides and suggestions for resources from Peter Mucha § Website from Matt Salganik’s class on Computational Social Science (Fall 2016): http://www.princeton.edu/~mjs3/soc596_f2016/ § I drew some material and ideas from his slides on ethics § It would be pretty ironic if I plagiarized these slides, wouldn’t it?