19. OSS
• OSS is software that is under an Open Source License
20. OSS
• OSS is software that is under an Open Source License
• The big shift: industry wants to use it
• Car makers/Software developers/TVs/Phone/Internet-of-
things…
27. Architectural Analysis
• What are the components of a system?
• How are they connected?
• What files are part of what components?
• What components are actually used?
31. Examples
• Apple’s OS X:
• Contains many open source components
• Under a variety of licenses (although GPL-licensed components are
being replaced with BSD-licensed ones)
32. Examples
• Apple’s OS X:
• Contains many open source components
• Under a variety of licenses (although GPL-licensed components are
being replaced with BSD-licensed ones)
• FreeBSD
33. Examples
• Apple’s OS X:
• Contains many open source components
• Under a variety of licenses (although GPL-licensed components are
being replaced with BSD-licensed ones)
• FreeBSD
• Licensed under the BSD-2 license, but contains GPL’d software
34. Examples
• Apple’s OS X:
• Contains many open source components
• Under a variety of licenses (although GPL-licensed components are
being replaced with BSD-licensed ones)
• FreeBSD
• Licensed under the BSD-2 license, but contains GPL’d software
• Disabled by default
49. Provenance Discovery
• Provenance:
• Evidence of origin, history and integrity
• External:
• Where does this module/file/snippet come from?
• Internal:
• How has this file being modified by the project?
55. For effective provenance you need a very large corpus
Journal of Empirical Software Engineering, 2013
56. For effective provenance you need a very large corpus
Journal of Empirical Software Engineering, 2013
Tools without corpus are useless
57. For effective provenance you need a very large corpus
Journal of Empirical Software Engineering, 2013
Tools without corpus are useless
Industry does not want methods or tools; it wants consulting
68. • What are the different licenses used in GitHub?
• What are the common license change patterns?
• Why do they change?
Journal of Empirical Software Engineering, 2017
69. • What are the different licenses used in GitHub?
• What are the common license change patterns?
• Why do they change?
Journal of Empirical Software Engineering, 2017
70. • What are the different licenses used in GitHub?
• What are the common license change patterns?
• Why do they change?
Journal of Empirical Software Engineering, 2017
71. • What are the different licenses used in GitHub?
• What are the common license change patterns?
• Why do they change?
Most frequently: changes to licensing are documented but not explained
Journal of Empirical Software Engineering, 2017
85. Patrick McHardy
• Former chair of the Netfilter Core Development Team
• Part of Linux; widely used
86. Patrick McHardy
• Former chair of the Netfilter Core Development Team
• Part of Linux; widely used
• Has been seeking non-compliant reusers of the Linux Kernel
• Allegedly for financial gain (it is estimated he has made more than
€2M)
87. Patrick McHardy
• Former chair of the Netfilter Core Development Team
• Part of Linux; widely used
• Has been seeking non-compliant reusers of the Linux Kernel
• Allegedly for financial gain (it is estimated he has made more than
€2M)
• In 2016 he was suspended from the Netfilter Core Development Team
88. – Greg Kroah-Hartman et. al
“While the kernel community has always supported enforcement efforts to
bring companies into compliance, we have never even considered enforcement
for the purpose of extracting monetary gain.”
http://kroah.com/log/blog/2017/10/16/linux-kernel-community-enforcement-statement/
89. Geniatec vs McHardy
in the words of H. Welte (Netfilter)
• “The court recognized that there is no co-authorship / joint authorship (German:
Miturheber) in the Linux kernel as a whole, as it was not a group of people
planning+developing a given program together, but it is a program that has been
released by Linus Torvalds and has since been edited by more than 15.000
developers without any "grand joint plan" but rather in successive iterations. This
situation constitutes "editing authorship" (German: Bearbeiterurheber)”
• "The court further recognized that being listed as "head of the netfilter core team"
or a "subsystem maintainer" doesn't necessarily mean that one is contributing
copyrightable works. Reviewing thousands of patches doesn't mean you own
copyright on them, drawing an analogy to an editorial office at a publisher.”
• “The court understood there are plenty of Linux versions that may not even
contain any of Patrick McHardy's code (such as older versions)”
http://laforge.gnumonks.org/blog/20180307-mchardy-gpl/
90. But…
• “The Linux kernel development model does not support the claim of Patrick
McHardy having co-authored Linux. In so far, he is only an editing
author (Bearbeiterurheber), and not a co-author. Nevertheless, even
an editing author has the right to ask for cease and desist, but only on
those portions that he authored/edited, and not on the entire Linux kernel.”
• “The plaintiff did not sufficiently show what exactly his contributions were
and how they were forming themselves copyrightable works”
• “The plaintiff being a member of the netfilter core team or even the head of
the core team still doesn't support the claim of being a co-author, as
netfilter substantially existed since 1999, three years before Patrick's first
contribution to netfilter, and five years before joining the core team in 2004.”
91. Important questions:
• What has McHardy contributed to Linux?
• What copyrightable material has he contributed to Linux?
• What is currently in Linux?
92. Context
• 26.5 years of development
• 806k changes
• By 17k different persons
• 63k files
93. Version control and Linux
1991-2002 NONE
2002-2005 Bitkeeper
2005 - today git
95. Linux Foundation Request
• Can we know where every “character” of the source code of
the Linux kernel comes from?
96. Linux Foundation (sidebar)
• The Linux Foundation (LF) is not Linux
• LF is a “business league” (non for profit, industrial members)
• Its goal: “Building sustainable ecosystems around open source
projects to accelerate technology development and commercial
adoption.”
• It pays the salaries of Linus Torvalds and Greg Kroah-Hartman
97. Linux Foundation efforts
• SPDX: Software Package Data Exchange (SPDX)
• Document open source licenses
• OpenChain
• Documents good-practice processes to improve license compliance
• CHAOSS: Community Health Analytics of Open Source Software
• Create metrics to measure the “health” of an open source project
99. However…
• cregit only tells us what change introduced what
• Open questions:
• What changes are copyrightable?
• Where does the code in a change come from?
• Is the person doing the change the copyright holder?
100. At a more holistic level
• Who are the copyrights holders of:
• A system/module/file/function/API
• Is there “residual” IP when code is replaced
with new code?
• On purpose or by evolution