Changeset - 49aad6b046fb
[Not reviewed]
0 1 0
Bradley Kuhn (bkuhn) - 5 years ago 2016-08-09 04:38:02
Add introduction text.
1 file changed with 10 insertions and 0 deletions:
0 comments (0 inline, 0 general)
Show inline comments
@@ -3,6 +3,16 @@
{% block submenuselection %}VMwareCodeSimilarity{% endblock %}
{% block content %}

<h1>Similarity Analysis and Contribution Analysis of Christoph Hellwig's
  Linux Code as found in VMware ESXi 5.5</h1>

<p>This analysis verifies by reproducible analysis a set of specific
contributions that are clearly made by Christoph Hellwig to Linux, and shows
  how those contributions appear in the VMware ESXi 5.5 product.</p>

<p>This analysis was prepared and written by Bradley M. Kuhn.</p>

<h1 id="understanding-code-similarity-and-cloning">Understanding Code Similarity and &quot;Cloning&quot;</h1>
<p>Software is often modified in various ways; indeed, Linux developers form a community that encourages and enables modification by many parties. Given this development model, communities often find it valuable to determine when software source code moves from one place to another with only minor modifications. Various scientifically-vetted techniques can be used to identify &quot;clones&quot; -- a portion of code that is substantially similar to pre-existing source code. The specific area of academic research is called &quot;code cloning detection&quot; or &quot;code duplication detection&quot;. The area has been under active research since the mid-1990s <a href="#fn1" class="footnoteRef" id="fnref1"><sup>1</sup></a>. In 2002, Japanese researchers published a tool called CCFinder <a href="#fn2" class="footnoteRef" id="fnref2"><sup>2</sup></a>, which, in its updated incarnation (called CCFinderX), is widely used and referenced by academic researchers in the field <a href="#fn3" class="footnoteRef" id="fnref3"><sup>3</sup></a> and has specifically been used to explore reuses of code in GPL'd software such as Linux <a href="#fn4" class="footnoteRef" id="fnref4"><sup>4</sup></a>.</p>
<p>CCFinderX uses a token-based clone detection method and a suffix-tree matching algorithm; both techniques have been highly vetted and considered in the academic literature. The techniques are considered viable and useful in detecting clones. Many academic papers on the subject have been peer-reviewed and published, and nearly every newly published paper compares its new techniques of clone detection to the seminal results found by CCFinderX. For purposes of our analysis, we have therefore chosen to use CCFinderX. These results can be easily reproduced since CCFinderX is, itself, also Open Source software.</p>
0 comments (0 inline, 0 general)