diff --git a/www/conservancy/static/copyleft-compliance/vmware-code-similarity.html b/www/conservancy/static/copyleft-compliance/vmware-code-similarity.html index 4321778c00975a6fc46728a0ddb728e758cf36f4..4d68c7484699c9b1b8388e5bf5b67f90c916d500 100644 --- a/www/conservancy/static/copyleft-compliance/vmware-code-similarity.html +++ b/www/conservancy/static/copyleft-compliance/vmware-code-similarity.html @@ -24,7 +24,13 @@
(Submitted\s+by|original\s+patch|patch\s+(from|by)|originally\s+(from|by)).*Hellwig

Specifically, I used a script to extract a list of commit ids from the historical Linux repository. This method found 1,012 separate occasions of contribution by Hellwig from 2002-02-04 through 2005-04-03.

After finding these separate occasions of contribution, I then extracted the source code lines that Hellwig added or changed in each contribution in this repository. I did so by carefully cross-referencing the commits that Hellwig performed with the output of git blame. I specifically wrote a script to carefully extracted only lines that Hellwig changed or added in that repository, and placed only those contributions identifiable as Hellwig's into new files whose named matched the original filenames. This created a corpus of code that can be verifiable as added or changed by Hellwig and no one else.

-

Here are the specific commands I ran: $ git clone git://git.kernel.org/pub/scm/linux/kernel/git/tglx/history.git linux-historical $ ./commit-id-list-matching-regex.plx pwd/linux-historical/.git Hellwig '(Submitted+by|originals+patch|patch+from|originally+by).' > hellwig-historical.ids $ ./extract-code-added-in-commits.plx --repository=pwd/linux-historical --output-dir=pwd/hellwig-historical --central-commit e7e173af42dbf37b1d946f9ee00219cb3b2bea6a --progress --blame-opts=-M --blame-opts=-C < ./hellwig-historical.ids $ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git linux-current $ ./commit-id-list-matching-regex.plx pwd/linux-current/.git Hellwig '(Submitted+by|original+patch|patch+(from|by)|originally+(from|by)).' > ./hellwig-current.ids $ ./extract-code-added-in-commits.plx --progress --repository=pwd/linux-current --output-dir=pwd/hellwig-through-2.6.34 --fork-limit=14 --blame-opts=-M --blame-opts=-M --blame-opts=-C --blame-opts=-C --central-commit e40152ee1e1c7a63f4777791863215e3faa37a86 < hellwig-current.ids

+

Here are the specific commands I ran:

+
$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/tglx/history.git linux-historical
+$ ./commit-id-list-matching-regex.plx `pwd`/linux-historical/.git Hellwig '(Submitted\s+by|originals+patch|patch\s+from|originally\s+by).*' > hellwig-historical.ids
+$ ./extract-code-added-in-commits.plx --repository=`pwd`/linux-historical --output-dir=`pwd`/hellwig-historical --central-commit e7e173af42dbf37b1d946f9ee00219cb3b2bea6a --progress --blame-opts=-M --blame-opts=-C < ./hellwig-historical.ids
+$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git linux-current
+$ ./commit-id-list-matching-regex.plx `pwd`/linux-current/.git Hellwig '(Submitted\s+by|original\s+patch|patch\s+(from|by)|originally\s+(from|by)).*' > ./hellwig-current.ids
+$ ./extract-code-added-in-commits.plx --progress --repository=`pwd`/linux-current --output-dir=`pwd`/hellwig-through-2.6.34 --fork-limit=14 --blame-opts=-M  --blame-opts=-M --blame-opts=-C --blame-opts=-C --central-commit e40152ee1e1c7a63f4777791863215e3faa37a86   < hellwig-current.ids 

Note: e40152ee1e1c7a63f4777791863215e3faa37a86 is the 2.6.34 version created by Linus Torvalds