FOSSology Project Logo FOSSology
Advancing open source analysis and development
 

The Automated Collection and Presentation of FOSS Notices

The following is a collection of requirements from Martin von Willebrand, and others for improving license and copyright reports.

  • Change “List Files base on License” to display the match %
  • Add filters (see also 'process tracking' below) to License Browser to show:
    1. All licenses (today's behavior)
    2. Only files with <100% license matches, include files with no license
    3. Only files with <90% license matches, include files with no license
    4. Files with no license

The above only applies to the new sentence classifier license analyzer and the (depricated) bSAM license analyzer. Match percentages are not available from the new Nomos license analyzer. A 100% match is based on a template match, not an absolute character match. This allows words and characters irrelevant to the license to be ignored.

  • Copyright notices should be saved in the database with:
    1. filename
    2. copyright notice
    3. copyright years
    4. copyright holder name
  • Process tracking:
    1. Files that are manually processed (license checked) should be so indicated in the database.
    2. Allow comments to be attached to files, directories, packages, uploads.
    3. Allow marking files, directories, packages, uploads as having been processed.
    4. Allow flags or keywords to be attached to files, directories, packages, uploads. This allow you to flag a file as “needs more review” or “ok by agreement with copyright holder”, etc.
    5. Report based on process tracking. This includes user selectable criteria for processed, unprocessed, match %, comments, flags/keywords that may be combined to yield reports like “unprocessed files no licenses”, or “unprocessed files with license having <90% match”.
  • Reports
    1. Full document: grouped by file names, creates a long document detailing the name of the file and under that the notices under such file and repeating this for each file.
    2. Full document, no file names: same as 1, but file names are omitted Why is it important to omit file names?
    3. License based document: grouped by license texts; states the same license text only once and then lists all copyright notices together with the name of the file applicable for such notice and license
    4. License based document, no file names: same as 3, but file names are omitted.
    5. License based document, copyright notices summarized: Same as 3, but copyright notices of the same holder are summarized (e.g. copyright notice years 20×2005; 20×2006-2007; 40×2008 are summarized as 2005-2008, instead of 80 notices) and a list of files relevant for each notice is shown after the notice Combined copyright notices example.
    6. License based document, copyright notices summarized, no file names: Same as 5, but file names are omitted
 
task/willebrand.txt · Last modified: 2009/06/08 13:33 by bobg

Copyright (C) 2007-2009 Hewlett-Packard Development Company, L.P.
FOSSology Project documentation is licensed under the GNU Free Documentation License Version 1.2
Recent changes RSS feed Valid XHTML 1.0 Valid CSS3 Driven by DokuWiki