The following is a collection of requirements from Martin von Willebrand, and others for improving license and copyright reports.
The above only applies to the new sentence classifier license analyzer
and the (depricated) bSAM license analyzer. Match percentages are not available from the new Nomos license analyzer. A 100% match is based on a template match, not an absolute character match. This allows words and characters irrelevant to the license to be ignored.
Process tracking:
Files that are manually processed (license checked) should be so indicated in the database.
Allow comments to be attached to files, directories, packages, uploads.
Allow marking files, directories, packages, uploads as having been processed.
Allow flags or keywords to be attached to files, directories, packages, uploads. This allow you to flag a file as “needs more review” or “ok by agreement with copyright holder”, etc.
Report based on process tracking. This includes user selectable criteria for processed, unprocessed, match %, comments, flags/keywords that may be combined to yield reports like “unprocessed files no licenses”, or “unprocessed files with license having <90% match”.
Reports
Full document: grouped by file names, creates a long document detailing the name of the file and under that the notices under such file and repeating this for each file.
Full document, no file names: same as 1, but file names are omitted Why is it important to omit file names?
License based document: grouped by license texts; states the same license text only once and then lists all copyright notices together with the name of the file applicable for such notice and license
License based document, no file names: same as 3, but file names are omitted.
License based document, copyright notices summarized: Same as 3, but copyright notices of the same holder are summarized (e.g. copyright notice years 20×2005; 20×2006-2007; 40×2008 are summarized as 2005-2008, instead of 80 notices) and a list of files relevant for each notice is shown after the notice
Combined copyright notices example.
License based document, copyright notices summarized, no file names: Same as 5, but file names are omitted