cssutil 20040312-BlameSnow-auto.1
cssutil - create, summarize and edit a .css file
cssutil file.css (.css file name) [-b (brief: print only summary, don't start shell)] [-h (print help)] [-r (print report)] [-s NNNN (number of buckets in new .css file)] [-S NNNN (number of buckets in new .css file, rounded up to next 2^n + 1 boundary)]
Creating .css files
The cssutil program can be used to create information-less
.css files:
cssutil -b -r spam.css cssutil -b -r nonspam.css. This creates the full-size files ./spam.css and ./nonspam.css, holding no information.
Editing .css files
The cssutil utility can be used to edit .css spectral files. The default
behaviour of cssutil (when no optional arguments are passed), is to first give
you statistics on the file somefile.css. After this you'll enter the cssutil
shell, which allows you to rescale, clip, and otherwise manage your .css files.
Available commands in the cssutil shell:
Z n - zero bins at or below a value S n - subtract a constant from all bins D n - divide all bins by a constant R - rescan P - pack Q - quit
Inspecting .css files
You can check that the .css files are reasonable. Invoke cssutil as:
cssutil -b -r spam.css cssutil -b -r nonspam.cssYou should get back a report something like this:
Sparse spectra file spam.css statistics: Total available buckets : 1048576 Total buckets in use : 506987 Total hashed datums in file : 1605968 Average datums per bucket : 3.17 Maximum length of overflow chain : 39 Average length of overflow chain : 1.84 Average packing density : 0.48Note that the packing density is 0.48; this means that this .css file is about half full of features. Once the packing density gets above about 0.9, you will notice that CRM114 will take longer to process text. The penalty is small below packing densities below about 0.95 and only about a factor of 2 at 0.97 . Best is to keep it below .7 to .8.
The CRM114 homepage is at http://crm114.sf.net/ .
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program (see COPYING); if not, check with http://www.gnu.org/copyleft/gpl.html or write to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111, USA.