gist-classify
Description: Classify a set of unlabeled vectors using a trained support vector machine.
Usage:
gist-classify [options] -train <filename> -learned <filename> -test <filename>
Input:
- -train <filename> - a tab-delimited file of training examples. The first column contains labels, and the remaining columns contain real-valued features. Missing values are not allowed.
- -learned <filename> - a file of weights, as output from
compute-weights
. The kernel parameters are read from the header of this file.- -test <filename> - a tab-delimited file of test examples to be classified. Missing values are not allowed.
Note that, if the given weights file was trained on an externally defined kernel (using the '-matrix' option to
compute-weights
), then the file specified by the '-test' option should contain a kernel matrix, rather than test set examples. The kernel matrix is an n+1 (rows) by m+1 (columns) matrix, where n is the number of test examples and m is the number of training examples. The first row and column contain data labels. The matrix entry for row x, column y, contains the kernel value K(x,y). If a kernel matrix is supplied in place of the test set, then a training set should not be specified. However, if the kernel is normalized or is radial basis, then the '-selftrain' and '-selftest' files must be supplied (see below).Output: A three-column file containing data labels, classifications (-1 or 1) and discriminants for the test set.
Options:
- -selftrain <filename> - Read from the given file a series of n values of the form K(x,x), where K is the base kernel function and x is an element in the training set. This option is only necessary if the base kernel matrix is supplied from a file (using the -matrix option in
compute-weights
) and the kernel is normalized or radial basis. The input file should be in tab-delimited format, with titles in the first row, data labels in the first column and values in the second column.- -selftest <filename> - Similar to '-selftrain', but for the test set. This option is only needed if you used the -matrix option in
compute-weights
.- -rdb - Allow the program to read and create RDB formatted files, which contain an additional format line after the first line of text.
- -kernelout - Compute and print the kernel matrix to stdout. Do not compute the classifications.
- -notime - Do not include timing information in the output header.
- -precision <value> - Number of digits after the decimal place in the output file. The default is 6.
- -verbose 1|2|3|4|5 - Set the verbosity level of the output to stderr. The default level is 2.
Gist