One of the first real-world experiments carried out with SVM was done at AT&T Bell Labs using optical character recognition (OCR) data ([17,54]). These early experiments already showed the astonishingly high accuracies for SVMs which was on a par with convolutive multi-layer perceptrons. Below we record the classification performance of SVM, some variants not discussed in this chapter, and other classifiers on the USPS (US-Postal Service) benchmark (parts from [62]). The task is to classify handwritten digits into one of ten classes. Standard SVMs as presented here already exhibit a performance similar to other methods.
A benchmark problem larger than the USPS data set ( patterns)
was collected by NIST and contains
handwritten
digits. Invariant SVMs achieve an error rate of
([19]) on this challenging and more realistic data set,
slightly better than the tangent distance method (
) or single neural
networks (LeNet
:
). An ensemble of LeNet
networks
that was trained on a vast number of artificially generated patterns
(using invariance transformations) is on a par with the best SVM and
also achieved
([36]).
The high dimensional problem of text categorization seems to be an
application for which SVMs have performed particularly well. A popular
benchmark is the Reuters- text corpus, where Reuters
collected
news stories from 1997, and partitioned and
indexed them into
different categories to simplify the
access. The feature typically used to classify Reuters documents are
-dimensional vectors containing word frequencies within
a document. With such a coding SVMs have achieved excellent results
(see e.g. [31]).
A challenging problem of computational chemistry is the discovery of active chemical compounds. The goal is to found the compounds that bind to a certain molecule in as few iterations of biological testing as possible. An astonishingly effective application of SVM to this problem has been recently proposed by [75].
There are numerous other applications to which SVM were successfully applied. Examples are object and face recognition tasks ([46]), inverse problems ([71,77]), gene array expression monitoring ([15]), remote protein homology detection ([30]), or splice site detection ([79,64]). This list can be extended.