|
Ntrain = Different Methods |
5 | 10 | 20 | 50 |
| Image Correlation | 4.58 ± 0.26% | 5.06 ± 0.26% | 6.60 ± 0.26% | 8.85 ± 0.26% |
| Spatial Pyramid Matching (algorithm of Lazebnik et. al.) |
18.74 ± 0.48% | 25.01 ± 0.50% | 31.31 ± 0.74% | 39.01 ± 0.45% |
| Combining Multiple Kernels in an SVM Framework (submitted by Manik Varma) |
43.99% | 53.34% | 59.84% | 66.44% |
On the top left we plot the error rates for each algorithm and each category, with SPM error rates on the x-axis and the Manik Varma's error rates on the y-axis. There are 256 dots corresponding to 256 categories. Except for a few outliers, the Varma algorithm does better on almost every category. To be specific, it performs better on 92% of the categories.
The fact that this scatter plot is so one-sided suggests that the vanilla SPM algorithm does not add much, if anything, to Varma's method based on a linear combination of kernels. In other words, it has very little originality compared to Varma. This is perhaps not surprising, since SPM is one of the kernels that Varma uses in his linear ensemble.

Define the distance between any two categories as the mean distance to the nearest common parent. For example, on the tree to the left, the distance between duck and goose is 1, since they are each 1 level away from the node animate=>animal=>air. The distance between boom-box and breadmaker is 2 since this is the mean distance to their nearest common ancestor inanimate=>electronics.
For any two categories, compute their distance on each of the above hierarchies. Take the minimim of these two distances and apply the following formula to arrive at the cost of misclassification:
| Distance | Non-Hierarchical Cost | Hierarchical Cost | Categories are... |
| 0 | 0.0 | 0.0 | correct |
| 1 | 1.0 | 0.5 | reasonable mistake |
| 2 | 1.0 | 1.0 | mistake |
| ≥3 | 1.0 | 2.0 | bad mistake |
In the non-hierarchical case, we do not have a tree which lets us discriminate between similar and different categories. Thus every misclassification carries the same unit cost. Using the above relations we derive misclassification costs matrices C which looks like this:
| Similarity Measured Using : | Ordinary Cost Matrix (0 to 100%) |
Hierarchical Cost Matrix (-100 to 100%) | ||||||
|
Ntrain : |
5 | 10 | 20 | 50 | 5 | 10 | 20 | 50 |
| Image Correlation | 4.6% | 5.1% | 6.6% | 8.8% | -58.9% | -57.3% | -54.5% | -50.4% |
| Spatial Pyramid Matching | 18.7% | 25.0% | 31.3% | 39.0% | -23.2% | -12.5% | -2.2% | 10.1% |
| Combining Multiple Kernels (Manik Varma) |
44.0% | 53.3 | 59.8 | 66.4 | 13.4% | 28.4 | 38.6 | 48.7 |
|