deposit_your_work

Learning minimum volume sets with support vector machines

Files in this item

Files Size Format View
Dav2006Sep5Learningmi.PDF 731.0Kb application/pdf Thumbnail

Show full item record

Item Metadata

Title: Learning minimum volume sets with support vector machines
Author: Davenport, Mark A.; Baraniuk, Richard G.; Scott, Clayton D.
Type: Conference Paper
Citation: M. A. Davenport, R. G. Baraniuk and C. D. Scott,"Learning minimum volume sets with support vector machines," in IEEE Workshop on Machine Learning for Signal Processing (MLSP),, pp. 301-306.
Abstract: Given a probability law P on d-dimensional Euclidean space, the minimum volume set (MV-set) with mass beta , 0 < beta < 1, is the set with smallest volume enclosing a probability mass of at least beta. We examine the use of support vector machines (SVMs) for estimating an MV-set from a collection of data points drawn from P, a problem with applications in clustering and anomaly detection. We investigate both one-class and two-class methods. The two-class approach reduces the problem to Neyman-Pearson (NP) classification, where we artificially generate a second class of data points according to a uniform distribution. The simple approach to generating the uniform data suffers from the curse of dimensionality. In this paper we (1) describe the reduction of MV-set estimation to NP classification, (2) devise improved methods for generating artificial uniform data for the two-class approach, (3) advocate a new performance measure for systematic comparison of MV-set algorithms, and (4) establish a set of benchmark experiments to serve as a point of reference for future MV-set algorithms. We find that, in general, the two-class method performs more reliably.
Date Published: 2006-09-01

This item appears in the following Collection(s)

  • ECE Publications [1082 items]
    Publications by Rice University Electrical and Computer Engineering faculty and graduate students
  • DSP Publications [508 items]
    Publications by Rice Faculty and graduate students in digital signal processing.