Implicit Feature Selection with the Value Difference Metric

T R Payne, Peter Edwards

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

19 Downloads (Pure)

Abstract

The nearest neighbour paradigm provides an effective approach to supervised learning. However, it is especially susceptible to the presence of irrelevant attributes.
Whilst many approaches have been proposed that select only the most relevant attributes within a data set, these approaches involve pre-processing the data in some way, and can often be computationally complex. The Value Difference Metric (VDM) is a symbolic distance metric used by a number of different nearest neighbour learning algorithms. This paper demonstrates how the VDM can be used to reduce the impact
of irrelevant attributes on classification accuracy without the need for pre-processing the data. We illustrate how this metric uses simple probabilistic techniques to weight features in the instance space, and then apply this weighting technique to an
alternative symbolic distance metric. The resulting distance metrics are compared in terms of classification accuracy, on a number of real-world and artificial data sets.
Original languageEnglish
Title of host publicationProceedings of the European Conference on Artificial Intelligence - ECAI-98
EditorsHenri Prade
PublisherWiley
Pages450-454
Publication statusPublished - 1998

Keywords

  • machine learning
  • nearest-neighbour
  • feature selection

Fingerprint

Dive into the research topics of 'Implicit Feature Selection with the Value Difference Metric'. Together they form a unique fingerprint.

Cite this