Combining information extraction with genetic algorithms for text mining

John Atkinson, Christopher Stuart Mellish, Stuart Aitken

Research output: Contribution to journalArticle

37 Citations (Scopus)

Abstract

An evolutionary approach that combines information extraction technology and genetic algorithms can produce a new, integrated model for text mining. Text mining discovers unseen patterns in textual databases. We've brought together the benefits of GAs for data mining and IE technology to propose a new approach for high-level knowledge discovery. Unlike previous KDT approaches, our model doesn't rely on external resources or conceptual descriptions. Instead, it performs the discovery using only information from the original corpus of text documents and from training data computed from them. The GA that produces the hypotheses is strongly guided by semantic constraints, which means that several specifically defined metrics evaluate the quality and plausibility.
Original languageEnglish
Pages (from-to)22-30
Number of pages9
JournalIEEE Intelligent Systems
Volume19
Issue number3
DOIs
Publication statusPublished - May 2004

Fingerprint

Data mining
Genetic algorithms
Semantics

Keywords

  • data mining
  • genetic algorithms
  • knowledge discovery from texts (KDTs)
  • multiobjective optimization
  • semantic analysis
  • text mining
  • information extraction
  • textual database
  • information analysis
  • natural languages
  • pattern analysis
  • performance analysis
  • robustness

Cite this

Combining information extraction with genetic algorithms for text mining. / Atkinson, John; Mellish, Christopher Stuart; Aitken, Stuart.

In: IEEE Intelligent Systems, Vol. 19, No. 3, 05.2004, p. 22-30.

Research output: Contribution to journalArticle

Atkinson, John ; Mellish, Christopher Stuart ; Aitken, Stuart. / Combining information extraction with genetic algorithms for text mining. In: IEEE Intelligent Systems. 2004 ; Vol. 19, No. 3. pp. 22-30.
@article{07f2b7514f83460792bd1534b5c89687,
title = "Combining information extraction with genetic algorithms for text mining",
abstract = "An evolutionary approach that combines information extraction technology and genetic algorithms can produce a new, integrated model for text mining. Text mining discovers unseen patterns in textual databases. We've brought together the benefits of GAs for data mining and IE technology to propose a new approach for high-level knowledge discovery. Unlike previous KDT approaches, our model doesn't rely on external resources or conceptual descriptions. Instead, it performs the discovery using only information from the original corpus of text documents and from training data computed from them. The GA that produces the hypotheses is strongly guided by semantic constraints, which means that several specifically defined metrics evaluate the quality and plausibility.",
keywords = "data mining, genetic algorithms, knowledge discovery from texts (KDTs), multiobjective optimization, semantic analysis, text mining, information extraction, textual database, information analysis, natural languages, pattern analysis, performance analysis, robustness",
author = "John Atkinson and Mellish, {Christopher Stuart} and Stuart Aitken",
year = "2004",
month = "5",
doi = "10.1109/MIS.2004.4",
language = "English",
volume = "19",
pages = "22--30",
journal = "IEEE Intelligent Systems",
issn = "1541-1672",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "3",

}

TY - JOUR

T1 - Combining information extraction with genetic algorithms for text mining

AU - Atkinson, John

AU - Mellish, Christopher Stuart

AU - Aitken, Stuart

PY - 2004/5

Y1 - 2004/5

N2 - An evolutionary approach that combines information extraction technology and genetic algorithms can produce a new, integrated model for text mining. Text mining discovers unseen patterns in textual databases. We've brought together the benefits of GAs for data mining and IE technology to propose a new approach for high-level knowledge discovery. Unlike previous KDT approaches, our model doesn't rely on external resources or conceptual descriptions. Instead, it performs the discovery using only information from the original corpus of text documents and from training data computed from them. The GA that produces the hypotheses is strongly guided by semantic constraints, which means that several specifically defined metrics evaluate the quality and plausibility.

AB - An evolutionary approach that combines information extraction technology and genetic algorithms can produce a new, integrated model for text mining. Text mining discovers unseen patterns in textual databases. We've brought together the benefits of GAs for data mining and IE technology to propose a new approach for high-level knowledge discovery. Unlike previous KDT approaches, our model doesn't rely on external resources or conceptual descriptions. Instead, it performs the discovery using only information from the original corpus of text documents and from training data computed from them. The GA that produces the hypotheses is strongly guided by semantic constraints, which means that several specifically defined metrics evaluate the quality and plausibility.

KW - data mining

KW - genetic algorithms

KW - knowledge discovery from texts (KDTs)

KW - multiobjective optimization

KW - semantic analysis

KW - text mining

KW - information extraction

KW - textual database

KW - information analysis

KW - natural languages

KW - pattern analysis

KW - performance analysis

KW - robustness

U2 - 10.1109/MIS.2004.4

DO - 10.1109/MIS.2004.4

M3 - Article

VL - 19

SP - 22

EP - 30

JO - IEEE Intelligent Systems

JF - IEEE Intelligent Systems

SN - 1541-1672

IS - 3

ER -