Protein-Protein Interactions Classification from Text via Local Learning with Class Priors

Yulan He; Chenghua Lin

doi:10.1007/978-3-642-12550-8_15

Protein-Protein Interactions Classification from Text via Local Learning with Class Priors

Yulan He, Chenghua Lin

Computing Science

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

1 Citation (Scopus)

Abstract

Text classification is essential for narrowing down the number of documents relevant to a particular topic for further pursual, especially when searching through large biomedical databases. Protein-protein interactions are an example of such a topic with databases being devoted specifically to them. This paper proposed a semi-supervised learning algorithm via local learning with class priors (LL-CP) for biomedical text classification where unlabeled data points are classified in a vector space based on their proximity to labeled nodes. The algorithm has been evaluated on a corpus of biomedical documents to identify abstracts containing information about protein-protein interactions with promising results. Experimental results show that LL-CP outperforms the traditional semi-supervised learning algorithms such as SVM and it also performs better than local learning without incorporating class priors.

Original language	English
Title of host publication	Natural Language Processing and Information Systems
Subtitle of host publication	14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrücken, Germany, June 24-26, 2009. Revised Papers
Publisher	Springer Berlin / Heidelberg
Pages	182-191
Number of pages	10
ISBN (Electronic)	978-3-642-12550-8
ISBN (Print)	978-3-642-12549-2
DOIs	https://doi.org/10.1007/978-3-642-12550-8_15
Publication status	Published - 2010

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer Berlin Heidelberg
Volume	5723
ISSN (Print)	0302-9743

Access to Document

10.1007/978-3-642-12550-8_15

Cite this

He, Y., & Lin, C. (2010). Protein-Protein Interactions Classification from Text via Local Learning with Class Priors. In Natural Language Processing and Information Systems: 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrücken, Germany, June 24-26, 2009. Revised Papers (pp. 182-191). (Lecture Notes in Computer Science; Vol. 5723). Springer Berlin / Heidelberg. https://doi.org/10.1007/978-3-642-12550-8_15

Protein-Protein Interactions Classification from Text via Local Learning with Class Priors. / He, Yulan; Lin, Chenghua.
Natural Language Processing and Information Systems: 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrücken, Germany, June 24-26, 2009. Revised Papers. Springer Berlin / Heidelberg, 2010. p. 182-191 (Lecture Notes in Computer Science; Vol. 5723).

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

He, Y & Lin, C 2010, Protein-Protein Interactions Classification from Text via Local Learning with Class Priors. in Natural Language Processing and Information Systems: 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrücken, Germany, June 24-26, 2009. Revised Papers. Lecture Notes in Computer Science, vol. 5723, Springer Berlin / Heidelberg, pp. 182-191. https://doi.org/10.1007/978-3-642-12550-8_15

He Y, Lin C. Protein-Protein Interactions Classification from Text via Local Learning with Class Priors. In Natural Language Processing and Information Systems: 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrücken, Germany, June 24-26, 2009. Revised Papers. Springer Berlin / Heidelberg. 2010. p. 182-191. (Lecture Notes in Computer Science). doi: 10.1007/978-3-642-12550-8_15

He, Yulan ; Lin, Chenghua. / Protein-Protein Interactions Classification from Text via Local Learning with Class Priors. Natural Language Processing and Information Systems: 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrücken, Germany, June 24-26, 2009. Revised Papers. Springer Berlin / Heidelberg, 2010. pp. 182-191 (Lecture Notes in Computer Science).

@inproceedings{28be7b83c32544069ffba0b9f0a84eba,

title = "Protein-Protein Interactions Classification from Text via Local Learning with Class Priors",

abstract = "Text classification is essential for narrowing down the number of documents relevant to a particular topic for further pursual, especially when searching through large biomedical databases. Protein-protein interactions are an example of such a topic with databases being devoted specifically to them. This paper proposed a semi-supervised learning algorithm via local learning with class priors (LL-CP) for biomedical text classification where unlabeled data points are classified in a vector space based on their proximity to labeled nodes. The algorithm has been evaluated on a corpus of biomedical documents to identify abstracts containing information about protein-protein interactions with promising results. Experimental results show that LL-CP outperforms the traditional semi-supervised learning algorithms such as SVM and it also performs better than local learning without incorporating class priors.",

author = "Yulan He and Chenghua Lin",

year = "2010",

doi = "10.1007/978-3-642-12550-8_15",

language = "English",

isbn = "978-3-642-12549-2",

series = "Lecture Notes in Computer Science",

publisher = "Springer Berlin / Heidelberg",

pages = "182--191",

booktitle = "Natural Language Processing and Information Systems",

}

TY - GEN

T1 - Protein-Protein Interactions Classification from Text via Local Learning with Class Priors

AU - He, Yulan

AU - Lin, Chenghua

PY - 2010

Y1 - 2010

N2 - Text classification is essential for narrowing down the number of documents relevant to a particular topic for further pursual, especially when searching through large biomedical databases. Protein-protein interactions are an example of such a topic with databases being devoted specifically to them. This paper proposed a semi-supervised learning algorithm via local learning with class priors (LL-CP) for biomedical text classification where unlabeled data points are classified in a vector space based on their proximity to labeled nodes. The algorithm has been evaluated on a corpus of biomedical documents to identify abstracts containing information about protein-protein interactions with promising results. Experimental results show that LL-CP outperforms the traditional semi-supervised learning algorithms such as SVM and it also performs better than local learning without incorporating class priors.

AB - Text classification is essential for narrowing down the number of documents relevant to a particular topic for further pursual, especially when searching through large biomedical databases. Protein-protein interactions are an example of such a topic with databases being devoted specifically to them. This paper proposed a semi-supervised learning algorithm via local learning with class priors (LL-CP) for biomedical text classification where unlabeled data points are classified in a vector space based on their proximity to labeled nodes. The algorithm has been evaluated on a corpus of biomedical documents to identify abstracts containing information about protein-protein interactions with promising results. Experimental results show that LL-CP outperforms the traditional semi-supervised learning algorithms such as SVM and it also performs better than local learning without incorporating class priors.

U2 - 10.1007/978-3-642-12550-8_15

DO - 10.1007/978-3-642-12550-8_15

M3 - Published conference contribution

SN - 978-3-642-12549-2

T3 - Lecture Notes in Computer Science

SP - 182

EP - 191

BT - Natural Language Processing and Information Systems

PB - Springer Berlin / Heidelberg

ER -

Protein-Protein Interactions Classification from Text via Local Learning with Class Priors

Abstract

Publication series

Access to Document

Fingerprint

Cite this