Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets

Siliva Moraes; Andre Santos; Matheus Redecker; Rackel Machado; Felipe Meneguzzi

doi:10.1007/978-3-319-41552-9_8

Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets

Siliva Moraes, Andre Santos^*, Matheus Redecker, Rackel Machado, Felipe Meneguzzi

^*Corresponding author for this work

Computing Science

Pontifícia Universidade Católica do Rio Grande do Sul

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

7 Citations (Scopus)

Abstract

In this paper, we compare lexicon-based and machine learning-based approaches to define the subjectivity of tweets in Portuguese. We tested SentiLex and WordAffectBR lexicons, and Sequential Machine Optimization and Naive Bayes algorithms for this task. In our study, we used the Computer-BR corpus that contains messages about the technology area. We obtained better results using the Comprehensive Measurement Feature Selection method and the Sequential Machine Optimization algorithm as the classifier. We achieved considerable accuracy when we included the polarities of words in the vector space model of tweets.

Original language	English
Title of host publication	Computational Processing of the Portuguese Language
Subtitle of host publication	PROPOR 2016
Editors	J. Silva, R Ribeiro, P. Quaresma, A. Adami, A. Branco
Publisher	Springer
Pages	86-94
Number of pages	9
ISBN (Electronic)	978-3-319-41552-9
ISBN (Print)	978-3-319-41551-2
DOIs	https://doi.org/10.1007/978-3-319-41552-9_8
Publication status	Published - 21 Jun 2016

Publication series

Name	Lecture Notes in Computer Science
Volume	9727

Bibliographical note

Moraes, S.M.W., Santos, A.L.L., Redecker, M., Machado, R.M., Meneguzzi, F.R. (2016). Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds) Computational Processing of the Portuguese Language. PROPOR 2016. Lecture Notes in Computer Science(), vol 9727. Springer, Cham. https://doi.org/10.1007/978-3-319-41552-9_8

Access to Document

10.1007/978-3-319-41552-9_8Licence: Unspecified

Cite this

Moraes, S., Santos, A., Redecker, M., Machado, R., & Meneguzzi, F. (2016). Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets. In J. Silva, R. Ribeiro, P. Quaresma, A. Adami, & A. Branco (Eds.), Computational Processing of the Portuguese Language: PROPOR 2016 (pp. 86-94). (Lecture Notes in Computer Science; Vol. 9727). Springer . https://doi.org/10.1007/978-3-319-41552-9_8

Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets. / Moraes, Siliva; Santos, Andre; Redecker, Matheus et al.
Computational Processing of the Portuguese Language: PROPOR 2016. ed. / J. Silva; R Ribeiro; P. Quaresma; A. Adami; A. Branco. Springer , 2016. p. 86-94 (Lecture Notes in Computer Science; Vol. 9727).

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

Moraes, S, Santos, A, Redecker, M, Machado, R & Meneguzzi, F 2016, Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets. in J Silva, R Ribeiro, P Quaresma, A Adami & A Branco (eds), Computational Processing of the Portuguese Language: PROPOR 2016. Lecture Notes in Computer Science, vol. 9727, Springer , pp. 86-94. https://doi.org/10.1007/978-3-319-41552-9_8

@inproceedings{a46491108b6242ee91581a3597cb993e,

title = "Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets",

abstract = "In this paper, we compare lexicon-based and machine learning-based approaches to define the subjectivity of tweets in Portuguese. We tested SentiLex and WordAffectBR lexicons, and Sequential Machine Optimization and Naive Bayes algorithms for this task. In our study, we used the Computer-BR corpus that contains messages about the technology area. We obtained better results using the Comprehensive Measurement Feature Selection method and the Sequential Machine Optimization algorithm as the classifier. We achieved considerable accuracy when we included the polarities of words in the vector space model of tweets.",

author = "Siliva Moraes and Andre Santos and Matheus Redecker and Rackel Machado and Felipe Meneguzzi",

note = "Moraes, S.M.W., Santos, A.L.L., Redecker, M., Machado, R.M., Meneguzzi, F.R. (2016). Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds) Computational Processing of the Portuguese Language. PROPOR 2016. Lecture Notes in Computer Science(), vol 9727. Springer, Cham. https://doi.org/10.1007/978-3-319-41552-9_8",

year = "2016",

month = jun,

day = "21",

doi = "10.1007/978-3-319-41552-9_8",

language = "English",

isbn = "978-3-319-41551-2",

series = "Lecture Notes in Computer Science",

publisher = "Springer ",

pages = "86--94",

editor = "J. Silva and R Ribeiro and P. Quaresma and A. Adami and A. Branco",

booktitle = "Computational Processing of the Portuguese Language",

}

TY - GEN

T1 - Comparing Approaches to Subjectivity Classification

T2 - A Study on Portuguese Tweets

AU - Moraes, Siliva

AU - Santos, Andre

AU - Redecker, Matheus

AU - Machado, Rackel

AU - Meneguzzi, Felipe

N1 - Moraes, S.M.W., Santos, A.L.L., Redecker, M., Machado, R.M., Meneguzzi, F.R. (2016). Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds) Computational Processing of the Portuguese Language. PROPOR 2016. Lecture Notes in Computer Science(), vol 9727. Springer, Cham. https://doi.org/10.1007/978-3-319-41552-9_8

PY - 2016/6/21

Y1 - 2016/6/21

N2 - In this paper, we compare lexicon-based and machine learning-based approaches to define the subjectivity of tweets in Portuguese. We tested SentiLex and WordAffectBR lexicons, and Sequential Machine Optimization and Naive Bayes algorithms for this task. In our study, we used the Computer-BR corpus that contains messages about the technology area. We obtained better results using the Comprehensive Measurement Feature Selection method and the Sequential Machine Optimization algorithm as the classifier. We achieved considerable accuracy when we included the polarities of words in the vector space model of tweets.

AB - In this paper, we compare lexicon-based and machine learning-based approaches to define the subjectivity of tweets in Portuguese. We tested SentiLex and WordAffectBR lexicons, and Sequential Machine Optimization and Naive Bayes algorithms for this task. In our study, we used the Computer-BR corpus that contains messages about the technology area. We obtained better results using the Comprehensive Measurement Feature Selection method and the Sequential Machine Optimization algorithm as the classifier. We achieved considerable accuracy when we included the polarities of words in the vector space model of tweets.

U2 - 10.1007/978-3-319-41552-9_8

DO - 10.1007/978-3-319-41552-9_8

M3 - Published conference contribution

SN - 978-3-319-41551-2

T3 - Lecture Notes in Computer Science

SP - 86

EP - 94

BT - Computational Processing of the Portuguese Language

A2 - Silva, J.

A2 - Ribeiro, R

A2 - Quaresma, P.

A2 - Adami, A.

A2 - Branco, A.

PB - Springer

ER -

Comparing Approaches to Subjectivity Classification: A Study on Portuguese Tweets

Abstract

Publication series

Bibliographical note

Access to Document

Fingerprint

Cite this