Fully-unsupervised embeddings-based hypernym discovery

Maurizio Atzori; Simone Balloccu

doi:10.3390/INFO11050268

Fully-unsupervised embeddings-based hypernym discovery

Maurizio Atzori^*, Simone Balloccu^* (Corresponding Author)

^*Corresponding author for this work

Computing Science

University of Cagliari

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

5 Downloads (Pure)

Abstract

The hypernymy relation is the one occurring between an instance term and its general term (e.g., "lion" and "animal", "Italy" and "country"). This paper we addresses Hypernym Discovery, the NLP task that aims at finding valid hypernyms from words in a given text, proposing HyperRank, an unsupervised approach that therefore does not require manually-labeled training sets as most approaches in the literature. The proposed algorithm exploits the cosine distance of points in the vector space of word embeddings, as already proposed by previous state of the art approaches, but the ranking is then corrected by also weighting word frequencies and the absolute level of similarity, which is expected to be similar when measuring co-hyponyms and their common hypernym. This brings us two major advantages over other approaches-(1) we correct the inadequacy of semantic similarity which is known to cause a significant performance drop and (2) we take into accounts multiple words if provided, allowing to find common hypernyms for a set of co-hyponyms-a task ignored in other systems but very useful when coupled with set expansion (that finds co-hyponyms automatically). We then evaluate HyperRank against the SemEval 2018 Hypernym Discovery task and show that, regardless of the language or domain, our algorithm significantly outperforms all the existing unsupervised algorithms and some supervised ones as well. We also evaluate the algorithm on a new dataset to measure the improvements when finding hypernyms for sets of words instead of singletons.

Original language	English
Article number	268
Number of pages	20
Journal	Information (Switzerland)
Volume	11
Issue number	5
DOIs	https://doi.org/10.3390/INFO11050268
Publication status	Published - 18 May 2020

Bibliographical note

Funding: Supported in part by Sardegna Ricerche project OKgraph (CRP 120) and MIUR MIUR PRIN 2017 (2019-2022) project HOPE—High quality Open data Publishing and Enrichment.

Keywords

Hypernym discovery
Natural language processing
Natural language understanding
Unsupervised learning
Word embeddings
Word2vec

Access to Document

10.3390/INFO11050268Licence: CC BY

Atzori_etal_Fully-unsupervised_embeddings_VOR
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Final published version, 308 KBLicence: CC BY

Cite this

@article{611178f0f8544bfa8c4e59f83e9e1b26,

title = "Fully-unsupervised embeddings-based hypernym discovery",

abstract = "The hypernymy relation is the one occurring between an instance term and its general term (e.g., {"}lion{"} and {"}animal{"}, {"}Italy{"} and {"}country{"}). This paper we addresses Hypernym Discovery, the NLP task that aims at finding valid hypernyms from words in a given text, proposing HyperRank, an unsupervised approach that therefore does not require manually-labeled training sets as most approaches in the literature. The proposed algorithm exploits the cosine distance of points in the vector space of word embeddings, as already proposed by previous state of the art approaches, but the ranking is then corrected by also weighting word frequencies and the absolute level of similarity, which is expected to be similar when measuring co-hyponyms and their common hypernym. This brings us two major advantages over other approaches-(1) we correct the inadequacy of semantic similarity which is known to cause a significant performance drop and (2) we take into accounts multiple words if provided, allowing to find common hypernyms for a set of co-hyponyms-a task ignored in other systems but very useful when coupled with set expansion (that finds co-hyponyms automatically). We then evaluate HyperRank against the SemEval 2018 Hypernym Discovery task and show that, regardless of the language or domain, our algorithm significantly outperforms all the existing unsupervised algorithms and some supervised ones as well. We also evaluate the algorithm on a new dataset to measure the improvements when finding hypernyms for sets of words instead of singletons.",

keywords = "Hypernym discovery, Natural language processing, Natural language understanding, Unsupervised learning, Word embeddings, Word2vec",

author = "Maurizio Atzori and Simone Balloccu",

note = "Funding: Supported in part by Sardegna Ricerche project OKgraph (CRP 120) and MIUR MIUR PRIN 2017 (2019-2022) project HOPE—High quality Open data Publishing and Enrichment. ",

year = "2020",

month = may,

day = "18",

doi = "10.3390/INFO11050268",

language = "English",

volume = "11",

journal = "Information (Switzerland)",

issn = "2078-2489",

publisher = "MDPI Multidisciplinary Digital Publishing Institute",

number = "5",

}

TY - JOUR

T1 - Fully-unsupervised embeddings-based hypernym discovery

AU - Atzori, Maurizio

AU - Balloccu, Simone

N1 - Funding: Supported in part by Sardegna Ricerche project OKgraph (CRP 120) and MIUR MIUR PRIN 2017 (2019-2022) project HOPE—High quality Open data Publishing and Enrichment.

PY - 2020/5/18

Y1 - 2020/5/18

N2 - The hypernymy relation is the one occurring between an instance term and its general term (e.g., "lion" and "animal", "Italy" and "country"). This paper we addresses Hypernym Discovery, the NLP task that aims at finding valid hypernyms from words in a given text, proposing HyperRank, an unsupervised approach that therefore does not require manually-labeled training sets as most approaches in the literature. The proposed algorithm exploits the cosine distance of points in the vector space of word embeddings, as already proposed by previous state of the art approaches, but the ranking is then corrected by also weighting word frequencies and the absolute level of similarity, which is expected to be similar when measuring co-hyponyms and their common hypernym. This brings us two major advantages over other approaches-(1) we correct the inadequacy of semantic similarity which is known to cause a significant performance drop and (2) we take into accounts multiple words if provided, allowing to find common hypernyms for a set of co-hyponyms-a task ignored in other systems but very useful when coupled with set expansion (that finds co-hyponyms automatically). We then evaluate HyperRank against the SemEval 2018 Hypernym Discovery task and show that, regardless of the language or domain, our algorithm significantly outperforms all the existing unsupervised algorithms and some supervised ones as well. We also evaluate the algorithm on a new dataset to measure the improvements when finding hypernyms for sets of words instead of singletons.

AB - The hypernymy relation is the one occurring between an instance term and its general term (e.g., "lion" and "animal", "Italy" and "country"). This paper we addresses Hypernym Discovery, the NLP task that aims at finding valid hypernyms from words in a given text, proposing HyperRank, an unsupervised approach that therefore does not require manually-labeled training sets as most approaches in the literature. The proposed algorithm exploits the cosine distance of points in the vector space of word embeddings, as already proposed by previous state of the art approaches, but the ranking is then corrected by also weighting word frequencies and the absolute level of similarity, which is expected to be similar when measuring co-hyponyms and their common hypernym. This brings us two major advantages over other approaches-(1) we correct the inadequacy of semantic similarity which is known to cause a significant performance drop and (2) we take into accounts multiple words if provided, allowing to find common hypernyms for a set of co-hyponyms-a task ignored in other systems but very useful when coupled with set expansion (that finds co-hyponyms automatically). We then evaluate HyperRank against the SemEval 2018 Hypernym Discovery task and show that, regardless of the language or domain, our algorithm significantly outperforms all the existing unsupervised algorithms and some supervised ones as well. We also evaluate the algorithm on a new dataset to measure the improvements when finding hypernyms for sets of words instead of singletons.

KW - Hypernym discovery

KW - Natural language processing

KW - Natural language understanding

KW - Unsupervised learning

KW - Word embeddings

KW - Word2vec

UR - http://www.scopus.com/inward/record.url?scp=85086072243&partnerID=8YFLogxK

U2 - 10.3390/INFO11050268

DO - 10.3390/INFO11050268

M3 - Article

AN - SCOPUS:85086072243

SN - 2078-2489

VL - 11

JO - Information (Switzerland)

JF - Information (Switzerland)

IS - 5

M1 - 268

ER -

Fully-unsupervised embeddings-based hypernym discovery

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this