Contextual information usage for the enhancement of basic emotion classification in a weakly labelled social network dataset in Spanish
Resumen: Basic emotion classification is one of the main tasks of Sentiment Analysis usually performed by using several machine learning techniques. One of the main issues in Sentiment Analysis is the availability of tagged resources to properly train supervised classification algorithms. This is of particular concern in languages other than English, such as Spanish, where scarcity of these resources is the norm. In addition, most basic emotion datasets available in Spanish are rather small, containing a few hundred (or thousand) samples. Usually, the samples only contain a short text (frequently a comment) and a tag (the basic emotion), omitting crucial contextual information that may help to improve the classification task results. In this paper, the impact of using contextual information is measured on a recently published Spanish basic emotion dataset and the baseline architecture proposed in the Semantic Evaluation 2019 competition. This particular dataset has two main advantages for this paper. First, it was compiled using Distant Supervision and as a result it contains several hundred thousand samples. Secondly, the authors included valuable contextual information for each comment. The results show that contextual information, such as news headlines or summaries, helps improve the classification accuracy over a dataset of distantly supervised basic emotion labelled comments.
Idioma: Inglés
DOI: 10.1007/s11042-022-13750-x
Año: 2023
Publicado en: Multimedia Tools and Applications 82 (2023), 9871–9890
ISSN: 1380-7501

Factor impacto JCR: 3.0 (2023)
Categ. JCR: COMPUTER SCIENCE, INFORMATION SYSTEMS rank: 95 / 250 = 0.38 (2023) - Q2 - T2
Categ. JCR: ENGINEERING, ELECTRICAL & ELECTRONIC rank: 136 / 353 = 0.385 (2023) - Q2 - T2
Categ. JCR: COMPUTER SCIENCE, THEORY & METHODS rank: 41 / 144 = 0.285 (2023) - Q2 - T1
Categ. JCR: COMPUTER SCIENCE, SOFTWARE ENGINEERING rank: 37 / 132 = 0.28 (2023) - Q2 - T1

Factor impacto CITESCORE: 7.2 - Hardware and Architecture (Q1) - Media Technology (Q1) - Computer Networks and Communications (Q1) - Software (Q2)

Factor impacto SCIMAGO: 0.801 - Media Technology (Q1) - Software (Q2) - Computer Networks and Communications (Q2) - Hardware and Architecture (Q2)

Financiación: info:eu-repo/grantAgreement/ES/DGA-FEDER/T60-20R-AFFECTIVE LAB
Financiación: info:eu-repo/grantAgreement/ES/MICINN/RTI2018-096986-B-C31
Tipo y forma: Article (PostPrint)
Área (Departamento): Área Lenguajes y Sistemas Inf. (Dpto. Informát.Ingenie.Sistms.)
Exportado de SIDERAL (2024-11-22-11:57:52)


Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
articulos > articulos-por-area > lenguajes_y_sistemas_informaticos



 Notice créée le 2023-09-21, modifiée le 2024-11-25


Postprint:
 PDF
Évaluer ce document:

Rate this document:
1
2
3
 
(Pas encore évalué)