Contextual information usage for the enhancement of basic emotion classification in a weakly labelled social network dataset in Spanish
Resumen: Basic emotion classification is one of the main tasks of Sentiment Analysis usually performed by using several machine learning techniques. One of the main issues in Sentiment Analysis is the availability of tagged resources to properly train supervised classification algorithms. This is of particular concern in languages other than English, such as Spanish, where scarcity of these resources is the norm. In addition, most basic emotion datasets available in Spanish are rather small, containing a few hundred (or thousand) samples. Usually, the samples only contain a short text (frequently a comment) and a tag (the basic emotion), omitting crucial contextual information that may help to improve the classification task results. In this paper, the impact of using contextual information is measured on a recently published Spanish basic emotion dataset and the baseline architecture proposed in the Semantic Evaluation 2019 competition. This particular dataset has two main advantages for this paper. First, it was compiled using Distant Supervision and as a result it contains several hundred thousand samples. Secondly, the authors included valuable contextual information for each comment. The results show that contextual information, such as news headlines or summaries, helps improve the classification accuracy over a dataset of distantly supervised basic emotion labelled comments.
Idioma: Inglés
DOI: 10.1007/s11042-022-13750-x
Año: 2023
Publicado en: Multimedia Tools and Applications 82 (2023), 9871–9890
ISSN: 1380-7501

Factor impacto JCR: 3.0 (2023)
Categ. JCR: COMPUTER SCIENCE, INFORMATION SYSTEMS rank: 95 / 250 = 0.38 (2023) - Q2 - T2
Categ. JCR: ENGINEERING, ELECTRICAL & ELECTRONIC rank: 136 / 353 = 0.385 (2023) - Q2 - T2
Categ. JCR: COMPUTER SCIENCE, THEORY & METHODS rank: 41 / 144 = 0.285 (2023) - Q2 - T1
Categ. JCR: COMPUTER SCIENCE, SOFTWARE ENGINEERING rank: 37 / 132 = 0.28 (2023) - Q2 - T1

Factor impacto CITESCORE: 7.2 - Hardware and Architecture (Q1) - Media Technology (Q1) - Computer Networks and Communications (Q1) - Software (Q2)

Factor impacto SCIMAGO: 0.801 - Media Technology (Q1) - Software (Q2) - Computer Networks and Communications (Q2) - Hardware and Architecture (Q2)

Financiación: info:eu-repo/grantAgreement/ES/DGA-FEDER/T60-20R-AFFECTIVE LAB
Financiación: info:eu-repo/grantAgreement/ES/MICINN/RTI2018-096986-B-C31
Tipo y forma: Artículo (PostPrint)
Área (Departamento): Área Lenguajes y Sistemas Inf. (Dpto. Informát.Ingenie.Sistms.)

Creative Commons Debe reconocer adecuadamente la autoría, proporcionar un enlace a la licencia e indicar si se han realizado cambios. Puede hacerlo de cualquier manera razonable, pero no de una manera que sugiera que tiene el apoyo del licenciador o lo recibe por el uso que hace.


Exportado de SIDERAL (2024-11-22-11:57:52)


Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
Artículos > Artículos por área > Lenguajes y Sistemas Informáticos



 Registro creado el 2023-09-21, última modificación el 2024-11-25


Postprint:
 PDF
Valore este documento:

Rate this document:
1
2
3
 
(Sin ninguna reseña)