Polysemy in Spoken Conversations and Written Texts - Télécom Paris Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Polysemy in Spoken Conversations and Written Texts

Résumé

Our discourses are full of potential lexical ambiguities, due in part to the pervasive use of words having multiple senses. Sometimes, one word may even be used in more than one sense throughout a text. But, to what extent is this true for different kinds of texts? Does the use of polysemous words change when a discourse involves two people, or when speakers have time to plan what to say? We investigate these questions by comparing the polysemy level of texts of different nature, with a focus on spontaneous spoken dialogs; unlike previous work which examines solely scripted, written, monolog-like data. We compare multiple metrics that presuppose different conceptualizations of text polysemy, i.e., they consider the observed or the potential number of senses of words, or their sense distribution in a discourse. We show that the polysemy level of texts varies greatly depending on the kind of text considered, with dialog and spoken discourses having generally a higher polysemy level than written monologs. Additionally, our results emphasize the need for relaxing the popular "one sense per discourse" hypothesis.
Fichier principal
Vignette du fichier
2022.lrec-1.179.pdf (381.99 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03860827 , version 1 (18-11-2022)

Identifiants

  • HAL Id : hal-03860827 , version 1

Citer

Aina Garí Soler, Matthieu Labeau, Chloé Clavel. Polysemy in Spoken Conversations and Written Texts. 13th Conference on Language Resources and Evaluation (LREC 2022), Jun 2022, Marseille, France. ⟨hal-03860827⟩
47 Consultations
42 Téléchargements

Partager

Gmail Facebook X LinkedIn More