Making a dictionary without words: lemmatization problems in a sign language dictionary

Jette Hedegaard Kristoffersen, Thomas Troelsgård

Publikation: Bidrag til bog/antologi/rapportKonferenceartikel i proceedingpeer review

Abstract

This paper addresses some of the particular problems connected with lemma representation and lemmatization in a sign language dictionary. The paper is mainly based on the authors' work experience from the Danish Sign Language Dictionary project. In a sign language dictionary sign representation constitutes a problem. as there is - at least for Danish Sign Language - no conventional notation used by native signers and the various other sign user groups. We look into the different possibilities of representing signs and present the solution that we chose for the Danish Sign Language Dictionary. Defining the criteria for lernmatization is another area where sign language dictionaries differ from written language dictionaries. The criteria should obviously include the manual expression of the signs, but a sign's manual expression has features from several categories (e.g. handshape, place of articulation and movement). Also non-manual elements such as mouth movement could be taken into consideration when defining the lemmatization criteria. As we defined the lemmatization criteria for the Danish Sign Language Dictionary we aimed for a solution that would result in relatively few homonyms, but that at the same time would not lead to very large polysemous entries. We also tried to define the criteria so that the resulting entries would reflect the lexicon of Danish Sign Language rather than resembling a Danish dictionary.
OriginalsprogEngelsk
TitelE-lexicography in the 21st century: New challenges, new applications : proceedings of eLex 2009, Louvain-la Neuve, 22-24 october 2009
RedaktørerSylviane Granger, Magali Paquot
Antal sider8
UdgivelsesstedLouvain
ForlagPresses Universitaires de Louvain
Publikationsdato2010
Sider165-172
ISBN (Trykt)978-2-87463-211-2
StatusUdgivet - 2010
BegivenhedeLEX : Electronic lexicography in the 21st century: New challenges, new applications - Centre for English Corpus Linguistics (CECL), Université Catholique de Louvian, Louvain-la-Neuve, Belgien
Varighed: 22 okt. 200924 okt. 2009
Konferencens nummer: 1
http://www.uclouvain.be/en-cecl-elexicography.html

Konference

KonferenceeLEX : Electronic lexicography in the 21st century
Nummer1
LokationCentre for English Corpus Linguistics (CECL), Université Catholique de Louvian
Land/OmrådeBelgien
ByLouvain-la-Neuve
Periode22/10/0924/10/09
Internetadresse
NavnCahiers du Cental (Louvain-la-Neuve)

Emneord

  • Medier, kommunikation og sprog
  • Dictionary
  • Lemmatisation
  • Lemmatisering
  • Lexicography
  • Ordbog
  • Ordbog over Dansk Tegnsprog
  • Sign Language
  • Sign Language Dictionary
  • Tegnsprog
  • The Danish Sign Language Dictionary
  • ODTS: Ordbog over Dansk Tegnsprog

    Troelsgård, T. (Projektdeltager), Pedersen, M. J. (Projektdeltager), Hårdell, A. K. S. (Projektdeltager), Kristoffersen, J. H. (Projektleder), Abildgaard, E. (Projektdeltager) & Kjeldsen, K. K. (Projektdeltager)

    01/08/03 → …

    Projekter: ProjektForskning

    Fil

Citationsformater