Making a dictionary without words: lemmatization problems in a sign language dictionary

Jette Hedegaard Kristoffersen, Thomas Troelsgård

Publikation: Bidrag til bog/antologi/rapportKonferenceartikel i proceedingForskningpeer review


This paper addresses some of the particular problems connected with lemma representation and lemmatization in a sign language dictionary. The paper is mainly based on the authors' work experience from the Danish Sign Language Dictionary project. In a sign language dictionary sign representation constitutes a problem. as there is - at least for Danish Sign Language - no conventional notation used by native signers and the various other sign user groups. We look into the different possibilities of representing signs and present the solution that we chose for the Danish Sign Language Dictionary. Defining the criteria for lernmatization is another area where sign language dictionaries differ from written language dictionaries. The criteria should obviously include the manual expression of the signs, but a sign's manual expression has features from several categories (e.g. handshape, place of articulation and movement). Also non-manual elements such as mouth movement could be taken into consideration when defining the lemmatization criteria. As we defined the lemmatization criteria for the Danish Sign Language Dictionary we aimed for a solution that would result in relatively few homonyms, but that at the same time would not lead to very large polysemous entries. We also tried to define the criteria so that the resulting entries would reflect the lexicon of Danish Sign Language rather than resembling a Danish dictionary.
TitelE-lexicography in the 21st century: New challenges, new applications : proceedings of eLex 2009, Louvain-la Neuve, 22-24 october 2009
RedaktørerSylviane Granger, Magali Paquot
Antal sider8
ForlagPresses Universitaires de Louvain
ISBN (Trykt)978-2-87463-211-2
StatusUdgivet - 2010
BegivenhedeLEX : Electronic lexicography in the 21st century: New challenges, new applications - Centre for English Corpus Linguistics (CECL), Université Catholique de Louvian, Louvain-la-Neuve, Belgien
Varighed: 22 okt. 200924 okt. 2009
Konferencens nummer: 1


KonferenceeLEX : Electronic lexicography in the 21st century
LokationCentre for English Corpus Linguistics (CECL), Université Catholique de Louvian
NavnCahiers du Cental (Louvain-la-Neuve)


  • Medier, kommunikation og sprog
  • ODTS: Ordbog over Dansk Tegnsprog

    Troelsgård, T., Hårdell, A. K. S., Kristoffersen, J. H., Abildgaard, E., Pedersen, M. J. & Kjeldsen, K. K.

    01/08/03 → …

    Projekter: ProjektForskning