Making a dictionary without words: lemmatization problems in a sign language dictionary

Jette Hedegaard Kristoffersen, Thomas Troelsgård

Research output: Chapter in Book/Report/Conference proceedingConference contribution to proceedingResearchpeer-review

Abstract

This paper addresses some of the particular problems connected with lemma representation and lemmatization in a sign language dictionary. The paper is mainly based on the authors' work experience from the Danish Sign Language Dictionary project. In a sign language dictionary sign representation constitutes a problem. as there is - at least for Danish Sign Language - no conventional notation used by native signers and the various other sign user groups. We look into the different possibilities of representing signs and present the solution that we chose for the Danish Sign Language Dictionary. Defining the criteria for lernmatization is another area where sign language dictionaries differ from written language dictionaries. The criteria should obviously include the manual expression of the signs, but a sign's manual expression has features from several categories (e.g. handshape, place of articulation and movement). Also non-manual elements such as mouth movement could be taken into consideration when defining the lemmatization criteria. As we defined the lemmatization criteria for the Danish Sign Language Dictionary we aimed for a solution that would result in relatively few homonyms, but that at the same time would not lead to very large polysemous entries. We also tried to define the criteria so that the resulting entries would reflect the lexicon of Danish Sign Language rather than resembling a Danish dictionary.
Original languageEnglish
Title of host publicationE-lexicography in the 21st century: New challenges, new applications : proceedings of eLex 2009, Louvain-la Neuve, 22-24 october 2009
EditorsSylviane Granger, Magali Paquot
Number of pages8
Place of PublicationLouvain
PublisherPresses Universitaires de Louvain
Publication date2010
Pages165-172
ISBN (Print)978-2-87463-211-2
Publication statusPublished - 2010
EventeLEX : Electronic lexicography in the 21st century: New challenges, new applications - Centre for English Corpus Linguistics (CECL), Université Catholique de Louvian, Louvain-la-Neuve, Belgium
Duration: 22 Oct 200924 Oct 2009
Conference number: 1
http://www.uclouvain.be/en-cecl-elexicography.html

Conference

ConferenceeLEX : Electronic lexicography in the 21st century
Number1
LocationCentre for English Corpus Linguistics (CECL), Université Catholique de Louvian
Country/TerritoryBelgium
CityLouvain-la-Neuve
Period22/10/0924/10/09
Internet address
SeriesCahiers du Cental (Louvain-la-Neuve)

Keywords

  • Media, communication and languages

Cite this