Designing a Lexical Database for a Combined Use of Corpus Annotation and Dictionary Editing

Jette Hedegaard Kristoffersen, Thomas Troelsgård, Gabriele Langer, Reiner Konrad, Thomas Hanke, Susanne König

    Research output: Chapter in Book/Report/Conference proceedingConference contribution to proceedingpeer-review

    Abstract

    In a combined corpus-dictionary project, you would need one lexical database that could serve as a shared “backbone” for both
    corpus annotation and dictionary editing, but it is not that easy to define a database structure that applies satisfactorily to both these
    purposes. In this paper, we will exemplify the problem and present ideas on how to model structures in a lexical database that
    facilitate corpus annotation as well as dictionary editing. The paper is a joint work between the DGS Corpus Project and the DTS
    Dictionary Project. The two projects come from opposite sides of the spectrum (one adjusting a lexical database grown from
    dictionary making for corpus annotating, one building a lexical database in parallel with corpus annotation and editing a
    corpus-based dictionary), and we will consider requirements and feasible structures for a database that can serve both corpus and
    dictionary.
    Translated title of the contributionUdvikling af en leksikalsk database til brug for både korpusannotation og ordbogsredaktion
    Original languageEnglish
    Title of host publicationCorpus Mining : Proceedings of the 7th Workshop on the Representation and Processing of Sign Languages. 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorõz, Slovenien.
    EditorsEleni Efthimiou, Evita Fotinea, Jette Hedegaard Kristofffersen, Johanna Mesch, Julie Hochgesang, Thomas Hanke
    Number of pages10
    Place of PublicationParis
    PublisherELRA
    Publication date2016
    Pages143-152
    Publication statusPublished - 2016
    EventLanguage Resources and Evaluation Conference, 23-28 May 2016: 7 th Workshop on the Representation and Processing of - Hotel Grand Bernadin, Portorož, Slovenia
    Duration: 23 May 201628 May 2016
    Conference number: 10
    http://lrec2016.lrec-conf.org/en/

    Conference

    ConferenceLanguage Resources and Evaluation Conference, 23-28 May 2016
    Number10
    LocationHotel Grand Bernadin
    CountrySlovenia
    CityPortorož
    Period23/05/1628/05/16
    Internet address
    SeriesLREC Proceedings from workshops on the Representation and Processing of Sign Languages
    Number7

    Keywords

    • Media, communication and languages

    Cite this