Morlex - a lexical database for French
Morlex is a lexical database containing over 33.000 entries,
for use in both basic research and NLP applications, such as
information retrieval, speech synthesis and speech recognition.
Each entry represents a lemma (i.e. a base form) and contains the following information:
- orthographical form
- word class (part-of-speech)
- several morphological properties (gender, number, class,...), depending on the word class
- information to derive/recognize inflected forms of a base form
- a representation of pronunciation. This is a so-called morpho-phonological representation. It is used
- to obtain the pronunciation of a word form (its phonological of phonetic form)
- for basic research on morphology.
|part-of-speech ||entries ||analysis ||generation ||oral form |
|adjective ||3200 ||+ ||- ||+ |
|noun ||20000 ||+ ||- ||+ |
|adj/noun ||2200 ||+ ||- ||+ |
|verb ||6773 ||+ ||+ ||- |
|adverb ||1300 ||+ ||- ||+ |
|preposition ||56 ||+ ||- ||- |
|others ||165 ||+ ||- ||- |
The lexical database is compiled into a form which can be used by a computer program.
Morlex software for analysis and generation of word forms
The Morlex database is accompanied by software for morphological analysis and generation.
- Analysis - Lemmatisation
This program recognizes inflected forms of verbs, nouns and adjectives,
and returns the corresponding lemma and morphological properties.
For morphologically ambiguous word forms, all interpretations are given.
marchons = marcher,verbe,indicatif,présent,1,plur
livres = livre,noun,masc,plur
livres = livre,noun,fem,plur
livres = livrer,verbe,indicatif,présent,2,sing
livres = livrer,verbe,subjonctif,présent,2,sing
Click here for an on-line demonstration of verb lemmatisation
This program generates inflected forms of verbs, starting from infinitive and
Click here for an on-line demonstration of verb conjugation.
- Rule-based approach, written in Prolog.
- The Morlex database and analysis software is integrated in our complete
analysis system, as a preliminary stage before syntactic analysis.
This system is used for part-of-speech tagging, among other things.
I thank the students who, over the last few years, directly or indirectly contributed to
the construction of Morlex:
Inge Hoorne, Filip Vercammen, Kevin Mouton, Kim Hermans.
Thanks to Michel Bagein, for providing list of errors.
Thanks to Eric Broeders, for the verbal morphology software.
This file is maintained by Piet Mertens
Last updated: Oct. 29, 1999