As one envisions a document model where language, physical location and medium - electronic, paper or other - impose no barrier
to effective use, natural language processing will play an increasing role, especially in the context of digital libraries.
This paper presents language components based mostly on finite-state technology that improve our capabilities for exploring,
enriching and
... [Show full abstract] interacting in various ways with documents. This ranges from morphology to part-of-speech tagging, NP extraction
and shallow parsing.
We then focus on a series of on-going projects which illustrate how this technology is already impacting the building and
sharing of knowledge through digital libraries.