Arabic is a large used language across world, but the lack of linguistics tools and resource make it still under-resourced language which influences on development and research.
Many studies and research are done on Arabic for academic and experimental, but didn’t really adapted for development process and end-user usage and can’t be integrated in existing systems.
The problem under investigation is to develop tools and resources which must be open source, multipurpose, usable by researchers, developers and end-users.
Our solution named Adawat has many applications, API and corpora like: Light stemmer, verb conjugator, morphology analyzer, Spell checker, Text to speech system, Mishkal diacrtizer, vocalized texts corpus, synonyms dictionary, collocations etc… We use mainly rule based approach to build rules and data.
These tools are developed to be integrated with existing systems like Hunspell spell checker used by millions users under Firefox and LibreOffice, and eSpeak text to speech. The availability of our tools and resources give high impact on new researches, which use mainly Tashkeela corpus, and Tashaphyne stemmer.
Figures - uploaded by
Taha ZerroukiAuthor contentAll figure content in this area was uploaded by Taha Zerrouki
Content may be subject to copyright.