Figure - available via license: Creative Commons Attribution 4.0 International
Content may be subject to copyright.
Source publication
With the ever increasing speed of the digitization process, a large collection of Ottoman documents is accessible to researchers and the general public. But, the majority of the users interested in these documents can not read these documents unless they are transcripted to the modern Turkish script which use an extended version of the Latin alphab...
Contexts in source publication
Context 1
... there is also the option of using n-gram word statistics in WBS, it is not used in this experiment. Table 2 presents the experiments evaluating the effects of corpus and language model on the WBS algorithm. Here, different corpora are used for generating the recognition lexicon. ...Context 2
... those figures, we can say that the corpus size is not enough to provide useful n-gram statistics. Actually, in all of of the WBS experiments in Table 2, use of N-gram statistics with or without forecasting fails to improve the results over lexicon mode. A larger corpus can help in integrating reliable n-gram statistics to decoding process, in the case of agglutinative languages like Turkish, afflicted with the vocabulary explosion problem [51]. ...Similar publications
In the Special Region of Yogyakarta Province and Central Java Province, where most of the population is Javanese, the use of Javanese script in daily life is increasingly replaced by Latin script, endangering the further loss of the Javanese identity. This research describes the development of marker-based Augmented Reality (AR) technology to creat...