The topic of data mining is a very popular subject, especially nowadays. Data mining is a process which access the information among large-scale data and mine the knowledge. The most widespread use in the literature is to process large amounts of data automatically or semi-automatically to find meaningful patterns.
Depending on the pace of the spread of Internet usage, digital media takes the place of traditional media, and the size of the domain is increasing day by day. The number of textual forms in digital media is also quite large. For this reason, Text Mining techniques should be used for text review.
Text Mining is one of the computer-based ways that make text meaningful by automatically extracting data that can not be deduced from any meaning, which is regarded as insignificant from unstructured text.
Text Mining is a new and interdisciplinary field consisting of a combination of fields such as data mining, machine learning and statistics. The commercial potential of this area seems to be quite high, as most of the information is stored as text. At the same time, the largest source of information currently available in the field of text mining is unstructured text on the Internet.
In the field of Data and Text Mining which is a very popular field in the recent period, some studies are also being carried out in our country. In this thesis study, economical researches have been carried out especially on the importance given to these fields in our country and in the world, the market sizes, the successes of the trainings given in this field prior to university education and the added value of the studies made in this field.
In this study, Sector of Information and Communication Technologies in the World, European Union 2020 Innovation Indicators, Information and Communication Technologies Sector in Turkey, India - Turkey Comparison in the Field of Information and Communication Technologies, Education in Information and Communication Technologies, Qualified labor issues in communication technologies have been examined. In the introductory part of this work, the traditional suggestion systems and recommendation engines approaches applied according to the size of the increasing digital data on the Internet are mentioned and the concepts of machine learning and artificial intelligence, which are the current topics in information technology, are mentioned.
In the second chapter, economic effects of data analysis are mentioned and in this context, the importance of information and communication technologies in the world and Turkey, the importance of qualified labor and education in this area, and India and Turkey are examined comparatively.
In the third chapter, the concept of data analysis is mentioned and the concepts of data mining, text mining, suggestion engine and ethics in data mining are mentioned.
In the fourth chapter, literature review has been done and the application steps in text analysis and recommendation engines have been extensively discussed.
In the fifth chapter, the aim, structure and flow of the personalized recommendation engine developed within the context of the application are mentioned.
In the sixth chapter, the application is explained and the data collection, preprocessing and suggestion engine application realized in this context are explained.
In the conclusion part, the thesis application was interpreted, evaluated and finalized.
The microeconomic effect shown in the conclusion of the application is obtained by deductions from the extra advertising revenues obtained due to the increase of intrasite interaction thanks to the Turkish Text Mining Support Engine which is built on the advertisement areas of "Gazetemsi", which is a Web content site.
As a result of this project, the amount of time spent by users on the site and the number of related content readings were increased, resulting in an additional gain of approximately TL 22.230 per month. Therefore the project has been achieved for the purpose of development.
As a result of these conclusions, a local recommendation engine supported by Turkish Text Mining has been developed as a national achievement in the project. The Recommendation Engine Project, which was put into the scope of the study, was supported by TÜBİTAK under the project name "TUBITAK-TEYDEB 1507-SME R & D Start-Up Support Program" with the project name "Personalized Recommendation Engine Supported by Text and Data Mining Approaches for Content Based Website".