Conference Paper

Improving folksonomies quality by syntactic tag variations grouping.

DOI: 10.1145/1529282.1529559 Conference: Proceedings of the 2009 ACM Symposium on Applied Computing (SAC), Honolulu, Hawaii, USA, March 9-12, 2009
Source: DBLP

ABSTRACT Folksonomies offer an easy method to organize information in the current Web. This fact and their collaborative features have derived in an extensive involvement in many Social Web projects. However they present important drawbacks regarding their limited exploring and searching capabilities, in contrast with other methods as taxonomies, thesauruses and ontologies. One of these drawbacks is an effect of its flexibility for tagging, producing frequently multiple variations of a same tag. In this paper we propose a method to group syntactic variations of tags using pattern matching techniques. We propose the utilization of a fuzzy similarity measure and we conclude that this technique offers better results than other classic techniques after comparing them on a large real dataset.

Full-text

Available from: Francisco Echarte, Jun 12, 2015
0 Followers
 · 
111 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: Social tagging systems have become a popular system to organize information in many web 2.0 sites. They are also being rapidly adopted in enterprises to enhance information sharing, knowledge sharing and emerged as a novel categorization scheme based on the collective knowledge of people. Scalability is an issue of the categorization of the resources of social tagging systems. Scalability has highlighted a critical trade-off between accuracy and complexity. As social tagging systems evolve over time, resource categories can appear or disappear either by grouping new resources or disaggregating existing ones, and this implies the re-assignation of the resources involved to others categories. This makes the methods and/or algorithms that categorize resources of social tagging systems to be nonscalable, and then not efficiently implementable on real social tagging systems. This paper presents a simple method for categorizing resources on social tagging systems which is self-adaptive, scalable and implementable in any real social tagging system.
    Expert Systems with Applications 07/2013; 40:3696–3714. DOI:10.1016/j.eswa.2012.12.075 · 1.97 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Folksonomies are a widely used tool of collaboratively creating and managing tags to annotate and categorize Internet resources (Web 2.0). The process of annotation and tag management by users of social networks is extremely easy and simple; however, it involves serious problems of navigation and search unlike what happens with taxonomies, thesauri and ontologies. The use of fuzzy similarity measures allows the correct identification of syntactic variations when tag lengths are greater or equal than five symbols, been inadequate for smaller length tags. This article presents a method that combines both fuzzy similarity and cosine measures in order to provide a proper classification of tags even with smaller tag lengths. This method allows the proper classification of the 95% of the syntactic variations of tags analyzed in the experiments.
    IEEE Latin America Transactions 03/2010; 8(1):88-93. DOI:10.1109/TLA.2010.5453951 · 0.19 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Folksonomies have emerged as a common way of annotating and categorizing content using a set of tags that are created and managed in a collaborative way. Tags carry the semantic information within a folksonomy, and provide thus the link to ontologies. The appeal of folksonomies comes from the fact that they require a low effort for creation and maintenance since they are community-generated. However they present important drawbacks regarding their limited navigation and searching capabilities, in contrast with other methods as taxonomies, thesauruses and ontologies. One of these drawbacks is an effect of its flexibility for tagging, producing frequently multiple syntactic variations of a same tag. Similarity measures allow the correct identification of tag variations when tag lengths are greater than five symbols. In this paper we propose the use of cosine relatedness measures in order to cluster tags with lengths lower or equal than five symbols. We build a discriminator based on the combination of a fuzzy similarity and a cosine measures and we analyze the results obtained.
    Web Engineering, 9th International Conference, ICWE 2009, San Sebastián, Spain, June 24-26, 2009, Proceedings; 01/2009