Conference Paper

SUITE 2009: First International Workshop on Search-Driven Development - Users, Infrastructure, Tools and Evaluation

DOI: 10.1109/ICSE-COMPANION.2009.5071054 Conference: 31st International Conference on Software Engineering, ICSE 2009, May 16-24, 2009, Vancouver, Canada, Companion Volume
Source: DBLP

ABSTRACT SUITE is a new workshop series that specifically
focuses on exploring the notion of search as a
fundamental activity during software development.
The goal of the workshop is to bring researchers and
practitioners with special interest on search
technology for software developers together.
Participants will have broad range of expertise in
topics ranging from building software tools and
infrastructure, Information Retrieval, user studies
and Human-computer interaction, benchmarking and
evaluation. The first edition of SUITE is held in
conjunction with the 31st International Conference
in Software Engineering (May 16th, 2009. Vancouver,


Available from: Yunwen Ye, May 22, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Free and open source software strongly promotes the reuse of source code. Some open source Java components/libraries are distributed as jar archives only containing the bytecode and some additional information. For whoever wanting to integrate this jar in her own project, it is important to determine the license(s) of the code from which the jar archive was produced, as this affects the way that such component can be used. This paper proposes an automatic approach to determine the license of jar archives, combining the use of a code-search engine with the automatic classification of licenses contained in textual flies enclosed in the jar. Results of an empirical study performed on 37 jars - from 17 different systems - indicate that this approach is able to successfully infer the jar licenses in over 95% of the cases, but that in many cases the license in textual flies may differ from the one of the classes contained in the jar.
    Mining Software Repositories (MSR), 2010 7th IEEE Working Conference on; 06/2010
  • [Show abstract] [Hide abstract]
    ABSTRACT: This paper sketches a research path that seeks to examine the search for suitable code problem, based on the observation that when code retargeting is included within a code search activity, developers can justify the suitability of these results upfront and thus reduce their searching efforts looking for suitable code. To support this observation, this paper introduces the Snippet Retargeting Approach, or simply SNIPR. SNIPR complements code search with code retargeting capabilities. These capabilities' intent is to help expedite the process of determining if a found example is a best fit. They do that by allowing developers to explore code modification ideas in place, without requiring to leave the search interface. With SNIPR, developers engage in a virtuous loop where they find code, retarget code, and select only code choices they can justify as suitable. This assures immediate feedback on retargeted examples and thus saves valuable time searching for appropriate code.
    Software Engineering (ICSE), 2013 35th International Conference on; 01/2013
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: This paper presents an analysis of a year long usage log of Koders, the first commercially available Internet-Scale code search engine ( The usage log comprises about ten million activities from more than three million users. Analysis of the usage data shows that despite of attracting a large number of visitors, Koders has a very sparse usage and that it lacks regular usage from many of its users. When compared to Web search, search behavior in Koders showed many similar patterns. A topic modeling analysis of the usage data shows what topics users of Koders are looking for. Observations on the prevalence of these topics among the users, and observations on how search and download activities vary across topics, lead to the conclusion that users who find code search engines usable are those who already know to a high level of specificity what to look for. This paper also presents a general categorization of these topics that provides insights on the different ways code search engine users express their queries. It identifies various forms of queries in Koders’s log and the kinds of results addressed by the queries. It also provides several suggestions for improvements in code search engines based on the analysis of usage, topics, and query forms. The work presented in this paper is the first of its kind that reveals several insights on the usage of an Internet-Scale code search engine. KeywordsCode search engine-Usage log analysis-Mining topics
    Empirical Software Engineering 08/2012; 17(4-5):1-43. DOI:10.1007/s10664-010-9144-6 · 1.64 Impact Factor