Yehuda Koren’s research while affiliated with Google Inc. and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (88)


Collaborative Filtering for Implicit Feedback Datasets
  • Conference Paper
  • Full-text available

December 2008

·

20,879 Reads

·

3,440 Citations

·

Yehuda Koren

·

Chris Volinsky

A common task of recommender systems is to improve customer experience through personalized recommenda- tions based on prior implicit feedback. These systems pas- sively track different sorts of user behavior, such as pur- chase history, watching habits and browsing activity, in or- der to model user preferences. Unlike the much more ex- tensively researched explicit feedback, we do not have any direct input from the users regarding their preferences. In particular, we lack substantial evidence on which products consumer dislike. In this work we identify unique proper- ties of implicit feedback datasets. We propose treating the data as indication of positive and negative preference asso- ciated with vastly varying confidence levels. This leads to a factor model which is especially tailored for implicit feed- back recommenders. We also suggest a scalable optimiza- tion procedure, which scales linearly with the data size. The algorithmis used successfully within a recommender system for television shows. It compares favorably with well tuned implementations of other known methods. In addition, we offer a novel way to give explanations to recommendations given by this factor model.

Download

The Binary Stress Model for Graph Drawing

September 2008

·

164 Reads

·

27 Citations

Lecture Notes in Computer Science

We introduce a new force-directed model for computing graph layout. The model bridges the two more popular force directed approaches – the stress and the electrical-spring models – through the binary stress cost function, which is a carefully defined energy function with low descriptive complexity allowing fast computation via a Barnes-Hut scheme. This allows us to overcome optimization pitfalls from which previous methods suffer. In addition, the binary stress model often offers a unique viewpoint to the graph, which can occasionally add useful insight to its topology. The model uniformly spreads the nodes within a circle. This helps in achieving an efficient utilization of the drawing area. Moreover, the ability to uniformly spread nodes regardless of topology, becomes particularly helpful for graphs with low connectivity, or even with multiple connected components, where there is not enough structure for defining a readable layout.


Factorization meets the neighborhood: A multifaceted collaborative filtering model

August 2008

·

2,572 Reads

·

4,250 Citations

Recommender systems provide users with personalized suggestions for products or services. These systems often rely on Collaborating Filtering (CF), where past transactions are analyzed in order to establish connections between users and products. The two more successful approaches to CF are latent factor models, which directly profile both users and products, and neighborhood models, which analyze similarities between products or users. In this work we introduce some innovations to both approaches. The factor and neighborhood models can now be smoothly merged, thereby building a more accurate combined model. Further accuracy improvements are achieved by extending the models to exploit both explicit and implicit feedback by the users. The methods are tested on the Netflix data. Results are better than those previously published on that dataset. In addition, we suggest a new evaluation metric, which highlights the differences among methods, based on their performance at a top-K recommendation task.



Direction-Aware Proximity on Graphs

January 2008

·

9 Reads

In many graph mining settings, measuring node proximity is a fundamental problem. While most of existing measurements are (implicitly or explicitly) designed for undirected graphs; edge directions in the graph provide a new perspective to proximity measurement: measuring the proximity from A to B; rather than between A and B. (See Figure 1 as an example). In this chapter, we study the role of edge direction in measuring proximity on graphs. To be specific, we will address the following fundamental research questions in the context of direction-aware proximity: 1. Problem definitions: How to define a directionaware proximity? 2. Computational issues: How to compute the proximity score efficiently? 3. Applications: How can direction-aware proximity benefit graph mining?



Figure 4: A family of graphs characterized by the number of s-t paths
Figure 5: A family of graphs with varying degree of a
Figure 7: Boxplots showing % delivered proximity as a function of α for the Phone data and the Netflix Data. The box marks the space between the 25th and 75th percentiles of the distribution.
Figure 7: Boxplots showing % delivered proximity as a function of α for the Phone data and the Netflix Data. The box marks the space between the 25th and 75th percentiles of the distribution. In Figure 8, we explore how large the subgraphs need to be to capture a meaningful percentage of proximity. Looking at the resulting subgraphs for 2000 pairs using α = 10, we plot graph size against the percent overall captured proximity in the bottom figure. On top we plot a histogram showing proximity graph size. One can see that the majority of the proximity graphs have 50 nodes or less,  
Figure 8: Plot showing size of proximity graphs and their relation to captured proximity. The top plot shows the distribution of graph sizes for our sample of 2000 pairs. The bottom figure plots graph size against the percent captured proximity, with a smoothing spline plotted through the data.  
Measuring and extracting proximity graphs in networks

December 2007

·

351 Reads

·

56 Citations

ACM Transactions on Knowledge Discovery from Data

Measuring distance or some other form of proximity between objects is a standard data mining tool. Connection subgraphs were recently proposed as a way to demonstrate proximity between nodes in networks. We propose a new way of measuring and extracting proximity in networks called "cycle- free effective conductance" (CFEC). Importantly, the measured proximity is accompanied with a proximity subgraph which allows assessing and understanding measured values. Our proximity calculation can handle more than two endpoints, directed edges, is statistically well behaved, and produces an effectiveness score for the computed subgraphs. We provide an efficient algorithm to measure and extract proximity. Also, we report experimental results and show examples for four large network datasets: a telecommunications calling graph, the IMDB actors graph, an academic coauthorship network, and a movie recommendation system.


Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights

November 2007

·

303 Reads

·

547 Citations

Recommender systems based on collaborative filtering predict user preferences for products or services by learning past user-item relationships. A predominant approach to collaborative filtering is neighborhood based ("k-nearest neighbors"), where a user-item preference rating is interpolated from ratings of similar items and/or users. We enhance the neighborhood-based approach leading to substantial improvement of prediction accuracy, without a meaningful increase in running time. First, we remove certain so-called "global effects" from the data to make the ratings more comparable, thereby improving interpolation accuracy. Second, we show how to simultaneously derive interpolation weights for all nearest neighbors, unlike previous approaches where each weight is computed separately. By globally solving a suitable optimization problem, this simultaneous interpolation accounts for the many interactions between neighbors leading to improved accuracy. Our method is very fast in practice, generating a prediction in about 0.2 milliseconds. Importantly, it does not require training many parameters or a lengthy preprocessing, making it very practical for large scale applications. Finally, we show how to apply these methods to the perceivably much slower user-oriented approach. To this end, we suggest a novel scheme for low dimensional embedding of the users. We evaluate these methods on the netflix dataset, where they deliver significantly better results than the commercial netflix cinematch recommender system.


Improved neighborhood-based collaborative filtering

September 2007

·

434 Reads

·

206 Citations

Recommender systems based on collaborative filtering predict user preferences for products or services by learning past user-item re-lationships. A predominant approach to collaborative filtering is neighborhood based ("k-nearest neighbors"), where a user-item pref-erence rating is interpolated from ratings of similar items and/or users. In this work, we enhance the neighborhood-based approach leading to a substantial improvement of prediction accuracy, with-out a meaningful increase in running time. First, we remove certain so-called "global effects" from the data to make the different ratings more comparable, thereby improving interpolation accuracy. Sec-ond, we show how to simultaneously derive interpolation weights for all nearest neighbors. Unlike previous approaches where each interpolation weight is computed separately, simultaneous interpo-lation accounts for the many interactions between neighbors by globally solving a suitable optimization problem, also leading to improved accuracy. Our method is very fast in practice, generat-ing a prediction in about 0.2 milliseconds. Importantly, it does not require training many parameters or a lengthy preprocessing, mak-ing it very practical for large scale applications. The method was evaluated on the Netflix dataset. We could process the 2.8 million queries of the Qualifying set in 10 minutes yielding a RMSE of 0.9086. Moreover, when an extensive training is allowed, such as SVD-factorization at the preprocessing stage, our method can pro-duce results with a RMSE of 0.8982.


Fast Direction-Aware Proximity for Graph Mining

August 2007

·

29 Reads

·

94 Citations

In this paper we study asymmetric proximity measures on directed graphs, which quantify the relationships between two nodes or two groups of nodes. The measures are useful in several graph min- ing tasks, including clustering, link prediction and connection sub- graph discovery. Our proximity measure is based on the concept of escape probability. This way, we strive to summarize the mul- tiple facets of nodes-proximity, while avoiding some of the pit- falls to which alternative proximity measures are susceptible. A unique feature of the measures is accounting for the underlying directional information. We put a special emphasis on computa- tional efficiency, and develop fast solutions that are applicable in several settings. Our experimental study shows the usefulness of our proposed direction-aware proximity method for several appli- cations, and that our algorithms achieve a significant speedup (up to 50,000x) over straightforward implementations.


Citations (79)


... Collaborative filtering is the most common and widely used method for generating recommendations in music streaming services [22]. This algorithm relies on a set of songs that users preferred in the past to predict which song they would like to listen to. ...

Reference:

Content-Based Filtering Technique using Clustering Method for Music Recommender Systems
Advances in Collaborative Filtering
  • Citing Chapter
  • November 2021

... In this paper, we focus on whole-data models with weighted square loss. Weighted Matrix Factorization (WMF), also called iALS [14,20], pioneered this class of models and is still known to achieve competitive results while having highly scalable learning and prediction routines [22]. After its introduction, many extensions were proposed, among which three variants for context-aware recommender systems (CARS) [5,10,11], where each variant uses a different tensor decomposition method. ...

Revisiting the Performance of iALS on Item Recommendation Benchmarks
  • Citing Conference Paper
  • September 2022

... To evaluate our method's effectiveness, we carefully select and compare our proposed model with the following representative non-review-based and RRSs models, as well as the two most recent sentiment debiasing methods, Debias (Lin et al. 2021) and CISD (He et al. 2022). Non-review methods include MF (Koren, Bell, and Volinsky 2009), NeuMF (He et al. 2017b), which are extensively used as baselines in previous works. RRSs models include DeepCoNN (Zheng, Noroozi, and Yu 2017), NARRE (Chen et al. 2018), and MPCN (Tay, Luu, and Hui 2018 Evaluation Metrics. ...

Matrix factorization techniques for recommender systems
  • Citing Article
  • August 2009

Computer

... The authors [13] proposed an approach that might fit the identification of library migrations by analyzing large datasets related to software developmentspecifically, code change histories. This approach searches for migration process patterns and then filters them based on their frequency or associated code changes. ...

Factor in the neighbors
  • Citing Article
  • January 2010

ACM Transactions on Knowledge Discovery from Data

... In [4], semantic change between consecutive queries and the relationship between the changed query and the clicked document is used to infer query context. In addition, query clustering [3], geographical location [15], and association rules [1] are some of the methods used by researchers for better information retrieval. However, we argued that these context extraction methods are confined by the capacity of their employed representation, which is hardly generalizable and not optimal for retrieval tasks. ...

Expediting search trend detection via prediction of query counts
  • Citing Conference Paper
  • February 2013

... We quantitatively evaluate our model in the context of two large datasets containing both numerical and text reviews; the Amazon Review dataset [17] and the Yelp dataset [25]. To avoid the problems frequently highlighted with RMSE-based evaluation [12], we follow the approach of Koren and Sill [31]. 2 The evaluation highlights that our proposed KNN model beats strong baselines for both memory-based and model-based systems. The result is that our model provides both explainability benefits, inherited from memory-based methods, enhanced by now enabling textual-review snippets to be used, as well as competitive performance. ...

Collaborative filtering on ordinal user feedback
  • Citing Conference Paper
  • August 2013

... These methods emphasize the importance of learning item-to-item semantics rather than user-to-item predictions. For example, [14] proposed learning item representations from implicit feedback in a Euclidean space. The I2V model [15] is a popular method for learning static item representations based on CF item cooccurrences [15]. ...

Towards scalable and accurate item-oriented recommendations
  • Citing Conference Paper
  • October 2013

... It has been shown that in collaborative filtering problems, much of the signal lies in simple popularity biases [71]. For example, the winning model in the Netflix Prize competition [10] managed to explain 42.6% of the ratings' variance i.e., R 2 = 42.6%, but the vast majority of the learned signal was attributed to popularity biases which explained a whopping R 2 = 32.5% of the variance (without any personalization) [72]. ...

Web-Scale Media Recommendation Systems
  • Citing Article
  • September 2012

Proceedings of the IEEE

... They streamline access to relevant information by identifying resources aligned with user interests based on historical experiences, aiming to save users time and costs. Originating in ecommerce to combat information overload in the Web 2.0 era, recommender systems quickly expanded into e-learning [2], tourism [27], smart cities [5], music [3], research resources, and television programs. In modern times, platforms like Amazon.com, ...

Recommender Systems Handbook
  • Citing Book
  • October 2010