Keping Yang’s research while affiliated with Alibaba Group and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (33)


ReprBERT: Distilling BERT to an Efficient Representation-Based Relevance Model for E-Commerce
  • Conference Paper

August 2022

·

261 Reads

·

32 Citations

·

Jiwei Tan

·

Xi Chen

·

[...]

·

Keping Yang


Personalized Knowledge-Aware Recommendation with Collaborative and Attentive Graph Convolutional Networks

March 2022

·

46 Reads

·

40 Citations

Pattern Recognition

Knowledge graphs (KGs) are increasingly used to solve the data sparsity and cold start problems of collaborative filtering. Recently, graph neural networks (GNNs) have been applied to build KG-based recommender systems and achieved competitive performance. However, existing GNN-based methods are either limited in their ability to capture fine-grained semantics in a KG, or insufficient in effectively modeling user-item interactions. To address these issues, we propose a novel framework with collaborative and attentive graph convolutional networks for personalized knowledge-aware recommendation. Particularly, we model the user-item graph and the KG separately and simultaneously with an efficient graph convolutional network and a personalized knowledge graph attention network, where the former aims to extract informative collaborative signals, while the latter is designed to capture fine-grained semantics. Collectively, they are able to learn meaningful node representations for predicting user-item interactions. Extensive experiments on benchmark datasets demonstrate the effectiveness of the proposed method compared with state-of-the-arts.


Figure 2: The framework of our proposed SBG model.
Dataset Statistics.
The improvement percentages of NDCG@10 over ZAM by DREM (the best baseline) and our SBG.
Modeling User Behavior with Graph Convolution for Personalized Product Search
  • Preprint
  • File available

February 2022

·

90 Reads

User preference modeling is a vital yet challenging problem in personalized product search. In recent years, latent space based methods have achieved state-of-the-art performance by jointly learning semantic representations of products, users, and text tokens. However, existing methods are limited in their ability to model user preferences. They typically represent users by the products they visited in a short span of time using attentive models and lack the ability to exploit relational information such as user-product interactions or item co-occurrence relations. In this work, we propose to address the limitations of prior arts by exploring local and global user behavior patterns on a user successive behavior graph, which is constructed by utilizing short-term actions of all users. To capture implicit user preference signals and collaborative patterns, we use an efficient jumping graph convolution to explore high-order relations to enrich product representations for user preference modeling. Our approach can be seamlessly integrated with existing latent space based methods and be potentially applied in any product retrieval method that uses purchase history to model user preferences. Extensive experiments on eight Amazon benchmarks demonstrate the effectiveness and potential of our approach. The source code is available at \url{https://github.com/floatSDSDS/SBG}.

Download



Embedding-based Product Retrieval in Taobao Search

June 2021

·

84 Reads

Nowadays, the product search service of e-commerce platforms has become a vital shopping channel in people's life. The retrieval phase of products determines the search system's quality and gradually attracts researchers' attention. Retrieving the most relevant products from a large-scale corpus while preserving personalized user characteristics remains an open question. Recent approaches in this domain have mainly focused on embedding-based retrieval (EBR) systems. However, after a long period of practice on Taobao, we find that the performance of the EBR system is dramatically degraded due to its: (1) low relevance with a given query and (2) discrepancy between the training and inference phases. Therefore, we propose a novel and practical embedding-based product retrieval model, named Multi-Grained Deep Semantic Product Retrieval (MGDSPR). Specifically, we first identify the inconsistency between the training and inference stages, and then use the softmax cross-entropy loss as the training objective, which achieves better performance and faster convergence. Two efficient methods are further proposed to improve retrieval relevance, including smoothing noisy training data and generating relevance-improving hard negative samples without requiring extra knowledge and training procedures. We evaluate MGDSPR on Taobao Product Search with significant metrics gains observed in offline experiments and online A/B tests. MGDSPR has been successfully deployed to the existing multi-channel retrieval system in Taobao Search. We also introduce the online deployment scheme and share practical lessons of our retrieval system to contribute to the community.


Figure 1: The overview of the product search system in Taobao. The head of each circle denotes different phases. The bottom is the corresponding number of candidate sets.
Figure 2: The general architecture of the proposed Multi-Grained Deep Semantic Product Retrieval (MGDSPR).
Figure 3: The overview of Taobao search engine.
Figure 4: Deployment system of our MGDSPR model.
Embedding based Product Retrieval in Taobao Search

May 2021

·

2,414 Reads

·

95 Citations

Nowadays, the product search service of e-commerce platforms has become a vital shopping channel in people's life. The retrieval phase of the product determines the search system's quality and gradually attracts researchers' attention. Retrieving the most relevant products from the large-scale corpus while preserving personalized user characteristics remains an open question. Recent approaches in this domain have mainly focused on embedding-based retrieval (EBR) systems. However, after a long period of practice on Taobao, we found that EBR system performance is dramatically degraded due to its: (1) low relevance with a given query and (2) discrepancy between the training and inference phase. To this end, we propose a novel and practical embedded product retrieval model, named Multi-Grained Deep Semantic Product Retrieval (MGDSPR). Specifically, we first identify the inconsistency between the training and inference, and use the softmax function as the training objective, achieving better performance and faster convergence. Two efficient methods are further proposed to promote the model relevance, including smoothing noisy training data and generating relevance hard negative samples without requiring extra knowledge and training procedures. We evaluated MGDSPR on Taobao Product Search with significant metrics gains observed in offline experiments and online A/B test. MGDSPR has been successfully deployed to the existing multi-channel retrieval system in Taobao Search. We also introduce the online deployment scheme and share practical lessons of our retrieval system to contribute to the community.


AutoDebias: Learning to Debias for Recommendation

May 2021

·

70 Reads

Recommender systems rely on user behavior data like ratings and clicks to build personalization model. However, the collected data is observational rather than experimental, causing various biases in the data which significantly affect the learned model. Most existing work for recommendation debiasing, such as the inverse propensity scoring and imputation approaches, focuses on one or two specific biases, lacking the universal capacity that can account for mixed or even unknown biases in the data. Towards this research gap, we first analyze the origin of biases from the perspective of \textit{risk discrepancy} that represents the difference between the expectation empirical risk and the true risk. Remarkably, we derive a general learning framework that well summarizes most existing debiasing strategies by specifying some parameters of the general framework. This provides a valuable opportunity to develop a universal solution for debiasing, e.g., by learning the debiasing parameters from data. However, the training data lacks important signal of how the data is biased and what the unbiased data looks like. To move this idea forward, we propose \textit{AotoDebias} that leverages another (small) set of uniform data to optimize the debiasing parameters by solving the bi-level optimization problem with meta-learning. Through theoretical analyses, we derive the generalization bound for AutoDebias and prove its ability to acquire the appropriate debiasing strategy. Extensive experiments on two real datasets and a simulated dataset demonstrated effectiveness of AutoDebias. The code is available at \url{https://github.com/DongHande/AutoDebias}.



Citations (19)


... Knowledge distillation [6] bridges this gap by transferring LLM capabilities to lighter, deployable models. Knowledge distillation methods can be mainly categorized into three types: response-based [4,7,15,19,38,39], representation-based [13,36], and relation-based methods [14]. Specifically, Hofstätter et al. [7] introduced a margin focused loss (Margin-MSE) to adapt knowledge [25] introduced a novel instruction distillation method to improve the efficiency of LLMs by converting complex pairwise ranking into more efficient pointwise ranking. ...

Reference:

Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models
ReprBERT: Distilling BERT to an Efficient Representation-Based Relevance Model for E-Commerce
  • Citing Conference Paper
  • August 2022

... • Amazon Review. It was first introduced by Van Gysel et al. [48,49] and has become a benchmark dataset for evaluating product recommendation methods [14,34,46]. We select the Electronics subset which products a sufficient number of user reviews and includes comprehensive metadata, such as product titles and categories. ...

Modeling User Behavior with Graph Convolution for Personalized Product Search
  • Citing Conference Paper
  • April 2022

... The advent of personalized knowledgeaware recommender systems marked a significant advance in artificial intelligence. Such systems, as described in the study "Personalized Knowledge-Aware Recommendation with Collaborative and Attentive Graph Convolutional Networks [COAT] 9 ," have successfully combined user-item graphs and knowledge graphs for optimized recommendations. They employ graph convolutional networks (GCN) to effectively modulate collaborative signals for user and item embeddings from the user-item graph. ...

Personalized Knowledge-Aware Recommendation with Collaborative and Attentive Graph Convolutional Networks
  • Citing Article
  • March 2022

Pattern Recognition

... We briefly introduce the our experimental environment for the readers' interest. For text matching, we have already deployed query/product tagging matching methods based on manual efforts and knowledge graphs [27] and the query rewriting technique CLE-QR [22]. For personal retrieval, we have graph-based methods [7,13,33] and an embedding-based method [23] online. ...

AliCoCo2: Commonsense Knowledge Extraction, Representation and Application in E-commerce

... Consequently, the learned user representations might not accurately reflect true preferences, leading to suboptimal recommendations. Although some researchers have explored debiasing in RSs (Saito et al. 2020;Li et al. 2022;Chen et al. 2021;Wang et al. 2023), they often focus solely on fixed latent embeddings, overlooking the data's inherent probabilistic nature, which can weaken model robustness and generalization. CH2-posterior collapse: This phenomenon occurs when the learned latent representation ignores the input data and relies ...

AutoDebias: Learning to Debias for Recommendation
  • Citing Conference Paper
  • July 2021

... During model weight updating, each supervised dataset dedicated to unique domains and assigned tasks receives specific updates within the fine-tuning stage. E-commerce applications provide the data source featuring query-product pairs, which receive click-through behavior information and sales conversion labels (Yao et al., 2021). The model obtains specialized knowledge through fine-tuning to grasp specialized terminology and business categories and pursue user goals that would otherwise be hidden in its broad corpus. ...

Learning a Product Relevance Model from Click-Through Data in E-Commerce
  • Citing Conference Paper
  • April 2021

... Huang et al. (2020) designed the Facebook search system with three stages (indexing, retrieval and ranking) under a unified embedding based framework. Six different stages including retrieval, pre-ranking, relevance ranking, ranking, re-ranking, mix-ranking were deployed to build a search system as described in Li et al. (2021). ...

Embedding based Product Retrieval in Taobao Search

... Query category prediction has received significant attention in related works [6,9,[17][18][19]: the proposed approaches target various challenges associated with Q2PT prediction, such as short queries [6,9], long-tail queries [17,18], product type hierarchy [19], etc. However, to the best of our knowledge, there has not yet been studies, which target the issues of Q2PT classification in the multilocale setting. ...

Modeling Across-Context Attention For Long-Tail Query Classification in E-commerce
  • Citing Conference Paper
  • March 2021

... It is worth noting that techniques from many emerging fields like graph learning (Chen et al. , 2023bWu et al. 2024;Wu, Zhang, and Fan 2023), have prompted the fast development of multi-view learning methods. By leveraging the ability to construct a comprehensive understanding of objects, multi-view learning provides versatile solutions for complex data analysis tasks in different domains, such as bioinformatics Luo et al. 2018;Thammasorn et al. 2021;Yu et al. 2024), social network analysis (Lan et al. 2017;Chen et al. 2023c;Ma et al. 2017;Chen et al. 2024b), recommender systems (Zou et al. 2022;Wang et al. 2020;Cui et al. 2018). However, conventional multi-view learning methods presuppose that all views are constantly available. ...

M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems
  • Citing Conference Paper
  • August 2020

... While MMOE attempts to solve the issue of negative transfer, Xi et al. proposed AITM (Xi et al. 2021), which uses an attention mechanism to transfer information from clicks to conversions. Several studies (Ma et al. 2018b;Zhang et al. 2022;Wen et al. 2021Wen et al. , 2020 introduce special auxiliary tasks to transfer new information to the target tasks. ...

Entire Space Multi-Task Modeling via Post-Click Behavior Decomposition for Conversion Rate Prediction
  • Citing Conference Paper
  • July 2020