Latest Trends출처: Netflix Tech조회수 48

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

By Netflix Technology Blog

2025년 10월 26일

**Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning**

Author: Keertana Chidambaram, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya(*The work was done when Keertana interned at Netflix.)IntroductionThis blog focuses on post-training generative recommender systems. Generative recommenders (GRs) represent a new paradigm in the field of recommendation systems (e.g. These models draw inspiration from recent advancements in transformer architectures used for language and vision tasks. They approach the recommendation problem, including both ranking and retrieval, as a sequential transduction task. This perspective enables generative training, where the model learns by imitating the next event in a sequence of user activities, thereby effectively modeling user behavior over time.However, a key challenge with simply replicating observed user patterns is that it may not always lead to the best possible recommendations...

---

**[devsupporter 해설]**

이 기사는 Netflix Tech에서 제공하는 최신 개발 동향입니다. 관련 도구나 기술에 대해 더 알아보시려면 원본 링크를 참고하세요.

원본 보기

목록으로 돌아가기

Welcome back

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

DevSupporter

Categories

게시글 정보