This project investigates token quality from a noisy-label perspective and propose a generic token cleaning pipeline for SFT tasks. Our method filters out uninformative tokens while preserving those ...
Abstract: Learning from data streams originating from non-stationary environments is vital for many real-world applications. A notable challenge in this task is concept drift. Most existing methods ...
Abstract: With the rise of e-commerce, personalized recommendation algorithms have received much attention in recent years. Meanwhile, multimodal recommendation algorithms have become the next ...