Researchers from Zhejiang University and Intelligence Indeed present GUI-Robust, a dataset of 5,318 GUI tasks including 200 real-world anomalies, designed to evaluate the robustness of GUI agents. Experiments reveal that state-of-the-art GUI-specific agents and general-purpose MLLMs experience substantial performance degradation in abnormal scenarios, highlighting a critical gap in real-world deployability.
View blogResearchers from Zhejiang University developed DR-GNN, a GNN-based recommendation system that enhances robustness against various distribution shifts by integrating Distributionally Robust Optimization. The method reinterprets GNN aggregation as a graph smoothing regularizer and introduces a Graph Edge-Addition strategy to mitigate data sparsity, demonstrating consistent performance improvements over state-of-the-art baselines across multiple datasets and shift types.
View blogResearchers from Zhejiang University and OPPO Research Institute introduce LLaCTR, a lightweight method for enhancing Click-Through Rate (CTR) prediction by leveraging Large Language Models at the field level rather than instance level, achieving 2.24% AUC improvement across four real-world datasets while reducing computational overhead by 10-100x compared to existing LLM-enhanced approaches.
View blogZhejiang University and OPPO Research Institute researchers introduce SOFT (Self-Optimized Fine-Tuning), a training framework that improves LLM-based recommender systems by combining supervised fine-tuning with self-distilled auxiliary data generation and a self-adaptive curriculum scheduler that dynamically balances training on simplified self-generated examples versus real recommendation data based on semantic distance between model outputs and target items, achieving superior performance across multiple datasets compared to traditional fine-tuning approaches, LLM-enhanced recommenders, and classical recommendation models while demonstrating that curriculum learning principles can effectively bridge the knowledge gap between general LLM capabilities and domain-specific recommendation requirements.
View blog