alphaXiv

History

Papers Benchmarks

OTRI

189

06 Jun 2025

computer-science artificial-intelligence machine-learning

Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning

Osaka University Bogazici University Ozyegin University OTRI

State Reconstruction for Diffusion Policies (SRDP) improves out-of-distribution generalization in offline reinforcement learning by integrating an auxiliary state reconstruction objective into diffusion policies. This method demonstrates a 167% performance improvement over Diffusion-QL in maze navigation with missing data and achieves a Chamfer distance of 0.071 0.02 against ground truth in real-world UR10 robot experiments.

26 Jul 2025

computer-science continual-learning artificial-intelligence

Interleaved Multitask Learning with Energy Modulated Learning Progress

Bogazici University Ozyegin University University of Osaka OTRI International Professional University of Technology, Osaka

As humans learn new skills and apply their existing knowledge while maintaining previously learned information, "continual learning" in machine learning aims to incorporate new data while retaining and utilizing past knowledge. However, existing machine learning methods often does not mimic human learning where tasks are intermixed due to individual preferences and environmental conditions. Humans typically switch between tasks instead of completely mastering one task before proceeding to the next. To explore how human-like task switching can enhance learning efficiency, we propose a multi task learning architecture that alternates tasks based on task-agnostic measures such as "learning progress" and "neural computational energy expenditure". To evaluate the efficacy of our method, we run several systematic experiments by using a set of effect-prediction tasks executed by a simulated manipulator robot. The experiments show that our approach surpasses random interleaved and sequential task learning in terms of average learning accuracy. Moreover, by including energy expenditure in the task switching logic, our approach can still perform favorably while reducing neural energy expenditure.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning

Interleaved Multitask Learning with Energy Modulated Learning Progress

Events

AI for Law

Personalize Your Feed