We propose TraceRL, a trajectory-aware reinforcement learning method for diffusion language models, which demonstrates the best performance among RL approaches for DLMs. We also introduce a ...
Abstract: Path planning is an important step in ensuring the safety of unmanned surface vehicle (USV) navigation and executing missions quickly and efficiently. However, current USV path planning ...