1997 Acura RL - Search News

RL-OPC: Mask Optimization With Deep Reinforcement Learning

Abstract: Mask optimization is a vital step in the VLSI manufacturing process in advanced technology nodes. As one of the most representative techniques, optical proximity correction (OPC) is widely ...

GitHub

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

We propose TraceRL, a trajectory-aware reinforcement learning method for diffusion language models, which demonstrates the best performance among RL approaches for DLMs. We also introduce a ...

GitHub

PRIME-RL: Async RL Training at Scale

PRIME-RL is a framework for large-scale asynchronous reinforcement learning. It is designed to be easy-to-use and hackable, yet capable of scaling to 1000+ GPUs. Beyond that, here is why we think you ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

RL-OPC: Mask Optimization With Deep Reinforcement Learning

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

PRIME-RL: Async RL Training at Scale

Trending now