AI Coding with CodeRL: Toward Mastering Program Synthesis with Deep Reinforcement Learning
TL;DR: CodeRL is a new framework for program synthesis through holistic integration of pretrained language models and deep reinforcement learning. By utilizing unit test feedback as part of model training and inference, and integrating with an improved CodeT5 model, CodeRL achieves state-of-the-art results on competition-level programming tasks. The following
19 Jul 2022 • Henry Hung Le • #reinforcement-learning