CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
- 👤 Speaker: Zhijiang Guo (HKUST (GZ) | HKUST)
- 📅 Date & Time: Friday 17 April 2026, 12:00 - 13:00
- 📍 Venue: ONLY ONLY. Here is the Google Meet Link: https://meet.google.com/cru-hcuo-rhu
Abstract
In this talk, I will present CodeScaler, a novel framework designed to overcome the scalability bottlenecks of Reinforcement Learning from Verifiable Rewards (RLVR) in code generation. While traditional RLVR relies heavily on the availability of high-quality unit tests—which are often scarce or unreliable—CodeScaler introduces an execution-free reward model that scales both training and test-time inference. By leveraging carefully curated preference data, syntax-aware code extraction, and validity-preserving reward shaping, CodeScaler achieves significant performance gains, improving the Qwen3-8B-Base model by an average of +11.72 points across five benchmarks. Furthermore, CodeScaler functions as a highly efficient test-time scaling method, delivering performance comparable to execution-based approaches while reducing latency by 10$\times$. I will discuss how this approach enables robust optimization on synthetic datasets without the need for test cases and its broader implications for enhancing reasoning capabilities in general domains.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- ONLY ONLY. Here is the Google Meet Link: https://meet.google.com/cru-hcuo-rhu
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Friday 17 April 2026, 12:00-13:00