Sololearn Learn to Code

Post-Completion Learning for Language Models

Our code is based on open-r1, with our customized Trainer for mixed SFT+GRPO training. Some other updates focus on the white-box RL (reward function design) and post-completion training (replacement ...

IEEE

Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models

Abstract: This paper studies how AI-assisted programming and large language models (LLM) improve software developers' ability via AI tools (LLM agents) like Github Copilot and Amazon CodeWhisperer, ...

IEEE

“Paper, Meet Code”: A Deep Learning Approach to Linking Scholarly Articles With GitHub Repositories

Abstract: Computer scientists often publish their source code accompanying their publications, prominently using code repositories across various domains. Despite the concurrent existence of scholarly ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Post-Completion Learning for Language Models

Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models

“Paper, Meet Code”: A Deep Learning Approach to Linking Scholarly Articles With GitHub Repositories

Trending now