Reinforcement Learning Python Code

I tried vibe coding an app as a beginner - here's what Cursor and Replit taught me

I tried four vibe-coding tools, including Cursor and Replit, with no coding background. Here's what worked (and what didn't).

GitHub

verl: Volcano Engine Reinforcement Learning for LLMs

verl is a flexible, efficient and production-ready RL training library for large language models (LLMs). verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper.

IEEE

Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models

Abstract: This paper studies how AI-assisted programming and large language models (LLM) improve software developers' ability via AI tools (LLM agents) like Github Copilot and Amazon CodeWhisperer, ...

IEEE

Multi-Agent Evolutionary Reinforcement Learning Based on Cooperative Games

Abstract: Despite the significant advancements in single-agent evolutionary reinforcement learning, research exploring evolutionary reinforcement learning within multi-agent systems is still in its ...

Microsoft

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

AI agents are reshaping software development, from writing code to carrying out complex instructions. Yet LLM-based agents are prone to errors and often perform poorly on complicated, multi-step tasks ...

GitHub

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

RLinf is a flexible and scalable open-source RL infrastructure designed for Embodied and Agentic AI. The 'inf' in RLinf stands for Infrastructure, highlighting its role as a robust backbone for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results