Official implementation of paper "AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning"
-
Updated
Jun 18, 2025
Official implementation of paper "AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning"
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
Space Group Informed Transformer for Crystalline Materials Generation
MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning framework for general agentic tasks, providing a foundational MARFT framework.
Reinforcement Fine-tuning LLMs with GRPO | Deeplearning.ai
Add a description, image, and links to the reinforcement-finetuning topic page so that developers can more easily learn about it.
To associate your repository with the reinforcement-finetuning topic, visit your repo's landing page and select "manage topics."