
41 papers found
A Tutorial on Meta-Reinforcement Learning
Foundations and Trends® in Machine Learning202517 citations
SRT-H: A hierarchical framework for autonomous surgery via language-conditioned imitation learning
Science Robotics202535 citations
ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation
arXiv (Cornell University)20247 citations
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
arXiv (Cornell University)20245 citations
Surgical Robot Transformer (SRT): Imitation Learning for Surgical Tasks
arXiv (Cornell University)20246 citations
GENERALIZING SKILLS WITH SEMI-SUPERVISED REINFORCEMENT LEARNING
TIB Data Manager20246 citations
$π_0$: A Vision-Language-Action Flow Model for General Robot Control
arXiv (Cornell University)20246 citations
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
arXiv (Cornell University)202428 citations
OpenVLA: An Open-Source Vision-Language-Action Model
arXiv (Cornell University)202435 citations
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents
arXiv (Cornell University)202414 citations
Waypoint-Based Imitation Learning for Robotic Manipulation
arXiv (Cornell University)20238 citations
Fine-tuning Language Models for Factuality
arXiv (Cornell University)202310 citations
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
arXiv (Cornell University)2023262 citations
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
arXiv (Cornell University)202316 citations
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
arXiv (Cornell University)202320 citations
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
arXiv (Cornell University)202329 citations
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback
arXiv (Cornell University)20235 citations
Open-World Object Manipulation using Pre-trained Vision-Language Models
arXiv (Cornell University)202324 citations
RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches
arXiv (Cornell University)20235 citations
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
arXiv (Cornell University)20237 citations
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
arXiv (Cornell University)2023268 citations
BridgeData V2: A Dataset for Robot Learning at Scale
arXiv (Cornell University)202312 citations