Abstract: Preference-based reinforcement learning (PbRL) is a suitable approach for style adaptation of pre-trained robotic behavior: adapting the robot's policy to follow human user preferences while ...
Abstract: Cross-database Text-to-SQL tasks require models to perform both structural planning (Skeleton Parsing) and Schema Linking on unfamiliar database schemas. The high coupling of these two ...
Low-rank tensor completion has become a fundamental tool for recovering high-dimensional data from incomplete observations. However, conventional methods rely primarily on algebraic low-rank priors ...
├── core/agent_base.py # ReAct loop, tool registry, cost tracker ├── labs/exercise_04_*/ # 4 exercises (beginner → stretch) ├── infrastructure ...