publications
Please see Google Scholar for more recent works and arXiv papers.
2026
-
Preprint Train Less, Learn More: Adaptive and Efficient Rollout Optimization for Group-Based Reinforcement Learning2026 -
arXiv -
ICLR EDIVAL-Agent: An Object-Centric Framework for Automated, Fine-Grained Evaluation of Multi-Turn EditingIn International Conference on Learning Representations 2026 -
ICLR Single Index Bandits: Generalized Linear Contextual Bandits with Unknown Reward FunctionsIn International Conference on Learning Representations 2026
2025
-
AISTATS Quantile Additive Trend FilteringIn International Conference on Artificial Intelligence and Statistics 2025 -
arXiv -
AISTATS Statistical Guarantees for Lifelong Reinforcement Learning Using PAC-Bayes TheoryIn International Conference on Artificial Intelligence and Statistics 2025 - arXiv
2024
- arXivDODT: Enhanced Online Decision Transformer Learning through Dreamer’s Actor-Critic Trajectory Forecasting2024
-
AISTATS Multivariate Time Series Forecasting by Graph Attention Networks with Theoretical GuaranteesIn International Conference on Artificial Intelligence and Statistics 2024