# Selected Publications

## Selected
### Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning.
- Authors: Zichao Li, Jie Lou, Fangchen Dong, Zhiyuan Fan, Mengjie Ren, Hongyu Lin, Xianpei Han, Debing Zhang, Le Sun, Yaojie Lu and Xing Yu.
- Venue: ICML 2026
- Year: 2026
- Links:
  - [arXiv](https://doi.org/10.48550/arXiv.2603.10535)

### Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
- Authors: Xueru Wen, Jie Lou, Yaojie Lu, Hongyu Lin, XingYu, Xinyu Lu, Ben He, Xianpei Han, Debing Zhang and Le Sun.
- Venue: ICLR 2025
- Year: 2025
- Links:
  - [Paper](https://openreview.net/pdf?id=Cnwz9jONi5)

### PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides.
- Authors: Hao Zheng, Xinyan Guan, Hao Kong, Wenkai Zhang, Jia Zheng, Weixiang Zhou, Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun.
- Venue: EMNLP 2025
- Year: 2025
- Links:
  - [Paper](https://aclanthology.org/2025.emnlp-main.728.pdf)
  - [Code](https://github.com/icip-cas/PPTAgent)
  - [Stars](https://img.shields.io/github/stars/icip-cas/PPTAgent?style=social)

### SoFA: Shielded On-the-fly Alignment via Priority Rule Following.
- Authors: Xinyu Lu, Bowen Yu, Yaojie Lu, Hongyu Lin, Haiyang Yu, Le Sun, Xianpei Han and Yongbin Li.
- Venue: ACL Findings 2024
- Year: 2024
- Links:
  - [Paper](https://arxiv.org/pdf/2402.17358)

### Unified Structure Generation for Universal Information Extraction.
- Authors: Yaojie Lu, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin, Xianpei Han, Le Sun and Hua Wu.
- Venue: ACL 2022
- Year: 2022
- Links:
  - [Paper](https://aclanthology.org/2022.acl-long.395/)