Zhu, Lin, Fan Guo, Guohui Cai, and Yumeng Ma. “Structured Preference Modeling for Reinforcement Learning-Based Fine-Tuning of Large Models”. Journal of Computer Technology and Software 4, no. 4 (April 30, 2025). Accessed May 14, 2025. https://ashpress.org/index.php/jcts/article/view/156.