Zhu, Lin, Fan Guo, Guohui Cai, and Yumeng Ma. 2025. “Structured Preference Modeling for Reinforcement Learning-Based Fine-Tuning of Large Models”. Journal of Computer Technology and Software 4 (4). https://doi.org/10.5281/zenodo.15340770.