Zhu, Lin, et al. “Structured Preference Modeling for Reinforcement Learning-Based Fine-Tuning of Large Models”. Journal of Computer Technology and Software, vol. 4, no. 4, Apr. 2025, doi:10.5281/zenodo.15340770.