[1]
Zhu, L. et al. 2025. Structured Preference Modeling for Reinforcement Learning-Based Fine-Tuning of Large Models. Journal of Computer Technology and Software. 4, 4 (Apr. 2025). DOI:https://doi.org/10.5281/zenodo.15340770.