[1]
Akhil Reddy Bairi et al. 2024. Unified Pipelines for Multi-Dimensional LLM Optimization Through SFT, RLHF, and DPO. Journal of AI-Assisted Scientific Discovery. 4, 2 (Sep. 2024), 325–366.