1.
Akhil Reddy Bairi, Jawaharbabu Jeyaraman, Debabrata Das. Unified Pipelines for Multi-Dimensional LLM Optimization Through SFT, RLHF, and DPO. Journal of AI-Assisted Scientific Discovery. 2024;4(2):325-366. Accessed January 17, 2025. https://scienceacadpress.com/index.php/jaasd/article/view/285