Akhil Reddy Bairi, Jawaharbabu Jeyaraman, & Debabrata Das. (2024). Unified Pipelines for Multi-Dimensional LLM Optimization Through SFT, RLHF, and DPO. Journal of AI-Assisted Scientific Discovery, 4(2), 325-366. https://scienceacadpress.com/index.php/jaasd/article/view/285