[1]
Akhil Reddy Bairi, Jawaharbabu Jeyaraman, and Debabrata Das, “Unified Pipelines for Multi-Dimensional LLM Optimization Through SFT, RLHF, and DPO”, Journal of AI-Assisted Scientific Discovery, vol. 4, no. 2, pp. 325–366, Sep. 2024, Accessed: Jan. 17, 2025. [Online]. Available: https://scienceacadpress.com/index.php/jaasd/article/view/285