Vol. 2 No. 1 (2022): Journal of AI-Assisted Scientific Discovery
Articles

Transformer-based Language Models - Architectures and Applications: Analyzing transformer-based language models such as BERT, GPT, and T5, and their applications in NLP tasks such as text generation and classification

Dr. Maria Fox
Professor of Computer Science, King's College London (UK)
Cover

Published 30-06-2022

Keywords

  • Transformer-based language models,
  • BERT,
  • GPT

How to Cite

[1]
Dr. Maria Fox, “Transformer-based Language Models - Architectures and Applications: Analyzing transformer-based language models such as BERT, GPT, and T5, and their applications in NLP tasks such as text generation and classification”, Journal of AI-Assisted Scientific Discovery, vol. 2, no. 1, pp. 150–164, Jun. 2022, Accessed: Sep. 18, 2024. [Online]. Available: https://scienceacadpress.com/index.php/jaasd/article/view/72

Abstract

Transformer-based language models have revolutionized natural language processing (NLP) by enabling efficient training on large-scale datasets and achieving state-of-the-art performance on various tasks. This paper provides an in-depth analysis of transformer-based language models, focusing on key architectures like BERT, GPT, and T5. We explore the underlying mechanisms of transformers, including self-attention and positional encoding, and discuss how these models have been applied to NLP tasks such as text generation and classification. Additionally, we examine the strengths and limitations of transformer-based models and discuss future research directions in this field.

Downloads

Download data is not yet available.

References

  1. Tatineni, Sumanth. "Beyond Accuracy: Understanding Model Performance on SQuAD 2.0 Challenges." International Journal of Advanced Research in Engineering and Technology (IJARET) 10.1 (2019): 566-581.
  2. Shaik, Mahammad, Srinivasan Venkataramanan, and Ashok Kumar Reddy Sadhu. "Fortifying the Expanding Internet of Things Landscape: A Zero Trust Network Architecture Approach for Enhanced Security and Mitigating Resource Constraints." Journal of Science & Technology 1.1 (2020): 170-192.
  3. Tatineni, Sumanth. "Cost Optimization Strategies for Navigating the Economics of AWS Cloud Services." International Journal of Advanced Research in Engineering and Technology (IJARET) 10.6 (2019): 827-842.