Vol. 2 No. 2 (2022): Journal of AI-Assisted Scientific Discovery
Articles

Kubernetes at the Edge: Enabling AI and Big Data Workloads in Remote Locations

Naresh Dulam
Vice President Sr Lead Software Engineer, JP Morgan Chase, USA
Jayaram Immaneni
Sre Lead, JP Morgan Chase, USA
Madhu Ankam
Vice President Sr Lead Software Engineer, JP Morgan Chase, USA
Cover

Published 18-10-2022

Keywords

  • Kubernetes,
  • AI workloads

How to Cite

[1]
Naresh Dulam, Jayaram Immaneni, and Madhu Ankam, “Kubernetes at the Edge: Enabling AI and Big Data Workloads in Remote Locations”, Journal of AI-Assisted Scientific Discovery, vol. 2, no. 2, pp. 251–277, Oct. 2022, Accessed: Dec. 23, 2024. [Online]. Available: https://scienceacadpress.com/index.php/jaasd/article/view/229

Abstract

The growing demand for real-time analytics and AI-driven decision-making pushes businesses to move computational workloads closer to where data is generated—often in remote or distributed locations. This shift addresses critical needs such as reducing latency, enabling faster insights, and optimizing operational efficiency. Kubernetes, the leading container orchestration platform, plays a pivotal role in this transition by extending cloud-native principles to the edge. Its inherent capabilities for scalability, reliability, workload portability, and efficient resource utilization make it an ideal framework for deploying AI & big data workloads in these environments. However, edge computing presents unique challenges, including limited resources, intermittent connectivity, and higher risks associated with remote operations. Kubernetes rises to these challenges by enabling dynamic workload scheduling, automated scaling, and seamless orchestration across geographically dispersed clusters. Its lightweight distributions and edge-focused adaptations allow it to run effectively on resource-constrained devices while maintaining a consistent developer experience. Furthermore, Kubernetes facilitates data processing and AI inference at the edge, enabling organizations to derive actionable insights in near real-time. Deploying Kubernetes at the edge also requires careful consideration of network bandwidth, security, and infrastructure management to overcome latency, hardware variability, and operational constraints. Best practices include Adopting edge-optimized Kubernetes distributions, Leveraging tools for monitoring and resource efficiency & Implementing strategies to maintain high availability despite unreliable connectivity.

Downloads

Download data is not yet available.

References

  1. Raheja, R. (2020). Enabling kubernetes for distributed ai processing on edge devices (Doctoral dissertation, The University of North Carolina at Charlotte).
  2. Liu, B. (2019). Study and benchmarking of Artificial Intelligence (AI) model serving systems on edge computation units and cloud environments (Master's thesis).
  3. Toka, L., Dobreff, G., Fodor, B., & Sonkoly, B. (2021). Machine learning-based scaling management for kubernetes edge clusters. IEEE Transactions on Network and Service Management, 18(1), 958-972.
  4. Kraemer, F. (2021, May). AI and Big Data Management for Autonomous Driving. In 21. Internationales Stuttgarter Symposium: Automobil-und Motorentechnik (pp. 447-460). Wiesbaden: Springer Fachmedien Wiesbaden.
  5. Rossi, F., Cardellini, V., Presti, F. L., & Nardelli, M. (2020). Geo-distributed efficient deployment of containers with kubernetes. Computer Communications, 159, 161-174.
  6. Zhang, X., Li, L., Wang, Y., Chen, E., & Shou, L. (2021). Zeus: Improving resource efficiency via workload colocation for massive kubernetes clusters. IEEE Access, 9, 105192-105204.
  7. Thurgood, B., & Lennon, R. G. (2019, July). Cloud computing with kubernetes cluster elastic scaling. In Proceedings of the 3rd International Conference on Future Networks and Distributed Systems (pp. 1-7).
  8. Trakadas, P., Nomikos, N., Michailidis, E. T., Zahariadis, T., Facca, F. M., Breitgand, D., ... & Gkonis, P. (2019). Hybrid clouds for data-intensive, 5G-enabled IoT applications: An overview, key issues and relevant architecture. Sensors, 19(16), 3591.
  9. Kommera, A. R. (2013). The Role of Distributed Systems in Cloud Computing: Scalability, Efficiency, and Resilience. NeuroQuantology, 11(3), 507-516.
  10. Wojciechowski, Ł., Opasiak, K., Latusek, J., Wereski, M., Morales, V., Kim, T., & Hong, M. (2021, May). Netmarks: Network metrics-aware kubernetes scheduler powered by service mesh. In IEEE INFOCOM 2021-IEEE Conference on Computer Communications (pp. 1-9). IEEE.
  11. Kim, J., Ullah, S., & Kim, D. H. (2021). GPU-based embedded edge server configuration and offloading for a neural network service. The Journal of Supercomputing, 77(8), 8593-8621.
  12. Sellami, R., Zalila, F., Nuttinck, A., Dupont, S., Deprez, J. C., & Mouton, S. (2020, September). Fadi-a deployment framework for big data management and analytics. In 2020 IEEE 29th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) (pp. 153-158). IEEE.
  13. Goethals, T., Volckaert, B., & De Turck, F. (2020). Adaptive Fog Service Placement for Real-time Topology Changes in Kubernetes Clusters. In CLOSER (pp. 161-170).
  14. Serrano, M. A., Marín, C. A., Queralt, A., Cordeiro, C., Gonzalez, M., Pinho, L. M., & Quiñones, E. (2021). An Elastic Software Architecture for Extreme-Scale Big Data Analytics. In Technologies and Applications for Big Data Value (pp. 89-110). Cham: Springer International Publishing.
  15. Gilbert, M. (2018). The role of artificial intelligence for network automation and security. In Artificial Intelligence for Autonomous Networks (pp. 1-23). Chapman and Hall/CRC.
  16. Thumburu, S. K. R. (2021). Integrating Blockchain Technology into EDI for Enhanced Data Security and Transparency. MZ Computing Journal, 2(1).
  17. Thumburu, S. K. R. (2021). Optimizing Data Transformation in EDI Workflows. Innovative Computer Sciences Journal, 7(1).
  18. Gade, K. R. (2021). Cloud Migration: Challenges and Best Practices for Migrating Legacy Systems to the Cloud. Innovative Engineering Sciences Journal, 1(1).
  19. Gade, K. R. (2021). Data Analytics: Data Democratization and Self-Service Analytics Platforms Empowering Everyone with Data. MZ Computing Journal, 2(1).
  20. Katari, A., Muthsyala, A., & Allam, H. HYBRID CLOUD ARCHITECTURES FOR FINANCIAL DATA LAKES: DESIGN PATTERNS AND USE CASES.
  21. Katari, A. Conflict Resolution Strategies in Financial Data Replication Systems.
  22. Komandla, V. Strategic Feature Prioritization: Maximizing Value through User-Centric Roadmaps.
  23. Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.
  24. Thumburu, S. K. R. (2020). Enhancing Data Compliance in EDI Transactions. Innovative Computer Sciences Journal, 6(1).
  25. Thumburu, S. K. R. (2020). Leveraging APIs in EDI Migration Projects. MZ Computing Journal, 1(1).