Vol. 1 No. 2 (2021): Journal of AI-Assisted Scientific Discovery
Articles

A new pattern for managing massive datasets in the Enterprise through Data Fabric and Data Mesh

Sarbaree Mishra
Program Manager at Molina Healthcare Inc., USA
Vineela Komandla
Vice President - Product Manager, JP Morgan, USA
Srikanth Bandi
Software Engineer, JP Morgan Chase, USA
COver

Published 05-12-2021

Keywords

  • Data Fabric,
  • Data Mesh,
  • Enterprise Data Management

How to Cite

[1]
Sarbaree Mishra, Vineela Komandla, and Srikanth Bandi, “A new pattern for managing massive datasets in the Enterprise through Data Fabric and Data Mesh”, Journal of AI-Assisted Scientific Discovery, vol. 1, no. 2, pp. 236–259, Dec. 2021, Accessed: Dec. 23, 2024. [Online]. Available: https://scienceacadpress.com/index.php/jaasd/article/view/236

Abstract

The ever-increasing volume and complexity of enterprise data have exposed the limitations of traditional architectures in handling scalability, governance, and flexibility, prompting the need for innovative approaches like Data Fabric and Data Mesh. Data Fabric focuses on creating a unified, automated framework that connects data across hybrid and multi-cloud environments, ensuring seamless integration, robust governance, and simplified user access. Automating workflows enhances data reliability and compliance while reducing operational overhead. In contrast, Data Mesh shifts the paradigm to a decentralized model, where data ownership is distributed across domain teams that treat data as a product. This approach leverages self-serve platforms & domain-specific expertise to promote agility, innovation, and collaboration, empowering teams to manage their data more effectively without reliance on centralized bottlenecks. Both paradigms address critical challenges of modern data management, but their true potential lies in their complementary strengths. By combining the comprehensive integration & governance of Data Fabric with the domain-driven ownership and scalability of Data Mesh, enterprises can create a dynamic, democratized data ecosystem. Such a hybrid approach enables organizations to meet diverse business needs, foster innovation, and enhance decision-making capabilities while maintaining control and compliance. This article explores both paradigms' core principles, architectural components, and implementation strategies, offering insights into their application in enterprise settings. It also highlights how integrating Data Fabric and Data Mesh can provide a scalable, flexible, and democratized framework that empowers businesses to unlock more excellent value from their massive datasets while adapting to evolving market demands.

Downloads

Download data is not yet available.

References

  1. Raptis, T. P., Passarella, A., & Conti, M. (2019). Data management in industry 4.0: State of the art and open challenges. IEEE Access, 7, 97052-97093.
  2. Kamath, C. (2001). On mining scientific datasets. Data mining for scientific and engineering applications, 1-21.
  3. Cox, M., & Ellsworth, D. (1997, August). Managing big data for scientific visualization. In ACM siggraph (Vol. 97, No. 1, pp. 21-38). MRJ/NASA Ames Research Center.
  4. Rubin, D. B. (1997). Estimating causal effects from large data sets using propensity scores. Annals of internal medicine, 127(8_Part_2), 757-763.
  5. Kamath, C. (2009). Scientific data mining: a practical perspective. Society for Industrial and Applied Mathematics.
  6. Zhou, Z. H., Chawla, N. V., Jin, Y., & Williams, G. J. (2014). Big data opportunities and challenges: Discussions from data analytics perspectives [discussion forum]. IEEE Computational intelligence magazine, 9(4), 62-74.
  7. Praveena, M. A., & Bharathi, B. (2017, February). A survey paper on big data analytics. In 2017 International Conference on Information Communication and Embedded Systems (ICICES) (pp. 1-9). IEEE.
  8. Shoshani, A., & Rotem, D. (Eds.). (2009). Scientific data management: challenges, technology, and deployment. CRC Press.
  9. Post, F. H., Nielson, G., & Bonneau, G. P. (Eds.). (2002). Data visualization: The state of the art.
  10. Chen, M., Mao, S., & Liu, Y. (2014). Big data: A survey. Mobile networks and applications, 19, 171-209.
  11. Dobre, C., & Xhafa, F. (2014). Parallel programming paradigms and frameworks in big data era. International Journal of Parallel Programming, 42(5), 710-738.
  12. Ramnath, S., Haghighi, P., Kim, J. H., Detwiler, D., Berry, M., Shah, J. J., ... & Menzel, S. (2019, August). Automatically generating 60,000 CAD variants for big data applications. In International Design Engineering Technical Conferences and Computers and Information in Engineering Conference (Vol. 59179, p. V001T02A006). American Society of Mechanical Engineers.
  13. Belaud, J. P., Negny, S., Dupros, F., Michéa, D., & Vautrin, B. (2014). Collaborative simulation and scientific big data analysis: Illustration for sustainability in natural hazards management and chemical process engineering. Computers in Industry, 65(3), 521-535.
  14. Kitchin, R., & Lauriault, T. P. (2015). Small data in the era of big data. GeoJournal, 80, 463-475.
  15. Foster, I., Zhao, Y., Raicu, I., & Lu, S. (2008, November). Cloud computing and grid computing 360-degree compared. In 2008 grid computing environments workshop (pp. 1-10). Ieee.
  16. Gade, K. R. (2019). Data Migration Strategies for Large-Scale Projects in the Cloud for Fintech. Innovative Computer Sciences Journal, 5(1).
  17. Gade, K. R. (2017). Integrations: ETL/ELT, Data Integration Challenges, Integration Patterns. Innovative Computer Sciences Journal, 3(1).
  18. Komandla, V. Enhancing Security and Fraud Prevention in Fintech: Comprehensive Strategies for Secure Online Account Opening.
  19. Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.
  20. Gade, K. R. (2018). Real-Time Analytics: Challenges and Opportunities. Innovative Computer Sciences Journal, 4(1).