The Hadoop Ecosystem: An Open-Source Framework for Enterprise-Scale Big Data Processing and Analytics

Nagwa Elmobark; Aymen Saad; Sajjad H. Hasan; Mohamed Badouch

doi:10.4028/p-Yx812u

Paper Titles

Design a Simulation System for Data Transmission in Order to Obtain Best Values for the Laser Power Used
p.155

Assessing the Performance of Standard Mobility Models in Cellular Networks for Drones
p.165

Development of a Group Decision Making Method for Ranking Alternatives: Selection of most Preferred Data Mining Algorithm for a Construction Project
p.177

Evolving Text Matching: A Systematic Review of Classical and Modern Approaches in the Neural Network
p.195

The Hadoop Ecosystem: An Open-Source Framework for Enterprise-Scale Big Data Processing and Analytics
p.210

EHD-ABC: An Enhanced History-Driven Artificial Bee Colony Algorithm for Improved Data Clustering
p.227

Advancements in Machine Learning-Based DDoS Attack Detection within Software-Defined Networking Environments
p.243

Enhancing Medical Data Security in Private Cloud: An MAR-Based Encryption Approach
p.254

Design a Framework to Protect the Algorithms for Blockchain Technology
p.261

HomeEngineering HeadwayEngineering Headway Vol. 35The Hadoop Ecosystem: An Open-Source Framework for...

The Hadoop Ecosystem: An Open-Source Framework for Enterprise-Scale Big Data Processing and Analytics

Abstract:

The exponential growth of virtual information presents unprecedented challenges for conventional records processing systems. This research explores the Hadoop surroundings as an innovative method to Big Data control, analyzing its architecture, talents, and strategic importance in cutting-edge data analytics. The take a look at investigates Hadoop's disbursed computing framework, which permits parallel processing of huge datasets throughout commodity hardware. Key additives including the Hadoop Distributed File System (HDFS), MapReduce programming version, and YARN aid control are analyzed to illustrate the platform's specific method to handling large, complex records workloads. Comparative analysis famous Hadoop's tremendous benefits over conventional systems, consisting of cost-effectiveness, scalability, and fault tolerance. The studies highlight the environment's evolution, from its origins to contemporary cloud-based implementations, and examines integration skills with equipment like Hive, Pig, and Spark that increase its analytical potential. While identifying challenges which includes operational complexity and security concerns, the observe in the long run positions Hadoop as a vital generation for agencies in search of to leverage Big Data for strategic selection-making. The findings underscore Hadoop's ability to convert information processing tactics, offering a sturdy, flexible option to the developing needs of current statistics-pushed companies.

You might also be interested in these eBooks

The 6th International Scientific Conference of Alkafeel University (ISCKU)

View Preview

Info:

Periodical:

Engineering Headway (Volume 35)

Pages:

210-226

DOI:

https://doi.org/10.4028/p-Yx812u

Citation:

Cite this paper

Online since:

February 2026

Authors:

Nagwa Elmobark, Aymen Saad, Sajjad H. Hasan, Mohamed Badouch

Keywords:

Big Data Processing, Distributed Computing, Hadoop Architecture, Hadoop Ecosystem, MapReduce

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] A. Oussous, F. Benjelloun, A. Lahcen, and S. Belfkih, "Big Data technologies: A survey," Journal of King Saud University - Computer and Information Sciences, vol. 30, no. 4, pp.431-448, 2018.