Sunday, November 22, 2015

Components in Hadoop Architecture 1.x


Hadoop System is a Master-Slave architecture. Hadoop 1.x file system has 64 MB block size.
The default replication factor in Hadoop is 3, which is configurable.

There are total 5 components in Hadoop 1.x architecture

1. Name Node (NN)
2. Data Node (DN)
3. Secondary Name Node (SNN)
4. Job Tracker (JT)
5. Task Tracker (TT)

Name Node, Data Node and Secondary Name Node are called as Storage components in Hadoop, whereas Job Tracker and Task Tracker are called as Processing components in Hadoop.

Name node is a Master Node in Hadoop
Data Node is the slave nodes in Hadoop.
Master Node's MR component is called as Job Tracker
Slave Node's MR component is called as Task Tracker.

Details of each of the components are as below:






x    Name Node:

1. The job of name node is to decide, how to store the physical location of each and every file in blocks in the cluster
2. It also manages the metadata (data about the data) of the files stored in Data nodes.
3. Name node also decides, by combining which all physical locations in the data nodes, actual file will be generated.
4. Namenode always stores the Metadata in FSImage and EditLogs file at regular intervals. 
This process is called as Checkpoint mechanism.

Note: The Name node is meant for maintaining the metadata information of the complete hadoop cluster, but it will never stores the actual data. The Actual data will always be stored in the Data node only.
Name Node is also called as Single Point of Failure ( SPOF).

Data Node:
Data node will store the file data in blocks, as per the instructions from Name node.
Data node will send the RPC signal at regular interval, to notify the Name node that it is alive and working.(called as Heart beat mechanism)

Secondary Name Node:
Secondary Name node acts as a backup for Name node.
It stores the meta data information from Name node at regular interval as per checkpoint mechanism.
When Name node goes down, in that case Secondary Name node comes into picture and act as a temporary Name node, till Name node becomes active again.

The working of the Secondary Name Node is described in details later blogs.

Job Tracker
1. Job Tracker most of the times resides in the same node as of Name Node.
2. It's job is to assign the task to the Data nodes/Task trackers
3. It also decides the job scheduling for the data nodes/Task trackers.
4. In case of Job failure, Job tracker decides about the rescheduling of the task on some other nodes.

Task Tracker
1. Task tracker's job is to execute the task assigned by Job tracker.

Note: The communication between Job Tracker and Task Tracker is via MR jobs only.


How does the components in Hadoop work collaboratively?
When client request the data from Hadoop System
  • When Hadoop system receives the client request, it is first received by the Master node.
  • Master node's MR component "Job Tracker" is responsible for receiving the client work and assigns the task to Task trackers, once divides the work into manageable independent task.
  • Slave node's MR component "Task Tracker" receives the tasks from "Job Tracker" and performs the work using MR.
  • Once all the Task trackers finished their work, JT takes those results and combines to produce the final result.
  • At last, Hadoop system sends the results back to clients.

22 comments:

  1. Great and helpful blog to everyone.. Before reading this blog i have dont have a proper idea about hadoop architecture but now i am very strong in hadoop architecture which really helpful to update my knowledge of big data.. thanks a lot for sharing this blog to us..

    hadoop training institute in chennai velachery | big data training institute in chennai velachery

    ReplyDelete
  2. I appreciate your work on Bigdata Hadoop. It’s such a wonderful read on Bigdata hadoop course. Keep sharing stuffs like this. I am also educating people on similar Bigdata Hadoop training so if you are interested to know more you can watch this Bigdata Hadoop tutorial:- https://www.youtube.com/watch?v=1jMR4cHBwZE
    You can also check this Intellipaat review on Bigdata Hadoop tutorial to have a significant insight on Bigdata Hadoop training.
    https://www.youtube.com/watch?v=t17UGxxOM_M&t=s

    ReplyDelete

  3. This really has covered a great insight on Hadoop. I found myself lucky to visit your page and came across this insightful read on Hadoop tutorial. Please allow me to share similar work on Hadoop training course . Watch and gain knowledge today.https://www.youtube.com/watch?v=nRm3NbuS0IA

    ReplyDelete
  4. The knowledge of technology you have been sharing thorough this post is very much helpful to develop new idea. here by i also want to share this.
    Digital Marketing Training in Chennai

    Digital Marketing Training in Bangalore

    digital marketing training in tambaram

    digital marketing training in annanagar

    ReplyDelete
  5. A universal message I suppose, not giving up is the formula for success I think. Some things take longer than others to accomplish, so people must understand that they should have their eyes on the goal, and that should keep them motivated to see it out til the end.
    python training institute in chennai
    python training in Bangalore
    python training institute in chennai

    ReplyDelete
  6. Great thoughts you got there, believe I may possibly try just some of it throughout my daily life.
    Blueprism training in marathahalli

    Blueprism training in btm

    Blueprism online training

    ReplyDelete
  7. Nice post. By reading your blog, i get inspired and this provides some useful information. Thank you for posting this exclusive post for our vision. 

    Data Science training in Chennai
    Data science training in bangalore
    Data science online training
    Data science training in pune

    ReplyDelete
  8. This comment has been removed by the author.

    ReplyDelete
  9. Thank you for allowing me to read it, welcome to the next in a recent article. And thanks for sharing the nice article, keep posting or updating news article.
    angularjs Training in bangalore

    angularjs Training in btm

    angularjs Training in electronic-city

    angularjs online Training

    angularjs Training in marathahalli

    ReplyDelete
  10. Everyone wants to get unique place in the IT industries for that you need to upgrade your skills, your blog helps me improvise my skill set to get good career, keep sharing your thoughts with us.
    health and safrety courses in chennai

    ReplyDelete
  11. This is an awesome post.Really very informative and creative contents. These concept is a good way to enhance the knowledge.I like it and help me to development very well.Thank you for this brief explanation and very nice information.Well, got a good knowledge.
    Java training in Chennai

    Java training in Bangalore

    ReplyDelete
  12. This is a nice article here with some useful tips for those who are not used-to comment that frequently. Thanks for this helpful information I agree with all points you have given to us. I will follow all of them.
    best rpa training in bangalore
    rpa training in bangalore
    rpa course in bangalore
    RPA training in bangalore
    rpa training in chennai
    rpa online training

    ReplyDelete
  13. This is a nice post in an interesting line of content.Thanks for sharing this article, great way of bring this topic to discussion.
    AWS Training in pune
    AWS Online Training
    AWS Training in Bangalore

    ReplyDelete
  14. A universal message I suppose, not giving up is the formula for success I think. Some things take longer than others to accomplish, so people must understand that they should have their eyes on the goal, and that should keep them motivated to see it out til the end.
    Microsoft Azure online training
    Selenium online training
    Java online training
    Java Script online training
    Share Point online training

    ReplyDelete
  15. Attend The Python training in bangalore From ExcelR. Practical Python training in bangalore Sessions With Assured Placement Support From Experienced Faculty. ExcelR Offers The Python training in bangalore.
    python training in bangalore

    ReplyDelete
  16. Thanks a lot for one’s intriguing write-up. It’s actually exceptional. Searching ahead for this sort of revisions.
    AI course malaysia

    ReplyDelete
  17. Its very informative. You can now get hands on online services for 3D Architectural Renderings that too with 20% discount on all services. Visit here- 3D Power Visualization Pvt. Ltd.

    ReplyDelete