Question

What exactly is involved in namenode formatting. If I type in the following command into my terminal within my hadoop installation folder:

  bin/hadoop namenode -format

What exactly does it accomplish? I am looking to understand principles of namenode formatting & its significance. Thanks...

Was it helpful?

Solution

Hadoop NameNode is the centralized place of an HDFS file system which keeps the directory tree of all files in the file system, and tracks where across the cluster the file data is kept. In short, it keeps the metadata related to datanodes. When we format namenode it formats the meta-data related to data-nodes. By doing that, all the information on the datanodes are lost and they can be reused for new data.

OTHER TIPS

hadoop namenode -format formats your file system at the location specified in hdfs-site.xml

here my namenode directory is /usr/local/hadoop/dfs/name

<property>
<name>dfs.name.dir</name>
<value>/usr/local/hadoop/dfs/name</value>
<final>true</final>
</property>

Simply say,

  • NameNode contains metadata about data stored in DataNodes;
  • If NameNode is formatted, metadata alone gets deleted.
  • Original data in DataNode will not get affected;
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top