HDFS Under replicated blocks

Question 1

It's a client setting. Client wants to replicate file for 3 times. Canary test acts as a client. Looks like you have to tune hdfs canary test settings. Or toy could try to use Cloudera managr and set replication factor prop as final. It would forbid client to change this property.

Question 2

Change the replication factor directly in a shell

hadoop fs -setrep -R 1 /

If you have permission problems, what worked for me was to change the replication factor as the user of each file. I had to change the replication factor for oozie files as follows:

sudo -u oozie bash
hadoop fs -setrep -R 1 /

Repeat for each user which the permissions failed.

Question 3

I faced this issue. In my case it was due to missing blocks. Please confirm if that is the case then go to hdfs://hostname:50070 and see the block report. Try to delete or uploads files for which blocks are missing. This should resolve you issue. That is how I resolved mine.

Question 4

Login using HDFS user #su - hdfs

Execute this set of commands to fix under replicated blocks in HDFS manually

# hdfs fsck / | grep 'Under replicated' | awk -F':' '{print $1}' >> /tmp/under_replicated_files`
# for hdfsfile in `cat /tmp/under_replicated_files\`; do echo "Fixing $hdfsfile :" ;  hadoop fs -setrep 3 $hdfsfile; done

Question 5

$ hadoop fs -setrep -R 1 /

or

update your hdfs-site.xml file property

dfs.replication=1

Question 6

Well, It's not recommendable to hold both secondary namenode and namenode in the same node. Put in separate machine for better results.

Come to your question. I hope you are testing in your same machine. Cloudera mistakenly consider as you have three replicas, that's why this problem showed up. Form a separate cluster it should have a minimum 4 systems.

First check your hdfc configuration in hdfs-site.xml has this configuration or not

<property>
  <name>dfs.replication</name>
  <value>3</value>
</property>

I hope your cluster has 2 or 3 systems, so the rest of the replicas are not replicated properly, so that this problem showed up.

You can resolve this problem. Just open terminal enter this command

$ hadoop fs -setrep -R 1 /

Now replications overwrites and resolve this problem, or else add few systems either three or more to the existing cluster. It means commission process surely your problem will resolved.