

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
HDFS commands for list files, read/write files, upload/download files, file management, filesystem, administration etc
Typology: Cheat Sheet
1 / 2
This page cannot be seen from the preview
Don't miss anything!
hdfs dfs - ls / List all the files/directories for the given hdfs destination path. hdfs dfs - ls - d /hadoop Directories are listed as plain files. In this case, this command will list the details of hadoop folder. hdfs dfs - ls - h /data Format file sizes in a human-readable fashion (eg 64.0m instead of 67108864). hdfs dfs - ls - R /hadoop Recursively list all files in hadoop directory and all subdirectories in hadoop directory. hdfs dfs - ls /hadoop/dat* List all the files matching the pattern. In this case, it will list all the files inside hadoop directory which starts with 'dat'.
hdfs dfs - text /hadoop/derby.log HDFS Command that takes a source file and outputs the file in text format on the terminal. The allowed formats are zip and TextRecordInputStream. hdfs dfs - cat /hadoop/test This^ command will display the content of the HDFS file test on your stdout. hdfs dfs - appendToFile /home/ubuntu/test /hadoop/text Appends the content of a local file test1 to a hdfs file test2.
hdfs dfs - put /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS. hdfs dfs - put - f /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS, and in case the local already exits in the given destination path, using - f option with put command will overwrite it. hdfs dfs - put - l /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS. Allow DataNode to lazily persist the file to disk. Forces replication factor of 1. hdfs dfs - put - p /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS. Passing - p preserves access and modification times, ownership and the mode. hdfs dfs - get /newfile /home/ubuntu/ Copies the file from HDFS to local file system. hdfs dfs - get - p /newfile /home/ubuntu/ Copies the file from HDFS to local file system. Passing - p preserves access and modification times, ownership and the mode. hdfs dfs - get /hadoop/*.txt /home/ubuntu/ Copies all the files matching the pattern from local file system to HDFS. hdfs dfs - copyFromLocal /home/ubuntu/sample /hadoop Works similarly to the^ put^ command, except that the source is restricted to a local file reference. hdfs dfs - copyToLocal /newfile /home/ubuntu/ Works similarly to the^ put^ command, except that the destination is restricted to a local file reference. hdfs dfs - moveFromLocal /home/ubuntu/sample /hadoop Works similarly to the put command, except that the source is deleted after it's copied.
hdfs dfs - cp /hadoop/file1 /hadoop Copies file from source to destination on HDFS. In this case, copying file1 from hadoop directory to hadoop1 directory. hdfs dfs - cp - p /hadoop/file1 /hadoop Copies file from source to destination on HDFS. Passing - p preserves access and modification times, ownership and the mode. hdfs dfs - cp - f /hadoop/file1 /hadoop1 Copies^ file from source to destination on HDFS. Passing^ - f overwrites the destination if it already exists. hdfs dfs - mv /hadoop/file1 /hadoop Move files that match the specified file pattern
hdfs dfs - rm - r /hadoop hdfs dfs - rm - R /hadoop hdfs dfs - rmr /hadoop Deletes the directory and any content under it recursively. hdfs dfs - rm - skipTrash /hadoop The^ - skipTrash option will bypass trash, if enabled, and delete the specified file(s) immediately. hdfs dfs - rm - f /hadoop If the file does not exist, do not display a diagnostic message or modify the exit status to reflect an error. hdfs dfs - rmdir /hadoop1 Delete a directory. hdfs dfs - mkdir /hadoop2 Create a directory in specified HDFS location. hdfs dfs - mkdir - f /hadoop Create a directory in specified HDFS location. This command does not fail even if the directory already exists. hdfs dfs - touchz /hadoop Creates a file of zero length at