【Big Data】Spark - 5：HDFS - 複製, MapReduce - SpicyBoyd 部落格

上課筆記程式 Big Data Hadoop Spark

【Big Data】Spark - 5：HDFS - 複製, MapReduce

SpicyBoyd

tags: `Big Data` `Hadoop`

複製檔案至 HDFS

範例檔案，也可以任意找一個txt文字檔案
將要練習的檔案放置於 “Downloads” 檔案夾
於HDFS，建立資料夾：hadoop fs -mkdir -p 資料夾路徑
- 例如：hadoop fs -mkdir -p /user/test/jausten
cd ~/Downloads
複製檔案至 HDFS：hadoop fs -copyFromLocal 檔案完整名稱資料夾路徑
- 例如：hadoop fs -copyFromLocal jane_austen.txt /user/test/jausten
檢查檔案是否複製成功，列出該資料夾中所有檔案：hadoop fs -ls 資料夾路徑
- 例如：hadoop fs -ls /user/hduser/jausten

MapReduce

WordCount 程式
- 因為是jar檔，瀏覽器會告知你有風險，直接下載即可
執行完複製檔案至 HDFS
執行 WordCount 程式：hadoop jar WordCount檔案名稱 WordCount txt文字檔案路徑執行結果資料夾路徑
- hadoop jar wordcount2.jar WordCount /user/test/jausten/jane_austen.txt /user/test/output
檢查執行結果
- 列出資料夾中所有檔案：hadoop fs -ls /user/test/output
- print出文件內容：hadoop fs -cat /user/test/output/part-r-00000
移除檔案/資料夾：hadoop fs -rm -r 資料夾路徑
- hadoop fs -rm -r /user/hduser/output

Extensive 延伸閱讀

Hadoop指令介紹：https://ithelp.ithome.com.tw/articles/10191116

Reference 參考資料

上課講義：https://tims.etraining.gov.tw/TIMSonline/index3.aspx?OCID=113442
封面圖片：https://www.ithome.com.tw/node/73978

SpicyBoyd

沒有留言:

張貼留言

訂閱：張貼留言 (Atom)

本網站建議使用電腦或平板瀏覽

Copyright 2017-, SpicyBoyd 部落格. All rights Reserved. | Designed by Colorlib