Big Data – Hive and Impala

I’m stuck on a Computer Science question and need an explanation.

Lab#6

Hive and Impala

1)Import table “webpage” via Sqoop

$ sqoop import

–connect jdbc:mysql://localhost/loudacre

–username training –password training

–table webpage

–target-dir /loudacre/webpage

–fields-terminated-by “t”

2)Validate the imported data in HDFS ( Please provide screenshot from Terminal and NOT HUE )

3)Create External Table via Hive session

CREATE EXTERNAL TABLE webpage

(page_id SMALLINT,

name STRING,

assoc_files STRING)

ROW FORMAT DELIMITED

FIELDS TERMINATED BY ‘t’

LOCATION ‘/loudacre/webpage’

4)Open FireFox browser and run the below query from Hive

SELECT * FROM webpage WHERE name LIKE “ifruit%”

5)Open FireFox browser and run the below query from Impala

SELECT * FROM webpage WHERE name LIKE “ifruit%”