1/21/2013

My Intro to Cloudera Hive (day 3)

Today I installed the Hive component of the Cloudera Hadoop ecosystem.

Hive Documentation:

http://archive.cloudera.com/cdh/3/hive/

Hive Installation:

https://ccp.cloudera.com/display/CDHDOC/Hive+Installation


Install the necessary files to the directory: /usr/lib/hive/lib

 
Add:  /usr/lib/hive/lib/* to the HADOOP_CLASSPATH so it can reference the necessary JAR files to run HIVE...
 
Running Hive:
 
 
$ hive


Run the script:


and it gives the results:


And now you can download sample data files from this URL:

https://cwiki.apache.org/confluence/display/Hive/GettingStarted#GettingStarted-ExampleQueries

My first impression is that HIVE is a pseudo SQL like language.

You 'create' the table, load the data, do 'selects' against the data and then 'drop' the table.

It still has a 'structured' feel to it.

So much to learn.

Having fun!

*(Here's my Day 2 Post) http://www.bloomconsultingbi.com/2013/01/cloudera-hadoop-day-2.html
*(Here's my Day 1 Post) http://www.bloomconsultingbi.com/2013/01/first-try-at-cloudera-hadoop.html
*(Here's my Day 0 Post) http://www.bloomconsultingbi.com/2013/01/getting-started-with-cloudera-hadoop.html

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.