I signed up for the Hortonworks Certified Associate exam last Thursday. Figured if I sign up, I'd have to take the test. And if I tak...
Data becomes information. Information adds value if used properly to align business practices, streamline processes with net result of incr...
Data is the new oil. Sort of a good analogy. Except new oil is constantly required. And there is only so many oil wells on the planet. A...
What do you want to do when you grow up. For some of us, we still haven't decided. After close to 50 years. Chances are, if you chos...
Pig, Hive and Sqoop Reference Links
Starting to get ramped up for the upcoming Hadoop project.
Found a good reference cheat sheet for Apache PIG.
And a good reference tutorial on Apache HIVE. With PDF Download.
Apache Derby, an Apache DB subproject, is an open source relational database implemented entirely in Java
Apache Sqoop for transferring data / files to and from HDFS using connectors run via jobs. And Sqoop Basics.
And a few tweets:
Hadoop is a valuable took that augments the traditional data warehouse. It's got some complexity, but worth the effort.