10/21/2013

Installation #Hortonworks #Hadoop 1.3 Part 1

So today we are going to install Hadoop 1.3 single node cluster onto a Hyper-V system.

Download the files from the Hortonworks website:

http://hortonworks.com/products/hdp-windows/

Version 1.3


 Click the link to begin the download.  Unzip the file, creates a folder:
 
 Viewing the folder contents...
 
See the text file "clusterproperties.txt"
 
 
Install and load Hyper-V (Windows 8).  Create a new VM.  Load Windows 2012 Server. 
 

Start the Server:

Be sure to Create a Network Adapter, I created an "Internal" adapter:
 
 
 Then set the network configuration (Version 4):
 
 


 
Next I copied the files up to the VM Server.  Then begin the install.  Using the Hortonworks page as a reference:
 
 
Next open the Hortonworks page to view the pre-requisites for the install...
 
 
 
Download Python:
 
 
 
Create a folder on the VM, I named it HWHadoop13:
 
 
Copy the Python install to the VM as well, and update the Path variable...


Open PowerShell as Administrator.  Rewrite the line of code in PowerShell...Execute...

Python 2.7.5
*** MESSAGE TO READER ***
Be sure to add the Python executable path to the Environment Variable "PATH"...

Use the following instructions to manually install Python in your local environment:

  1. Download Python from here to the workspace directory.
  2. Update the PATH environment variable. Using Administrator privileges. From the Powershell window, execute the following commands as Administrator user:
    msiexec  /qn /norestart /log %WORKSPACE%\python-2.7.5.log /i %WORKSPACE%\python-2.7.5.msi
    setx PATH "$env:path;C:\Python27" /m
    where
    • %WORKSPACE% is the full workspace directory path.
    • $env is the Environment setting for your cluster.

[Important] Important
Ensure the downloaded Python MSI name matches python-2.7.5.msi. If not, change the above command to match the MSI file name.

Next download the C++ 2010 Redistributable Package...

http://www.microsoft.com/en-us/download/details.aspx?id=14632

 
Copy the file to the HWHadoop (Your home directory for Hadoop) folder...
 
 
Type this in the PowerShell command line...
 
 
Microsoft Visual C++ 2010 Redistributable Package (64-bit)

  1. Use the instructions provided here to download Microsoft Visual C++ 2010 Redistributable Package (64-bit) to the workspace directory.
  2. Execute the following command from Powershell with Administrator privileges:

    %WORKSPACE%\vcredist_x64.exe /q /norestart

    For example:
    C:\prereqs\vcredist_x64.exe /q /norestart
 
 
Now, download the Microsoft Framework...
 
Microsoft.NET framework 4.0
 
*** MESSAGE TO READER ***
Be sure to connected to the internet, because it has to pull some files off the web, if you're not connected, the install will fail...
  1. Use the instructions provided here to download Microsoft.NET framework 4.0 to the workspace directory.
  2. Execute the following command from Powershell with Administrator privileges:

    %WORKSPACE%\slavesetup\dotNetFx40_Full_setup.exe /q /norestart /log %WORKSPACE%/dotNetFx40_Full_setup.exe
 
 

 
And now for the JDK:
 
JDK 6.31 or higher
 
*** MESSAGE TO READER ***
During the installation process, it threw an error.  Turns out you can not have spaces in the path for JAVA_HOME.  So uninstall and re-install to new directory, i.e. C:\Java instead of C:\Program Files\...
 
Use the instructions provided below to manually install JDK to the workspace directory:

  1. Check the version. From a command shell or Powershell window, type:
    java -version
  2. (Optional): Uninstall the Java package if the JDK version is less than v1.6 update 31.
  3. Go to Oracle Java SE 6 Downloads page and accept the license.
    Download the JDK installer to the workspace directory.
    [Important] Important
    Ensure that no whitespace characters are present in the installation directory's path.
    For example, C:\Program Files is not allowed.

  4. From Powershell with Administrator privileges, execute the following commands:
    %WORKSPACE%\jdk-6u31-windows-x64.exe /qn /norestart /log %WORKSPACE%\jdk-6u31-windows-x64.log  INSTALLDIR=C:\java\jdk1.6.0_31
    setx JAVA_HOME "C:\java\jdk1.6.0_31" /m
    
    where %WORKSPACE% is the full workspace directory path.
    [Important] Important
    Ensure the downloaded JDK .exe file's name matches with jdk-6u31-windows-x64.exe. If not, change the above command to match the EXE file name.
    For example:
    C:\prereqs\jdk-6u31-windows-x64.exe /qn /norestart/log C:\prereqs\jdk-6u31-windows-x64.log
    INSTALLDIR=C:\java\jdk1.6.0_31
     
     
    http://www.oracle.com/technetwork/java/javasebusiness/downloads/java-archive-downloads-javase6-419409.html#jdk-6u31-oth-JPR
     
    Only problem is you have to have an Oracle account or you must create one.
     
    execute the Power Shell command...
     
     
     
    Now on to Page 2 of the installation...
     
    http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-Win-1.3.0/bk_installing_hdp_for_windows/content/win-chap2-singlenode.html
     
     
Continue here to Part 2 of the installation of Hortonworks Hadoop version 1.3 for Windows ....
http://www.bloomconsultingbi.com/2013/10/installation-hortonworks-hadoop-13-part_22.html

No comments:

Post a Comment