Archive

Archive for the ‘HDInsight’ Category

HDInsight on Windows: Building PowerPivot Report from Hive in a Few Easy Steps (PDF)

January 7, 2013 Leave a comment

Download Step by Step PDF from Slide Share

Categories: HDInsight, PowerPivot

HDInsight on Windows: Building PowerPivot Report from Hive in a Few Easy Steps

January 4, 2013 2 comments

What are we Covering:

  • Hive
    • Create table
    • Load Data
    • Query Data
  • Configure ODBC
  • Create PowerPivot
    • With DataSource as HDInsight
    • Import Data from HDInsight 
  • Create Pivot Chat using imported data

 

(1)Open Hadoop Command Line

After installing HDInsight for Windows  you will get Hadoop Command Line Shortcut on desktop

Click on Hadoop Command Line Shortcut and it will open Cmd Line for Hadoop

clip_image002

(2)Type Hive and it will change to Hive Command

clip_image004

(3)Create Tables

CREATE TABLE emp(id int,name string)ROW FORMAT DELIMITED FIELDS TERMINATED BY ‘,’;

CREATE TABLE sales(id int,sales int)ROW FORMAT DELIMITED FIELDS TERMINATED BY ‘,’;

clip_image005

(4)Load Data using CSV Files

LOAD DATA LOCAL INPATH ‘c:/data/emp.csv’ OVERWRITE INTO TABLE emp;

clip_image006

LOAD DATA LOCAL INPATH ‘c:/data/sales.csv’ OVERWRITE INTO TABLE sales;

clip_image007

(5) Open HDInsight Dashboard (You will find shortcut on desktop) and click on Download

clip_image010

(6)Click on the appropriate link for your Excel.

clip_image013

(7)Configure ODBC

For configuring ODBC, Click start and type in ODBC. This will bring up the Data Sources dialog. Go to the System DSN tab and click Add. Configure it with a Data Source Name, Your Host Information, and a Username. In Single Node HDInsight Installation you don’t need to provide Username but if you are using HDInsight on Azure you can provide Username .

clip_image014

clip_image016

(8) Provider Data Source Name and Default Port is 10000

clip_image018

(9)Open Excel Sheet, select PowerPivot in ribbon and click on Manage

clip_image021

(10)Click on Get External Data

clip_image025

(11)Select – Get External Data and Data Import window and Select Others(OLEDB/ODBC)

clip_image027

(12)Click Build

clip_image029

(13)Select OLE DB Provider for ODBC

clip_image030

(14)Select DSN which we have created earlier in step number 8, Select Database (In Single Node HDInsight installation you don’t need to provider User Name and Password in current release but in case of HDInsight on Azure you need to provide User Name and Password) and follow the wizard for selecting tables in our case its EMP and Sales

clip_image031

clip_image034

clip_image035

clip_image036

(15)If you Notice Relationship between Emp and Sales is absent so with one click we can create relationship

clip_image038

clip_image040

(16)Added feature like Calculated Measure name Sales and created a KPI

clip_image041

(17)Select Pivot of your Choice from PivotTable

clip_image042

(18) Finally – you are good to go with PowerPivot Report based on Hive

clip_image044

Refer

HDInsight – What is it?

Configuring: PowerPivot for Excel 2013

Advertisement

HDInsight – What is it?

January 4, 2013 4 comments

HDInsight is Microsoft’s Hadoop-based distribution.

HDInsight comes in two flavors:

HDInsight Server for local on-premise installation of the Hadoop distribution, this will allow to build local cluster with your own Hadoop Hive able to run Hadoop Jobs, on clip_image001top of that fully integration with Microsoft BI Stack

HDInsight on Azure Service is the easiest way to deploy, manage and scale Hadoop based solution. Current release includes:

    • HDFS and Map/Reduce
    • Pig
    • Hive
    • Sqoop

HDInsight:

Categories: HDInsight Tags: , ,