We know that Apache Zeppelin is a web-based multipurpose note book. It provides an interactive Data Analysis and many more. such as Data Ingestion, Data Discovery, Data Visualization & Collaboration. In this post I will explore some basic data analysis using Zeppelin and Spark.
To enable Apache Spark and Zeppelin on Windows system you need to download and install the Sparklet on your windows system.
It’s been a couple of weeks that we have released Sparklet beta version, the Apache Spark and Zeppelin installer for windows standalone. As I was playing around with data visualization, I thought to write a blog post on it. So here are the steps to create a basic chart which are included in Zeppelin.
To run Spark and Zeppelin you need to download and install the Sparklet on your windows system. Go through the Zeppelin’s Display system to learn more about display charts.
In this post I am going to show you a Data Visualization using web page view data which is the number of web page views in every month for the year 2014. I have downloaded the data in a csv format and after a bit of cleansing the data file, it looks as below.
When you are working with Scale functions in D3, there you need a domain and range to map the data values from an input domain to an output range, which means range of possible input data values to range of possible output values.
Below is the basic example to understand the domain and range.
Suppose you have a dataset like [100, 200, 300, 400, 500, 600] and you need to visualize each data to width of one bar into a canvas of 500px width and height. Code will look like as below.
Since last couple of weeks I am learning D3, so you can say I am a beginner to the D3. Here I am going to show you a very basic Data Visualization example.