Sparklet – Apache Spark and Zeppelin installer for Windows

Tweet about this on TwitterShare on LinkedInShare on Google+Share on Facebook

Needless to mention that Apache Spark is becoming the de facto platform for big data analytics. At the same time there is a notebook revolution going on. Data scientists and others who use a notebook simply love it. A notebook provides a browser based interactive environment to write and execute code, view output, make plots and many more. IPython Notebook is no doubt leading this revolution but it only allows python code.

Apache Zeppelin is a new entrant to the league. It enables interactive data analytics. One can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more. Zeppelin is based on the concept of an interpreter that can be bound to any language or data processing backend. Basically, Zeppelin is a web based notebook server. Its backend already supports quite a few interpreters like Spark, Scala, Python, Hive, Markdown etc and many more are yet to come. That means from a single notebook you can work with different big data platform and build your analytics solution. Zeppelin tends to cater all your needs: Data Ingestion, Data Discovery, Data Analytics, Data Visualization & Collaboration. It comes with Spark/Scala as its default interpreter.

When we started using Zeppelin we just loved it. Though it is still in its the early days, it promises quite a lot. As we work mostly in Windows environment we built Zeppelin and Spark and prepared an installer so that anyone in our team can set it up with just couple of clicks. Then we realised, if the installer is so useful for us, others might also find it useful. So today, we are making it available for everyone to download from our website. With this beta release of the standalone installer, the distribution which we call Sparklet includes Spark 1.6 and Zeppelin 0.5.6. Spark, Scala and Spark SQL works well with this release and we will make the all other interpreters work in the upcoming releases.

Sparklet can be downloaded from the following URL which also includes a link to the detailed user guide to download and install.

http://mund-consulting.com/Products/Sparklet.aspx

Here is a link to the video where the creator of Zeppelin explains how Spark with Zeppelin can be used for complete data science/advanced analytics life cycle:

https://www.youtube.com/watch?v=J6Ei1RMG5Xo

We would encourage you to try out Sparklet if you are looking at using Spark/Zeppelin in Windows environment. Please let us know what you think by dropping an email with the subject line, “Sparklet Feedback” to support@mund-consulting.com.

About Sumit Mund

Sumit Mund is a big data analytics consultant with about a decade of industry experience. At Mund Consulting Ltd, he is a director and acts as the lead consultant. He is an expert in machine learning, predictive analytics, Apache Spark, Python, C#, R, and Scala; Sumit has an active interest in Artificial Intelligence. He has extensive experience working with most of Microsoft Data Analytics tools and Big Data platforms like HDInsight. He is a Certified Developer on Apache Spark and also Microsoft Certified Solution Expert (MCSE in Business Intelligence). Sumit regularly engages on social media platforms through his tweets, blogs, and LinkedIn profile, and often gives talks at industry conferences and local user group meetings.

4 thoughts on “Sparklet – Apache Spark and Zeppelin installer for Windows

  1. Hello – Any way to connect this tool to a Hadoop cluster and access the other objects like Hive tables , HDFS files etc ?

  2. Hello, the installation went well, but Zeppelin is requiring that I log in even though I have sen the zeppelin.anonymous.allowed var to true. Where can I find the login information?

Leave a Reply

Your email address will not be published. Required fields are marked *


+ 9 = fifteen