Skip to content

Latest commit

 

History

History

jupyter-notebooks

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Example Jupyter Notebooks

Overview

The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. In PNDA, it supports exploration and presentation of data from HDFS and HBase.

Connecting

Along with PNDA cluster provisioning, Jupyter is deployed with four kernel supports: Python2 kernel, Python3 kernel, PySpark (Python2) kernel and Spark (Scala) kernel. An example Jupyter notebook is also provided with details instructions on how to rapid prototype using Jupyter PySpark kernel. In particular the example noteobook gives an example usage of platform libraries to load HDFS data and perform interactive data analytics. A sample Jupyter notebook is also provided for executing scala code using Spark kernel that allow to connect with remote Spark cluster and execute code.

By default, Jupyter is installed on Jupyter node. In order to access Jupyter portal go to: http://[cluster-name]-cdh-jupyter:8000

Quick start

Log into Jupyter portal

You should see a jupyter login page as below.

Jupyter home

Use pnda/pnda to login, you will be redirected to a notebook list view. By default the list is empty.

Jupyter notebook view

Upload example notebook

Clone example-jupyter-notebooks repository. Click the upload button and select the example notebook (example-jupyter-notebooks/notebooks/Example Platform-library PySpark Notebook.ipynb) from your local repository.

Jupyter notebook view

Open example notebook

Click the link Example Platform-library PySpark Notebook.ipynb.

Jupyter example notebook