Example Applications

This repository contains a number of example applications that can be built and run on PNDA. Each application directory contains more detailed information.

Spark Streaming

Examples of consuming data from Kafka and populating both HBase and OpenTSDB with simple Scala based Spark Streaming applications.

Spark

Example of consuming data ingested by Gobblin on a batch basis and producing Parquet datasets, optimized for consumption by Impala.

Write to parquet format (scala)
Write to parquet format (python)

Jupyter

Example of a notebook for manipulating network data.

H2O

Application that runs the H2O data science platform as an application on PNDA.

Flink Streaming

Count Words (scala) Count the words from Socket.
Count Words (python) Count the words from input file.
Flink Windows (java) host-network-data-usage illustrating Flink windows, triggers and event processing.
Count Hashtags (java) specific word count from input file illustrating metrics, counters and accumulators.

Compound Packages

An example of a package containing multiple application component types, in this case a Spark app and related Jupyter notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
flink-batch-java-hashtagcount-metrics		flink-batch-java-hashtagcount-metrics
flink-streaming-host-network-data-usage		flink-streaming-host-network-data-usage
flink-streaming-word-count		flink-streaming-word-count
flink-wordcount-python-app		flink-wordcount-python-app
jupyter-notebooks		jupyter-notebooks
kafka-spark-opentsdb		kafka-spark-opentsdb
literary-word-count-app		literary-word-count-app
spark-batch-python		spark-batch-python
spark-batch		spark-batch
spark-streaming-python		spark-streaming-python
spark-streaming		spark-streaming
spark2-streaming-python		spark2-streaming-python
traffic-loss-analysis-app		traffic-loss-analysis-app
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Example Applications

Spark Streaming

Spark

Jupyter

H2O

Flink Streaming

Compound Packages

About

Releases

Packages

Contributors 14

Languages

License

pndaproject/example-applications

Folders and files

Latest commit

History

Repository files navigation

Example Applications

Spark Streaming

Spark

Jupyter

H2O

Flink Streaming

Compound Packages

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 14

Languages

Packages