A Simple Zeppelin Tutorial
This topic walks you through using a very simple Zeppelin notebook, to help you learn about using Zeppelin with Splice Machine.
Our Getting Started with Zeppelin page provides a very brief overview of using Zeppelin; If you’re new to Zeppelin, we strongly encourage you to visit the Zeppelin documentation site to learn about creating, modifying, and running your own Zeppelin notebooks.
Running the Tutorial Notebook
You can access this Zeppelin notebook by clicking the Basics (Spark) link under Zeppelin Tutorials on the Zeppelin Dashboard page:
Once you’ve opened the tutorial, you can run each step (each Zeppelin paragraph) by clicking the Ready button that you’ll see on the right side of each paragraph. This example includes these steps:
Click the first READY button to create the schema and a table:
Import data (in this case, TPCH1 benchmark data) into the table, then verify the data load by counting the number of records in the table:
Create indexes on the table, and then run compaction on the data, which is always a good idea after updating a large number of records:
Collect statistics, to improve query planning, and then run a query:
After the query runs, you can take advantage of Zeppelin’s built-in visualization tools to display the query results in various graphical and tabular formats.
When you click the READY button, Zeppelin runs the paragraph that loads your data and subsequently displays the Finished message.
If you see Error instead of Finished, it usually means that you’ve forgotten to set SpliceMachine interpreter as the default.
Apply Different Visualizations to Your Results
Zeppelin provides a wealth of data visualization tools you can use. In
the example below, we have modified the presentation of query results to
use different visualizations by clicking different visualization icons
in the output pane. You can define and modify the values of variables
that you use in your queries; for example, the
values in the examples below: