Difference between revisions of "Snowplow (software)"

From Wiki @ Karl Jones dot com
Jump to: navigation, search
(See also)
(See also)
Line 38: Line 38:
 
* [[Amazon Redshift]] - a hosted data warehouse product, which is part of the larger cloud computing platform [[Amazon Web Services]].
 
* [[Amazon Redshift]] - a hosted data warehouse product, which is part of the larger cloud computing platform [[Amazon Web Services]].
 
* [[Amazon Web Services]]
 
* [[Amazon Web Services]]
* [[Iglu repository]]
+
* [[Iglu repository]] - a store of data schemas for [[Snowplow (software)]], currently (August 2016) supporting [[JSON Schema|JSON Schemas]] only.
 
* [[JSON Schema]] - a [[JSON]]-based format to define the structure of JSON data for validation, documentation, and interaction control.
 
* [[JSON Schema]] - a [[JSON]]-based format to define the structure of JSON data for validation, documentation, and interaction control.
 
* [[Web application]]
 
* [[Web application]]

Revision as of 10:53, 22 August 2016

Snowplow is a marketing and product analytics platform.

Description

According to the official website, Snowplow does three things:

  • Identifies website users, and tracks the way they engage with a website or web application;
  • Stores users' behavioral data in a scalable "event data warehouse" you control: in Amazon S3 and (optionally) Amazon Redshift or Postgres;
  • Leverages the biggest range of tools to analyze that data, including big data tools (e.g. Hive, Pig, Mahout) via EMR or more traditional tools e.g. Tableau, R, Looker, Chartio to analyze that behavioral data.

Core concepts

Snowplow is built around the following core concepts:

  • Events
  • Dictionaries and schemas
  • Contexts
  • Iglu
  • Stages in the Snowplow data pipeline

Setting up Snowplow

The process of setting up Snowplow consists of:

  1. Set up a collector;
  2. Set up a tracker or webhook;
  3. Set up enrich;
  4. Set up alternative data stores.

Iglu repository

An Iglu repository acts as a store of data schemas (Snowplow, currently JSON Schemas only).

Hosting JSON Schemas in an Iglu repository allows you to use those schemas in Iglu-capable systems such as Snowplow.

See also

External links