Skip to content

Latest commit

 

History

History
180 lines (120 loc) · 14.8 KB

README.md

File metadata and controls

180 lines (120 loc) · 14.8 KB

RudderStack - The smart customer data pipeline

The smart customer data pipeline

Release


What is RudderStack?

RudderStack is a customer data pipeline tool for collecting, routing and processing data from your websites, apps, cloud tools, and data warehouse.

With RudderStack, you can build customer data pipelines that connect your whole customer data stack and then make them smarter by triggering enrichment and activation in customer tools based on analysis in your data warehouse. Its easy-to-use SDKs and event source integrations, Cloud Extract integrations, transformations, and expansive library of destination and warehouse integrations makes building customer data pipelines for both event streaming and cloud-to-warehouse ELT simple.

Questions? Please join our Slack channel for support on the product.

Try RudderStack Cloud Free - a free tier of RudderStack Cloud. Click here to start building a smarter customer data pipeline today, with RudderStack Cloud.

Why Use RudderStack?

  • Developer-focused: RudderStack is built API-first. So it integrates seamlessly with the tools that the developers already use and love. Its backend is written in Go, with a rich UI written in React.js.

  • Warehouse-first: RudderStack treats your data warehouse as a first class citizen among destinations, with advanced features and configurable, near real-time sync.

  • Production-ready: Companies like Mattermost, IFTTT, Torpedo, Grofers, 1mg, Nana, OnceHub, and dozens of large companies use RudderStack for collecting their events. Note: If you're using RudderStack and would like to add your name in this list, please submit a PR.

  • Extreme Scale: One of our largest installations currently sends 300 Million events/day with peak of 40K req/sec, via a two-node RudderStack setup.

  • High Availability: RudderStack comes with at least 99.99% uptime. We have built a sophisticated error handling and retry system that ensures that your data will be delivered even in the event of network partitions or destinations downtime.

  • Privacy and Security: You can collect and store your customer data without sending everything to a third-party vendor. With RudderStack, you get fine-grained control over what data to forward to which analytical tool.

  • Seamless Integration: RudderStack currently supports integration with over 70 popular destination platforms such as Google Analytics, Amplitude, Hotjar, Mixpanel, Amazon S3, and more. Moreover, it also supports data warehouse integrations for Snowflake, Redshift, ClickHouse, PostgreSQL and Google BigQuery.

  • Client-side SDK Support: RudderStack offers client-side SDKs for JavaScript, Android, iOS, and Unity, and more.

  • User-specified Transformation: RudderStack gives you the power to filter or transform your events before sending them to the desired destinations.

Key Features

  • Segment API Compatible: RudderStack is Segment API compatible. So you don't need to change your app if you are using Segment, just integrate the RudderStack SDKs into your app and your events will keep flowing as before (including data-warehouse).

  • Processing Flexibility: With RudderStack's powerful JavaScript-based event transformation framework, you can enhance or transform your event data by combining it with your other internal data. Furthermore, since RudderStack runs inside your cloud or on-premise environment, you can easily access your production data to join with the event data.

  • Unlimited Events: Event volume-based pricing of most of the commercial systems is broken. With RudderStack, you are be able to collect as much data as possible without worrying about overrunning event budgets.

  • Stand-alone System: RudderStack runs as a single Go binary with the dependencies being on a PostgreSQL server and a Node.js service. There is no other dependency required to run RudderStack.

  • Platform-independent: RudderStack is Kubernetes-native and can run on any Kubernetes cluster with our Helm charts. RudderStack is cloud-agnostic and can run on stand-alone machines in all popular cloud platforms, namely AWS, Microsoft Azure, GCP, and others.

  • High Performance: On a single m4.2xlarge AWS EC2 instance, RudderStack can process 3000 events/second.

  • Enhanced Telemetry: To help us improve RudderStack, we collect performance and diagnostic metrics about how you use RudderStack, and how it is working. No customer data is present in the metrics. For technical details, please check out our wiki page on Telemetry.

Our Customers

1mg Grofers IFTTT Mattermost Wynn Casino Acorns Hinge Proposify Barstool Sports

Get Started

The easiest way to experience RudderStack is to sign up for RudderStack Cloud Free - a completely free tier of RudderStack Cloud. Click here to get started.

You can also set up and use RudderStack on your platform of choice.

Note: If you are planning to use RudderStack in production, we strongly recommend the Kubernetes Helm charts. We update our Docker images with bug fixes etc much more frequently than our GitHub repo (where we release once a month).

Once you have installed RudderStack, send test events to verify the setup.

UI Pages

Connections Page

Connections Page

Events Page

Events Page

Connecting a Source to a Destination

Sources Destinations

RudderStack Architecture

RudderStack has two major components, namely:

  • Control Plane: The control plane allows you to manage the configuration of your sources and destinations. There are two options for setting up the connections:

    • Managed Control Plane: This is the easiest way to manage your source and destination configurations.
    • RudderStack Config Generator: RudderStack also allows you to manage your source and destination configurations without having to sign up and use our hosted services. Please note that you cannot create transformations or Live Debugger with the RudderStack Config Generator.
  • Data Plane: This is the core engine that is responsible for:

    • Receiving and buffering the event data
    • Transforming the event data into the required destination format, and
    • Relaying it to the destination

For a detailed understanding of the RudderStack architecture, please check our documentation.

A high-level view of RudderStack's architecture is as shown:

Architecture

License

RudderStack server is released under the AGPLv3 License.

Contribute

We would love to see you contribute to RudderStack. Get more information on how to contribute here.

Wiki

For more information on RudderStack's features and functionalities, make sure you check out our Wiki page.

Follow Us

👏 Our Supporters

Stargazers repo roster for @rudderlabs/rudder-server Forkers repo roster for @rudderlabs/rudder-server