Stream Analytics with Materialize
Last updated
Last updated
Fluentd and Fluent bit are vendor-neutral open-source log and metric collectors that are used throughout enterprises today. The plugins already include analytic backends like Elasticsearch, Splunk, and Datadog today. One of the newest integrations with Fluentd and Fluent Bit is the new streaming database, Materialize. Built on the open-source project, Timely Dataflow, Users can use standard SQL on top of vast amounts of streaming data to build low-latency, continually refreshed views across multiple sources of incoming data. Together these projects allow you to easily collect your data wherever it may be and analyze it in real-time, or push it to downstream business visualization systems like Metabase and others.
Connect to data sources such as infrastructure, network, application, and system logs
Route important data for analysis, and remove unnecessary data
Use built-in reliability features to ensure data is collected and routed with error handling and persistent disk storage
Supports Postgres-formatted ANSI-standard SQL, allowing reuse of existing tools
Low-latency, incrementally updated views
Join data across multiple streams and datasets
If you’d like to start playing with this, here are step-by-step instructions to get setup in 15 minutes or less:
Install FluentBit or Fluentd and use any of the many input plugins to read from Syslog, Network data, Kubernetes, Docker containers, Application log files, and more
Leverage Fluent Bit or Fluentd’s ability to write to JSON with the following output configurations
Fluent Bit
Fluentd
[OUTPUT]
name file
path /var/log/fluent
format plain
match *
<match pattern>
@type file
path /var/log/fluent/myapp
</match>
Install Materialize and create a new JSON file source
Start using Materialize on top of all this real-time data