Versions | v1.0 (td-agent3)

Store Apache Logs into Minio

This article explains how to use Fluentd to aggregate and transport Apache logs to a Minio server.

Table of Contents

Prerequisites

This article requires the following services are set up correctly:

Also, if you have installed Fluentd through ruby gems (without td-agent), please install the out_s3 plugin manually:

$ sudo fluent-gem install fluent-plugin-s3

Configuration

Input settings

In this example, we use the access log file as an input source, so save the following <source> settings to /etc/td-agent/td-agent.conf:

<source>
  @type tail
  format apache2
  path /var/log/apache2/access.log
  pos_file /var/log/td-agent/apache.access.log.pos
  tag minio.apache.access
</source>

(If you are using the stand-alone version of Fluentd, use /etc/fluent/fluent.conf instead)

Before proceeding, please confirm that the access log file has a proper file permission. If the log file is not readable by the td-agent/fluentd, the rest of this article will not work.

Output settings

Now let’s add settings for storing the incoming data in your Minio server. Since Minio is compatible with Amazon Simple Storage Service (S3), we can use the out_s3 plugin to connect to the server.

<match minio.apache.**>
  @type s3
  aws_key_id ACCESS_KEY   # The access key for Minio
  aws_sec_key SECRET_KEY  # The secret key for Minio
  s3_bucket BUCKET_NAME   # The bucket to store the log data
  s3_endpoint ENDPOINT    # The endpoint URL (like "http://localhost:9000/")
  s3_region us-east-1     # See the region settings of your Minio server
  path logs/              # This prefix is added to each file
  force_path_style true   # This prevents AWS SDK from breaking endpoint URL
  time_slice_format %Y%m%d%H%M  # This timestamp is added to each file name

  <buffer time>
    @type file
    path /var/log/td-agent/s3
    timekey 60m            # Flush the accumulated chunks every hour
    timekey_wait 1m        # Wait for 60 seconds before flushing
    timekey_use_utc true   # Use this option if you prefer UTC timestamps
    chunk_limit_size 256m  # The maximum size of each chunk
  </buffer>
</match>

After adding the settings above to the conf file, please restart the Fluentd daemon.

Test the settings

Use curl to generate some log data for testing:

$ curl http://localhost/

Or you can use the Apache benchmarking tool for bulk generation:

$ ab -n 100 -c 10 http://localhost/

Wait until the data gets flushed from the buffer (you can adjust the flush interval using the timekey and timekey_wait options above). Then you will see the aggregated log data on Minio:

Learn More

Last updated: 2018-07-16 08:42:38 +0000

Versions | v1.0 (td-agent3)

If this article is incorrect or outdated, or omits critical information, please let us know. Fluentd is a open source project under Cloud Native Computing Foundation (CNCF). All components are available under the Apache 2 License.