This article explains how to use Fluentd to aggregate and transport Apache logs to a Minio server.
The following services are required to be set up correctly:
Also, if you have installed Fluentd through RubyGems (without td-agent
), please install the out_s3
plugin manually:
$ sudo fluent-gem install fluent-plugin-s3
In this example, we use the access log file as an input source, so save the following <source>
settings to /etc/td-agent/td-agent.conf
:
<source>@type tailformat apache2path /var/log/apache2/access.logpos_file /var/log/td-agent/apache.access.log.postag minio.apache.access</source>
NOTE: If you are using the standalone version of Fluentd, use /etc/fluent/fluent.conf
instead.
Before proceeding, please confirm that the access log file has proper file permission. If the log file is not readable by the td-agent
/fluentd
, the rest of this article will not work.
Now let's add settings for storing the incoming data in your Minio server. Since Minio is compatible with Amazon Simple Storage Service (S3), we can use the out_s3
plugin to connect to the server.
<match minio.apache.**>@type s3aws_key_id ACCESS_KEY # The access key for Minioaws_sec_key SECRET_KEY # The secret key for Minios3_bucket BUCKET_NAME # The bucket to store the log datas3_endpoint ENDPOINT # The endpoint URL (like "http://localhost:9000/")s3_region us-east-1 # See the region settings of your Minio serverpath logs/ # This prefix is added to each fileforce_path_style true # This prevents AWS SDK from breaking endpoint URLtime_slice_format %Y%m%d%H%M # This timestamp is added to each file name​<buffer time>@type filepath /var/log/td-agent/s3timekey 60m # Flush the accumulated chunks every hourtimekey_wait 1m # Wait for 60 seconds before flushingtimekey_use_utc true # Use this option if you prefer UTC timestampschunk_limit_size 256m # The maximum size of each chunk</buffer></match>
After adding the settings to the conf file, please restart the Fluentd daemon.
Use curl
to generate some log data for testing:
$ curl http://localhost/
Or you can use the Apache Bench for the bulk request generation:
$ ab -n 100 -c 10 http://localhost/
Wait until the data gets flushed from the buffer (you can adjust the flush interval using the timekey
and timekey_wait
options above). Then you will see the aggregated log data on Minio:
​Fluentd Architecture​
​Amazon S3 Output plugin​
If this article is incorrect or outdated, or omits critical information, please let us know. Fluentd is an open-source project under Cloud Native Computing Foundation (CNCF). All components are available under the Apache 2 License.