No release in over 3 years
Low commit activity in last 3 years
Extends the fluent-plugin-s3 compression algorithm to enable red-arrow compression.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

Runtime

>= 1.20.0
>= 0.14.10, < 2
>= 2.0
 Project Readme

fluent-plugin-s3-arrow

Gem Version

Extends the fluent-plugin-s3 compression algorithm to enable red-arrow compression.

Installation

Requirements

RubyGems

$ gem install fluent-plugin-s3-arrow

Bundler

Add following line to your Gemfile:

gem "fluent-plugin-s3-arrow"

And then execute:

$ bundle

Configuration

Example of fluent-plugin-s3-arrow configuration.

<match pattern>
  @type s3

  # fluent-plugin-s3 configurations ...

  <format>
    @type json # This plugin currently supports only json formatter.
  </format>

  store_as arrow
  <arrow>
    format parquet
    compression gzip

    schema_from static
    <static>
      schema [
        {"name": "test_string", "type": "string"},
        {"name": "test_uint64", "type": "uint64"}
      ]
    </static>
  </arrow>
</match>

format and compression

This plugin supports multiple columnar formats and compressions by using red-arrow. Valid settings are below.

format compression
arrow gzip, zstd
feather zstd
parquet gzip, snappy, zstd

schema

Schema of columnar formats.

schema_from static

Set the schema statically.

schema_from static
<static>
  schema [
    {"name": "test_string", "type": "string"},
    {"name": "test_uint64", "type": "uint64"}
  ]
</static>
schema (required)

An array containing the names and types of the fields.

schema_from glue

Retrieve the schema from the AWS Glue Data Catalog.

schema_from glue
<glue>
  catalog test_catalog
  database test_db
  table test_table
</glue>
catalog

The name of the data catalog for which to retrieve the definition. The default value is the same as the AWS API CatalogId.

database

The name of the database for which to retrieve the definition. The default value is default.

table (required)

The name of the table for which to retrieve the definition.

License

Apache License, Version 2.0