Repository is archived
No release in over 3 years
Low commit activity in last 3 years
Dumps records to Orc format file.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
 Dependencies

Development

~> 1.0
>= 10.0
 Project Readme

Orc output plugin for Embulk

Warning

Embulk into the maintenance mode | Embulk

Embulk got into the maintenance mode. Therefore, this plugin will also end its maintenance.

Build Status Gem Version

Overview

  • Plugin type: output
  • Load all or nothing: no
  • Resume supported: no
  • Cleanup supported: yes

Configuration

  • path_prefix: A prefix of output path. (string, required)

    • support: file, s3, s3n and s3a.
  • file_ext: An extension of output file. (string, default: .orc)

  • sequence_format: (string, default: .%03d)

  • buffer_size: Set the ORC buffer size (integer, default: 262144(256KB) )

  • strip_size: Set the ORC strip size (integer, default: 67108864(64MB) )

  • block_size: Set the ORC block size (integer, default: 268435456(256MB))

  • compression_kind: description (string, default: 'ZLIB')

    • NONE, ZLIB, SNAPPY, LZO, LZ4
  • overwrite: Overwrite if output files already exist. (boolean, default: false)

    • Support: LocalFileSystem, S3(s3, s3a, s3n)
  • default_from_timezone Time zone of timestamp columns. This can be overwritten for each column using column_options (DateTimeZone, default: UTC)

  • auth_method: name of mechanism to authenticate requests (basic, env, instance, profile, properties, anonymous, or session. default: basic)
    see: https://github.com/embulk/embulk-input-s3#configuration

    • env, basic, profile, default, session, anonymous, properties

Example

out:
  type: orc
  path_prefix: "/tmp/output"
  compression_kind: ZLIB
  overwrite:   true

ChangeLog

ver 0.3.4

  • Bump orc library to 1.5.4
  • bugfix

ver 0.3.3

  • bugfix
  • Bump orc library to 1.4.4

ver 0.3.2

  • Update orc libraries to 1.4.3

ver 0.3.0

ver 0.2.0

  • support: output to s3

    • s3n, s3a protocol

ver 0.1.0

  • initial release

Build

$ ./gradlew gem  # -t to watch change of files and rebuild continuously

SonarQube

embulk-output-orc