No commit activity in last 3 years
No release in over 3 years
Strip Html Tags
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
 Dependencies

Development

>= 1.10.6
>= 0.8.30
>= 10.0
 Project Readme

Strip Html Tags filter plugin for Embulk

This plugin strips HTML tags from values of specified columns.

Overview

  • Plugin type: filter

Configuration

  • columns: column names (array, default: [])

Example

This settings strips tags on column foo and bar, leaves other columns untouched.

in:
  type: file
  path_prefix: ./test.csv
  parser:
    type: csv
    charset: UTF-8
    delimiter: ","
    columns:
      - {name: foo, type: string}
      - {name: bar, type: string}
      - {name: baz, type: string}

filters:
  - type: strip_html_tags
    columns:
      - foo
      - bar

out:
  type: stdout

it converts a CSV record like this:

<a>foo</a>,<div>bar</div>,<p>baz</p>

into:

foo,bar,<p>baz</p>

Build

$ ./gradlew gem