A Pentaho Kettle Plugin to clean your input stream off special character
- Plugin version: 1.1.0
- Author: Rishu Shrivastava
- Email: [email protected]
Special Character Remover is a Pentaho DI Step Plugin which aims at cleaning your input data off the Special Characters or junk values. Use pre-defined algorithms in the Step to remove special characters. Users can also use his/her custom regular expression codes to work with the input stream.
A detailed explaination has been provided in the wordpress blog post as below:
As compared to the version 1.0.0
of this plugin, please find below the list of features added to the recent release:
- Pre-defined algorithms to the plugin, which aims at minimal coding effort in cleaning up the data.
- Custom codes are also added so that the user has the flexibility to write his/her code using regex.
Follow the link here for more.
- Download the Zip file from the
Marketplace
Github folder:
- Simply copy the contents of the folder into :
/pentaho/design-tools/data-integration/plugins/steps/SpecialCharacterRemover
path of you Pentaho Installation.
-
Start/ReStart your Pentaho DI
-
Open a new Transformation
-
Open "Experimental" tab. You will find a step named "Special Character Remover".
Its done. Drag and Drop the Step and enjoy :)