Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


Version Author Released on Rating Downloads
Hadoop Configuration

Cloudera CDH5.0.X

1.0 rdubois 2014-04-17
10

Custom hadoop configuration which allows to connect to a Cloudera CDH5.0.X cluster.

Component

test

5.2.1 test001 2014-04-17
2

Component

tFileExcelWorkbookSave

5.1 jlolling 2014-04-16
638

This component is designed to work with tFileExcelWorkbookOpen and tFileExcelSheetOutput.
This component writes a workbook into a output file.
This happens after finish writing in several sheets by tFileExcelSheetOutput.
Can delete sheets (usually used as templates before)
Option to (re)evaluate all formulas added
Translation in German and French added
*NEW*: The output file name can be set without file extension. The correct extension will be added (or even fixed) automatically. The final file name can be retrieved from the return value FILENAME.

Component

tFileExcelSheetInput

5.1 jlolling 2014-04-11
1100

This component works in conjunction with tFileExcelWorkbookOpen.
This component reads a sheet identified by name or index.
Unlike the tFileInputExcel component you can specify which columns you want to read and leaf out what is not needed.
The fields can comfortably configured by excel column name or index.
It will always use the current Apache POI 3.10 libs (also for the older xls format).
The sheet name can be retrieved from the output of the new component tFileExcelSheetList.
Can read comment (alternatively to the value) and
Can use none empty value of previous row to fill empty values in current row.
Can ignore cell format errors if wanted.
*NEW*: Number format can changed for different languages/countries in advanced options.
*NEW*: Column positions can automatically be adjusted by header line
*NEW*: Can read hyperlinks (title and/or URL) configurable in the advanced settings
*NEW*: Can limit the number of rows read.
*NEW*: Can stop reading if a row is empty
*NEW*: Column position in the header row can be found with regularly expressions

For using this component, you have to update tFileExcelWorkbookOpen at first!

For all which have problems installing this component. Installing problems has in 99,9 % of the cases nothing to do with the component. This component is tested with all releases. In case of trouble installing this component, please contact me directly instead of rating the component bad for problems not caused by the component.
email: jan.lolling@gmail.com

Component

tFileExcelSheetOutput

5.1 jlolling 2014-04-11
904

This component is designed to work with the components tFileExcelWorkbookOpen and tFileExcelWorkbookSave. This component use the Apache POI library version 3.10.
The goal of this component component is to write or create a sheet with a minimum impact to the structure of the spreadsheet (it will as best it can keep references, macros, graphics and so on).
Many reports are especially designed Excel documents with lots of formulas and references and the automated report creation process have to write into a well described area and keep the rest unchanged.

This component can also write formulas (define a String column and write the formula in the English language starting with a =. To follow the row index in the formula you can write {row} into your formula and this will be replaced with the current row index.
Example for a formula string: =A{row}+G{row}

You can define for every column the target column in Excel with the Excel column name, set the target column auto sized and define the format for Date and Number columns.

There is also an option to write datasets into columns (every new row creates an new column (like if you rotate you data by 90 degree).
This component can be used in sub jobs in iterations to create or write into many sheets.

*NEW*: Sheets can be created as clone of an other sheet.
*NEW*: Freeze option added
*NEW*: Option to allow overwrite existing cells with null (default = false)

*NEW*: Can reuse from the template file the styles from first or first two row(s) for all data rows
To use this feature simply define in a file styles in a template sheet, read this file in the tFileExcelWorkbookOpen and use this as template.
*NEW*: Can remove all surplus rows

*NEW*: Can reuse conditional formats
*NEW*: improves the handling of styles and avoids overwriting styles related also to other columns
*NEW*: Can reuse the row height from the first data row for all new rows.

Please take a look into the new usage guid linked in this component detail page!

Please update always the component tFileExcelWorkbookOpen to get the current necessary library.

Component

tFileExcelWorkbookOpen

5.1 jlolling 2014-04-11
1728

This component is the base component for all other tFileExcel-components.
This component use the Apache POI library version 3.10.
This components reads a spreadsheet in a workbook or create a new empty workbook in memory.
This workbook can be filled (with sheets) by the component tFileExcelSheetOutput or can be read by tFileExcelSheetList/Input components.
Finally the component tFileExcelWorkbooksave persists the workbook in the same file as read or in a new file.
The component recognize the file format by the filename extension, it is not necessary to configure it (except if you create a new workbook).
*NEW*: can read and write xlsm files.
Bug fixed: reading rows stops if row is empty or does not exists
*NEW*: Memory saving mode for XSLX type
*NEW*: Can read password protected files
*NEW*: support for IFERROR function added
*NEW*: Support for reading hyperlinks added to the library

Component

tNotesRunAgent

5.4.3 tuanport 2014-04-10
1

Component

tNotesOutput

4.5.3 tuanport 2014-04-10
0

Component

tNotesInput

5.4.3 tuanport 2014-04-10
0

Component

tUniservRTIdentitySearch

2.3.0 dqsh_uniserv 2014-04-08
82

tUniservRTIdentitySearch allows Talend jobs to search in a DQ identity RT search index for customer data records taking into account misspellings, phonetic similarities, synonyms or missing data.

To be able to use tUniservRTIdentitySearch, the search index has to be built using the tUniservRTIdentityBulk component.

To be able to use the tUniservRTIdentitySearch component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Average Length With Null For SQL Server Storing Large


  • Author: yyi
  • Categories: Indicator
  • First revision date: 2012-06-12
  • Latest revision date: 2012-06-12
  • Compatible with: Data Quality releases 5.0.0, 5.0.0M2, 5.0.0M3, 5.0.0M4, 5.0.0M5, 5.0.0RC1, 5.0.0RC2, 5.0.0RC3, 5.0.1, 5.0.2, 5.1.0, 5.1.0M2, 5.1.0RC1, 5.1.1
  • Downloads: 20

About: Computes the average length of the field, counting null data as zero length values, the text filed type should be ntext or text in Microsoft SQL Server.

Revision list

expand/collapse all

Revision 1.0 20 Downloads, Released on 2012-06-12
Download revision 1.0

Compatible with: 5.1.1, 5.1.0, 5.1.0RC1, 5.1.0M2, 5.0.2, 5.0.1, 5.0.0, 5.0.0RC3, 5.0.0RC2, 5.0.0RC1, 5.0.0M5, 5.0.0M4, 5.0.0M3, 5.0.0M2

Related to http://jira.talendforge.org/browse/TDQ-5030

Reviews (0)

Be the first to review this extension!

 

Submit review
Name:*
Email:*
Title:*
Please select your rating*
Review:*


Version Author Released on Rating Downloads
Export

Product Demo

3.0 ctoum 2012-05-31
559

Product & families, with Cafepress pictures.

Data-Model

Clinical Trials: Janus Model Basics

1.0 jaymce 2010-11-22
376

This is a model of the basic of the Janus Clinical Data Repository.
http://www.fda.gov/ForIndustry/DataStandards/StudyDataStandards/ucm155327.htm

Data-Model

D* Demo Model

1.0 ctoum 2010-08-13
705

Model used in the D* Demo.

Export

Talendshop Demo

1.0 ctoum 2010-08-04
1109

Talendshop Demo (Demo Project)


55 ms