Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


Version Author Released on Rating Downloads
Component

tFileExcelSheetInput

5.1 jlolling 2014-04-11
1083

This component works in conjunction with tFileExcelWorkbookOpen.
This component reads a sheet identified by name or index.
Unlike the tFileInputExcel component you can specify which columns you want to read and leaf out what is not needed.
The fields can comfortably configured by excel column name or index.
It will always use the current Apache POI 3.10 libs (also for the older xls format).
The sheet name can be retrieved from the output of the new component tFileExcelSheetList.
Can read comment (alternatively to the value) and
Can use none empty value of previous row to fill empty values in current row.
Can ignore cell format errors if wanted.
*NEW*: Number format can changed for different languages/countries in advanced options.
*NEW*: Column positions can automatically be adjusted by header line
*NEW*: Can read hyperlinks (title and/or URL) configurable in the advanced settings
*NEW*: Can limit the number of rows read.
*NEW*: Can stop reading if a row is empty
*NEW*: Column position in the header row can be found with regularly expressions

For using this component, you have to update tFileExcelWorkbookOpen at first!

For all which have problems installing this component. Installing problems has in 99,9 % of the cases nothing to do with the component. This component is tested with all releases. In case of trouble installing this component, please contact me directly instead of rating the component bad for problems not caused by the component.
email: jan.lolling@gmail.com

Component

tFileExcelSheetOutput

5.1 jlolling 2014-04-11
889

This component is designed to work with the components tFileExcelWorkbookOpen and tFileExcelWorkbookSave. This component use the Apache POI library version 3.10.
The goal of this component component is to write or create a sheet with a minimum impact to the structure of the spreadsheet (it will as best it can keep references, macros, graphics and so on).
Many reports are especially designed Excel documents with lots of formulas and references and the automated report creation process have to write into a well described area and keep the rest unchanged.

This component can also write formulas (define a String column and write the formula in the English language starting with a =. To follow the row index in the formula you can write {row} into your formula and this will be replaced with the current row index.
Example for a formula string: =A{row}+G{row}

You can define for every column the target column in Excel with the Excel column name, set the target column auto sized and define the format for Date and Number columns.

There is also an option to write datasets into columns (every new row creates an new column (like if you rotate you data by 90 degree).
This component can be used in sub jobs in iterations to create or write into many sheets.

*NEW*: Sheets can be created as clone of an other sheet.
*NEW*: Freeze option added
*NEW*: Option to allow overwrite existing cells with null (default = false)

*NEW*: Can reuse from the template file the styles from first or first two row(s) for all data rows
To use this feature simply define in a file styles in a template sheet, read this file in the tFileExcelWorkbookOpen and use this as template.
*NEW*: Can remove all surplus rows

*NEW*: Can reuse conditional formats
*NEW*: improves the handling of styles and avoids overwriting styles related also to other columns
*NEW*: Can reuse the row height from the first data row for all new rows.

Please take a look into the new usage guid linked in this component detail page!

Please update always the component tFileExcelWorkbookOpen to get the current necessary library.

Component

tFileExcelWorkbookOpen

5.1 jlolling 2014-04-11
1698

This component is the base component for all other tFileExcel-components.
This component use the Apache POI library version 3.10.
This components reads a spreadsheet in a workbook or create a new empty workbook in memory.
This workbook can be filled (with sheets) by the component tFileExcelSheetOutput or can be read by tFileExcelSheetList/Input components.
Finally the component tFileExcelWorkbooksave persists the workbook in the same file as read or in a new file.
The component recognize the file format by the filename extension, it is not necessary to configure it (except if you create a new workbook).
*NEW*: can read and write xlsm files.
Bug fixed: reading rows stops if row is empty or does not exists
*NEW*: Memory saving mode for XSLX type
*NEW*: Can read password protected files
*NEW*: support for IFERROR function added
*NEW*: Support for reading hyperlinks added to the library

Component

tNotesRunAgent

5.4.3 tuanport 2014-04-10
1

Component

tNotesOutput

4.5.3 tuanport 2014-04-10
0

Component

tNotesInput

5.4.3 tuanport 2014-04-10
0

Component

tUniservRTIdentitySearch

2.3.0 dqsh_uniserv 2014-04-08
82

tUniservRTIdentitySearch allows Talend jobs to search in a DQ identity RT search index for customer data records taking into account misspellings, phonetic similarities, synonyms or missing data.

To be able to use tUniservRTIdentitySearch, the search index has to be built using the tUniservRTIdentityBulk component.

To be able to use the tUniservRTIdentitySearch component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Component

tUniservRTIdentityOutput

2.3.0 dqsh_uniserv 2014-04-08
133

tUniservRTIdentityOutput allows Talend jobs to update an existing DQ identity RT search index that has previously been built using tUniservRTIdentityBulk by inserting data for new customers, deleting existing customer entries or modifying the attributes for existing customers.

Updates the index pool which is used for duplicate search.

To be able to use the tUniservRTIdentityOutput component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Component

tUniservRTIdentityBulk

2.3.0 dqsh_uniserv 2014-04-08
88

tUniservRTIdentityBulk allows Talend jobs to build a search index over a database of customer data for later use by tUniservRTIdentitySearch to efficiently and effectively retrieve addresses and identify duplicates even with misspelled or incomplete data.

Prepares the index pool for the search for duplicates.

To be able to use the tUniservRTIdentityBulk component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Component

tUniservRTPost

2.3.0 dqsh_uniserv 2014-04-08
126

tUniservRTPost allows Talend jobs to use the Uniserv post product for validation, correction and normalization of international addresses.

The tUniservRTPost component may either be used with the Uniserv post software and the reference tables for the respective countries installed on site or with the Uniserv SaaS offering.

For more information on Uniserv post, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/address-check. If you are interested in the Uniserv SaaS offering, go to http://www.data-quality-on-demand.com.

Version Author Released on Rating Downloads
Indicator

Order of Magnitude

1.1 scorreia 2013-04-25
121

measure the order of magnitude of numerical data

Indicator

phone_area_code_freq

1.0 scorreia 2013-04-24
29

Area codes of American phone numbers

Indicator

udi_average_yearly_income

1.0 scorreia 2013-04-24
27

parses $50K - $70K and return the average value

Indicator

email_indicator

1.0 scorreia 2013-04-24
46

SQL like clause to demo the UDI drill down feature on email columns

Indicator

US_customers_count

1.0 scorreia 2013-04-24
27

Indicator to count the number of \'USA\' values in country columns

Indicator

ISO Week Number Frequency

1.0 scorreia 2013-04-08
47

Computes the ISO week number frequency of a given date.

ParserRule

parser_rule_for_merge

5.2.1 mzhao 2013-02-04
46

parser_rule_for_merge

Regex

pattern_for_test_merge

5.2.1 mzhao 2013-02-04
42

pattern_for_test_merge

Indicator

udi_for_test_merge

5.2.1 mzhao 2013-02-04
37

udi_for_test_merge

Regex

EmptyTextForTest

5.2.1 mzhao 2013-02-04
43

EmptyTextForTest

D* Demo Model


  • Author: ctoum
  • Categories: Data-Model
  • First revision date: 2010-08-13
  • Latest revision date: 2010-08-13
  • Compatible with: Master Data Management releases 4.0.2, 4.0.3
  • Downloads: 698

About: Model used in the D* Demo.

Revision list

expand/collapse all

Revision 1.0 698 Downloads, Released on 2010-08-13
Download revision 1.0

Compatible with: 4.0.3, 4.0.2

DStar model

Reviews (1)

 Review By Bernardo Silva on March 25, 2013
Review
Submit review
Name:*
Email:*
Title:*
Please select your rating*
Review:*



70 ms