Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.

Version Author Released on Rating Downloads


5.1 jlolling 2014-04-16

This component is designed to work with tFileExcelWorkbookOpen and tFileExcelSheetOutput.
This component writes a workbook into a output file.
This happens after finish writing in several sheets by tFileExcelSheetOutput.
Can delete sheets (usually used as templates before)
Option to (re)evaluate all formulas added
Translation in German and French added
*NEW*: The output file name can be set without file extension. The correct extension will be added (or even fixed) automatically. The final file name can be retrieved from the return value FILENAME.



5.1 jlolling 2014-04-11

This component works in conjunction with tFileExcelWorkbookOpen.
This component reads a sheet identified by name or index.
Unlike the tFileInputExcel component you can specify which columns you want to read and leaf out what is not needed.
The fields can comfortably configured by excel column name or index.
It will always use the current Apache POI 3.10 libs (also for the older xls format).
The sheet name can be retrieved from the output of the new component tFileExcelSheetList.
Can read comment (alternatively to the value) and
Can use none empty value of previous row to fill empty values in current row.
Can ignore cell format errors if wanted.
*NEW*: Number format can changed for different languages/countries in advanced options.
*NEW*: Column positions can automatically be adjusted by header line
*NEW*: Can read hyperlinks (title and/or URL) configurable in the advanced settings
*NEW*: Can limit the number of rows read.
*NEW*: Can stop reading if a row is empty
*NEW*: Column position in the header row can be found with regularly expressions

For using this component, you have to update tFileExcelWorkbookOpen at first!

For all which have problems installing this component. Installing problems has in 99,9 % of the cases nothing to do with the component. This component is tested with all releases. In case of trouble installing this component, please contact me directly instead of rating the component bad for problems not caused by the component.
email: jan.lolling@gmail.com



5.1 jlolling 2014-04-11

This component is designed to work with the components tFileExcelWorkbookOpen and tFileExcelWorkbookSave. This component use the Apache POI library version 3.10.
The goal of this component component is to write or create a sheet with a minimum impact to the structure of the spreadsheet (it will as best it can keep references, macros, graphics and so on).
Many reports are especially designed Excel documents with lots of formulas and references and the automated report creation process have to write into a well described area and keep the rest unchanged.

This component can also write formulas (define a String column and write the formula in the English language starting with a =. To follow the row index in the formula you can write {row} into your formula and this will be replaced with the current row index.
Example for a formula string: =A{row}+G{row}

You can define for every column the target column in Excel with the Excel column name, set the target column auto sized and define the format for Date and Number columns.

There is also an option to write datasets into columns (every new row creates an new column (like if you rotate you data by 90 degree).
This component can be used in sub jobs in iterations to create or write into many sheets.

*NEW*: Sheets can be created as clone of an other sheet.
*NEW*: Freeze option added
*NEW*: Option to allow overwrite existing cells with null (default = false)

*NEW*: Can reuse from the template file the styles from first or first two row(s) for all data rows
To use this feature simply define in a file styles in a template sheet, read this file in the tFileExcelWorkbookOpen and use this as template.
*NEW*: Can remove all surplus rows

*NEW*: Can reuse conditional formats
*NEW*: improves the handling of styles and avoids overwriting styles related also to other columns
*NEW*: Can reuse the row height from the first data row for all new rows.

Please take a look into the new usage guid linked in this component detail page!

Please update always the component tFileExcelWorkbookOpen to get the current necessary library.



5.1 jlolling 2014-04-11

This component is the base component for all other tFileExcel-components.
This component use the Apache POI library version 3.10.
This components reads a spreadsheet in a workbook or create a new empty workbook in memory.
This workbook can be filled (with sheets) by the component tFileExcelSheetOutput or can be read by tFileExcelSheetList/Input components.
Finally the component tFileExcelWorkbooksave persists the workbook in the same file as read or in a new file.
The component recognize the file format by the filename extension, it is not necessary to configure it (except if you create a new workbook).
*NEW*: can read and write xlsm files.
Bug fixed: reading rows stops if row is empty or does not exists
*NEW*: Memory saving mode for XSLX type
*NEW*: Can read password protected files
*NEW*: support for IFERROR function added
*NEW*: Support for reading hyperlinks added to the library



5.4.3 tuanport 2014-04-10



4.5.3 tuanport 2014-04-10



5.4.3 tuanport 2014-04-10



2.3.0 dqsh_uniserv 2014-04-08

tUniservRTIdentitySearch allows Talend jobs to search in a DQ identity RT search index for customer data records taking into account misspellings, phonetic similarities, synonyms or missing data.

To be able to use tUniservRTIdentitySearch, the search index has to be built using the tUniservRTIdentityBulk component.

To be able to use the tUniservRTIdentitySearch component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.



2.3.0 dqsh_uniserv 2014-04-08

tUniservRTIdentityOutput allows Talend jobs to update an existing DQ identity RT search index that has previously been built using tUniservRTIdentityBulk by inserting data for new customers, deleting existing customer entries or modifying the attributes for existing customers.

Updates the index pool which is used for duplicate search.

To be able to use the tUniservRTIdentityOutput component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.



2.3.0 dqsh_uniserv 2014-04-08

tUniservRTIdentityBulk allows Talend jobs to build a search index over a database of customer data for later use by tUniservRTIdentitySearch to efficiently and effectively retrieve addresses and identify duplicates even with misspelled or incomplete data.

Prepares the index pool for the search for duplicates.

To be able to use the tUniservRTIdentityBulk component, the Uniserv DQ identity RT software must be installed.

For more information on DQ identity RT, visit http://www.uniserv.com/en/products/uniserv-data-quality-functions/identity-resolution.

Version Author Released on Rating Downloads


1.0 scorreia 2013-11-20

get information from tweets.
Extract the date/time, user, hashtags, referenced users and urls from Twitter messages.


Only alphabetical characters not empty

1.0 dcortinovis 2013-06-19

Only alphabetical characters not empty.
And at least one (empty forbidden)


EMail validation via mail server

5.4/5.3 mzhao 2013-06-03

This Java UDI check emails by sending a SMTP request to mail server. the code sample can be found at: http://www.rgagnon.com/javadetails/java-0452.html


Frequency table of hours

2.0 scorreia 2013-04-25

This indicator helps to analyze the most frequent day hours that appear in date time columns.


Sample Standard Deviation

1.1 scorreia 2013-04-25

This indicator computes the sample standard deviation of any numerical column



1.1 scorreia 2013-04-25

This indicator computes the variance of numeric columns



1.0 scorreia 2013-04-25

evaluate the number of data which are correctly trimmed


Week Frequency

2.0 scorreia 2013-04-25

aggregates Date fields into weeks


Duplicate Rows

2.0 scorreia 2013-04-25

this indicator counts the number of duplicate rows.
It's different from the system indicator called "duplicate count" because it counts the number of duplicate rows, not the number of duplicate values.


Length Range Frequency

1.1 scorreia 2013-04-25

get length ranges of data.

group data according to their length range.
Ranges are the following:
data of length < 10
data of length < 20
data of length < 30
data of length >= 30
null data

Clinical Trials: Janus Model Basics

  • Author: jaymce
  • Categories: Data-Model
  • First revision date: 2010-11-22
  • Latest revision date: 2010-11-22
  • Compatible with: Master Data Management releases 4.0.2, 4.0.3
  • Downloads: 376

About: This is a model of the basic of the Janus Clinical Data Repository.

Revision list

expand/collapse all

Revision 1.0 376 Downloads, Released on 2010-11-22
Download revision 1.0

Compatible with: 4.0.3, 4.0.2

the basic Janus clinical trials data model

Reviews (0)

Be the first to review this extension!


Submit review
Please select your rating*

63 ms