Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


Show

Category
Search
Version
Author
 

Statistics

  • 597 extensions
  • 901 revisions
  • 291 contributors
  • 154192 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Routine

BRules

1.6 walkerca 2014-07-30
856

A collection of Java routines for validation, formatting, and logic for use in Talend Open Studio

Software license: Apache License, Version 2.0

http://www.apache.org/licenses/LICENSE-2.0

Component

Elasticsearch Indexing

1.0.3 pklalitjha 2014-07-29
193

Elasticsearch Indexing Talend Custom Component
================================================
Often, we need full text search support in our application along with analytics. Having a component will allow indexing created/updated along with
data loaded in warehouse. So, I have created a component for indexing (create/update) for Elasticsearch. Elasticsearch, is distributed, open source,
full text engine built on top of Lucene. It is well documented, supports REST API over JSON and many other native language API for indexing and querying.

NOTE: To support add or update, at least one field in schema must be designated as key to identify rows as unique. Multiple columns can be designated as key (in case of composite primary keys).

Updated extension to fix compatibility issues with 1.x version of Elasticsearch. Removed dependent jars (lucne-core and elasticsearch), so now need to add them from Elasticsearch distribution. Also reduced download size from 10MB to 15KB.

Elasticsearch v1.2+ works with JDK 7+, so if elasticsearch version is 1.2+ then use JDK 7 u25/55/65 (recommended by ES) as this component is using elasticsearch Java API and corrosponding lucene core jar.

Added support for configuring cluster name in v1.0.3

Component

tGanttChart

0.2 bennatigiuliano 2014-07-24
4

This component allows you to generate a Gantt Graph.

Component

tJobInstanceStart+tJobInstanceEnd+tJobDataRangeScanner

3.0 jlolling 2014-07-20
14

These 3 components are dedicated to manage job monitoring.
These components provides following features:
* Creates an job instance Id to mark the datasets for data lineage
* holds all necessary information about a job run in a table
* keeps all context variables in a table (at the job start and its end)
* keeps as much as needed counters from the job
* provides incremental loads
* provides a lot of key figures about the last job run
* keeps the error messages from the job in the status table
* enables Log4J for the job with various outputs
Please refer the linked documentation
The release 3.0 needs slightly changed tables!
Please contact me in case of questions and do not use the rating function to post questions.

Component

tFileExcelSheetOutput

6.4 jlolling 2014-07-14
48

This component depends on tFileExcelWorkbookOpen.
It can create/fill sheets on a very flexible way:
It can set the header with self defined content
It can write columns with gaps
It can copy styles, conditional styles, row height
It can reconfigure written tables to update pivot tables
Please take a look in the documentation linked here. This component is very powerful while using excel templates.

Component

tPLotChart

0.1 bennatigiuliano 2014-07-11
5

This component allows you to generate Plot Graph.

Component

tUnpivotRow

1 wzawwin 2014-07-09
42

tGoogleGeocoder

0.6 llaen 2014-07-07
274

This component takes an address from the input flow and appends lat and lon columns to the output flow for the address. Google\\\\\\\\\\\\\\\'s geocode API is used to look up the latitude and longitude for the address.

Started with the original tGoogleGeocoder component but rewritten for API v3.
It also supports business API (given a client and private key strings).

Instructions:
The input row must have at least one column that contains an address. Select that column in the components \\\\\\\"Address Column\\\\\\\" variable.
The output is the same as the input but with two new columns added (lat and lon) which will contain the latitude and longitude coordinates that Google gave for the input address.

Component

tFileExcelSheetCellOutput

6.3 jlolling 2014-07-03
19

This component writes into referenced cells in a sheet.
It depends on the component tFileExcelWorkbookOpen and tFileExcelWorkbookSave.
The component takes the cell reference and the values / comments from an input flow (simplest way is using a tFixedFlowInput if the reference are fix.

Component

tCalendar

1.1 jlolling 2014-07-01
33

Data warehouses needs a date dimension and this component creates the most of the needed data (also fully localised).
The schema of this component is fully commented.
The main features are:
* creates all possible date categories like day, week, month, year, quarter
* builds integer id for all categories to support a table as dimension
* supports different years (year for the current week and a financial year)
* build automatically a configurable amount of days in the future.

Show

Category
Search
Version
Author
 

Statistics

  • 142 extensions
  • 206 revisions
  • 38 contributors
  • 17371 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Indicator

EMail validation via mail server

5.4/5.3 mzhao 2013-06-03
612

This Java UDI check emails by sending a SMTP request to mail server. the code sample can be found at: http://www.rgagnon.com/javadetails/java-0452.html

Indicator

Frequency table of hours

2.0 scorreia 2013-04-25
398

This indicator helps to analyze the most frequent day hours that appear in date time columns.

Indicator

Sample Standard Deviation

1.1 scorreia 2013-04-25
307

This indicator computes the sample standard deviation of any numerical column

Indicator

Variance

1.1 scorreia 2013-04-25
280

This indicator computes the variance of numeric columns

Indicator

Trimmed

1.0 scorreia 2013-04-25
81

evaluate the number of data which are correctly trimmed

Indicator

Week Frequency

2.0 scorreia 2013-04-25
311

aggregates Date fields into weeks

Indicator

Duplicate Rows

2.0 scorreia 2013-04-25
841

this indicator counts the number of duplicate rows.
It's different from the system indicator called "duplicate count" because it counts the number of duplicate rows, not the number of duplicate values.

Indicator

Length Range Frequency

1.1 scorreia 2013-04-25
154

get length ranges of data.

group data according to their length range.
Ranges are the following:
data of length < 10
data of length < 20
data of length < 30
data of length >= 30
null data

Indicator

Order of Magnitude

1.1 scorreia 2013-04-25
160

measure the order of magnitude of numerical data

Indicator

phone_area_code_freq

1.0 scorreia 2013-04-24
45

Area codes of American phone numbers

Show

Category
Search
Version
Author
 

Statistics

  • 5 extensions
  • 7 revisions
  • 4 contributors
  • 3990 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Export

Product Demo

3.0 ctoum 2012-05-31
595

Product & families, with Cafepress pictures.

Data-Model

Clinical Trials: Janus Model Basics

1.0 jaymce 2010-11-22
401

This is a model of the basic of the Janus Clinical Data Repository.
http://www.fda.gov/ForIndustry/DataStandards/StudyDataStandards/ucm155327.htm

Data-Model

D* Demo Model

1.0 ctoum 2010-08-13
738

Model used in the D* Demo.

Export

Talendshop Demo

1.0 ctoum 2010-08-04
1128

Talendshop Demo (Demo Project)


101 ms