Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


Show

Category
Search
Version
Author
 

Statistics

  • 575 extensions
  • 1000 revisions
  • 275 contributors
  • 141383 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Component

tSetContextVariable

1.0 mjackisch 2014-03-19
14

tSetContextVariable can be used to set or modify a context variable inside a job, very much like tSetGlobalVar.

Component

tJobInstanceStart+tJobInstanceEnd+tJobDataRangeScanner

2.7 jlolling 2014-03-14
172

These both components are made to:
1. track execution of jobs
2. saves counters of a job
3. saves time range or value ranges to support incremental loading
4. saves or restores context variables to support restart capabilities
5. provides an unique job instance id to use in data tables to support data lineage
6. provides a list of all job instance id of all or selected jobs which have run after the last run of the current job to support incremental loading
7. provides a large number of return values to use them in the job
8. Able to scan the flows for min and max values with the help of the new component tJobDataRangeScanner (shipped with the this components)
Please refer to the linked usage guid! In case of problems please contact jan.lolling@gmail.com

In case of install an update of this component, please refer to the release notes of the new release to know what has been changed. Sometimes it is necessary to add columns!

Component

tNetinsightInput

1.0 Steven_Crawford 2014-03-13
6

This component uses Netinsight\'s XML API to run a report and fetch the resulting file. You can save the file to the local filesystem or write it to cache to be accessed using: ((java.io.InputStream)globalMap.get(\"tNetinsightInput_1_INPUT_STREAM\"))

You can use your own custom xml file or you can use the simple UI to create a GetReportData request with standard data offset (from current date).

Component

tLOBDownload

1.0 jlolling 2014-03-04
26

This component allows to download BLOBs or CLOBs from a database input flow into files.
The file name can be configured per LOB column and can contain place holders which are filled by other columns for the flow.
The CLOB or BLOB columns are mostly mapped to the Java type Object in the flow.
The placeholders: {column-name}
The column name must be taken from the schema column case sensitive. The string representation of date typed columns use the pattern from the schema.
If you need for the place holders other values than provided by the database feel free to use t tMap to add new one or insert context variables into the template.
The place holder syntax will only be used for the values of the flow.
The component detects for its self if it is a CLOB or a BLOB. Also character column content can be downloaded as files.
For CLOB files you can specify the charset.

Component

tGoogleAnalyticsInput

1.14 jlolling 2014-02-27
1446

Lets you query Google Analytics data.
You find this component in the palette in section Business->Google.
This component use the latest GA-API version 3 (r81 v1.17.0 rc) and OAuth-API 2.0.
A configured service account is needed and must be added to your Analytics Account (+profile).
Component returns (after):
- error_message,
- amount or rows used to create result set
- flag if data are sampled
- number of lines delivered
The dimensions and metrics are defined in the notation of the Google API.
In the Advance settings you can optimize the performance by changing the fetch size and avoid problems by changing the timeout (also used for read timeout).
There is also an advanced option to reuse the client (avoid multiple logins in iterations).
Contact: jan.lolling@gmail.com
Please read the help page linked in the component detail page.
In case of invalid grant errors: please check the system time on your machine and the API grants in the Google API Console
In case of permission denied errors, please check if you have added the service account to your profiles.

*NEW*: Sampling level can be set and in case of sampling, the size and space will returned (as return values --> Outline view)
Thanks to Hans Ressing for sharing ideas and test support!
*NEW*: Has the Option Die On Error (default = true)

Component

tCalendar

1.0 jlolling 2014-02-23
43

A Business calendar with a number of fields especially for data ware house applications.
This component generates entries for:
* a start-end date range or
* starting with the start date and stop after a number of years (ends with the current day in the lastest year)
The schema is fully commented, take a look at the screenshot.
It utilize the Java build in calendar classes.
The calendar is fully localized for all countries supported by the Java environment (nearly all countries of the world).
You will find this component in the palette under Business Intelligence

You can leaf out the locale setting but I strongly suggest set it up because otherwise the result depends on the setting of the host where you run the job.

Component

tFileExcelNamedCellInput

4.1 jlolling 2014-02-21
53

This component work together with tFileExcelWorkbookOpen.
This component iterates through all named cells and returns them in a flow or as return values for iterations.
The schema:
CELL_NAME = name of the cell
CELL_VALUE = The data object
CELL_CLASS_NAME = the Java class of the data object (can be used to cast the value)
CELL_REF = The absolute position of the cell within the sheet
CELL_ROW = the row number (first row = 1)
CELL_COL = the column number (first col = 0)
CELL_INDEX = number of current cell
SHEET_NAME = name of the sheet which contains the current cell

Please beware, reading or writing named cells is not compatible withe the memory saving mode of the workbook!

Component

tRunTask

1.6 jlolling 2014-02-21
257

This components runs a task within the Talend Administration Center.
Please refer the PDF documentation linked as resource here.
This component uses only Open Source libraries.
This component helps to create much more sophisticated job chains.
* set context parameters
* configure which task cannot run at the same time
* setup the wait for the end of the job by polling or direct response
* checks if the task is technical ready to run.
* at the moment it is not possible to use the job return code because of a missing feature of the TAC web service.

Component

tRDF2RDF

1.1 fbelleau 2014-02-14
17

Convert RDF graph from any format to any format.

Project web site : https://github.com/fbelleau/talend4sw
A component brought to you by Bio2RDF project programmers.

Component

tVirtuosoClearGraph

1.0 fbelleau 2014-02-14
6

Delete all RDF triples from a graph in OpenLinks Virtuoso triplestore.

Project web site : https://github.com/fbelleau/talend4sw
A component brought to you by Bio2RDF project programmers.

Show

Category
Search
Version
Author
 

Statistics

  • 141 extensions
  • 174 revisions
  • 36 contributors
  • 15674 downloads
 

Top Contributors

Version Author Released on Rating Downloads
ParserRule

Tweets

1.0 scorreia 2013-11-20
18

get information from tweets.
Extract the date/time, user, hashtags, referenced users and urls from Twitter messages.

Regex

Only alphabetical characters not empty

1.0 dcortinovis 2013-06-19
62

Only alphabetical characters not empty.
And at least one (empty forbidden)

Indicator

EMail validation via mail server

5.4/5.3 mzhao 2013-06-03
517

This Java UDI check emails by sending a SMTP request to mail server. the code sample can be found at: http://www.rgagnon.com/javadetails/java-0452.html

Indicator

Frequency table of hours

2.0 scorreia 2013-04-25
355

This indicator helps to analyze the most frequent day hours that appear in date time columns.

Indicator

Sample Standard Deviation

1.1 scorreia 2013-04-25
268

This indicator computes the sample standard deviation of any numerical column

Indicator

Variance

1.1 scorreia 2013-04-25
249

This indicator computes the variance of numeric columns

Indicator

Trimmed

1.0 scorreia 2013-04-25
60

evaluate the number of data which are correctly trimmed

Indicator

Week Frequency

2.0 scorreia 2013-04-25
270

aggregates Date fields into weeks

Indicator

Duplicate Rows

2.0 scorreia 2013-04-25
774

this indicator counts the number of duplicate rows.
It's different from the system indicator called "duplicate count" because it counts the number of duplicate rows, not the number of duplicate values.

Indicator

Length Range Frequency

1.1 scorreia 2013-04-25
122

get length ranges of data.

group data according to their length range.
Ranges are the following:
data of length < 10
data of length < 20
data of length < 30
data of length >= 30
null data

Show

Category
Search
Version
Author
 

Statistics

  • 5 extensions
  • 7 revisions
  • 4 contributors
  • 3864 downloads
 

Top Contributors

Version Author Released on Rating Downloads
Export

Product Demo

3.0 ctoum 2012-05-31
559

Product & families, with Cafepress pictures.

Data-Model

Clinical Trials: Janus Model Basics

1.0 jaymce 2010-11-22
376

This is a model of the basic of the Janus Clinical Data Repository.
http://www.fda.gov/ForIndustry/DataStandards/StudyDataStandards/ucm155327.htm

Data-Model

D* Demo Model

1.0 ctoum 2010-08-13
698

Model used in the D* Demo.

Export

Talendshop Demo

1.0 ctoum 2010-08-04
1109

Talendshop Demo (Demo Project)


85 ms