Talend Exchange is the place where Talend community can share items related to Talend opensource products, such as Data Integration, Data Quality and Data Master Management. Contribution is open to any user, no specific validation is needed. As soon as you have your forum account, you automatically get a Talend Exchange account.


Version Author Released on Rating Downloads
Component

tMemoryMonitor

0.1 tal00000 2014-08-22
60

Monitor the memory usage of your running Job, with tMemoryMonitor

Information on the used, total and maximum (see -Xmx) memory of the Java runtime, are written to standard output or, if a warning threshold is reached, to standard error.

Options include \'Enabled\', \'Interval\' (milliseconds) and \'Warning Threshold Percent %\'.

Used in conjunction with your own diagnostic messages, this component will help you to identify where memory is being used within your Job an help you to avoid Out of Memory (OOM) Exceptions.

As well as ensuring that your Job has enough memory, you\'ll also want to make sure that you do not waste system memory. The Talend Job defaults are -Xms256MB and -Xmx1024MB. As well as some of your Jobs requiring more that 1024MB, some will require less that 256MB. This utility will allow you to observe the peak memory usage and to tune accordingly.

For more information on Java Runtime memory reporting, read http://docs.oracle.com/javase/7/docs/api/java/lang/Runtime.html.

Used memory is totalMemory - freeMemory. Percentages are based on used memory vs. maxMemory.

This component also reports the number of available processors.

Sample output.

tMemoryMonitor_1: using 43MB/246MB (max=910MB/5%) (processors=4)
tMemoryMonitor_1: using 91MB/310MB (max=910MB/10%) (processors=4)
tMemoryMonitor_1: using 260MB/449MB (max=910MB/29%) (processors=4)
tMemoryMonitor_1: using 189MB/442MB (max=910MB/21%) (processors=4)
tMemoryMonitor_1: using 259MB/454MB (max=910MB/28%) (processors=4)
tMemoryMonitor_1: using 216MB/577MB (max=910MB/24%) (processors=4)

Component

tHTMLParse

0.1 tal00000 2014-08-22
5

This component fetches and parses an HTML document with help from the excellent Java library jsoup (jsoup.org).

A number of elements are extracted from the document together with useful information including the fetch time.

Some elements, for example, Hyperlinks are made available through the jsoup class Elements, allowing you to use the power of jsoup to process these as you wish. When I have time, I\'ll write some supplemental components that will iterate through these Elements; however, it\'s simple enough to do this yourself using a component such as tJavaFlex.

The entire document is also made available through the jsoup Document class, allowing you full control over the document.

Component

tJobInstance* component collection

3.2 jlolling 2014-08-19
25

These 4 components:
- tJobInstanceStart
- tJobInstanceEnd
- tJobDataRangeScanner
- tJobInstanceLiveCheck
are dedicated to manage job monitoring.
These components provides following features:
* Creates an job instance Id to mark the datasets for data lineage
* holds all necessary information about a job run in a table
* keeps all context variables in a table (at the job start and its end)
* keeps as much as needed counters from the job
* provides incremental loads
* provides a lot of key figures about the last job run
* keeps the error messages from the job in the status table
* enables Log4J for the job with various outputs
Please refer the linked documentation
The release 3.0 needs slightly changed tables!
Please contact me in case of questions and do not use the rating function to post questions.

Component

tTimeGenerator

1.0 Sanjay Tiwari 2014-08-12
5

Component

tOpenerpOut

1.0 popeye 2014-08-12
10

Component

tOpenerp

1.0 popeye 2014-08-12
7

Component

tInfobrightOutput

3.7 T.Sasaki 2014-08-12
6

Component

tQlikviewInput

0.1 gorotman 2014-08-07
11

Reads data from QVX (QlikView data eXchange) files and pushs rows to standard flow.
It can extracts fields properties from qvx (QvxTableHeader) and creates the talend metadata xml file (ready to import into repository as generic metadata; I\'m sorry but I didn\'t found a solution to manage a custom metadata type, I read some post, blog and so but neither works fine!).
Parameters:
date format string (used for talend metadata)
charset (typically is utf-8)
data buffer size (it\'s the buffer size loaded in memory; useful if document is very large)
text buffer size (it\'s the buffer used to store text fields when reading; insert a value larger to fit you longest text value)
debug (to print debug information).

Routine

BRules

1.6 walkerca 2014-07-30
907

A collection of Java routines for validation, formatting, and logic for use in Talend Open Studio

Software license: Apache License, Version 2.0

http://www.apache.org/licenses/LICENSE-2.0

Component

Elasticsearch Indexing

1.0.3 pklalitjha 2014-07-29
242

Elasticsearch Indexing Talend Custom Component
================================================
Often, we need full text search support in our application along with analytics. Having a component will allow indexing created/updated along with
data loaded in warehouse. So, I have created a component for indexing (create/update) for Elasticsearch. Elasticsearch, is distributed, open source,
full text engine built on top of Lucene. It is well documented, supports REST API over JSON and many other native language API for indexing and querying.

NOTE: To support add or update, at least one field in schema must be designated as key to identify rows as unique. Multiple columns can be designated as key (in case of composite primary keys).

Updated extension to fix compatibility issues with 1.x version of Elasticsearch. Removed dependent jars (lucne-core and elasticsearch), so now need to add them from Elasticsearch distribution. Also reduced download size from 10MB to 15KB.

Elasticsearch v1.2+ works with JDK 7+, so if elasticsearch version is 1.2+ then use JDK 7 u25/55/65 (recommended by ES) as this component is using elasticsearch Java API and corrosponding lucene core jar.

Added support for configuring cluster name in v1.0.3

Version Author Released on Rating Downloads
Regex

Swedish personnummer with accepted foreigners

4.0 mhallam 2014-07-29
41

Other Swedish personnummer regexps do not take into account that at Swedish universities, foreigners might get special temporary personnummers with a "P", "T", or "F" instead of the first of the four last numbers.

This code fixes the problem, but does not check the validity of the date, or the last number.

Regex

Istat Code

4.0 mhallam 2014-07-29
145

22.12.0|18.24.C||22.12_D|12.56| 1A.56.1

This pattern match the Italian Istat Ateco Code (Codice Istat) updated to Istat Declaration Ateco 2004

Regex

Longitude Validation

4.0 mhallam 2014-07-29
216

-98.092391

Longitude validation (similiar to Latitude Validation) and inspired by the same code

Regex

Currency 16,3

4.0 mhallam 2014-07-29
57

Currency expression, accepts 4 commas and 4 groups of 3 numbers and 1 number before the first comma, this first number will have to be different from zero. It accepts a number of, two or three decimal. It accepts zero numbers after the point. You can change the number of groups and numbers accespts before and after the point.

Regex

Random email

2.0 scorreia 2014-07-29
152

detect random emails

Regex

hour24

4.0 mhallam 2014-07-29
58

from 0 to 23

Regex

daysOfWeek

4.0 mhallam 2014-07-29
63

Saturday ...

Regex

MonthName

4.0 mhallam 2014-07-29
112

January , February ..

Regex

hh:mm

4.0 mhallam 2014-07-29
95

hh:mm

Regex

German postal code match

2.0 fadoe 2014-07-29
160

This regular expression not only checks if the postal code is five digits long. The second digit must not be zero, because a postal code like 00123 is invalid.

Clinical Trials: Janus Model Basics


  • Author: jaymce
  • Categories: Data-Model
  • First revision date: 2010-11-22
  • Latest revision date: 2010-11-22
  • Compatible with: Master Data Management releases 4.0.2, 4.0.3
  • Downloads: 403

About: This is a model of the basic of the Janus Clinical Data Repository.
http://www.fda.gov/ForIndustry/DataStandards/StudyDataStandards/ucm155327.htm

Revision list

expand/collapse all

Revision 1.0 403 Downloads, Released on 2010-11-22
Download revision 1.0

Compatible with: 4.0.3, 4.0.2

the basic Janus clinical trials data model

Reviews (0)

Be the first to review this extension!

 

Submit review
Name:*
Email:*
Title:*
Please select your rating*
Review:*



77 ms