Three Error Handling Strategies in Talend Open Studio
Some Talend Open Studio job errors are alternate paths that, though infrequent, occur often enough to justify special programming. This programming may come in the form of guard conditions, special logic applied to route the special case to another subjob. For an example of these type of errors, see this blog post on ETL Filter Patterns.
Other errors are related to system and network activity or are bugs. There are a few ways to handle this class of error in Talend Open Studio.
Do Nothing
For simple jobs, say an automated administrative task, you can rely on the exception throwing of Talend Open Studio. An example is a simple input to output job where a database failure in writing the output results in a system error. This is expressed in the Run View as a red stack trace.
Simple Job with No Extra Error Handling Configured |
Each subjob and component has a return code that can drive additional processing. The Subjob Ok/Error and Component Ok/Error can be used to steer the error toward an error handling routine like the tSendMail component. This example looks for a connection error (the database is off) or a file processing error (the database is on, but the table name is wrong).
Both an individual subjob and a finer-grain component can be tested. The screenshot shows two tSendMail routines being called from an OnSubjobError trigger.
Error Handling Tailored to the Subjob (or Component) |
Sometimes, there is a need for this level of detail. You may want to send a file that represents an intermediate stage of processing via email. This file isn't available throughout the job, and not every failure can handle this.
tAssertCatcher
A more general strategy is to define an error handling subjob to be performed when an error -- any error -- occurs. This has the important advantage of consolidating the error handling, dramatically reducing testing. It puts the burden of testing for error conditions on Talend (where it belongs).
To implement the general strategy, use the tAssertCatcher component which will be invoked whenever any component throws an error.
A Shared Error Handler with tAssertCatcher |
tAssertCatcher Config |
In the following screenshot, the database component tMSSqlOutput_1 has "Die on error" set. If the flag is not set, then the tMSSqlOutput will print a message and the tAssertCatcher will not be called. This particular example caught errors from the connection component (bad login) and the tMSSqlOutput component (DB-generated unique constraint violation and invalid insert of identity column).
An Example with Database Components |
Let Talend Work
Handling system errors is different than alternate paths and conditions that arise during coding a Talend job. Sometimes, you'll have a specific error routine for a specific system error condition. But where possible, let Talend throw the system errors and catch them with a tAssertCatcher.
Very nice and useful information thank you for sharing. Know about Talend Online Training
ReplyDeleteHello,
ReplyDeleteTalend Open Studio for Data Integration is an open source data integration product developed by Talend and designed to combine, convert and update data in various locations across a business. It was so nice article. Thanks For Providing.
http://chennaitraining.in/qliksense-training-in-chennai/
ReplyDeletehttp://chennaitraining.in/pentaho-training-in-chennai/
http://chennaitraining.in/machine-learning-training-in-chennai/
http://chennaitraining.in/artificial-intelligence-training-in-chennai/
http://chennaitraining.in/msbi-training-in-chennai/
http://chennaitraining.in/citrix-training-in-chennai/