Talend from the Roots

More than 80% of all organizations are running a significant amount of batch jobs involving bulk integration of data
Gartner Magic Quadrant for Data Integration Tools report. 2016

  • Does your organization need an ETL tool that can handle all your data integration and transformation related jobs in an efficient manner?
  • Are you a beginner ETL developer that needs a software without any hassle of complex coding?

Worry no more!

Talend is world’s leading open source ETL and data integration solution that empowers rapid development of data transformation processes using its interactive and intuitive drag-and-drop user interface. It provides services such as big data, Cloud, data integration, data management, Master Data Management, data quality, data preparation and enterprise application integration software with the set of its coding-friendly tools and software.

In this series, we’ll learn how to use Talend Open Studio in an effective manner. It doesn’t matter if you are a complete beginner or an advanced developer. We’ll help you better understand your favorite ETL software by designing simple to intermediate jobs based on real-life applications and using tips and tricks here and there, so that data transformation become less of a hassle and more of a fun-filled learning experience.

So, let’s start, shall we?

Today we’ll talk about “Installing Talend Open Studio for data integration” on your Linux/Windows or Mac work machine. In this guide, we are using Talend 6.1.1 ESB studio, 64 bit.

Prerequisites

When you move your office to a new location, you check for certain prerequisites like the locality, housing space, environment don’t you? Similarly, softwares require certain prerequisites or variables before installation so that they can work smoothly and error free. In case of Talend these are :-

  1. Java
  2. Database client software

Java – As talend development environment is based on Eclipse and produces Java code (can be python too!), it needs JRE and JDK to be pre-installed on the work machine.

Database client software – After all data integration and transformation involves working with databases and data warehouses, prior database client software on the local server can prove beneficial. Install any DB client of your choice Oracle, SQL server, Sybase etc.

Downloading Talend

Visit the following link and download Talend Open Studio for ESB .zip file, by clicking on the Download free tool button.

http://www.talend.com/download/talend-open-studio/#t3

Note that the .zip file contains binaries for ALL platforms (Linux/Unix, Windows and MacOS). For Talend Open Studio for ESB, both Talend Studio and Talend Runtime are bundled together.

Extraction and installation

You’ll find a compressed file, of the format: TOS_ESB-YYYYYYYY_YYYY-VA.B.C.zip

  1. Extract the zip file to a directory of your choice.
  2. Click one of the following and follow the on-screen instructions: –

For Windows

  • Talend-Tools-Installer-rYYYY-VA.B.C-windows-installer.exe for 32-bit and 64-bit Windows users,

For Linux

  • Talend-Tools-Installer-rYYYY-VA.B.C-linux-installer.run and Talend-Tools-Installer-rYYYYY-VA.B.C-linux64-installer.runfor 32-bit or 64-bit Linux users respectively,

For MacOS

  • Talend-Tools-Installer-rYYYYY-VA.B.C-osx-installer.app in the .tar file called Talend-Installer-Tools-rYYYYY-VA.B.C-osx-installer.app.tar for MacOS users

Note: YYYY is the revision number and A.B.C is the revision level (Major.Minor. Patch).

Configure the memory settings

If you want to tune the memory allocation for your JVM, you only need to edit the .ini file corresponding to your executable file. For example:

For Talend Open Studio on Windows, edit the file: TOS_DI/DQ/BD-win32-x86.ini,

For Talend Open Studio on Linux, edit the file: TOS_DI/DQ/BD-linux-gtk-x86.ini.

The default values are:

 -vmargs -Xms40m -Xmx500m -XX:MaxPermSize=128m

If you have low memory on your computer (for eg 512 Mb), you can specify the memory allocation as following:

-vmargs -Xms40m -Xmx256m -XX:MaxPermSize=64m

 

Running Talend Open Studio

After successful installation, your Talend ESB is ready to use. To launch it: –

On Windows

  • Double-click the exe file to launch Talend Open Studio

On Unix-like systems

  • add execution rights on the desired “TOS_ESB*” binary before launching it.

On a standard Linux, the command is:

$ chmod +x TOS_DI/DQ/BD-linux-gtk-x86

$ ./TOS_DI/DQ-linux-gtk-x86

On Mac OS X

  • unpack the TOS_DI/DQ/BD-macosx-cocoa.app folder and launch Talend Open Studio.

Conclusion

So we’ve set our foot towards the journey to learn and understand Talend Open Studio. Today we’ve been introduced to a tool that can rapidly solve all your ETL problems and can provide a much needed boost to your productivity.

Stay tuned and in no time you’ll be adept at handling data transformation with ease. On our next session,  we’ll learn about metadatas and schemas and develop our first talend job !


Also,  if you are an advanced ETL developer check out our post about Realtime Response using Talend ESB to learn a pro tip !.

What do you think about our post? Do you have any ideas or queries ?

Let us know in the comments.

 

 

 

 

 

 

Advertisements

One Comment Add yours

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s