Difference between Talend Open Studio for Data Integration and Talend Open Studio for Big Data

StackOverflow https://stackoverflow.com/questions/19303654

  •  30-06-2022
  •  | 
  •  

Question

I'm confused by the different products that Talend offer.

It seems that Talend Open Studio for Data Integration and Talend Open Studio for Big Data are the same product but just with different components installed by default. Is that right? If I use Talend Open Studio for Data Integration and download the components I need from Talend Exchange (e.g. for Google BigQuery), is it effectively the same product?

Was it helpful?

Solution

Aside from the different connectors they differ on one key point, the code base they generate.

Talend Open Studio for Data Integration generates Java code while the Talend Open Studio for Big Data can generate map reduce code as well as Java code.

See here for an example of Talend's Big Data offering showing how to generate map reduce code jobs.

OTHER TIPS

It's not only about components :

From : http://www.talend.com/products/data-integration

Features : Job Designer, Job Designer, 450+ Connectors, Versioning

From : http://www.talend.com/products/big-data

Features : Job Designer, Components for HDFS, HBase, HCatalog, Hive, Pig, Sqoop, Hadoop Job Scheduler, NoSQL Support

I'm not sure, but I don't think that official components can be found at Talend's exchange website. There you will only find community contributed components.

In Talend Open Studio for Data Integration you can create Standard Job (means directly you can use DI component and in the back side java code will be generated).

But Talend Open Studio for Big Data you have option to create map/reduce job (means in the back side map/reduce code [actually java map/reduce] code will be generated). In the map/reduce job all component run both map and reduce task in the talend server. Like in tmap component you can see map and reduce task bar below the component when its executed.

Talend has different Open Studio products for different requirements.

Talend OS for Data Integration - Caters to Data Integration/ETL requirements. This will not have Big Data specific components.

Talend OS for Big Data- Caters to Big Data requirements. You can connect to Hadoop,Hive and other Big Data components. Apart from generating Java code this can also generate map reduce code.

You also have Talend OS for ESB, MDM & Data Quality which cater to different requirements.

Talend does not differentiate between connectors.Spark is the differentator between TOS and licensed version.

Open studio does not support spark both batch and real-time.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top