Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us at Qlik Connect 2026 in Orlando, April 13–15: Register Here!

Machine learning components are not available in Studio

No ratings
cancel
Showing results for 
Search instead for 
Did you mean: 
TalendSolutionExpert
Contributor II
Contributor II

Machine learning components are not available in Studio

Last Update:

Feb 9, 2024 1:22:49 PM

Updated By:

Jamie_Gregory

Created date:

Apr 1, 2021 6:09:49 AM

Talend Version   6.3.1

Summary

Machine learning components are not available in Studio
Additional Versions 
ProductBig Data
ComponentComponents
Problem Description

To use Talend machine learning components, you need one of these licenses:

  • Talend Big Data Platform (with just a Talend Big Data license, machine learning components are not available)
  • Talend Real-time Big Data Platform
  • Talend Big Data Fabric

A Talend Real-time Big Data Platform license is active in Talend Studio, TAC 6.3.1. However, after creating a Big Data batch Job (Spark) for a remote project, the machine learning components are not available in Studio.

Problem root cause

A Talend Real-time Big Data Platform license (or Data Fabric license) allows you to have machine learning components available for use in a Spark Big Data batch or streaming Job. With a Real-time Big Data Platform license, you can create two types of projects and users in TAC:

  • Data Integration/ESB
  • Data Quality

A Data Quality project type enables Big Data features such as machine learning, but a Data Integration/ESB project type does not. This is due to the remote project type defined in TAC being Data Integration/ESB, and the Job being created in this project.

Solution or Workaround

The solution consists of ensuring that:

  1. The Talend license used in Talend Studio/TAC is Talend Big Data Platform, Talend Real-time Big Data Platform, or Talend Data Fabric license.
  2. The Talend Job using machine learning components is a Spark Job (Big Data streaming/batch), since machine learning components rely on the Spark MLib libraries.
  3. If the Talend Real-time Big Data Platform license is activated, and the Talend Job belongs to a remote project, the project is of the Data Quality type, and Talend Studio connects to TAC as a Data Quality type user.
JIRA ticket number 
Version history
Last update:
‎2024-02-09 01:22 PM
Updated by: