Solved: Re: Reading CSV files with different schemas - Qlik Community

RMotta2408 · ‎2023-05-23

Hello everyone,

I want to develop a Job to read CSV files from a folder and display the results in the same order always.

The problem is, there's no fixed schema for these files.

I mean, the files may come with different schemas:

Column_A, Column_B, Column_C, Column_D
Column_B Column_D, Column_A, Column_C
Column_D, Column_B, Column_C, Column_A

... any possible combination.

My question is: can the Job be dynamic enough so that, no matter the order of the columns, the files always gets read and the results displayed in the right order (A, B, C, D)?

Thank you so much,

Rui

RMotta2408 · ‎2023-05-24

Hi there,

I've found a solution.

For each CSV file tFileList finds, I do this:

1- tFileInputDelimited

schema: column01 (dynamic).

2- tExtractDynamicFields

input schema: column01 (dynamic).
output schema: columnA, columnB, columnC, columnD.

3- tDBOutput

This way, no matter what comes in the CSV file, my Job will only consider what values come in these 4 columns. And if one or more of these columns are missing, the Job considers that column with the value "null".

Hope this helps someone.

Thank you.

View solution in original post

Anonymous · ‎2023-05-24

Hello

You need to use Dynamic schema, you have a similar requirement as described in this KB article.

Regards

Shong

RMotta2408 · ‎2023-05-24

Hi @Shicong Hong ,

I will have a look RIGHT NOW!

Thank you

RMotta2408 · ‎2023-05-24

The link is inaccessible. 😮

Anonymous · ‎2023-05-24

@Rui Motta , sorry, I didn't notice that the article is only visible for internal staffs, let me think about what I can do, or I can do my best to provide an example.

RMotta2408 · ‎2023-05-24

Hi there,

I've found a solution.

For each CSV file tFileList finds, I do this:

1- tFileInputDelimited

schema: column01 (dynamic).

2- tExtractDynamicFields

input schema: column01 (dynamic).
output schema: columnA, columnB, columnC, columnD.

3- tDBOutput

This way, no matter what comes in the CSV file, my Job will only consider what values come in these 4 columns. And if one or more of these columns are missing, the Job considers that column with the value "null".

Hope this helps someone.

Thank you.

Anonymous · ‎2023-05-24

It is a good solution! Thank you for your sharing! @Rui Motta

Regards

Shong

RMotta2408 · ‎2023-05-25

My pleasure.

Rui

Reading CSV files with different schemas

Talend Data Integration

Talend Data Preparation

Talend Studio

v8.x