Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Discover how organizations are unlocking new revenue streams: Watch here
cancel
Showing results for 
Search instead for 
Did you mean: 
mahesh_Dev
Contributor III
Contributor III

Unstructured to structure?

Hi Everyone,

I had a file see below

A.B.cc.abc.76= kal jahs jahs
B.c.ggg.7282 = kahvevv 827
Haj.gggh.8282777.8282626 = jwvbwh 282

How to achieve as structure format by using talend

Output
Column 1 Column 2 Column 3    Column 4
A                B                cc             Kal jahs
B                c                72              Kah vevv


How should we achieve and what should the flow
Could you help me

Labels (4)
1 Solution

Accepted Solutions
fdenis
Master
Master

you can use tExtractDelimitedFields with "."
then tExtractDelimitedFields with "=" on columne4

did you try it?

View solution in original post

6 Replies
fdenis
Master
Master

Use tExtractDelimitedFields with "." as separator
then use tMap to get your fields
Anonymous
Not applicable

Are you sure your example is correct? It doesn't appear to follow a logical structure. For example, lines 1 and 2 appear to have values separated by . and =.....

 

A.B.cc.abc.76= kal jahs jahs
B.c.ggg.7282 = kahvevv 827
Haj.gggh.8282777.8282626 = jwvbwh 282

 

But your output for row 1 seems to take the 1st, 2nd and 3rd values separated by ".", then takes the first 8 characters after "=" (counting the spaces). Your output for row 2 takes the 1st, 2nd and the first 2 characters of the 4th values separated by ".". It then seems to get the first 8 characters after the "=" and adds a space after the first 3 of those.

 

The 3rd row is ignored.


Output
Column 1 Column 2 Column 3    Column 4
A                B                cc             Kal jahs
B                c                72              Kah vevv

 

I think you need to elaborate significantly on the rules and mappings as at the moment, there is no apparent logic to what you want.

mahesh_Dev
Contributor III
Contributor III
Author

Hi

 

Similar data which i had is below

 

thejob.work_data.working_on_data.name = abcabc

thejob2.work_on_data1.working_data_on_on.description = abc22

thejob3_work_data3.name = hbscc

 

 

i need the output like below

column1      column2                  column3                      column4         column5

thejob           work_data                working_ondata           name             abcabc

thejob2         work_on_data1        working_data_on         description       abc22

thejob3         work_data3               null                               name             hbscc

 

how do we achieve it like 

i am writing regular expression i am not able to take eveything

 

fdenis
Master
Master

you can use tExtractDelimitedFields with "."
then tExtractDelimitedFields with "=" on columne4

did you try it?
fdenis
Master
Master

does it solve your problem?
mahesh_Dev
Contributor III
Contributor III
Author

yes i solved my problem thanks for helping 0683p000009MACn.png