Skip to main content
Announcements
Join us at Qlik Connect for 3 magical days of learning, networking,and inspiration! REGISTER TODAY and save!
cancel
Showing results for 
Search instead for 
Did you mean: 
DTiwari818
Contributor
Contributor

i am not able to extract the delimited fields for some rows

Hello Community,

I iam trying to break the rows from this <BEITEM> tag by using tExtractDelimitedFields but what is happening is, only the 1st row is breaked and remaining rows are ignored.

Below is the data which i am trying to parse

<BEITEM>

INNSA170540481310201611RAW ASAFOETIDA (XL)RAW ASAFOETIDA1500KGS130190131249875IRFERAIDOON SHAMSAIEFSH100BARQ SQUARE,BANDAR ABBAS, IRANIRIRIRN.A.STCNR2FS~THE Food Safety Standard Authority Of India (FSSAI) may inspect, examine, test and other control before giving NOC/Clearance for food items of animal, plant or other origin as provided under relevant legal provision and other restriction if any based upon declared composition and other relevant parameter as declared or determined.This is LOW RISK INTERDICTION under 5% CATEGORY.------row number 1

INNSA170540481310201612RAW ASAFOETIDA (SHIR)RAW ASAFOETIDA1200KGS130190131090800IRAL-JAWI TRADING (L.L.C)FSH100P.O.BOX 12844,DUBAI (U.A.E.)AEIRIRN.A.STCNR2FS~THE Food Safety Standard Authority Of India (FSSAI) may inspect, examine, test and other control before giving NOC/Clearance for food items of animal, plant or other origin as provided under relevant legal provision and other restriction if any based upon declared composition and other relevant parameter as declared or determined.This is LOW RISK INTERDICTION under 5% CATEGORY.-----row no 2

</BEITEM>

 

there are some non printable special character which you can view in the attached document if you open it with notepad++ or VSCode. So what i did is i have replaced these characters with the | and using it as seperator in the textractdelimited fiels .

 

By using the same logic i have parsed the previous tags and they just worked fine

Please suggest me some solution for it

Thanks

Labels (2)
3 Replies
Anonymous
Not applicable

Hello,

How did you set the separator in tExtractDelimitedFields component? Please note that tExtractDelimitedFields uses JAVA String split function, and it only accept the regex pattern.

Best regards

Sabrina

DTiwari818
Contributor
Contributor
Author

i am using "\\|" as a field separator

 

Anonymous
Not applicable

Hello,

Where does your input come from and which talend input component you are using to try to reads data from it? tFileInputFullRow?

Could you please post your current job design screenshot on community which will be helpful for us to address this issue.

Please mask your sensitive data.

Best regards

Sabrina