Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hello Community,
I iam trying to break the rows from this <BEITEM> tag by using tExtractDelimitedFields but what is happening is, only the 1st row is breaked and remaining rows are ignored.
Below is the data which i am trying to parse
<BEITEM>
INNSA170540481310201611RAW ASAFOETIDA (XL)RAW ASAFOETIDA1500KGS130190131249875IRFERAIDOON SHAMSAIEFSH100BARQ SQUARE,BANDAR ABBAS, IRANIRIRIRN.A.STCNR2FS~THE Food Safety Standard Authority Of India (FSSAI) may inspect, examine, test and other control before giving NOC/Clearance for food items of animal, plant or other origin as provided under relevant legal provision and other restriction if any based upon declared composition and other relevant parameter as declared or determined.This is LOW RISK INTERDICTION under 5% CATEGORY.------row number 1
INNSA170540481310201612RAW ASAFOETIDA (SHIR)RAW ASAFOETIDA1200KGS130190131090800IRAL-JAWI TRADING (L.L.C)FSH100P.O.BOX 12844,DUBAI (U.A.E.)AEIRIRN.A.STCNR2FS~THE Food Safety Standard Authority Of India (FSSAI) may inspect, examine, test and other control before giving NOC/Clearance for food items of animal, plant or other origin as provided under relevant legal provision and other restriction if any based upon declared composition and other relevant parameter as declared or determined.This is LOW RISK INTERDICTION under 5% CATEGORY.-----row no 2
</BEITEM>
there are some non printable special character which you can view in the attached document if you open it with notepad++ or VSCode. So what i did is i have replaced these characters with the | and using it as seperator in the textractdelimited fiels .
By using the same logic i have parsed the previous tags and they just worked fine
Please suggest me some solution for it
Thanks
Hello,
How did you set the separator in tExtractDelimitedFields component? Please note that tExtractDelimitedFields uses JAVA String split function, and it only accept the regex pattern.
Best regards
Sabrina
i am using "\\|" as a field separator
Hello,
Where does your input come from and which talend input component you are using to try to reads data from it? tFileInputFullRow?
Could you please post your current job design screenshot on community which will be helpful for us to address this issue.
Please mask your sensitive data.
Best regards
Sabrina