Skip to main content
Announcements
NEW: Seamless Public Data Sharing with Qlik's New Anonymous Access Capability: TELL ME MORE!
cancel
Showing results for 
Search instead for 
Did you mean: 
raadwiptec
Creator II
Creator II

text data splitting into columns and rows!

Hi all,

could somebody suggest a solution.

the source is a flat file and data is as below


id=transit secs="2010-03-07 11:36:44" route=6 mk=10 username=sam type=on schedule

id=peek secs="2010-03-07 11:36:44" route=9 mk=10 username=sam type=on schedule

id=dest secs="2010-03-07 11:36:44" route=10 mk=10 username=sam type=on schedule

id= arr secs="2010-03-07 11:36:44" route=12 mk=10 username=sam type=on schedule


for example:-

id=transit secs="2010-03-07 11:36:44" route=6 mk=10 username=sam type=on schedule. I am trying split each of them as a column

for ex

id =transit -- so id is a column here and 'transit' is the value

secs="2010-03-07 11:36:44" - so here I want to make 'secs' as the column and 2010-03-07 11:36:44" this is the value and this value may change accordingly with different timestamps and dates in this column

in qlikview if iam loading this I would like to have as below

id  ¦   secs ¦

22 Replies
raadwiptec
Creator II
Creator II
Author

hi claran,

As you mentioned can you please share how do I load each row as Data . so in my example if iam trying to load the file containing the below data as a atable

id=transit secs="2010-03-07 11:36:44" route=6 mk=10 username=sam type=on schedule

id=peek secs="2010-03-07 11:36:44" route=9 mk=10 username=sam type=on schedule

id=dest secs="2010-03-07 11:36:44" route=10 mk=10 username=sam type=on schedule

id= arr secs="2010-03-07 11:36:44" route=12 mk=10 username=sam type=on schedule

I have 2 options.

while loading the file if I give an option of none.. my columns look like @1,@2,@3

if I give option of embedded labels I get my column as

id=transit secs="2010-03-07 11:36:44" route=6 mk=10 username=sam type=on schedule

gautik92
Specialist III
Specialist III

use textbetween

raadwiptec
Creator II
Creator II
Author

hi claran

as you said in ur earlier post iam not able to load @1:n -how do I do this?

  1. TableName: 
  2. LOAD [@1:n] AS data 
  3. FROM 
  4. [test.txt] 
  5. (fix, codepage is 1252); 

I basically tried to concatenate the tables like below

TableName:
LOAD

@1&@2&@3&@4&@5&@6&@7&@8&@9&@10&@11 as data

FROM xxxx.txt

After which I tried to load the CleanData. But there iam getting the values also concatenated.

Digvijay_Singh

Based on amazing sugestion from ciaran.mcgowan‌ , I created the sample qvw. I think we shouldn't worry how data is loaded, let it be one line per record as suggested-

Input:

LOAD @1:n as data

FROM

(fix, codepage is 1252);

CleanData:

LOAD

SubField(SubField(data, 'id=', 2), 'secs=', 1) AS ID,

SubField(SubField(data, 'secs=', 2), 'route=', 1) AS Secs,

SubField(SubField(data, 'route=', 2), 'mk=', 1) AS Route,

SubField(SubField(data, 'mk=', 2), 'username=', 1) AS mk,

SubField(SubField(data, 'username=', 2), 'type=', 1) AS Username,

Subfield(data, 'type=', 2) AS Type

Resident  Input;

subfield.PNG

I just included mk also as a column in last suggested script by ciaran.mcgowan

Here are the qvw and sample file.

raadwiptec
Creator II
Creator II
Author

hi Ciaran,

very nice.. Iam getting good results.. I need to test with the complete file. I will let you know. can you please tell me the code ...how subfield is managing this?

thanks digvijay..

the problem with my load earlier was

(fix, codepage is 1252); I was using --

(txt, codepage is 1252, no labels, delimiter is spaces, msq); where I wan not able to use the :n values

ciaran_mcgowan
Partner - Creator III
Partner - Creator III

Hi raadwiptec,

Glad to hear it. Subfield works by splitting up a field into smaller pieces by looking through a field for a specified value, once it finds it, it will return everything to the left or right of it.

If you had a string called FRUIT that contained the following vaule: "Apple-Orange-Grape-Pear-Banana" it can be split up using the "-" as a separator like this:

Subfield(FRUIT,'-',1) AS Fruit1, will return Apple

Subfield(FRUIT,'-',2) AS Fruit2 will return Orange

Subfield(FRUIT,'-',3) AS Fruit3, will return Grape

Subfield(FRUIT,'-',4) AS Fruit4, will return Pear

Subfield(FRUIT,'-',5) AS Fruit5, will return Banana


In your case, I used Subfield twice so it will search for everything to the left of the desired text and everything to the right and return what's in between the two:


SubField(SubField(data, 'id=', 2), 'secs=', 1) AS ID


Breakdown:


Original: "id=transit secs="2010-03-07 11:36:44" route=6 mk=10 username=sam type=on schedule"


SubField(data, 'id=', 2) would return everything to the right of the "id=" text

     Result: "transit secs="2010-03-07 11:36:44" route=6 mk=10 username=sam type=on schedule"


SubField(data, 'secs=', 1) would return everything to the left of the "secs=" text

     Result: "id=transit "


Combining the two tells it for first search for everything to the right of "id=" and then, from the result of that, everything to the left of the "secs=" which is why SubField(SubField(data, 'id=', 2), 'secs=', 1) will return transit for the above example.


raadwiptec
Creator II
Creator II
Author

is it possible to take the year ,month and date from the secs column - ="2010-03-07 11:36:44"

as it is in quotes .. it is not recognizing if I give

load

year(Secs)  as year

resident clean Data

Digvijay_Singh

You can remove " by purgechar and then interpret it as Timestamp.

Timestamp#(PurgeChar(SubField(SubField(data, 'secs=', 2), 'route=', 1),'"'),'YYYY-MM-DD hh:mm:ss') AS Secs, 

Colin-Albert

>In your case, I used Subfield twice so it will search for everything to the left of the desired text and everything to the right and return what's in between the two:


Hi Ceiran,

The textbetween() function can do this eliminating the need to use two subfield functions


     textbetween(data, 'id=', 'secs=', ) as Secs

ciaran_mcgowan
Partner - Creator III
Partner - Creator III

Good to know!

Thanks.