Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Connect 2026! Turn data into bold moves, April 13 -15: Learn More!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Apache Access Log Parsing using Regular Expression

Hello, 

I have an Apache Access Log (200MB) in the format : 

127.0.0.1 - - [05/Feb/2012:17:11:55 +0000] "GET / HTTP/1.1" 200 140 "-" "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.19 (KHTML, like Gecko) Chrome/18.0.1025.5 Safari/535.19"

With following Components : tFileInputDelimited (choosing the log file) -> tMap-> tMysqlOutput I was able to export the Logs to my Database. However, I would like to split the values available in the log for instance into separate column in the format: 

127.0.0.1 (col1)
05/Feb/2012:17:11:55 +0000 (col2)
GET / HTTP/1.1 (col3)
200 (col4)
140 (col5)
Mozilla/5.0 (Windows NT 6.1; WOW64) (col6)

According to the other similar threads, this is achievable by using regular expressions in the tMap. But I am very new to Talend and regular expressions parsing in Talend. Could anyone please help me how I could achieve this? 

 

Labels (2)
1 Solution

Accepted Solutions
akumar2301
Specialist II
Specialist II

try tApacheLogInput

View solution in original post

1 Reply
akumar2301
Specialist II
Specialist II

try tApacheLogInput