<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Fasta file in Talend in Talend Data Catalog</title>
    <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fasta-file-in-Talend/m-p/2335267#M1396</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;How did you row separactor and field separator in input component?&lt;/P&gt;
&lt;P&gt;From your requirement, we can create an input schema where you can take the row separator and field separator according to your Fasta file in and then use tMap component to pick the desired output columns.&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
    <pubDate>Mon, 15 Jan 2018 08:01:43 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2018-01-15T08:01:43Z</dc:date>
    <item>
      <title>Fasta file in Talend</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fasta-file-in-Talend/m-p/2335266#M1395</link>
      <description>&lt;P&gt;Dear Talend, I am having a problem to read a fasta file from talend.&lt;BR /&gt;I am still new to talend open studio for big data.&lt;/P&gt; 
&lt;P&gt;&lt;BR /&gt;Th&lt;FONT size="3"&gt;e Fasta file is as Follows:&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;&amp;gt;FAM138A ENST00000417324 1:35138-35736(-)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;atgctgctgactatagagacaaagtctcactatgttgctcaggctggtcttgaactcctggcctcaagcgatcctcccac&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;ctcagcctcccaaagtgttgggattatagacatgagccactgcacctggccgaccttgggcaagttcttaaacccttcaa&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;agcctcatttttctccaatcacaaaagggaaagatggtaatattttccccaccaaattcttgtcggatgccctcacagaa&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;ttgagattatgtacgtaa&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;&amp;gt;ENSG00000197490 ENST00000359752 1:37397-54936(+)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;atgttgctcaccttatgggcagggtctcactatgttgctgaggctggtctcaaactcctgacctcaagcaatctgtctgc&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;ttcagcctcccaagtagctgagaatacagggacaagccattgcacctga&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="3"&gt;I have&amp;nbsp; tried to use several input components like tFileInputDelimited, tFileInputMSDelimited and so on but i dont know a standard way to read the fasta file from talend.&lt;BR /&gt;I have also tried to used some process component like tMap, tJavaRow and tJavaFlex. But i could not get the output i want.&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="3"&gt;My objective is to extract each information from the fasta file and store it in a csv file.&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="3"&gt;Can someone help me, i am stuck with that for more than 2 weeks.&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="3"&gt;&lt;BR /&gt;The output should be as followed:&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;FAM138A; ENST00000417324 1:35138-35736(-); atgctgctgactatagagacaaagtctcactatgttgctcaggctggtcttgaactcctggcctcaagcgatcctcccacctcagcctcccaaagtgttgggattatagacatgagccactgcacctggccgaccttgggcaagttcttaaacccttcaaagcctcatttttctccaatcacaaaagggaaagatggtaatattttccccaccaaattcttgtcggatgccctcacagaattgagattatgtacgtaa&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT size="2"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;FAM138A;ENST00000417324;1:35138-35736(-);atgctgctgactatagagacaaagtctcactatgttgctcaggctggtcttgaactcctggcctcaagcgatcctcccacctcagcctcccaaagtgttgggattatagacatgagccactgcacctggccgaccttgggcaagttcttaaacccttcaaagcctcatttttctccaatcacaaaagggaaagatggtaatattttccccaccaaattcttgtcggatgccctcacagaattgagattatgtacgtaa&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jan 2018 08:12:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fasta-file-in-Talend/m-p/2335266#M1395</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-01-11T08:12:46Z</dc:date>
    </item>
    <item>
      <title>Re: Fasta file in Talend</title>
      <link>https://community.qlik.com/t5/Talend-Data-Catalog/Fasta-file-in-Talend/m-p/2335267#M1396</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;How did you row separactor and field separator in input component?&lt;/P&gt;
&lt;P&gt;From your requirement, we can create an input schema where you can take the row separator and field separator according to your Fasta file in and then use tMap component to pick the desired output columns.&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jan 2018 08:01:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Data-Catalog/Fasta-file-in-Talend/m-p/2335267#M1396</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-01-15T08:01:43Z</dc:date>
    </item>
  </channel>
</rss>

