<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: is there a Talend function I could extract a string pattern from a txt file? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330145#M99241</link>
    <description>&lt;P&gt;Use tFileInputFullRow to read the text file line by line, and then filter the line starting with the fixed string such as "Series Accession" using tFilterRow (with advanced model)&lt;/P&gt;&lt;P&gt;input_row.line.startsWith("Series Accession")&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;After you filter the line, &lt;/P&gt;&lt;P&gt;Series Accession: GSE152999 ID: 200152999&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;You only need to write a little Java code to extract the data you need.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Shong&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;</description>
    <pubDate>Wed, 07 Apr 2021 03:54:24 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2021-04-07T03:54:24Z</dc:date>
    <item>
      <title>is there a Talend function I could extract a string pattern from a txt file?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330140#M99236</link>
      <description>&lt;P&gt;Dear all, &lt;/P&gt;&lt;P&gt;Thanks for your help. I am a beginner and I would like to extract all word start with "GGD" in a txt document? Of course, I could run with Java and I am asking is there a Talend function could do this very easy?&lt;/P&gt;&lt;P&gt;Best, &lt;/P&gt;&lt;P&gt;Xinhui&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 00:29:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330140#M99236</guid>
      <dc:creator>Xinhui</dc:creator>
      <dc:date>2024-11-16T00:29:24Z</dc:date>
    </item>
    <item>
      <title>Re: is there a Talend function I could extract a string pattern from a txt file?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330141#M99237</link>
      <description>&lt;P&gt;Hi &lt;/P&gt;&lt;P&gt;You can read the content line by line or read the whole content as a string, and then extract the data using regex.  Can you show us an example of txt document? So that we can help you more.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Shong&lt;/P&gt;</description>
      <pubDate>Fri, 05 Mar 2021 05:02:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330141#M99237</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-03-05T05:02:54Z</dc:date>
    </item>
    <item>
      <title>Re: is there a Talend function I could extract a string pattern from a txt file?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330142#M99238</link>
      <description>&lt;P&gt;Thanks!  Just read your meassge. I have a file like the following. I would like extract all GSE number as  GSE160804 in  "Accession: GSE160804". I used the tFileinputDelimited with field separator as ":", then I use tFilterRow with advice mode "input_row.columnName1.startsWith("Series"), unfortunately, it is not work, could you help me.&lt;/P&gt;&lt;P&gt;----------------------------&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Integrated analysis of DNA methylation and gene expression profiles identified S100A9 as a potential biomarker in ulcerative colitis&lt;/P&gt;&lt;P&gt;(Submitter supplied) In this research,&amp;nbsp;90 differential expression mRNAs (DEMs), 72 differential expression lncRNAs (DELs) and biological functions and pathway were identified in ulcerative colitisby (UC) integrated analysis. Potential therapeutic target for treatment was preliminary verified by qRT-PCR experiment and bioinformatics analysis.&lt;/P&gt;&lt;P&gt;Organism:	Homo sapiens&lt;/P&gt;&lt;P&gt;Type:		Expression profiling by array; Non-coding RNA profiling by array&lt;/P&gt;&lt;P&gt;Platform: GPL20115 6 Samples&lt;/P&gt;&lt;P&gt;FTP download: GEO (TXT) ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE160nnn/GSE160804/&lt;/P&gt;&lt;P&gt;Series		Accession: GSE160804	ID: 200160804&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. Induced organoids derived from patients with ulcerative colitis recapitulate the colitic reactivity&lt;/P&gt;&lt;P&gt;(Submitter supplied) We report the application of single nucleus RNA-seq technology for transcriptomic-wide profiling of the induced organoids derived from patients with ulcerative colitis (iHUCO) and normal induced organoids derived from the healthy colon (iHNO), along with their parental fibroblasts. By comapring the nucleus profiles of both iHUCOs and their parental fibroblasts (UC FBs) to iHNOs and normal fibroblasts (NL FBs), we found unique signatures exclusive to the UC samples but not the controls. more...&lt;/P&gt;&lt;P&gt;Organism:	Homo sapiens&lt;/P&gt;&lt;P&gt;Type:		Expression profiling by high throughput sequencing&lt;/P&gt;&lt;P&gt;Platform: GPL24676 11 Samples&lt;/P&gt;&lt;P&gt;FTP download: GEO (MTX, TSV) ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE152nnn/GSE152999/&lt;/P&gt;&lt;P&gt;SRA Run Selector: https://www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA641142&lt;/P&gt;&lt;P&gt;Series		Accession: GSE152999	ID: 200152999&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Apr 2021 09:30:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330142#M99238</guid>
      <dc:creator>Xinhui</dc:creator>
      <dc:date>2021-04-06T09:30:09Z</dc:date>
    </item>
    <item>
      <title>Re: is there a Talend function I could extract a string pattern from a txt file?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330143#M99239</link>
      <description>&lt;P&gt;HI, if your gse number have always 6 digit you can use this regex : "(GSE\\d{6})" in a tFileInputRegex and after a tUniqRow to avoid duplicate values&lt;/P&gt;&lt;P&gt;Send me Love and Kudos&lt;/P&gt;</description>
      <pubDate>Tue, 06 Apr 2021 10:00:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330143#M99239</guid>
      <dc:creator>gjeremy1617088143</dc:creator>
      <dc:date>2021-04-06T10:00:14Z</dc:date>
    </item>
    <item>
      <title>Re: is there a Talend function I could extract a string pattern from a txt file?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330144#M99240</link>
      <description>&lt;P&gt;thanks！really helpful. Unfortunately, the length is not fixed and even more I would like also to extract  the information "homo sapiens"​ and "119 samples". I am more likely to extract more information after ":". do you have some suggestion? &lt;/P&gt;&lt;P&gt;Thanks and best, Xinhui!&lt;/P&gt;</description>
      <pubDate>Tue, 06 Apr 2021 11:09:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330144#M99240</guid>
      <dc:creator>Xinhui</dc:creator>
      <dc:date>2021-04-06T11:09:55Z</dc:date>
    </item>
    <item>
      <title>Re: is there a Talend function I could extract a string pattern from a txt file?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330145#M99241</link>
      <description>&lt;P&gt;Use tFileInputFullRow to read the text file line by line, and then filter the line starting with the fixed string such as "Series Accession" using tFilterRow (with advanced model)&lt;/P&gt;&lt;P&gt;input_row.line.startsWith("Series Accession")&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;After you filter the line, &lt;/P&gt;&lt;P&gt;Series Accession: GSE152999 ID: 200152999&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;You only need to write a little Java code to extract the data you need.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Shong&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;</description>
      <pubDate>Wed, 07 Apr 2021 03:54:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/is-there-a-Talend-function-I-could-extract-a-string-pattern-from/m-p/2330145#M99241</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-04-07T03:54:24Z</dc:date>
    </item>
  </channel>
</rss>

