<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Limit rows for Pig in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357828#M123028</link>
    <description>The retail demo dataset has information about customer orders. The data for each order contains the postal code of the customer. In this example you run Pig Latin statements to return the top ten postal codes by revenue.</description>
    <pubDate>Sat, 30 Jun 2018 12:28:04 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2018-06-30T12:28:04Z</dc:date>
    <item>
      <title>Limit rows for Pig</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357826#M123026</link>
      <description>&lt;P&gt;Hello, so I have use tPigLoad and I would like to limit the amount of rows to just the first 10 rows. How can I go around doing so?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have tried creating auto-incremental keys but they seem to only be possible using tMap and not in tPigMap.&lt;BR /&gt;I have also tried tPigCode to limit the rows but can't figure out what to put in there.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 08:10:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357826#M123026</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T08:10:39Z</dc:date>
    </item>
    <item>
      <title>Re: Limit rows for Pig</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357827#M123027</link>
      <description>hey,
&lt;BR /&gt;ever heard of Apache Pig - Limit Operator
&lt;BR /&gt;Apache Pig is an abstraction over MapReduce. It is a tool/platform which is used to analyze larger sets of data representing them as data flows. Pig is generally used with Hadoop; we can perform all the data manipulation operations in Hadoop using Apache Pig.
&lt;BR /&gt;
&lt;BR /&gt;To write data analysis programs, Pig provides a high-level language known as Pig Latin. This language provides various operators using which programmers can develop their own functions for reading, writing, and processing data.
&lt;BR /&gt;
&lt;BR /&gt;To analyze data using Apache Pig, programmers need to write scripts using Pig Latin language. All these scripts are internally converted to Map and Reduce tasks. Apache Pig has a component known as Pig Engine that accepts the Pig Latin scripts as input and converts those scripts into MapReduce jobs.
&lt;BR /&gt;regards
&lt;BR /&gt;julie</description>
      <pubDate>Fri, 01 Jun 2018 02:39:02 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357827#M123027</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-06-01T02:39:02Z</dc:date>
    </item>
    <item>
      <title>Re: Limit rows for Pig</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357828#M123028</link>
      <description>The retail demo dataset has information about customer orders. The data for each order contains the postal code of the customer. In this example you run Pig Latin statements to return the top ten postal codes by revenue.</description>
      <pubDate>Sat, 30 Jun 2018 12:28:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Limit-rows-for-Pig/m-p/2357828#M123028</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-06-30T12:28:04Z</dc:date>
    </item>
  </channel>
</rss>

