Apache Pig Tutorial on Apache Pig Limit Operator

Back to Course

Apache Pig Introduction

Apache Pig Overview

Read

Apache Pig Architecture

Read

Apache Pig Environment

Apache Pig Installation

Read

Apache Pig Execution

Read

Apache Pig Grunt Shell

Read

Pig Latin

Pig Latin Ã¢ÂÂ Basics

Read

Load & Store Operators

Apache Pig Reading Data

Read

Apache Pig Storing Data

Read

Diagnostic Operators

Apache Pig Diagnostic Operators

Read

Apache Pig Describe Operator

Read

Apache Pig Explain Operator

Read

Apache Pig Illustrate Operator

Read

Grouping & Joining

Apache Pig Group Operator

Read

Apache Pig Cogroup Operator

Read

Apache Pig Join Operator

Read

Apache Pig Cross Operator

Read

Combining & Splitting

Apache Pig Union Operator

Read

Apache Pig Split Operator

Read

Apache Pig Filter Operator

Read

Apache Pig Distinct Operator

Read

Apache Pig Foreach Operator

Read

Apache Pig Order By

Read

Apache Pig Limit Operator

Read

Pig Latin BuiltIn Functions

Apache Pig Eval Functions

Read

Apache Pig Load & Store Functions

Read

Apache Pig Bag & Tuple Functions

Read

Apache Pig String Functions

Read

Apache Pig Datetime Functions

Read

Apache Pig Math Functions

Read

Other Modes Of Execution

Apache Pig Running Scripts

Read

Apache Pig Quick Guide

Read

Apache Pig Useful Resources

Read

Discuss Apache Pig

Read

the limit operator is used to get a limited number of tuples from a relation.

syntax

given below is the syntax of the limit operator.

grunt> result = limit relation_name required number of tuples;

example

assume that we have a file named student_details.txt in the hdfs directory /pig_data/ as shown below.

student_details.txt

001,rajiv,reddy,21,9848022337,hyderabad
002,siddarth,battacharya,22,9848022338,kolkata
003,rajesh,khanna,22,9848022339,delhi 
004,preethi,agarwal,21,9848022330,pune 
005,trupthi,mohanthy,23,9848022336,bhuwaneshwar 
006,archana,mishra,23,9848022335,chennai 
007,komal,nayak,24,9848022334,trivendram 
008,bharathi,nambiayar,24,9848022333,chennai

and we have loaded this file into pig with the relation name student_details as shown below.

grunt> student_details = load 'hdfs://localhost:9000/pig_data/student_details.txt' using pigstorage(',')
   as (id:int, firstname:chararray, lastname:chararray,age:int, phone:chararray, city:chararray);

now, let’s sort the relation in descending order based on the age of the student and store it into another relation named limit_data using the order by operator as shown below.

grunt> limit_data = limit student_details 4;

verification

verify the relation limit_data using the dump operator as shown below.

grunt> dump limit_data;

output

it will produce the following output, displaying the contents of the relation limit_data as follows.

(1,rajiv,reddy,21,9848022337,hyderabad) 
(2,siddarth,battacharya,22,9848022338,kolkata) 
(3,rajesh,khanna,22,9848022339,delhi) 
(4,preethi,agarwal,21,9848022330,pune)

Previous Lesson

Next Lesson