<STDIN> in List Context

One previously seen operation that returns a different value in a list context is <STDIN>. As described earlier, <STDIN> returns the next line of input in a scalar context. However, in a list context, it returns all remaining lines up to end of file. Each line is returned as a separate element of the list. For example:

	@a = <STDIN>; # read standard input in a list context

When the input is coming from a file, this will read the rest of the file. But how can there be an end-of-file when the input comes from the keyboard? On Unix and similar systems, including Linux and Mac OS X, you'll normally type a Control-D¹ to indicate to the system that there's no more input; the special character itself is never seen by Perl,² even though it may be echoed to the screen. On DOS/ Windows systems, use Ctrl-Z instead.³ You'll need to check the documentation for your system or ask your local expert, if it's different from these.

If the person running the program types three lines, then presses CTRL-D (to indicate "end of file"), the array ends up with three elements. Each element will be a string that ends in a newline, corresponding to the three newline-terminated lines entered.

Wouldn't it be nice if, having read those lines, you could chomp the newlines all at once? It turns out that if you give chomp a list of lines, it will remove the newlines from each item in the list. For example:

	@lines = <STDIN>; # read all the lines
	chomp(@lines);	# discard all the newline characters

But the more common way to write that is with code similar to what we used earlier:

	chomp(@lines = <STDIN>); # read the lines, not the newline

Although you're welcome to write your code either way in the privacy of your own cubicle, most Perl programmers will expect the second, more compact, notation.

It may be obvious to you (but it's not obvious to everyone) that once these lines of input have been read, they can't be re-read.⁴ Once you've reached end-of-file, there's no more input out there to read.

And what happens if the input is coming from a 400MB log file? The line input operator reads all of the lines, gobbling up lots of memory.⁵ Perl tries not to limit you in what yo can do, but the other users of your system (not to mention your system administrator) are likely to object. If the input data is large, you should generally find a way to deal with it without reading it all into memory at once.

Next: Exercises

OR

Main Menu for this topic

[1] This is merely the default; it can be changed by the stty command. But it's pretty dependable - we've never seen a Unix system where a different character was used to mean end-of-file from the keyboard.

Return to the page from whence you came

[2] It's the OS that "sees" the control key and reports "end of file" to the application.

Return to the page from whence you came

[3] There's a bug affecting some ports of Perl for DOS / Windows where the first line of output to the terminal following the use of Control-Z is obscured. On these systems, you can work around this problem by simply printing a blank line ("\n") after reading the input.

Return to the page from whence you came

[4] Well, yes, if the input is from a source upon which you can seek, then you'll be able to go back and read again. But that's not what we're talking about here.

Return to the page from whence you came

[5] Typically, that's much more memory than the size of the file, too. That is, a 400MB file will typically take up at least a full gigabyte of memory when read into an array. This is because Perl will generally waste memory to save time. This is a good tradeoff; if you're short of memory, you can buy more, if you're short of time, you're hosed.

Return to the page from whence you came