When you want to read data from a text file, you create a file object using the

Here's the official Python documentation on reading and writing from files. But before reading that, let's dive into the bare minimum that I want you to know.

Let's just go straight to a code example. Pretend you have a file named example.txt in the current directory. If you don't, just create one, and then fill it with these lines and save it:

hello world
and now
I say
goodbye

Here's a short snippet of Python code to open that file and print out its contents to screen – note that this Python code has to be run in the same directory that the

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
1 file exists in.

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()

Did that seem too complicated? Here's a less verbose version:

myfile = open("example.txt")
print(myfile.read())
myfile.close()

Here's how to read that file, line-by-line, using a for-loop:

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()

(Note: If you're getting a FileNotFoundError already – that's almost to be expected. Keep reading!)

Still seem too complicated? Well, there's no getting around the fact that at the programmatic layer, opening a file is distinct from reading its contents. Not only that, we also have to manually close the file.

Now let's take this step-by-step.

To open a file, we simply use the

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
2 method and pass in, as the first argument, the filename:

myfile = open("example.txt")

That seems easy enough, so let's jump into some common errors.

How to mess up when opening a file

Here is likely the most common error you'll get when trying to open a file.

FileNotFoundError: [Errno 2] No such file or directory: 'SOME_FILENAME'

In fact, I've seen students waste dozens of hours trying to get past this error message, because they don't stop to read it. So, read it: What does

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
3 mean?

Try putting spaces where the capitalization occurs:

  File Not Found Error

You'll get this error because you tried to open a file that simply doesn't exist. Sometimes, it's a simple typo, trying to

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
2 a file named
myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
5 but accidentally misspelling it as
myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
6.

But more often, it's because you know a file exists under a given filename, such as

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
5 – but how does your Python code know where that file is? Is it the
myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
5 that exists in your Downloads folder? Or the one that might exist in your Documents folder? Or the thousands of other folders on your computer system?

That's a pretty complicated question. But the first step in not wasting your time is that if you ever see this error, stop whatever else you are doing. Don't tweak your convoluted for-loop. Don't try to install a new Python library. Don't restart your computer, then re-run the script to see if the error magically fixes itself.

The error

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
3 occurs because you either don't know where a file actually is on your computer. Or, even if you do, you don't know how to tell your Python program where it is. Don't try to fix other parts of your code that aren't related to specifying filenames or paths.

How to fix a FileNotFoundError

Here's a surefire fix: make sure the file actually exists.

Let's start from scratch by making an error. In your system shell (i.e. Terminal), change to your Desktop folder:

$ cd ~/Desktop

Now, run ipython:

$ ipython

And now that you're in the interactive Python interpreter, try to open a filename that you know does not exist on your Desktop, and then enjoy the error message:

>>> myfile = open("whateverdude.txt")
myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
0

Now manually create the file on your Desktop, using Sublime Text 3 or whatever you want. Add some text to it, then save it.

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
1

Look and see for yourself that this file actually exists in your Desktop folder:

When you want to read data from a text file, you create a file object using the

OK, now switch back to your interactive Python shell (i.e. ipython), the one that you opened after changing into the Desktop folder (i.e.

myfile = open("example.txt")
0). Re-run that
myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
2 command, the one that resulted in the FileNotFoundError:

>>> myfile = open("whateverdude.txt")

Hopefully, you shouldn't get an error.

But what is that object that the

myfile = open("example.txt")
2 variable points to? Use the
myfile = open("example.txt")
3 method to figure it out:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
3

And what is that? The details aren't important, other than to point out that

myfile = open("example.txt")
2 is most definitely not just a string literal, i.e.
myfile = open("example.txt")
5.

Use the Tab autocomplete (i.e. type in

myfile = open("example.txt")
6) to get a list of existing methods and attributes for the
myfile = open("example.txt")
2 object:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
4

Well, we can do a lot more with files than just

myfile = open("example.txt")
8 from them. But let's focus on just reading for now.

Assuming the

myfile = open("example.txt")
2 variable points to some kind of file object, this is how you read from it:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
5

What's in that

FileNotFoundError: [Errno 2] No such file or directory: 'SOME_FILENAME'
0 variable? Again, use the
myfile = open("example.txt")
3 function:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
6

It's just a string. Which means of course that we can print it out:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
7

Or count the number of characters:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
8

Or print it out in all-caps:

myfile = open("example.txt")
txt = myfile.read()
print(txt)
myfile.close()
9

And that's all there's to reading from a file that has been opened.

Now onto the mistakes.

How to mess up when reading from a file

Here's a very, very common error:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
0

The error output:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
1

Take careful note that this is not a FileNotFoundError. It is an AttributeError – which, admittedly, is not very clear – but read the next part:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
2

The error message gets to the point: the

myfile = open("example.txt")
5 object – i.e. a string literal, e.g. something like
FileNotFoundError: [Errno 2] No such file or directory: 'SOME_FILENAME'
3 does not have a
FileNotFoundError: [Errno 2] No such file or directory: 'SOME_FILENAME'
4 attribute.

Revisiting the erroneous code:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
0

If

FileNotFoundError: [Errno 2] No such file or directory: 'SOME_FILENAME'
5 points to "example.txt", then
FileNotFoundError: [Errno 2] No such file or directory: 'SOME_FILENAME'
5 is simply a
myfile = open("example.txt")
5 object.

In other words, a file name is not a file object. Here's a clearer example of errneous code:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
4

And to beat the point about the head:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
5

Why is this such a common mistake? Because in 99% of our typical interactions with files, we see a filename on our Desktop graphical interface and we double-click that filename to open it. The graphical interface obfuscates the process – and for good reason. Who cares what's happening as long as my file opens when I double-click it!

Unfortunately, we have to care when trying to read a file programmatically. Opening a file is a discrete operation from reading it.

  • You open a file by passing its filename – e.g.
    myfile = open("example.txt")
    for line in myfile:
        print(line)
    myfile.close()
    
    1 – into the
    myfile = open("example.txt")
    for line in myfile:
        print(line)
    myfile.close()
    
    2 function. The
    myfile = open("example.txt")
    for line in myfile:
        print(line)
    myfile.close()
    
    2 function returns a file object.
  • To actually read the contents of a file, you call that file object's read() method.

Again, here's the code, in a slightly more verbose fashion:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
6

The file object also has a

  File Not Found Error
1 method, which formally cleans up after the opened file and allows other programs to safely access it. Again, that's a low-level detail that you never think of in day-to-day computing. In fact, it's something you probably will forget in the programming context, as not closing the file won't automatically break anything (not until we start doing much more complicated types of file operations, at least…). Typically, as soon as a script finishes, any unclosed files will automatically be closed.

However, I like closing the file explicitly – not just to be on the safe side – but it helps to reinforce the concept of that file object.

One of the advantages of getting down into the lower-level details of opening and reading from files is that we now have the ability to read files line-by-line, rather than one giant chunk. Again, to read files as one giant chunk of content, use the

myfile = open("example.txt")
8 method:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
7

It doesn't seem like such a big deal now, but that's because

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()
1 probably contains just a few lines. But when we deal with files that are massive – like all 3.3 million records of everyone who has donated more than $200 to a single U.S. presidential campaign committee in 2012 or everyone who has ever visited the White House – opening and reading the file all at once is noticeably slower. And it may even crash your computer.

If you've wondered why spreadsheet software, such as Excel, has a limit of rows (roughly 1,000,000), it's because most users do want to operate on a data file, all at once. However, many interesting data files are just too big for that. We'll run into those scenarios later in the quarter.

For now, here's what reading line-by-line typically looks like:

myfile = open("example.txt")
for line in myfile:
    print(line)
myfile.close()

Because each line in a textfile has a newline character (which is represented as

  File Not Found Error
4 but is typically "invisible"), invoking the print() function will create double-spaced output, because print() adds a newline to what it outputs (i.e. think back to your original
  File Not Found Error
5 program).

To get rid of that effect, call the

  File Not Found Error
6 method, which belongs to
myfile = open("example.txt")
5 objects and removes whitespace characters from the left and right side of a text string:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
9

And of course, you can make things loud with the good ol'

  File Not Found Error
8 function:

myfile = open("example.txt")
print(myfile.read())
myfile.close()
9

That's it for now. We haven't covered how to write to a file (which is a far more dangerous operation) – I save that for a separate lesson. But it's enough to know that when dealing with files as a programmer, we have to be much more explicit and specific in the steps.