eolas/zk/IO_in_Python.md

202 lines
5.3 KiB
Markdown
Raw Normal View History

2023-03-07 07:39:05 +00:00
---
tags: [python]
---
# IO in Python
## The open() object
The built-in `open()` function creates a **file object** that allows us to read,
write and append to files.
2023-03-07 07:39:05 +00:00
The general syntax is as follows:
```py
file_object = open(<file_name>, <access_mode>)
```
`<file_name>` is obviously a path to the file you want to read, create or
modify. The `<access_mode>` denotes the mode in which to open the file. The most
frequently used are:
2023-03-07 07:39:05 +00:00
- `r`
- read
- `w`
- write
- `a`
- append
### All access modes
In addition we have the following access modes
## Reading files
Once a file object has been intialised with `open()` there are several ways in
which the content can be read:
2023-03-07 07:39:05 +00:00
| Read method | Behaviour |
| ----------- | ---------------------------------------------------------------------------------------------------------------------------------------- |
| `read` | Return the entire contents of the file as a single string. |
| `readline` | Read the contents of a file a line at a time. You would combine this with a loop so that you can do something with each individual line. |
| `readlines` | Return a list of all the lines in a file. Each line will be an element in the list. |
### Read
`read` reads the entire contents of a file and returns it as a single string.
`read()` reads the entire file into memory at once, so it may not be suitable
for very large files that exceed the available memory of your system. In such
cases, you can use the `readline()` method to read the file line by line
instead.
2023-03-07 07:39:05 +00:00
The basic syntax is as follows:
```py
# Open the file for reading
file = open('filename.txt', 'r')
# Read the entire contents of the file
contents = file.read()
# We could also limit the read to a number of characters:
contents = file.read(100)
# Close the file
file.close()
```
Once we have the file contents stored, we can then interact with it. The
standard way of doing this is to parse it line by line. For example, say we were
reading a CSV:
2023-03-07 07:39:05 +00:00
```py
lines = content.split(',')
for line in lines:
if line: # if the line is not empty
# do something with line
```
### Readline
> The readline() method in Python is used to read a single line from a file. It
> is typically used when you want to process a file line by line, rather than
> reading the entire file into memory at once.
2023-03-07 07:39:05 +00:00
`readline()` returns the line including the newline character at the end of the
line, so you may need to strip this character off using the strip() method
before processing the line further.
2023-03-07 07:39:05 +00:00
```python
# Open the file for reading
file = open('filename.txt', 'r')
# Loop through the file, reading one line at a time
line = file.readline()
while line:
# Strip off the newline character
line = line.strip()
# Do something with the line
print(line)
# Read the next line
line = file.readline()
# Close the file
file.close()
```
### Readlines
The `readlines()` method is used to read all the lines of a file and return them
as a list of strings, where each element of the list is a line from the file.
2023-03-07 07:39:05 +00:00
```py
# Open the file for reading
file = open('filename.txt', 'r')
# Read all the lines of the file and store them in a list
lines = file.readlines()
# Loop through the lines and print them to the console
for line in lines:
print(line)
# Close the file
file.close()
```
### Error handling
Obviously file access can raise errors - typically when the file you want to
access does not exist (i.e. a `FileNotFoundError`
2024-02-17 11:57:44 +00:00
[exception](Error_handling_in_Python.md)).
We can manage this scenario with
2024-02-17 11:57:44 +00:00
[exception handlers](Error_handling_in_Python.md):
2023-03-07 07:39:05 +00:00
```py
try:
with open('filename.txt', 'r') as file:
contents = file.readlines()
for line in lines:
print(line)
except FileNotFoundError as err:
print("File does not exist")
print(err)
```
## Close and "with as"
You notice that once we have finished with our I/O operation, we must call
`file.close()` to terminate the process. This removes the reference to the file
from memory.
2023-03-07 07:39:05 +00:00
A more pythonic and concise way of reading files and closing them is to use
`with...as` syntax. When this phrasing is used, a self-contained context is
created for the I/O operation that closes the file automatically.
2023-03-07 07:39:05 +00:00
```py
with open('filename.txt', 'r') as file:
contents = file.read()
print(contents)
```
2023-03-08 07:11:40 +00:00
## Writing to files
Again we create a file object with `open()` and this time use the `write`
method:
2023-03-08 07:11:40 +00:00
```py
# Open file in write mode
file = open("example.txt", "w")
# Write some text to the file
file.write("Hello, this is an example text written using Python.")
# Close the file
file.close()
```
> Note that in the above example, if the file does not already exist, it will
> create it. If it does exist, it will overwrite its contents with the new data.
> So we use `write` to create new files as well as to write to existing files.
2023-03-08 07:11:40 +00:00
## Renaming and deleting files
We hace to use another built-in module to rename and delete files: `os`.
To rename an existing file:
```py
import os
os.rename('original-file-name.txt', 'new-file-name.txt')
```
To delete a file:
```py
import os
os.remove('file-name.txt')
```