Skip to main content

How to read a JA Bank transaction CSV file into Pandas in Python

·

Transactions in a JA Bank account are available for download as a CSV file. Then Pandas reads it for processing.

Which encoding should I use? If I open the CSV file with macOS app TextMate, it automatically selects the encoding: Japanese - SHIFT JIS.

So I try:

df = pd.read_csv(file_name, encoding="shift_jis")

A raw numerical value in the CSV file is:

¥7,000

But Pandas would read it as:

\7,000

According to this post, there are multiple encodings for Japanese. I tried both cp932 and shift_jis_2004. The former has the same incorrect result. But the latter one works:

¥7,000

This is the working code:

df = pd.read_csv(file_name, encoding="shift_jis_2004")