Python CSV error: line contains NULL byte

I’m working with some CSV files, with the following code:

reader = csv.reader(open(filepath, "rU"))
try:
    for row in reader:
        print 'Row read successfully!', row
except csv.Error, e:
    sys.exit('file %s, line %d: %s' % (filename, reader.line_num, e))

And one file is throwing this error:

file my.csv, line 1: line contains NULL byte

What can I do? Google seems to suggest that it may be an Excel file that’s been saved as a .csv improperly. Is there any way I can get round this problem in Python?

== UPDATE ==

Following @JohnMachin’s comment below, I tried adding these lines to my script:

print repr(open(filepath, 'rb').read(200)) # dump 1st 200 bytes of file
data = open(filepath, 'rb').read()
print data.find('\x00')
print data.count('\x00')

And this is the output I got:

'\xd0\xcf\x11\xe0\xa1\xb1\x1a\xe1\x00\x00\x00\x00\x00\x00\x00\x00\ .... <snip>
8
13834

So the file does indeed contain NUL bytes.

TypeError: list indices must be integers or slices, not str
Writing a pandas DataFrame to CSV file
Writing a pandas DataFrame to CSV file
IndexError: too many indices for array
ValueError : I/O operation on closed file
Pandas: ValueError: cannot convert float NaN to integer
csv.Error: iterator should return strings, not bytes
Python – Reading and writing csv files with utf-8 encoding
Python Error io.UnsupportedOperation: not readable
load csv into 2D matrix with numpy for plotting
Dump a NumPy array into a csv file
(unicode error) ‘unicodeescape’ codec can’t decode bytes in position 2-3: truncated \UXXXXXXXX escape
How to load a tsv file into a Pandas DataFrame?
Difference between writerow() and writerows() methods of Python csv module
Create a .csv file with values from a Python list
Convert XML to CSV file
TypeError: list indices must be integers or slices, not str
Convert from CSV to array in Python
ValueError: cannot index with vector containing NA / NaN values
(unicode error) ‘unicodeescape’ codec can’t decode bytes in position 2-3: truncated \UXXXXXXXX escape
Creating a dictionary from a csv file?
Python import csv to list
ValueError: x and y must be the same size
OSError: Initializing from file failed on csv in Pandas
append new row to old csv file python
Writing a dictionary to a csv file with one line for every ‘key: value’
Python CSV Error: sequence expected
AttributeError: ‘float’ object has no attribute ‘split’4
How to add pandas data to an existing csv file?
convert csv file to list of dictionaries
_csv.Error: field larger than field limit (131072)
CSV new-line character seen in unquoted field error
Writing Python lists to columns in csv
Error in Reading a csv file in pandas[CParserError: Error tokenizing data. C error: Buffer overflow caught – possible malformed input file.]
How to read a CSV file from a URL with Python?
How to import a csv-file into a data array?
SyntaxError: unexpected EOF while parsing
How do I lowercase a string in Python?
How do I copy a file in Python?
How can I reverse a list in Python?
Manually raising (throwing) an exception in Python
How do I copy a file in Python?
can’t multiply sequence by non-int of type ‘float’
Difference between del, remove, and pop on lists
How can I reverse a list in Python?
How to use the pass statement
How to use filter, map, and reduce in Python 3
What does enumerate() mean?
Searching the student-t distribution table for values using python
How to declare an array in Python?
Does Python have a ternary conditional operator?
Use Gif Logo For Loading Screen In Kivy
Praw & Discord.py: The bot keep sending the same meme. I want the bot to send different meme whenever it is asked
Pig Latin Translator
What is the difference between Python’s list methods append and extend?
How can I make a time delay in Python? [duplicate]
Python – TypeError: ‘int’ object is not iterable
TypeError: ‘int’ object is not subscriptable
sphinx.ext.autodoc: Keeping names of constants in signature
are there dictionaries in javascript like python?
How do you round UP a number?
Understanding slice notation
Iterating over dictionaries using ‘for’ loops
How to define a two-dimensional array?
how to sort pandas dataframe from one column
Why am I seeing “TypeError: string indices must be integers”?
How do you round UP a number?
Understanding slice notation
TypeError: only integer scalar arrays can be converted to a scalar index with 1D numpy indices array
How do I update\upgrade pip itself from inside my virtual environment?
How to open a file using the open with statement
How to emulate a do-while loop?
How do I update\upgrade pip itself from inside my virtual environment?
How to comment out a block of code in Python [duplicate]
Microsoft Visual C++ 14.0 is required (Unable to find vcvarsall.bat)
Using “with open() as file” method, how to write more than once? [duplicate]
Why there is no do while loop in python
How do I get a substring of a string in Python?
How do I sort a dictionary by value?
ImportError: DLL load failed: The specified module could not be found
How do I sort a dictionary by value?
How to prettyprint a JSON file?
What does the “yield” keyword do?
ImportError: DLL load failed: The specified module could not be found
Replacements for switch statement in Python?
How to install pip with Python 3?
What is the difference between rw+ and r+
What does ** (double star/asterisk) and * (star/asterisk) do for parameters?
Renaming column names in Pandas
How to reset index in a pandas dataframe? [duplicate]
pip not recognised as an internal or external command
Correct way to write line to file?
Python: Find in list
Does Python have a string ‘contains’ substring method?
Is there a “not equal” operator in Python?
IndexError: list index out of range and python
How to read a file line-by-line into a list?
Delete a column from a Pandas DataFrame
strip(char) on a string
Python – TypeError: ‘int’ object is not iterable

Related Posts:

Leave a Comment Cancel reply