Your file contains UTF-8 BOM in the beginning.
To get rid of it, first decode your file contents to unicode.
fp = open("file.txt") data = fp.read().decode("utf-8-sig").encode("utf-8")
But better don’t encode it back to utf-8
, but work with unicode
d text. There is a good rule: decode all your input text data to unicode as soon as possible, and work only with unicode; and encode the output data to the required encoding as late as possible. This will save you from many headaches.
To read bigger files in a certain encoding, use io.open
or codecs.open
.
Also check this.
Use str.strip()
or str.rstrip()
to get rid of the newline character \n
.
Related Posts:
- How can I split and parse a string in Python?
- Split a string into 2 in Python
- Split string with multiple delimiters in Python [duplicate]
- How do you split a string in Python with multiple delimiters?
- How do you split a list into evenly sized chunks?
- How to split a string into a list of characters in Python?
- Python 2: AttributeError: ‘list’ object has no attribute ‘strip’
- AttributeError: ‘list’ object has no attribute ‘lower’ gensim
- Pandas split DataFrame by column value
- Python read in string from file and split it into values
- How to split elements of a list?
- What does random.sample() method in python do?
- (Python) TypeError: ‘float’ object is not subscriptable
- ValueError: Unknown label type: ‘continuous’
- Transpose/Unzip Function (inverse of zip)?
- TypeError: unhashable type: ‘numpy.ndarray’
- python base64 to hex
- What’s the canonical way to check for type in Python?
- IndexError: tuple index out of range —– Python
- ‘DataFrame’ object has no attribute ‘sort’
- How to check Django version
- How to change the font size on a matplotlib plot
- How can I create a dropdown menu from a List in Tkinter?
- How to find all occurrences of a substring?
- Does “\d” in regex mean a digit?
- How do you use subprocess.check_output() in Python?
- filename.whl is not a supported wheel on this platform
- How to use torch.stack function
- (Help) TypeError: ‘str’ object cannot be interpreted as an integer
- ‘builtin_function_or_method’ object is not iterable
- Why KeyError: 0
- bash: pip: command not found
- How do I concatenate two lists in Python?
- Convert a tensor to numpy array in Tensorflow?
- How to run Conda?
- How do I check whether a file exists without exceptions?
- How can I get dictionary key as variable directly in Python (not by searching from value)?
- How to convert string to binary?
- Python error load JSON code of google API
- How do I find the duplicates in a list and create another list with them?
- Get ZeroDivisionError: float division in python
- How do I detect collision in pygame?
- Remove all occurrences of a value from a list?
- os.path.dirname(__file__) returns empty
- keyerror 1 in my code
- ‘NoneType’ object has no attribute ‘group’
- “for line in…” results in UnicodeDecodeError: ‘utf-8’ codec can’t decode byte
- TypeError: string argument without an encoding
- ValueError: cannot index with vector containing NA / NaN values
- How to append multiple values to a list in Python
- How do I do a case-insensitive string comparison?
- How to copy files?
- What is the difference between ‘/’ and ‘//’ when used for division?
- How do I format a string using a dictionary in python-3.x?
- How to extend a class in python?
- Converting XML to JSON using Python?
- Django – no such table exception
- How to import a module given the full path?
- Why does “pip install” inside Python raise a SyntaxError?
- How to multiply all integers inside list
- How can I get the named parameters from a URL using Flask?
- How can I from bs4 import BeautifulSoup?
- Pandas OR statement ending in series contains
- TypeError: Invalid dimensions for image data when plotting array with imshow()
- “getaddrinfo failed”, what does that mean?
- pythonw.exe or python.exe?
- How to convert an XML string to a dictionary?
- matplotlib does not show my drawings although I call pyplot.show()
- Writing string to a file on a new line every time
- examining items in a python Queue
- What’s the difference between scikit-learn and tensorflow? Is it possible to use them together?
- How to initialize weights in PyTorch?
- RuntimeWarning: overflow encountered in ubyte_scalars
- Pandas – Drop function error (label not contained in axis)
- Python Requests – No connection adapters
- WindowsError: [Error 126] The specified module could not be found
- No module named ‘openpyxl’ – Python 3.4 – Ubuntu
- How to easily print ascii-art text?
- Using Tkinter in python to edit the title bar
- LinAlgError: Last 2 dimensions of the array must be square
- Nonlinear regression with python – what’s a simple method to fit this data better?
- ImportError: No module named mysql.connector using Python2
- What is the difference between random.randint and randrange?
- How do I access my webcam in Python?
- Boolean Series key will be reindexed to match DataFrame index
- What is the easiest way to clear a database from the CLI with manage.py in Django?
- Printing one character at a time from a string, using the while loop
- Change the name of a key in dictionary
- Check if an object exists
- How do I calculate the date six months from the current date using the datetime Python module?
- How to embed image or picture in jupyter notebook, either from a local machine or from a web resource?
- Python a &= b meaning?
- TypeError: can only concatenate tuple (not “int”) in Python
- Write a program using integers user_num and x as input, and output user_num divided by x three times
- Logical operators for Boolean indexing in Pandas
- Transposing a 1D NumPy array
- Explaining Python’s ‘__enter__’ and ‘__exit__’
- numpy.float64 object is not iterable…but I’m NOT trying to
- Converting a list to a set changes element order