UTF8 Encoding problems when exporting posts

I have a WordPress installation with MySQL encoded as utf8_general_ci. I’m trying to export it using wordpress’ export tool, but the result XML breaks some special characters such as “é”, “ê”, etc turning them into “é” for example.

The XML file is being downloaded into a Windows machine and scp’ed into a Linux. In both platforms the encode is off.

Read More

I checked the resulting xml with “file” and it´s telling me it is a UTF8 file.

What am I missing?

Thanks!

Related posts

Leave a Reply

1 comment

  1. The file is likely correct, but the program that you’re viewing it in is probably interpreting it with ISO-8859-1 / latin1 encoding.

    The é pairing is something I have often seen when viewing a UTF-8 encoded é in latin1.

    Be sure that the program you are viewing the file in is set to view in UTF-8 encoding.