ä is what you get when you take ä encoded as UTF-8 and interpret it as ISO-8859-1. So what probably happened, is that some text that was encoded as UTF-8 was treated as if it was ISO-8859-1/windows1252 and (unessearily) converted to UTF-8.

Common causes of this sort of thing:
- Very very old wiki from before MediaWiki adopted UTF-8 that wasn't upgraded properly. (I think MW adopted UTF-8 before MediaWiki 1.5, so it would have to be truly ancient).
- Restoring a DB from backup with some wrong options related to charset
- converting the charset of DB columns if they were originally mislabeled.

If its the entire DB that is broken, I think the easiest fix might be to take a DB dump, and use the iconv command line tool to convert UTF-8 -> windows-1252 (To undo one layer of conversion) and then import the result as if it was UTF-8.

--
brian.


On Thu, Sep 8, 2022 at 3:13 AM Zoran Dori <zorandori4444@gmail.com> wrote:
Hello,
I'm working on one wiki which shows characters in a weird way. UTF-8 is used for encoding, so I believe that it isn't an issue.

You can take a look here Statik A – Sub Bavaria (sub-bavaria.de), so you can better understand what I'm talking about.

Could you please point me to something that I should look for, so I can fix this issue?
Wiki was previously on version 1.31, I've upgraded it to 1.38.

Thanks for your help and understanding!

Best regards,
Zoran
_______________________________________________
MediaWiki-l mailing list -- mediawiki-l@lists.wikimedia.org
To unsubscribe send an email to mediawiki-l-leave@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/mediawiki-l.lists.wikimedia.org/