Hoi,
In many ways the codes you provide are confusing. What I would like you
to do is match the codes that you provide with the ISO-639-3 codes. The
use of zh or zho is problematic, typically Mandarin (cmn) is meant when
people talk about Chinese. Chinese (zho) can be subdevided in the
following languages:
* Gan Chinese [gan
<http://www.sil.org/iso639-3/documentation.asp?id=gan>]
* Hakka Chinese [hak
<http://www.sil.org/iso639-3/documentation.asp?id=hak>]
* Huizhou Chinese [czh
<http://www.sil.org/iso639-3/documentation.asp?id=czh>]
* Jinyu Chinese [cjy
<http://www.sil.org/iso639-3/documentation.asp?id=cjy>]
* Mandarin Chinese [cmn
<http://www.sil.org/iso639-3/documentation.asp?id=cmn>]
* Min Bei Chinese [mnp
<http://www.sil.org/iso639-3/documentation.asp?id=mnp>]
* Min Dong Chinese [cdo
<http://www.sil.org/iso639-3/documentation.asp?id=cdo>]
* Min Nan Chinese [nan
<http://www.sil.org/iso639-3/documentation.asp?id=nan>]
* Min Zhong Chinese [czo
<http://www.sil.org/iso639-3/documentation.asp?id=czo>]
* Pu-Xian Chinese [cpx
<http://www.sil.org/iso639-3/documentation.asp?id=cpx>]
* Wu Chinese [wuu
<http://www.sil.org/iso639-3/documentation.asp?id=wuu>]
* Xiang Chinese [hsn
<http://www.sil.org/iso639-3/documentation.asp?id=hsn>]
* Yue Chinese [yue
<http://www.sil.org/iso639-3/documentation.asp?id=yue>]
This list does NOT provide you with all the language spoken in China
These languagecodes can then have a script associated with them eg
cmn-Hans for simplified Chinese. When there are dialects within a
language they can be identified as well.
Using zh is imho utterly confusing and associating them with countries
does not help at all. More than one language is spoken in Taiwan and
therefore zh-hans-tw does not cut it. When you replace traditional with
simplified, within one language I can understand what you are doing.
However when you in essence start moving across languages and is that
not the implication when you talk about things having different
terminology are you then not trying to provide translations ?
Thanks,
GerardM
Generally the language codes ahould be all right,
however that should be more
correct form to using the language tags, by defination of language tags, IANA
(
http://www.iana.org/assignments/language-tags ):
zh: Chinese, only generally "Chinese" or Zhong Wen (中文)
zh-hans: Simplified Chinese language
zh-hant: Traditional Chinese language
zh-cn: Chinese language used in Mainland China
zh-hk: Chinese language used in Hong Kong
zh-mo: Chinese language used in Macau
zh-sg: Chinese language used in Singapore
zh-tw: Chinese language used in Taiwan
zh-hans-cn: Chinese language used in Mainland China, Simplified Chinese
zh-hans-hk: Chinese language used in Hong Kong, Simplified Chinese
zh-hans-mo: Chinese language used in Macau, Simplified Chinese
zh-hans-sg: Chinese language used in Singapore, Simplified Chinese
zh-hans-tw: Chinese language used in Taiwan, Simplified Chinese
zh-hant-cn: Chinese language used in Mainland China, Trititional Chinese
zh-hant-hk: Chinese language used in Hong Kong, Trititional Chinese
zh-hant-mo: Chinese language used in Macau, Trititional Chinese
zh-hant-sg: Chinese language used in Singapore, Trititional Chinese
zh-hant-tw: Chinese language used in Taiwan, Trititional Chinese
Don't forget there are many writing scripts besides the Simplified/Tratitional
scripts and localised scripts ( and don't dorget there's have script language
for Maacau too :) ).
regards
Man