面试题：Python字符串大小写转换的国际化编码考量

思路

解码与编码：在处理不同编码的字符串时，首先要将字节串解码成Python的str类型（Unicode），处理完成后再编码回所需的字节串。
字符大小写转换：使用Python内置的字符串方法upper()和lower()进行大小写转换，因为这些方法对Unicode字符有很好的支持。

关键代码片段

import codecs


def convert_case(input_bytes, encoding='utf - 8'):
    # 解码字节串为Unicode字符串
    decoded_str = codecs.decode(input_bytes, encoding)
    # 进行大小写转换
    converted_str = decoded_str.swapcase()
    # 编码回字节串
    encoded_bytes = codecs.encode(converted_str, encoding)
    return encoded_bytes


# 示例使用
input_utf8 = b'Hello, World! 你好，世界！'
result_utf8 = convert_case(input_utf8, 'utf - 8')
print(result_utf8)

input_utf16 = b'\xff\xfeH\x00e\x00l\x00l\x00o\x00,\x00 \x00W\x00o\x00r\x00l\x00d\x00!\x00 \xe4\xbd\xa0\x00e5\xa5\xbd\x00,\x00 \xe4\xb8\x96\x00e7\x95\x8c\x00!'
result_utf16 = convert_case(input_utf16, 'utf - 16')
print(result_utf16)

面试题：Python字符串大小写转换的国际化编码考量

知识考点

面试题答案

思路

关键代码片段