彻底解决android读取中文txt的乱码（自动判断文档类型并转码

原文：http://blog.csdn.net/handsomedylan/article/details/6138400

public String convertCodeAndGetText(String str_filepath) {// 转码

File file = new File(str_filepath);
BufferedReader reader;
String text = "";
try {
// FileReader f_reader = new FileReader(file);
// BufferedReader reader = new BufferedReader(f_reader);
FileInputStream fis = new FileInputStream(file);
BufferedInputStream in = new BufferedInputStream(fis);
in.mark(4);
byte[] first3bytes = new byte[3];
in.read(first3bytes);//找到文档的前三个字节并自动判断文档类型。
in.reset();
if (first3bytes[0] == (byte) 0xEF && first3bytes[1] == (byte) 0xBB
&& first3bytes[2] == (byte) 0xBF) {// utf-8

reader = new BufferedReader(new InputStreamReader(in, "utf-8"));

} else if (first3bytes[0] == (byte) 0xFF
&& first3bytes[1] == (byte) 0xFE) {

reader = new BufferedReader(
new InputStreamReader(in, "unicode"));
} else if (first3bytes[0] == (byte) 0xFE
&& first3bytes[1] == (byte) 0xFF) {

reader = new BufferedReader(new InputStreamReader(in,
"utf-16be"));
} else if (first3bytes[0] == (byte) 0xFF
&& first3bytes[1] == (byte) 0xFF) {

reader = new BufferedReader(new InputStreamReader(in,
"utf-16le"));
} else {

reader = new BufferedReader(new InputStreamReader(in, "GBK"));
}
String str = reader.readLine();

while (str != null) {
text = text + str + "/n";
str = reader.readLine();

}
reader.close();

} catch (FileNotFoundException e) {
// TODO Auto-generated catch block
e.printStackTrace();
} catch (IOException e) {
e.printStackTrace();
}
return text;
}
代码不难，觉得有用的可以顶一下。

时间： 2024-10-18 02:37:48

彻底解决android读取中文txt的乱码（自动判断文档类型并转码的相关文章

FileReader读取中文txt文件编码丢失问题（乱码）(转)

有一个UTF-8编码的文本文件,用FileReader读取到一个字符串,然后转换字符集:str=new String(str.getBytes(),"UTF-8");结果大部分中文显示正常,但最后仍有部分汉字显示为问号! public static List<String> getLines(String fileName){ List<String> lines=new ArrayList<String>(); try { BufferedRead

Python读取中文txt文件错误：UnicodeEncodeError: 'gbk' codec can't encode character

1 with open(file,'r') as f: 2 line=f.readline() 3 i=1 4 while line: 5 line=line.decode('utf-8') 6 print str(i)+": "+line7 line=f.readline() 8 i=i+1 用以上代码读取一个包含中文的txt文件时,在正确地读取并打印了六百多行之后,print str(i)+": "+line这一行报错: UnicodeEncodeError:

解决python3读写中文txt时UnicodeDecodeError : 'ascii' codec can't decode byte 0xc4 in position 5595: ordinal not in range(128) on line 0的问题

今天使用python3读写含有中文的txt时突然报了如下错误,系统是MAC OS,iDE是pycharm: UnicodeDecodeError : 'ascii' codec can't decode byte 0xc4 in position 5595: ordinal not in range(128) on line 0 按理说python3的默认编码是unicode,不应该出现这种错误,排查以后发现问题及解决方案如下: import locale print(locale.getpre

彻底解决android读取中文txt的乱码（自动判断文档类型并转码

彻底解决android读取中文txt的乱码（自动判断文档类型并转码的相关文章

FileReader读取中文txt文件编码丢失问题（乱码）(转)

Python读取中文txt文件错误：UnicodeEncodeError: 'gbk' codec can't encode character

解决python3读写中文txt时UnicodeDecodeError : 'ascii' codec can't decode byte 0xc4 in position 5595: ordinal not in range(128) on line 0的问题

(转)完美解决 Android WebView 文本框获取焦点后自动放大有关问题

Torch-RNN运行过程中的坑 [2]（Lua的string sub函数，读取中文失败，乱码？）

解决Mac上打开txt文件乱码问题

C#中StreamReader读取中文时出现乱码问题总结

有效解决ajax传中文时，乱码的情况，php处理接收到的值

[Android]_[初级]_[sdk docs reference api 文档打开慢的解决办法]