我有以下网址:
https://mantis.server.company/download/test/0022450-umlauts_öä_üüü_and_special_chars_%&$#.pdf
以前无法对字符串进行编码。我只需处理这个字符串(我知道它不是有效的URL字符串),以便打开该路径后面的文件。
String url = "https://mantis-daun.server.company/download/test/0022450-umlauts_öä_üüü_and_special_chars_%&$#.pdf";
try {
url = URLDecoder.decode(url, "UTF-8");
URL myConnection = new URL(url);
URLConnection connectMe = myConnection.openConnection();
// Only for error processing
HttpURLConnection httpConn = (HttpURLConnection) connectMe;
InputStream is;
if (httpConn.getResponseCode() >= 400) {
is = httpConn.getErrorStream();
} else {
is = httpConn.getInputStream();
}
BufferedReader rd = new BufferedReader(new InputStreamReader(is));
String line;
while ((line = rd.readLine()) != null)
{
System.out.println("-----" + line);
}
rd.close();
InputStream in = connectMe.getInputStream();
BufferedInputStream bin = new BufferedInputStream(in);
byte[] buffer = new byte[(int)connectMe.getContentLength()];
int fi = 0;
while(fi<buffer.length) {
fi = fi + bin.read(buffer, fi, buffer.length - fi);
}
bin.close();
} catch (MalformedURLException e) {
e.printStackTrace();
} catch (IOException e) {
e.printStackTrace();
}
通过这种方法,我得到:
Exception in thread "main" java.lang.IllegalArgumentException: URLDecoder: Illegal hex characters in escape (%) pattern - For input string: "&$"
at java.net.URLDecoder.decode(URLDecoder.java:173)
at org.mssql.main.MSSQLAccess.main(MSSQLAccess.java:34)
关于url = url.replaceAll("%", "%25");
,我得到:
-----<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
-----<html><head>
-----<title>400 Bad Request</title>
-----</head><body>
-----<h1>Bad Request</h1>
-----<p>Your browser sent a request that this server could not understand.<br />
-----</p>
-----<hr>
java.io.IOException: Server returned HTTP response code: 400 for URL: https://mantis-daun.server.company/download/test/0022450-umlauts_öä_üüü_and_special_chars_%&$#.pdf
-----<address>Apache/2.2.9 (Debian) PHP/5.2.6-1+lenny16 with Suhosin-Patch mod_ssl/2.2.9 OpenSSL/0.9.8o Server at mantis-daun.server.company Port 443</address>
-----</body></html>
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at sun.net.www.protocol.http.HttpURLConnection$6.run(HttpURLConnection.java:1491)
at java.security.AccessController.doPrivileged(Native Method)
at sun.net.www.protocol.http.HttpURLConnection.getChainedException(HttpURLConnection.java:1485)
at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1139)
at sun.net.www.protocol.https.HttpsURLConnectionImpl.getInputStream(HttpsURLConnectionImpl.java:234)
at org.mssql.main.MSSQLAccess.main(MSSQLAccess.java:51)
Caused by: java.io.IOException: Server returned HTTP response code: 400 for URL: https://mantis-daun.server.company/download/test/0022450-umlauts_öä_üüü_and_special_chars_%&$#.pdf
at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1436)
at java.net.HttpURLConnection.getResponseCode(HttpURLConnection.java:379)
at sun.net.www.protocol.https.HttpsURLConnectionImpl.getResponseCode(HttpsURLConnectionImpl.java:318)
at org.mssql.main.MSSQLAccess.main(MSSQLAccess.java:39)
如果我想在普通浏览器中打开"URL“,我也会得到一个"400:坏请求”。
那么,是否有一种方法来处理带有umlauts和特殊字符的字符串,以便它可以作为"URL“使用?
也许服务器设置也有问题?
发布于 2013-03-19 00:45:02
首先,正如Xavjer所指出的,您需要对URL进行编码。接下来,拆分URL并只对路径的“文本”部分进行编码是有意义的。域名没有编码(如果您有非拉丁域名,则必须按照Punycode进行编码),还必须保留路径分隔符(当您对URL进行整体编码时,情况并非如此)。因此,您只对“下载”、“测试”和filename+extension部件进行编码。
发布于 2013-03-19 00:40:54
好吧,您尝试解码url,但是实际上应该对进行编码,以满足您的需要。它实际上崩溃了,因为它试图解码%&$,这是无效的十六进制符号.
编码将导致: https%3A%2F%2Fmantis-daun.server.company%2Fdownload%2Ftest%2F0022450-umlauts_%C3%B6%C3%A4_%C3%BC%C3%BC%C3%BC_and_special_chars_%25%26%24%23.pdf
https://stackoverflow.com/questions/15494718
复制