Add a patch tag
[ikiwiki.git] / doc / bugs / Pages_with_non-ascii_characters_like_öäå_in_name_not_found_directly_after_commit.mdwn
1 At least my setup on kapsi.fi always prints 404 Not Found after adding a page with non-ascii characters in name. But the page exists and is visible after the 404 with url encoding and the blog page is inlined correctly on the feed page.
2
3 Apparently ikiwiki.info does not complain with 404. Should the character encoding be set in wiki config?
4
5 Happens also after editing the page. Here's an example:
6
7  * page name displayed in 404: http://mcfrisk.kapsi.fi/skiing/posts/Iso-Sy%F6te%20Freeride%202011%20Teaser.html?updated
8  * page name in the blog feed: http://mcfrisk.kapsi.fi/skiing/posts/Iso-Sy%C3%B6te%20Freeride%202011%20Teaser.html
9
10 Difference is in the word Iso-Syöte. Pehaps also the browsers is part of
11 the game, I use Iceweasel from Debian unstable with default settings.
12
13 > I remember seeing this problem twice before, and both times it was caused
14 > by a bug in the *web server* configuration. I think at least one case it was
15 > due to an apache rewrite rule that did a redirect and mangled the correct
16 > encoding.
17
18 > I recommend you check there. If you cannot find the problem with your web
19 > server, I recommend you get a http protocol dump while saving the page,
20 > and post it here for analysis. You could use tcpdump, or one of the
21 > browser plugins that allows examining the http protocol. --[[Joey]]
22
23 Server runs Debian 5.0.8 but I don't have access to the Apache configs. Here's the tcp stream from wireshark without cookie data, page name is testiä.html. I guess page name is in utf-8 but in redirect after post it is given to browser with 8859-1.
24
25         POST /ikiwiki.cgi HTTP/1.1
26         Host: mcfrisk.kapsi.fi
27         User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.16) Gecko/20110107 Iceweasel/3.5.16 (like Firefox/3.5.16)
28         Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
29         Accept-Language: en-us,en;q=0.5
30         Accept-Encoding: gzip,deflate
31         Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
32         Keep-Alive: 300
33         Connection: keep-alive
34         Referer: http://mcfrisk.kapsi.fi/ikiwiki.cgi
35         Cookie: XXXX
36         Content-Type: multipart/form-data; boundary=---------------------------138059850619952014921977844406
37         Content-Length: 1456
38
39         -----------------------------138059850619952014921977844406
40         Content-Disposition: form-data; name="_submitted"
41
42         2
43         -----------------------------138059850619952014921977844406
44         Content-Disposition: form-data; name="do"
45
46         edit
47         -----------------------------138059850619952014921977844406
48         Content-Disposition: form-data; name="sid"
49
50         93c956725705aa0bbdff98e57efb28f4
51         -----------------------------138059850619952014921977844406
52         Content-Disposition: form-data; name="from"
53
54
55         -----------------------------138059850619952014921977844406
56         Content-Disposition: form-data; name="rcsinfo"
57
58         5419fbf402e685643ca965d577dff3dafdd0fde9
59         -----------------------------138059850619952014921977844406
60         Content-Disposition: form-data; name="page"
61
62         testi..
63         -----------------------------138059850619952014921977844406
64         Content-Disposition: form-data; name="type"
65
66         mdwn
67         -----------------------------138059850619952014921977844406
68         Content-Disposition: form-data; name="editcontent"
69
70         test
71         -----------------------------138059850619952014921977844406
72         Content-Disposition: form-data; name="editmessage"
73
74
75         -----------------------------138059850619952014921977844406
76         Content-Disposition: form-data; name="_submit"
77
78         Save Page
79         -----------------------------138059850619952014921977844406
80         Content-Disposition: form-data; name="attachment"; filename=""
81         Content-Type: application/octet-stream
82
83
84         -----------------------------138059850619952014921977844406--
85         HTTP/1.1 302 Found
86         Date: Wed, 02 Feb 2011 19:45:49 GMT
87         Server: Apache/2.2
88         Location: /testi%E4.html?updated
89         Content-Length: 0
90         Keep-Alive: timeout=5, max=500
91         Connection: Keep-Alive
92         Content-Type: text/plain
93
94         GET /testi%E4.html?updated HTTP/1.1
95         Host: mcfrisk.kapsi.fi
96         User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.16) Gecko/20110107 Iceweasel/3.5.16 (like Firefox/3.5.16)
97         Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
98         Accept-Language: en-us,en;q=0.5
99         Accept-Encoding: gzip,deflate
100         Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
101         Keep-Alive: 300
102         Connection: keep-alive
103         Referer: http://mcfrisk.kapsi.fi/ikiwiki.cgi
104         Cookie: XXXX
105
106         HTTP/1.1 404 Not Found
107         Date: Wed, 02 Feb 2011 19:45:55 GMT
108         Server: Apache/2.2
109         Content-Length: 279
110         Keep-Alive: timeout=5, max=499
111         Connection: Keep-Alive
112         Content-Type: text/html; charset=iso-8859-1
113
114         <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
115         <html><head>
116         <title>404 Not Found</title>
117         </head><body>
118         <h1>Not Found</h1>
119         <p>The requested URL /testi..html was not found on this server.</p>
120         <hr>
121         <address>Apache/2.2 Server at mcfrisk.kapsi.fi Port 80</address>
122         </body></html>
123
124 Getting the pages has worked every time:
125
126         GET /testi%C3%A4.html HTTP/1.1
127         Host: mcfrisk.kapsi.fi
128         User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.16) Gecko/20110107 Iceweasel/3.5.16 (like Firefox/3.5.16)
129         Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
130         Accept-Language: en-us,en;q=0.5
131         Accept-Encoding: gzip,deflate
132         Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
133         Keep-Alive: 300
134         Connection: keep-alive
135         Cookie: XXXX
136         If-Modified-Since: Wed, 02 Feb 2011 19:45:54 GMT
137         If-None-Match: "1b518d-7c0-49b51e5a55c5f"
138         Cache-Control: max-age=0
139
140         HTTP/1.1 304 Not Modified
141         Date: Wed, 02 Feb 2011 20:01:43 GMT
142         Server: Apache/2.2
143         Connection: Keep-Alive
144         Keep-Alive: timeout=5, max=500
145         ETag: "1b518d-7c0-49b51e5a55c5f"