Re: UTF-8 in mail headers (namely FROM) sent by bugzilla
authorDaniel Kahn Gillmor <dkg@fifthhorseman.net>
Mon, 29 Jul 2013 19:46:50 +0000 (15:46 +2000)
committerW. Trevor King <wking@tremily.us>
Fri, 7 Nov 2014 17:56:12 +0000 (09:56 -0800)
1c/c40d5591f06374c3ef47832dddc77296a9c78b [new file with mode: 0644]

diff --git a/1c/c40d5591f06374c3ef47832dddc77296a9c78b b/1c/c40d5591f06374c3ef47832dddc77296a9c78b
new file mode 100644 (file)
index 0000000..dc34e17
--- /dev/null
@@ -0,0 +1,128 @@
+Return-Path: <dkg@fifthhorseman.net>\r
+X-Original-To: notmuch@notmuchmail.org\r
+Delivered-To: notmuch@notmuchmail.org\r
+Received: from localhost (localhost [127.0.0.1])\r
+       by olra.theworths.org (Postfix) with ESMTP id 518D6431FC3\r
+       for <notmuch@notmuchmail.org>; Mon, 29 Jul 2013 12:46:56 -0700 (PDT)\r
+X-Virus-Scanned: Debian amavisd-new at olra.theworths.org\r
+X-Spam-Flag: NO\r
+X-Spam-Score: 0\r
+X-Spam-Level: \r
+X-Spam-Status: No, score=0 tagged_above=-999 required=5 tests=[none]\r
+       autolearn=disabled\r
+Received: from olra.theworths.org ([127.0.0.1])\r
+       by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024)\r
+       with ESMTP id xAeRoB3lmb3H for <notmuch@notmuchmail.org>;\r
+       Mon, 29 Jul 2013 12:46:51 -0700 (PDT)\r
+Received: from che.mayfirst.org (che.mayfirst.org [209.234.253.108])\r
+       by olra.theworths.org (Postfix) with ESMTP id 20BBB431FC7\r
+       for <notmuch@notmuchmail.org>; Mon, 29 Jul 2013 12:46:51 -0700 (PDT)\r
+Received: from [192.168.23.229] (dsl254-070-154.nyc1.dsl.speakeasy.net\r
+       [216.254.70.154])\r
+       by che.mayfirst.org (Postfix) with ESMTPSA id 95CC6F984;\r
+       Mon, 29 Jul 2013 15:46:48 -0400 (EDT)\r
+Message-ID: <51F6C6AA.7010001@fifthhorseman.net>\r
+Date: Mon, 29 Jul 2013 15:46:50 -0400\r
+From: Daniel Kahn Gillmor <dkg@fifthhorseman.net>\r
+User-Agent: Mozilla/5.0 (X11; Linux x86_64;\r
+       rv:17.0) Gecko/20130630 Icedove/17.0.7\r
+MIME-Version: 1.0\r
+To: David Bremner <david@tethera.net>\r
+Subject: Re: UTF-8 in mail headers (namely FROM) sent by bugzilla\r
+References: <08cb1dcd-c5db-4e33-8b09-7730cb3d59a2@gmail.com>\r
+       <871u6psjwr.fsf@ericabrahamsen.net>\r
+       <5712cc41-d0ce-4ed3-af1c-37cf639dd9c0@gmail.com>\r
+       <87y58xv71x.fsf@zancas.localnet>\r
+In-Reply-To: <87y58xv71x.fsf@zancas.localnet>\r
+X-Enigmail-Version: 1.5.1\r
+Content-Type: multipart/signed; micalg=pgp-sha512;\r
+       protocol="application/pgp-signature";\r
+       boundary="----enig2LHNJNKRNVBGUMGFBMJAV"\r
+Cc: notmuch@notmuchmail.org\r
+X-BeenThere: notmuch@notmuchmail.org\r
+X-Mailman-Version: 2.1.13\r
+Precedence: list\r
+List-Id: "Use and development of the notmuch mail system."\r
+       <notmuch.notmuchmail.org>\r
+List-Unsubscribe: <http://notmuchmail.org/mailman/options/notmuch>,\r
+       <mailto:notmuch-request@notmuchmail.org?subject=unsubscribe>\r
+List-Archive: <http://notmuchmail.org/pipermail/notmuch>\r
+List-Post: <mailto:notmuch@notmuchmail.org>\r
+List-Help: <mailto:notmuch-request@notmuchmail.org?subject=help>\r
+List-Subscribe: <http://notmuchmail.org/mailman/listinfo/notmuch>,\r
+       <mailto:notmuch-request@notmuchmail.org?subject=subscribe>\r
+X-List-Received-Date: Mon, 29 Jul 2013 19:46:56 -0000\r
+\r
+This is an OpenPGP/MIME signed message (RFC 4880 and 3156)\r
+------enig2LHNJNKRNVBGUMGFBMJAV\r
+Content-Type: text/plain; charset=UTF-8\r
+Content-Transfer-Encoding: quoted-printable\r
+\r
+On 07/23/2013 07:39 AM, David Bremner wrote:\r
+> Franz Fellner <alpine.art.de@gmail.com> writes:\r
+>=20\r
+>>\r
+>> OK, thx. So every app needs to get patched to display those strings\r
+>> properly? Any chance this could be done directly in libnotmuch?  I\r
+>> grepped for "2047" inside te "emacs" subtree, but found nothing (had\r
+>> the hope for a comment for the workaround). Would be interesting to\r
+>> see how this is done, so I can at least try to create a patch (though\r
+>> my ruby is quite basic).\r
+>=20\r
+> In general notmuch relies on libgmime for rfc2047 parsing.  I'm not sur=\r
+e\r
+> of all the details now, but some of the filtering does happen in the\r
+> CLI, not the lib.  You could start by looking at\r
+> gmime-filter-headers.[ch] in the top directory.\r
+\r
+I agree this should be handled properly by gmime.  If it turns out that\r
+the library is misbehaving (i.e. that notmuch is using it sensibly and\r
+we're still getting bad data out of well-formed strings), it should be\r
+reported and fixed there.\r
+\r
+Just a note that other MUAs are struggling with this sort of thing too:\r
+\r
+http://blog.steve.org.uk/international_character_sets_and_encodings_are_h=\r
+ard_.html\r
+\r
+Steve Kemp (author of lumail) has good engineering skills and instincts;\r
+anyone actively working on trying to get this fixed "right" within\r
+notmuch (or underlying libraries) could probably drop him an e-mail and\r
+collaborate.  With a decent diagnostic of the specific problems and use\r
+cases, plus a recommendation for where the fix should be and how it\r
+should be done, the two projects together could probably exert\r
+sufficient influence on underlying libraries and toolchains to get them\r
+to address any issues.\r
+\r
+sorry to just provide links and not any actual analysis and code.\r
+\r
+       --dkg\r
+\r
+\r
+------enig2LHNJNKRNVBGUMGFBMJAV\r
+Content-Type: application/pgp-signature; name="signature.asc"\r
+Content-Description: OpenPGP digital signature\r
+Content-Disposition: attachment; filename="signature.asc"\r
+\r
+-----BEGIN PGP SIGNATURE-----\r
+Version: GnuPG v1.4.14 (GNU/Linux)\r
+Comment: Using GnuPG with Icedove - http://www.enigmail.net/\r
+\r
+iQJ8BAEBCgBmBQJR9saqXxSAAAAAAC4AKGlzc3Vlci1mcHJAbm90YXRpb25zLm9w\r
+ZW5wZ3AuZmlmdGhob3JzZW1hbi5uZXRFQjk2OTEyODdBN0FEREUzNzU3RDkxMUVB\r
+NTI0MDFCMTFCRkRGQTVDAAoJEKUkAbEb/fpcyk0QANMkTyqXqpFZ8sBaKhw1rY/P\r
+SzyCDFYWaTXhPMFDacCEmU6bG8OB1/0VLvPjKwcXs40UHgtxu2aNswJhoyuP0lry\r
+MYZCo/zb3Tup5m6jSXbMaZxvjHZ1kjsOUNR0mvMOJoAva81mbubG15I4obmWSldb\r
+sR4/vYqjGfO8Q69wRzJpJBUPM3qReKUDd8fr0/frI6DR4UYB7QusASLaEp7d/WgY\r
+MBqPD0DIw9nF1FOZ2xX1kdNbtNQvOdK3VBL9793k4VGJMfv/qzkfn3TPHwQMwCNP\r
+OlhjHPUDr1fhZKiFp4xa7dPFIvWEppEdyeNaQ3SLIEV7GHw2Qw1yGubnneh3IEOg\r
+VFXOnmD+eurxyHM6r9KEzmRK4K9X5CA6abLOcNYb6R6FFPZ2BDWZiOGWiTgsyJsl\r
+T/6DlZzubBTg/s3Wj0YKPVCq28/xyt7XT8zw9kMM+7eKNww2boqkZruvFIwFhcx1\r
+Bet8bualMUQasp/B/MN3Z2AQZJD+FoPeYIbqQytkhRBlbIiDbABKINdqnmpMabd1\r
+3/ah9gw/X4/s2/euAY1TzwiIgAfgj30ZD/gn4wXTIZ7sC1ITo/VV9Iyf29X/oHKf\r
+DzCOa0WR9YG5ML0OYDSJqxJEaLomJaAtH3TGzQ/Lcl+A+6dS9+Fik8lhvKarDOxM\r
+nCAIMQidKBxWcXRdAzkf\r
+=VNoq\r
+-----END PGP SIGNATURE-----\r
+\r
+------enig2LHNJNKRNVBGUMGFBMJAV--\r