From e648d24aa899ec2a7983ccb0f15c0a57741f4e53 Mon Sep 17 00:00:00 2001 From: David Bremner Date: Wed, 17 Feb 2016 08:38:09 +2000 Subject: [PATCH] encoding of message-ids --- 5d/79c340780b6613a838bfb0a2041dbd3075db54 | 58 +++++++++++++++++++++++ 1 file changed, 58 insertions(+) create mode 100644 5d/79c340780b6613a838bfb0a2041dbd3075db54 diff --git a/5d/79c340780b6613a838bfb0a2041dbd3075db54 b/5d/79c340780b6613a838bfb0a2041dbd3075db54 new file mode 100644 index 000000000..73da1c496 --- /dev/null +++ b/5d/79c340780b6613a838bfb0a2041dbd3075db54 @@ -0,0 +1,58 @@ +Return-Path: +X-Original-To: notmuch@notmuchmail.org +Delivered-To: notmuch@notmuchmail.org +Received: from localhost (localhost [127.0.0.1]) + by arlo.cworth.org (Postfix) with ESMTP id F329A6DE141B + for ; Tue, 16 Feb 2016 04:38:19 -0800 (PST) +X-Virus-Scanned: Debian amavisd-new at cworth.org +X-Spam-Flag: NO +X-Spam-Score: -0.307 +X-Spam-Level: +X-Spam-Status: No, score=-0.307 tagged_above=-999 required=5 tests=[AWL=0.244, + RP_MATCHES_RCVD=-0.55, SPF_PASS=-0.001] autolearn=disabled +Received: from arlo.cworth.org ([127.0.0.1]) + by localhost (arlo.cworth.org [127.0.0.1]) (amavisd-new, port 10024) + with ESMTP id 3zhaJlOONEfc for ; + Tue, 16 Feb 2016 04:38:13 -0800 (PST) +Received: from fethera.tethera.net (fethera.tethera.net [198.245.60.197]) + by arlo.cworth.org (Postfix) with ESMTPS id AF4B46DE02C9 + for ; Tue, 16 Feb 2016 04:38:13 -0800 (PST) +Received: from remotemail by fethera.tethera.net with local (Exim 4.84) + (envelope-from ) + id 1aVesi-0002It-RF; Tue, 16 Feb 2016 07:37:28 -0500 +Received: (nullmailer pid 23250 invoked by uid 1000); + Tue, 16 Feb 2016 12:38:09 -0000 +From: David Bremner +To: notmuch@notmuchmail.org +Subject: encoding of message-ids +User-Agent: Notmuch/0.21+26~g9404723 (http://notmuchmail.org) Emacs/24.5.1 + (x86_64-pc-linux-gnu) +Date: Tue, 16 Feb 2016 08:38:09 -0400 +Message-ID: <87si0svnim.fsf@zancas.localnet> +MIME-Version: 1.0 +Content-Type: text/plain +X-BeenThere: notmuch@notmuchmail.org +X-Mailman-Version: 2.1.20 +Precedence: list +List-Id: "Use and development of the notmuch mail system." + +List-Unsubscribe: , + +List-Archive: +List-Post: +List-Help: +List-Subscribe: , + +X-List-Received-Date: Tue, 16 Feb 2016 12:38:20 -0000 + + +I spent a little time this morning staring at the code, and it seems +that all of the message-ids are parsed via g_mime_decode_text, which +deals with RFC2047 encodings and makes guesses at decoding 8bit +characters. In practice this means that in the notmuch database all +headers are UTF-8. Since message-id's are supposed to be printable ascii +[at least in rfc5322], this seems like not such a terrible decision, but +I wonder if we should document this potential conversion somewhere? + +d + -- 2.26.2