Re: Unicode Paths
authorMartin Owens <doctormo@gmail.com>
Thu, 15 Sep 2011 16:52:30 +0000 (12:52 +2000)
committerW. Trevor King <wking@tremily.us>
Fri, 7 Nov 2014 17:39:27 +0000 (09:39 -0800)
bb/7f4d32183509f962b609d0e7ebbc71a6a90e72 [new file with mode: 0644]

diff --git a/bb/7f4d32183509f962b609d0e7ebbc71a6a90e72 b/bb/7f4d32183509f962b609d0e7ebbc71a6a90e72
new file mode 100644 (file)
index 0000000..7587cbc
--- /dev/null
@@ -0,0 +1,83 @@
+Return-Path: <doctormo@gmail.com>\r
+X-Original-To: notmuch@notmuchmail.org\r
+Delivered-To: notmuch@notmuchmail.org\r
+Received: from localhost (localhost [127.0.0.1])\r
+       by olra.theworths.org (Postfix) with ESMTP id 3E117431FD0\r
+       for <notmuch@notmuchmail.org>; Thu, 15 Sep 2011 10:29:41 -0700 (PDT)\r
+X-Virus-Scanned: Debian amavisd-new at olra.theworths.org\r
+X-Spam-Flag: NO\r
+X-Spam-Score: -0.799\r
+X-Spam-Level: \r
+X-Spam-Status: No, score=-0.799 tagged_above=-999 required=5\r
+       tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1,\r
+       FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7] autolearn=disabled\r
+Received: from olra.theworths.org ([127.0.0.1])\r
+       by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024)\r
+       with ESMTP id p1xWhkqsSM-F for <notmuch@notmuchmail.org>;\r
+       Thu, 15 Sep 2011 10:29:40 -0700 (PDT)\r
+Received: from mail-qw0-f46.google.com (mail-qw0-f46.google.com\r
+       [209.85.216.46]) (using TLSv1 with cipher RC4-SHA (128/128 bits))\r
+       (No client certificate requested)\r
+       by olra.theworths.org (Postfix) with ESMTPS id D6624431FB6\r
+       for <notmuch@notmuchmail.org>; Thu, 15 Sep 2011 10:29:40 -0700 (PDT)\r
+Received: by qwj8 with SMTP id 8so1114713qwj.5\r
+       for <notmuch@notmuchmail.org>; Thu, 15 Sep 2011 10:29:39 -0700 (PDT)\r
+DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma;\r
+       h=subject:from:to:cc:in-reply-to:references:content-type:date\r
+       :message-id:mime-version:x-mailer:content-transfer-encoding;\r
+       bh=wqnyt7s7pToiZ1Fkn23fLxUYm0+b6VXoE0xF3BeNHb8=;\r
+       b=A2qbKasEWgDRfUJY6UaARI4gD6ievoiMKq1QOzc141aXfQyfcR/kRQO8psv+EvwD5V\r
+       WoNsapPp+Ioom7zEIs70BgyDxWU9dDiNqInXG/0fTfuQntz9AHJUJWwlfOfD5pT0Am+Z\r
+       ihzUQ0oNrp54xM/Slls8ZF/uwkv/JTEKZ5kMM=\r
+Received: by 10.229.86.135 with SMTP id s7mr1157401qcl.257.1316107779138;\r
+       Thu, 15 Sep 2011 10:29:39 -0700 (PDT)\r
+Received: from [192.168.1.190] (pool-68-163-190-45.bos.east.verizon.net.\r
+       [68.163.190.45])\r
+       by mx.google.com with ESMTPS id do8sm4274515qab.17.2011.09.15.10.29.36\r
+       (version=SSLv3 cipher=OTHER); Thu, 15 Sep 2011 10:29:37 -0700 (PDT)\r
+Subject: Re: Unicode Paths\r
+From: Martin Owens <doctormo@gmail.com>\r
+To: Kan-Ru Chen <kanru@kanru.info>\r
+In-Reply-To: <8739fzwxfv.fsf@isil.kanru.info>\r
+References: <1315972539.2201.11.camel@delen> <8739fzwxfv.fsf@isil.kanru.info>\r
+Content-Type: text/plain; charset="UTF-8"\r
+Date: Thu, 15 Sep 2011 12:52:30 -0400\r
+Message-ID: <1316105550.2201.21.camel@delen>\r
+Mime-Version: 1.0\r
+X-Mailer: Evolution 2.30.3 \r
+Content-Transfer-Encoding: 7bit\r
+Cc: Notmuch developer list <notmuch@notmuchmail.org>\r
+X-BeenThere: notmuch@notmuchmail.org\r
+X-Mailman-Version: 2.1.13\r
+Precedence: list\r
+List-Id: "Use and development of the notmuch mail system."\r
+       <notmuch.notmuchmail.org>\r
+List-Unsubscribe: <http://notmuchmail.org/mailman/options/notmuch>,\r
+       <mailto:notmuch-request@notmuchmail.org?subject=unsubscribe>\r
+List-Archive: <http://notmuchmail.org/pipermail/notmuch>\r
+List-Post: <mailto:notmuch@notmuchmail.org>\r
+List-Help: <mailto:notmuch-request@notmuchmail.org?subject=help>\r
+List-Subscribe: <http://notmuchmail.org/mailman/listinfo/notmuch>,\r
+       <mailto:notmuch-request@notmuchmail.org?subject=subscribe>\r
+X-List-Received-Date: Thu, 15 Sep 2011 17:29:41 -0000\r
+\r
+It looks like the python variables do include null, my investigations\r
+show that the problem also effects tag names.\r
+\r
+The symptoms can be seen when trying to use the python interface and\r
+using unicode tag names or paths. Instead of seeing 'mytag1' we see 'm'\r
+and instead of '/my/path/to/mail' we see '/' thus causing issues were\r
+the db amusingly was trying to write to root.\r
+\r
+I'll see if there is a way to remove the nulls from the strings in the\r
+python bindings.\r
+\r
+Martin,\r
+\r
+On Wed, 2011-09-14 at 12:38 +0800, Kan-Ru Chen wrote:\r
+> I think as long as the path does not contain embedded null character\r
+> then it is safe. Most posix filesystem does not allow null character\r
+> in\r
+> the filename so you cannot use UTF-16 or UTF-32 to encode the unicode\r
+> path. \r
+\r