1 Return-Path: <gmn-notmuch@m.gmane.org>
\r
2 X-Original-To: notmuch@notmuchmail.org
\r
3 Delivered-To: notmuch@notmuchmail.org
\r
4 Received: from localhost (localhost [127.0.0.1])
\r
5 by olra.theworths.org (Postfix) with ESMTP id BC5B6429E20
\r
6 for <notmuch@notmuchmail.org>; Mon, 21 Mar 2011 19:04:27 -0700 (PDT)
\r
7 X-Virus-Scanned: Debian amavisd-new at olra.theworths.org
\r
11 X-Spam-Status: No, score=0 tagged_above=-999 required=5 tests=[none]
\r
13 Received: from olra.theworths.org ([127.0.0.1])
\r
14 by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024)
\r
15 with ESMTP id KXjvDmGUB36o for <notmuch@notmuchmail.org>;
\r
16 Mon, 21 Mar 2011 19:04:26 -0700 (PDT)
\r
17 Received: from lo.gmane.org (lo.gmane.org [80.91.229.12])
\r
18 by olra.theworths.org (Postfix) with ESMTP id 8D41D431FD0
\r
19 for <notmuch@notmuchmail.org>; Mon, 21 Mar 2011 19:04:26 -0700 (PDT)
\r
20 Received: from list by lo.gmane.org with local (Exim 4.69)
\r
21 (envelope-from <gmn-notmuch@m.gmane.org>) id 1Q1qxC-0003BK-U8
\r
22 for notmuch@notmuchmail.org; Tue, 22 Mar 2011 03:04:14 +0100
\r
23 Received: from c-71-237-233-41.hsd1.or.comcast.net ([71.237.233.41])
\r
24 by main.gmane.org with esmtp (Gmexim 0.1 (Debian))
\r
26 for <notmuch@notmuchmail.org>; Tue, 22 Mar 2011 03:04:14 +0100
\r
27 Received: from mueen by c-71-237-233-41.hsd1.or.comcast.net with local (Gmexim
\r
28 0.1 (Debian)) id 1AlnuQ-0007hv-00
\r
29 for <notmuch@notmuchmail.org>; Tue, 22 Mar 2011 03:04:14 +0100
\r
30 X-Injected-Via-Gmane: http://gmane.org/
\r
31 To: notmuch@notmuchmail.org
\r
32 From: Mueen Nawaz <mueen@nawaz.org>
\r
33 Subject: Re: Questions about importing mail (mbox)
\r
34 Date: Mon, 21 Mar 2011 19:02:45 -0700
\r
36 Message-ID: <87hbavlxoa.fsf@fester.com>
\r
37 References: <87bp15m9oz.fsf@fester.com>
\r
38 <87zkooo88x.fsf@A7GMS.i-did-not-set--mail-host-address--so-tickle-me>
\r
40 Content-Type: text/plain; charset=us-ascii
\r
41 X-Complaints-To: usenet@dough.gmane.org
\r
42 X-Gmane-NNTP-Posting-Host: c-71-237-233-41.hsd1.or.comcast.net
\r
43 Cancel-Lock: sha1:UCoiTIrgg/2RtGaCCdMlXQ7uo5o=
\r
44 X-BeenThere: notmuch@notmuchmail.org
\r
45 X-Mailman-Version: 2.1.13
\r
47 List-Id: "Use and development of the notmuch mail system."
\r
48 <notmuch.notmuchmail.org>
\r
49 List-Unsubscribe: <http://notmuchmail.org/mailman/options/notmuch>,
\r
50 <mailto:notmuch-request@notmuchmail.org?subject=unsubscribe>
\r
51 List-Archive: <http://notmuchmail.org/pipermail/notmuch>
\r
52 List-Post: <mailto:notmuch@notmuchmail.org>
\r
53 List-Help: <mailto:notmuch-request@notmuchmail.org?subject=help>
\r
54 List-Subscribe: <http://notmuchmail.org/mailman/listinfo/notmuch>,
\r
55 <mailto:notmuch-request@notmuchmail.org?subject=subscribe>
\r
56 X-List-Received-Date: Tue, 22 Mar 2011 02:04:27 -0000
\r
58 Pieter Praet <pieter@praet.org> writes:
\r
59 > It would've been a no-brainer if you'd been using Maildir all along
\r
60 > (mbox is evil incarnate), but...
\r
62 Sure, but mbox is too convenient.
\r
64 > I'd suggest keeping your original mbox file safe in git [1], and
\r
65 > consistently commiting every step of the way, so even if messages were
\r
66 > to get lost in translation, you still have a way to get them back, with
\r
67 > negligible storage overhead (just remember to "git gc --aggressive
\r
68 > --prune=now" when you're finished).
\r
70 I think you misunderstood me. A part of me suspects this has something
\r
71 to do with my not explaining myself, but who's to say?<G>
\r
73 I'm experimenting with notmuch, and if I can translate everything I
\r
74 currently do in mutt to notmuch, then I'll just dump mutt. The set of
\r
75 mboxes I have will remain archived, but for all future incoming email,
\r
76 I'll switch to MH or MailDir. So I don't actually need to put my old
\r
77 mboxes under revision control - I just need to save them somewhere.
\r
79 > For the actual conversion to Maildir (and any type of mail fetching in
\r
80 > general), I'd suggest using FDM [2], you'll never look back.
\r
82 Thanks - will take a look.
\r
84 > Regarding the significant discrepancy between processed and added files
\r
85 > in Notmuch: Could be dupes (e.g. mail to/cc/bcc yourself or mailing
\r
86 > lists, ending up in both Inbox and Sent), which are automatically
\r
87 > suppressed by Notmuch.
\r
89 It definitely was dupes. I didn't realize that notmuch did not keep
\r
92 So I wrote a Python script to go through the mboxes and do a count of
\r
93 only unique messages. Problem? I have over 1000 emails that don't have a
\r
94 Message-ID header (case invariant search). I could go over why that is,
\r
95 but suffice it to say that I hate Microsoft.<G>
\r
97 Once I remove all dupes, I get to within 300-400 of the count that
\r
98 notmuch provides. The remaining 1000+ emails do contain some dupes, and
\r
99 I can't find a convenient way to get an accurate count of unique emails
\r
100 from them, but at least now I'm in the ballpark, and a lot more
\r
103 Incidentally, one reason I didn't realize dupes were the reason is that
\r
104 I did a search for a word in one email I had and notmuch did not find
\r
105 it - so I assumed it had not been indexed. Later on, I realized I had
\r
106 written a partial word and discovered that notmuch does find it if I
\r
107 type the full word.
\r
109 What am I doing wrong? Can't notmuch handle partial word matches? Do I
\r
110 need to specify an option to get that to work?
\r
112 Anyway, thanks for the help - I'll investigate further.
\r