Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id BF484431FD0 for ; Tue, 20 Dec 2011 12:25:58 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org X-Spam-Flag: NO X-Spam-Score: -2.3 X-Spam-Level: X-Spam-Status: No, score=-2.3 tagged_above=-999 required=5 tests=[RCVD_IN_DNSWL_MED=-2.3] autolearn=disabled Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OUnmXSGneKEz for ; Tue, 20 Dec 2011 12:25:58 -0800 (PST) Received: from tempo.its.unb.ca (tempo.its.unb.ca [131.202.1.21]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by olra.theworths.org (Postfix) with ESMTPS id 07485431FB6 for ; Tue, 20 Dec 2011 12:25:57 -0800 (PST) Received: from convex-new.cs.unb.ca ([131.202.13.154]) by tempo.its.unb.ca (8.13.8/8.13.8) with ESMTP id pBKKPrxa030568; Tue, 20 Dec 2011 16:25:53 -0400 Received: from bremner by convex-new.cs.unb.ca with local (Exim 4.72) (envelope-from ) id 1Rd6G1-0001no-R1; Tue, 20 Dec 2011 16:25:53 -0400 From: David Bremner To: Tom Prince , Austin Clements Subject: Re: More ideas about logging. In-Reply-To: <87r501k4ub.fsf@loki.hocat.ca> References: <87obv9i7y3.fsf@zancas.localnet> <20111216040722.GC12245@mit.edu> <87hb0xbug7.fsf@zancas.localnet> <87r501k4ub.fsf@loki.hocat.ca> User-Agent: Notmuch/0.10.2+93~g518d4ef (http://notmuchmail.org) Emacs/23.3.1 (x86_64-pc-linux-gnu) Date: Tue, 20 Dec 2011 16:25:53 -0400 Message-ID: <87k45r9ei6.fsf@convex-new.cs.unb.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: Notmuch Mail X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 20 Dec 2011 20:25:58 -0000 On Sun, 18 Dec 2011 13:22:20 -0700, Tom Prince wrote: > On Sun, 18 Dec 2011 14:34:00 -0400, David Bremner wrote: > > The more worrying part is disk usage; the tag tree for 200k messages > > uses 400k inodes, and 836M of apparent disk usage (according to du) the > > same tags in "sup" format take 11M. Maybe this could be usefull if > > combined with some scheme to only dump tags not covered by maildir (for > > those using maildir flag synching already) > > Well, it would seem natural to re-use the nmbug logic here, and just use > a bare git repo for this. One would need a way to sync and merge the > tag-tree automatically anyway. I admit I haven't tried nmbug yet, but it > seems that nmbug, switched from sync just notmuch:: to syncing > everything but notmuch:: would be a sensible way to sync tags? I was mainly interested in if some guarantee of atomicity could be given in a simple way. The git update-index approach doesn't really make those kind of guaranteees.. Probably this is tolerable for a human initiated "dump" process; not so much for other uses. Furthermore much of the motivation for both mtimes and logging is to make incremental dumping possible in order to avoid the time to do of a full dump. This is experiment was also to see how feasible it was to insert some "mkdir+creat" in the notmuch-tag critical path. Since a few people have mentioned this, I should confess that there are (at least) 2 performance bugs lurking in nmbug that make it probably not yet suitable for large scale tag syncing. 1) I did not get the merging working with only the index, so nmbug currently makes a temporary checkout to do the merge. 2) transfering tags from the git repo to xapian is currently quite slow because it does one call to git tag for each tag, rather than constructing an input for "notmuch restore". I _think_ both of these are fixable in principle. Maybe somebody with better git internals knowledge than I would like to take a look at (1). (2) is just a SimpleMatterOfProgramming (TM). Patches, as they say, are welcome ;). d