Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id B82C5431FBC for ; Sat, 12 Dec 2009 04:01:00 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id n263XsGBQLYN for ; Sat, 12 Dec 2009 04:01:00 -0800 (PST) Received: from gw03.mail.saunalahti.fi (gw03.mail.saunalahti.fi [195.197.172.111]) by olra.theworths.org (Postfix) with ESMTP id A7880431FAE for ; Sat, 12 Dec 2009 04:00:59 -0800 (PST) Received: from djcbsoftware.nl (a88-112-254-208.elisa-laajakaista.fi [88.112.254.208]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by gw03.mail.saunalahti.fi (Postfix) with ESMTP id 54E2F2165C2 for ; Sat, 12 Dec 2009 14:00:53 +0200 (EET) Received: from cthulhu.mindcrime.djcbsoftware.nl (localhost [127.0.0.1]) by djcbsoftware.nl (Postfix) with ESMTP id AF77439C0D8 for ; Sat, 12 Dec 2009 13:36:24 +0200 (EET) Date: Sat, 12 Dec 2009 13:36:24 +0200 Message-ID: <87r5r0ctl3.wl%djcb@djcbsoftware.nl> From: Dirk-Jan C. Binnema To: "notmuch@notmuchmail org" Mail-Reply-To: djcb@djcbsoftware.nl User-Agent: Wanderlust/2.15.6 (Almost Unreal) Emacs/23.1 Mule/6.0 (HANACHIRUSATO) Organization: DJCBSoftware MIME-Version: 1.0 (generated by SEMI 1.14.6 - "Maruoka") Content-Type: text/plain; charset=US-ASCII Subject: [notmuch] Subject: [PATCH] update the check whether a dir entry should be ignored. X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.12 Precedence: list Reply-To: djcb@djcbsoftware.nl List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 12 Dec 2009 12:01:00 -0000 Hi all, This is a draft patch which hopefully improves the check whether a dir entry should be ignored for that. It adds one feature: if you put a file '.noindex' in a dir, the whole dir will be ignored for indexing. I find this very useful for removing e.g. folders with spam messages from the indexing. There is one maybe controversial change, namely that it ignores all dot-dirs; this works fine for .notmuch and .nnmaildir (gnus), but maybe there is some valid use case for having mail in dot-dirs. Maybe one of the IMAP-servers does this? Not sure. Anyway, I can change that part. If the overall approach is considered OK, I can make a new patch Best wishes, Dirk. --- notmuch-new.c | 48 ++++++++++++++++++++++++++++++++++-------------- 1 files changed, 34 insertions(+), 14 deletions(-) diff --git a/notmuch-new.c b/notmuch-new.c index 9d20616..28f69bc 100644 --- a/notmuch-new.c +++ b/notmuch-new.c @@ -109,6 +109,30 @@ is_maildir (struct dirent **entries, int count) return 0; } + +static int +ignore_dir_entry (const char* path, struct dirent *entry) +{ + char noindex[4096]; /* any path will fit */ + + /* ignore everything starting with a dot; this covers hidden + * files, as well as special dir (. and ..), but also things like + * gnus .nnmaildir or .notmuch */ + if (entry->d_name[0] == '.') + return 1; + + /* we also check if dir contains a file called '.noindex'; if so, + * we ignore this directory; alloca would be suitable here, if not + * for the portability. */ + snprintf (noindex, sizeof(noindex), "%s/%s/.noindex", path, entry->d_name); + if (access (noindex, F_OK) == 0) + return 1; + + return 0; /* don't ignore */ +} + + + /* Examine 'path' recursively as follows: * * o Ask the filesystem for the mtime of 'path' (path_mtime) @@ -181,21 +205,17 @@ add_files_recursive (notmuch_database_t *notmuch, if (path_mtime <= path_dbtime && entry->d_type == DT_REG) continue; - /* Ignore special directories to avoid infinite recursion. - * Also ignore the .notmuch directory. - */ - /* XXX: Eventually we'll want more sophistication to let the - * user specify files to be ignored. */ - if (strcmp (entry->d_name, ".") == 0 || - strcmp (entry->d_name, "..") == 0 || - (entry->d_type == DT_DIR && - (strcmp (entry->d_name, "tmp") == 0) && - is_maildir (namelist, num_entries)) || - strcmp (entry->d_name, ".notmuch") ==0) - { - continue; - } + /* ignore tmp Maildirs, for obvious reasons */ + if (entry->d_type == DT_DIR && + (strcmp (entry->d_name, "tmp") == 0) && + is_maildir (namelist, num_entries)) + continue; + + /* ignore special directories and files */ + if (ignore_dir_entry (path, entry)) + continue; + next = talloc_asprintf (notmuch, "%s/%s", path, entry->d_name); if (stat (next, st)) { -- 1.6.3.3