Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id 6DBD5431FBD for ; Fri, 4 Dec 2009 09:58:30 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ez7SUlynD8SY for ; Fri, 4 Dec 2009 09:58:29 -0800 (PST) Received: from morlet.ev-en.org (baruch.xen.prgmr.com [64.71.167.210]) by olra.theworths.org (Postfix) with ESMTP id AA9B0431FBC for ; Fri, 4 Dec 2009 09:58:29 -0800 (PST) Received: from [192.168.2.101] (bzq-84-109-117-83.red.bezeqint.net [84.109.117.83]) by morlet.ev-en.org (Postfix) with ESMTPS id 7DBE342E279; Fri, 4 Dec 2009 17:58:26 +0000 (UTC) Message-ID: <4B194DB5.7010403@ev-en.org> Date: Fri, 04 Dec 2009 19:58:13 +0200 From: Baruch Even User-Agent: Mozilla-Thunderbird 2.0.0.22 (X11/20091109) MIME-Version: 1.0 To: Karl Wiberg References: <1259840063-sup-1478@sam.mediasupervision.de> <871vjbh98x.fsf@yoom.home.cworth.org> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: notmuch Subject: Re: [notmuch] Notmuch's search view sucks X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 04 Dec 2009 17:58:30 -0000 Karl Wiberg wrote: > On Fri, Dec 4, 2009 at 1:29 AM, Carl Worth wrote: >> And a step beyond that would support different languages for >> different emails, but that sounds like something "hard" to identify. > > But probably not as hard as identifying spam. It could probably be > done with a simple Bayesian filter counting word frequencies---but > it'd be much better if somebody else had already solved the problem, > since this smells suspiciously like something that ought to be a > separate project and put in a library ... does anyone know if such a > project already exists? I know Google can do it ... > > It'd be very cool to have notmuch automatically tag messages according > to what language they're in. What we should have is an interface to run an external program to classify a message when it's newly introduced and another that runs when tags are changed so that machine learning can be made to work when the user changes tags. Baruch