Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id 5B449431FBF; Fri, 4 Dec 2009 10:01:49 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LnvbFEz22dUk; Fri, 4 Dec 2009 10:01:48 -0800 (PST) Received: from yoom.home.cworth.org (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id B1989431FAE; Fri, 4 Dec 2009 10:01:48 -0800 (PST) Received: by yoom.home.cworth.org (Postfix, from userid 1000) id 1AA192542FB; Fri, 4 Dec 2009 10:01:48 -0800 (PST) From: Carl Worth To: Aaron Ecay , notmuch@notmuchmail.org In-Reply-To: <4b18f807.47c2f10a.479e.1f0e@mx.google.com> References: <1259840063-sup-1478@sam.mediasupervision.de> <871vjbh98x.fsf@yoom.home.cworth.org> <4b18f807.47c2f10a.479e.1f0e@mx.google.com> Date: Fri, 04 Dec 2009 10:01:47 -0800 Message-ID: <87d42ufwj8.fsf@yoom.home.cworth.org> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha1; protocol="application/pgp-signature" Subject: Re: [notmuch] Notmuch's search view sucks X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 04 Dec 2009 18:01:49 -0000 --=-=-= Content-Transfer-Encoding: quoted-printable On Fri, 04 Dec 2009 06:52:38 -0500, Aaron Ecay wrote: > The same algorithm is implemented in C here: > http://www.mnogosearch.org/guesser/ >=20 > Licensed under the GPL and includes presets for ~50 languages. That indeed does look very interesting, (at least what I can get from google's cache of the website, as the server seems to be down just now). Oh, but I can just "apt-get source mnogosearch" and find src/mguesser.c and src/guesser.c at least. > A potential drawback is that it doesn't handle raw HTML very well, > according to the documentation. Shouldn't really be an issue. Notmuch will already want to de-tagify HTML before indexing anyway. =2DCarl --=-=-= Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) iD8DBQFLGU6L6JDdNq8qSWgRAhuTAJ4oCor2dBDIV7wsbnM2FJ3lb/VrzACfVwAt 4q4wIhnx5quZ58g5pJksOhM= =P796 -----END PGP SIGNATURE----- --=-=-=--