Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id 06C41431FC2 for ; Wed, 22 Feb 2012 09:15:26 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org X-Spam-Flag: NO X-Spam-Score: 0.514 X-Spam-Level: X-Spam-Status: No, score=0.514 tagged_above=-999 required=5 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_SORBS_WEB=0.614] autolearn=disabled Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bL-SAcEQ53uT for ; Wed, 22 Feb 2012 09:15:22 -0800 (PST) X-Greylist: delayed 427 seconds by postgrey-1.32 at olra; Wed, 22 Feb 2012 09:15:22 PST Received: from forward20.mail.yandex.net (forward20.mail.yandex.net [95.108.253.145]) by olra.theworths.org (Postfix) with ESMTP id 2C4D3431FB6 for ; Wed, 22 Feb 2012 09:15:22 -0800 (PST) Received: from smtp17.mail.yandex.net (smtp17.mail.yandex.net [95.108.252.17]) by forward20.mail.yandex.net (Yandex) with ESMTP id 2923310409DF for ; Wed, 22 Feb 2012 21:08:14 +0400 (MSK) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1329930494; bh=rOblEFHCPnlGyIqZ8Kyv9wYTA/9IMhgYD18qLHFAp2M=; h=Content-Type:MIME-Version:Content-Transfer-Encoding:To:Message-ID: From:Subject:Date; b=upFNuPGJp4kD5R8gBRjPBHfx2uMcTCK2Jw82+b2XDqth1xL0nrU+8MWBjqo1mkak+ ZLqSaT85QW76lAXK4wCQx6CxcP0YVbJD0JAYeOiQr03FjlOEOJ76uU9Ud6FaChtfVB I6XWeVj32Sgns4CC4A+Iy5neefz7hyG+a5iIscPo= Received: from smtp17.mail.yandex.net (localhost [127.0.0.1]) by smtp17.mail.yandex.net (Yandex) with ESMTP id 07DDF190019A for ; Wed, 22 Feb 2012 21:08:13 +0400 (MSK) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1329930494; bh=rOblEFHCPnlGyIqZ8Kyv9wYTA/9IMhgYD18qLHFAp2M=; h=Content-Type:MIME-Version:Content-Transfer-Encoding:To:Message-ID: From:Subject:Date; b=upFNuPGJp4kD5R8gBRjPBHfx2uMcTCK2Jw82+b2XDqth1xL0nrU+8MWBjqo1mkak+ ZLqSaT85QW76lAXK4wCQx6CxcP0YVbJD0JAYeOiQr03FjlOEOJ76uU9Ud6FaChtfVB I6XWeVj32Sgns4CC4A+Iy5neefz7hyG+a5iIscPo= Received: from host-8-152-66-217.spbmts.ru (host-8-152-66-217.spbmts.ru [217.66.152.8]) by smtp17.mail.yandex.net (nwsmtp/Yandex) with ESMTP id 7nSOL01Z-82SOMGte; Wed, 22 Feb 2012 21:08:13 +0400 X-Yandex-Spam: 1 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable To: notmuch@notmuchmail.org Message-ID: <20120222171041.11455.92079@localhost> From: Serge Z User-Agent: alot/0.21+ Subject: Searching through different charsets Date: Wed, 22 Feb 2012 21:10:41 +0400 X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 22 Feb 2012 17:15:26 -0000 Hello! I've got the following problem: fetched emails can be in different encoding= s. And searching a term typed in one encoding (system default) does not match = the same term in another encoding. The solution, as I see, can be in preprocessing each incoming email to "normalize" it and its encoding so that indexer will handle emails in system encoding only. Could you please suggest something? Another issue (not so much wanted but wanted too) is searching through html messages without matching html tags. This problem looks to be solvable by properly configured run-mailcap. Is th= ere such solution anywhere? Thanks.