From: Jani Nikula Date: Mon, 23 Feb 2015 18:18:56 +0000 (+0200) Subject: Re: [PATCH 2/3] doc: add material on stemming and wildcards X-Git-Url: http://git.tremily.us/?a=commitdiff_plain;h=e6f8e02e60e1bdb04cbcce166b4d453c096a7bc9;p=notmuch-archives.git Re: [PATCH 2/3] doc: add material on stemming and wildcards --- diff --git a/3a/3dafd83b29ae61029a0f007c4e530286d7e66e b/3a/3dafd83b29ae61029a0f007c4e530286d7e66e new file mode 100644 index 000000000..09ee0f3c3 --- /dev/null +++ b/3a/3dafd83b29ae61029a0f007c4e530286d7e66e @@ -0,0 +1,125 @@ +Return-Path: +X-Original-To: notmuch@notmuchmail.org +Delivered-To: notmuch@notmuchmail.org +Received: from localhost (localhost [127.0.0.1]) + by olra.theworths.org (Postfix) with ESMTP id 60902431FDE + for ; Mon, 23 Feb 2015 10:26:58 -0800 (PST) +X-Virus-Scanned: Debian amavisd-new at olra.theworths.org +X-Spam-Flag: NO +X-Spam-Score: 1.738 +X-Spam-Level: * +X-Spam-Status: No, score=1.738 tagged_above=-999 required=5 + tests=[DNS_FROM_AHBL_RHSBL=2.438, RCVD_IN_DNSWL_LOW=-0.7] + autolearn=disabled +Received: from olra.theworths.org ([127.0.0.1]) + by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) + with ESMTP id fUiwS2hpzH3b for ; + Mon, 23 Feb 2015 10:26:55 -0800 (PST) +Received: from mail-wg0-f52.google.com (mail-wg0-f52.google.com + [74.125.82.52]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client + certificate requested) by olra.theworths.org (Postfix) with ESMTPS id + 00293431FDA for ; Mon, 23 Feb 2015 10:26:54 -0800 + (PST) +Received: by wggx12 with SMTP id x12so450957wgg.11 + for ; Mon, 23 Feb 2015 10:18:40 -0800 (PST) +X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; + d=1e100.net; s=20130820; + h=x-gm-message-state:from:to:subject:in-reply-to:references + :user-agent:date:message-id:mime-version:content-type; + bh=HnX0CXs9d/+kMmMx1NokqzwdmsAGaM9Os7MFsUcWOJI=; + b=YDLPeVxecOWKKxlIDaFDw2XTX61KhNH4vxhxMZi6IolZNOzXl2ZANj7ev3a5fsOYV/ + dthcDG8o1Uc29TFN3n/1SmewN0bno5jS666xBYP3jAFzDN+HeQKWSk1vqxS+qQ4UxVwL + gE+/0KhF4zWgpNcp0r0Y1msnAquCn4dmjO4RyzRNvOzbDuAsVfkodKupN+m8AnRG7LDH + tJ5GM9a6jw+9+ctbMezEZ6YHvyylcJetbIc/7yMN58L3KwCwfiQJHG4uRigyWYQKpGGT + T1SE+Rpc1qqdMmAJzMmPbdr1b8pYK+4/l9nTB00wFPbuQ7WBlALSX6BLD66TOV/zsHjU + Nyuw== +X-Gm-Message-State: + ALoCoQm7g0Ft/4lctnoZNKGzEIsiQYQdYAMQZC78JlMt29dTOFrX0+TGTuf4HNGpo4xSG+xNCE8z +X-Received: by 10.180.78.3 with SMTP id x3mr23358077wiw.86.1424715519965; + Mon, 23 Feb 2015 10:18:39 -0800 (PST) +Received: from localhost (mobile-internet-bcee3b-76.dhcp.inet.fi. + [188.238.59.76]) + by mx.google.com with ESMTPSA id m4sm56659472wjb.25.2015.02.23.10.18.38 + (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); + Mon, 23 Feb 2015 10:18:39 -0800 (PST) +From: Jani Nikula +To: David Bremner , David Bremner , + notmuch@notmuchmail.org +Subject: Re: [PATCH 2/3] doc: add material on stemming and wildcards +In-Reply-To: <1421585971-29949-2-git-send-email-david@tethera.net> +References: <87h9vojsa0.fsf@nikula.org> + <1421585971-29949-1-git-send-email-david@tethera.net> + <1421585971-29949-2-git-send-email-david@tethera.net> +User-Agent: Notmuch/0.19+49~g2049205 (http://notmuchmail.org) Emacs/24.4.1 + (x86_64-pc-linux-gnu) +Date: Mon, 23 Feb 2015 20:18:56 +0200 +Message-ID: <8761as5w4v.fsf@nikula.org> +MIME-Version: 1.0 +Content-Type: text/plain +X-BeenThere: notmuch@notmuchmail.org +X-Mailman-Version: 2.1.13 +Precedence: list +List-Id: "Use and development of the notmuch mail system." + +List-Unsubscribe: , + +List-Archive: +List-Post: +List-Help: +List-Subscribe: , + +X-List-Received-Date: Mon, 23 Feb 2015 18:26:58 -0000 + +On Sun, 18 Jan 2015, David Bremner wrote: +> This is lightly massaged from the searching page on the wiki. +> --- +> doc/man7/notmuch-search-terms.rst | 28 ++++++++++++++++++++++++++++ +> 1 file changed, 28 insertions(+) +> +> diff --git a/doc/man7/notmuch-search-terms.rst b/doc/man7/notmuch-search-terms.rst +> index 52f6804..3b27053 100644 +> --- a/doc/man7/notmuch-search-terms.rst +> +++ b/doc/man7/notmuch-search-terms.rst +> @@ -129,6 +129,34 @@ operators, but will have to be protected from interpretation by the +> shell, (such as by putting quotation marks around any parenthesized +> expression). +> +> +Stemming +> +-------- +> + +> +**Stemming** in notmuch means is that these searches + +Unnecessary "is"? + +> + +> +:: +> + +> + notmuch search detailed +> + notmuch search details +> + notmuch search detail +> + +> +will all return identical results, because Xapian first "reduces" the +> +term to the common stem (here 'detail') and then performs the search. +> + +> +There are two ways to turn this off: a search for a capitalized word +> +will be performed unstemmed, so that one can search for "John" and not +> +get results for "Johnson"; phrase searches are also unstemmed (see +> +below for details). Stemming is currently only supported for +> +English. Words in other languages will be performed unstemmed unless + +Maybe end the sentence here, and drop this line: + +> +somebody teaches Xapian how to perform stemming for that language. +> + +> +Wildcards +> +--------- +> + +> +It is possible to use a trailing '\*' as a wildcard. A search for +> +'wildc\*' will match 'wildcard', 'wildcat', etc. +> + +> + +> Boolean and Probabilistic Prefixes +> ---------------------------------- +> +> -- +> 2.1.4