Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id CE3DD431FDE for ; Fri, 21 Dec 2012 05:08:40 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org X-Spam-Flag: NO X-Spam-Score: 0 X-Spam-Level: X-Spam-Status: No, score=0 tagged_above=-999 required=5 tests=[none] autolearn=disabled Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id yFxlFkBB5X5D for ; Fri, 21 Dec 2012 05:08:39 -0800 (PST) Received: from tesseract.cs.unb.ca (tesseract.cs.unb.ca [131.202.240.238]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by olra.theworths.org (Postfix) with ESMTPS id 2A004431FBC for ; Fri, 21 Dec 2012 05:08:39 -0800 (PST) Received: from fctnnbsc30w-156034082078.dhcp-dynamic.fibreop.nb.bellaliant.net ([156.34.82.78] helo=zancas.localnet) by tesseract.cs.unb.ca with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1Tm2L6-0005dd-RX; Fri, 21 Dec 2012 09:08:37 -0400 Received: from bremner by zancas.localnet with local (Exim 4.80) (envelope-from ) id 1Tm2L1-0005yh-9F; Fri, 21 Dec 2012 09:08:31 -0400 From: david@tethera.net To: notmuch@notmuchmail.org Subject: [Patch v8 07/18] unhex_and_quote: new function to quote hex-decoded queries Date: Fri, 21 Dec 2012 09:08:16 -0400 Message-Id: <1356095307-22895-7-git-send-email-david@tethera.net> X-Mailer: git-send-email 1.7.10.4 In-Reply-To: <1356095307-22895-1-git-send-email-david@tethera.net> References: <1356095307-22895-1-git-send-email-david@tethera.net> X-Spam_bar: - Cc: David Bremner X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 21 Dec 2012 13:08:41 -0000 From: David Bremner Space delimited tokens are hex decoded and then quoted according to Xapian rules. Prefixes and '*' are passed through unquoted, as is anything that hex-decoding would not change. --- tag-util.c | 81 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 81 insertions(+) diff --git a/tag-util.c b/tag-util.c index f89669a..46aab4e 100644 --- a/tag-util.c +++ b/tag-util.c @@ -56,6 +56,87 @@ illegal_tag (const char *tag, notmuch_bool_t remove) return NULL; } +/* Input is a hex encoded string, presumed to be a query for Xapian. + * + * Space delimited tokens are decoded and quoted, with '*' and prefixes + * of the form "foo:" passed through unquoted. + */ +static tag_parse_status_t +unhex_and_quote (void *ctx, char *encoded, const char *line_for_error, + char **query_string) +{ + char *tok = encoded; + size_t tok_len = 0; + char *buf = NULL; + size_t buf_len = 0; + tag_parse_status_t ret = TAG_PARSE_SUCCESS; + + *query_string = talloc_strdup (ctx, ""); + + while ((tok = strtok_len (tok + tok_len, " ", &tok_len)) != NULL) { + + size_t prefix_len; + char delim = *(tok + tok_len); + + *(tok + tok_len++) = '\0'; + + prefix_len = hex_invariant (tok, tok_len); + + if ((strcmp (tok, "*") == 0) || prefix_len >= tok_len - 1) { + + /* pass some things through without quoting or decoding. + * Note for '*' this is mandatory. + */ + + if (! (*query_string = talloc_asprintf_append_buffer ( + *query_string, "%s%c", tok, delim))) { + + ret = line_error (TAG_PARSE_OUT_OF_MEMORY, + line_for_error, "aborting"); + goto DONE; + } + + } else { + /* potential prefix: one for ':', then something after */ + if ((tok_len - prefix_len > 2) && *(tok + prefix_len) == ':') { + if (! (*query_string = talloc_strndup_append (*query_string, + tok, + prefix_len + 1))) { + ret = line_error (TAG_PARSE_OUT_OF_MEMORY, + line_for_error, "aborting"); + goto DONE; + } + tok += prefix_len + 1; + tok_len -= prefix_len + 1; + } + + if (hex_decode_inplace (tok) != HEX_SUCCESS) { + ret = line_error (TAG_PARSE_INVALID, line_for_error, + "hex decoding of token '%s' failed", tok); + goto DONE; + } + + if (double_quote_str (ctx, tok, &buf, &buf_len)) { + ret = line_error (TAG_PARSE_OUT_OF_MEMORY, + line_for_error, "aborting"); + goto DONE; + } + + if (! (*query_string = talloc_asprintf_append_buffer ( + *query_string, "%s%c", buf, delim))) { + ret = line_error (TAG_PARSE_OUT_OF_MEMORY, + line_for_error, "aborting"); + goto DONE; + } + } + } + + DONE: + if (ret != TAG_PARSE_SUCCESS && *query_string) + talloc_free (*query_string); + return ret; +} + tag_parse_status_t parse_tag_line (void *ctx, char *line, tag_op_flag_t flags, -- 1.7.10.4