Return-Path: X-Original-To: notmuch@notmuchmail.org Delivered-To: notmuch@notmuchmail.org Received: from localhost (localhost [127.0.0.1]) by olra.theworths.org (Postfix) with ESMTP id 86384429E20 for ; Sat, 29 Jan 2011 12:20:10 -0800 (PST) X-Virus-Scanned: Debian amavisd-new at olra.theworths.org X-Spam-Flag: NO X-Spam-Score: -0.09 X-Spam-Level: X-Spam-Status: No, score=-0.09 tagged_above=-999 required=5 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, T_MIME_NO_TEXT=0.01] autolearn=disabled Received: from olra.theworths.org ([127.0.0.1]) by localhost (olra.theworths.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ImrtHTDYy+W0 for ; Sat, 29 Jan 2011 12:20:10 -0800 (PST) Received: from homiemail-a17.g.dreamhost.com (caiajhbdcaid.dreamhost.com [208.97.132.83]) by olra.theworths.org (Postfix) with ESMTP id 01ABC431FB6 for ; Sat, 29 Jan 2011 12:20:09 -0800 (PST) Received: from homiemail-a17.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a17.g.dreamhost.com (Postfix) with ESMTP id 3E6D77A8063; Sat, 29 Jan 2011 12:20:08 -0800 (PST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=SSpaeth.de; h=from:to:subject :in-reply-to:references:date:message-id:mime-version: content-type; q=dns; s=sspaeth.de; b=fn+nG5UbGGphsU8ldJgpT3uAmNk OsaU2nxbhUCdHNapPMNXZVeDcvFEdvUmD2bff4YI3aArwfsJebqHIEw3AGi2pihb AKP7Cxu9HelxEDosgeXd9ASYtpI98rgdi+uJPbPgu4WIj/UcNVc4epzOngV9FZZa IjHU9bIdL9zrw4aM= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=SSpaeth.de; h=from:to :subject:in-reply-to:references:date:message-id:mime-version: content-type; s=sspaeth.de; bh=4OzE+j+f8YcjuHCCAR3EerxBRvA=; b=N e4U4DMjDSld7s1O92R9CgXV9V7eKZ4OewruZ9PJ+SgY6pqzOGmtfGJVkBq97E0dL xqoHERVjG6sbSqCmybiULOqaZd39oI1nxxA0N670nECxpE4ZM+x1memPIAMV6/nO 0ceKYEnJD1ihUdDf86vA6lgQgbABSNgKGJMV7LE6UU= Received: from spaetzbook.sspaeth.de (unknown [84.55.218.22]) (Authenticated sender: fax@sspaeth.de) by homiemail-a17.g.dreamhost.com (Postfix) with ESMTPA id 751CC7A805C; Sat, 29 Jan 2011 12:20:06 -0800 (PST) Received: by spaetzbook.sspaeth.de (sSMTP sendmail emulation); Sat, 29 Jan 2011 21:20:02 +0100 From: Sebastian Spaeth To: Jesse Rosenthal , notmuch@notmuchmail.org Subject: Re: A tool for printing from notmuch In-Reply-To: References: <87r5bvmqgy.fsf@SSpaeth.de> User-Agent: Notmuch/0.5-37-g3863e88 (http://notmuchmail.org) Emacs/23.1.1 (x86_64-pc-linux-gnu) Date: Sat, 29 Jan 2011 21:20:02 +0100 Message-ID: <87ipx7mphp.fsf@SSpaeth.de> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha1; protocol="application/pgp-signature" X-BeenThere: notmuch@notmuchmail.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: "Use and development of the notmuch mail system." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 29 Jan 2011 20:20:10 -0000 --=-=-= On Sat, 29 Jan 2011 15:09:14 -0500, Jesse Rosenthal wrote: > So BS is the best I could find for this job No doubt. I once tried to scrape http://theeconomist.com. It has so broken html that all parsers broke down. BeautifulSoup at least made it through and didn't completely fail. so I agree it is the best thing for surely broken html email Sebastian --=-=-= Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) iEYEARECAAYFAk1EdnIACgkQVYX1jMgnoGLyFACfZSYIbZCRQpqXBwnYNYV5DSjy uckAn2TaQpa62P0X+v/SzNO/pK+OxxA+ =TMn1 -----END PGP SIGNATURE----- --=-=-=--