Did you know ... Search Documentation:
Packs (add-ons) for SWI-Prolog

Package "readability_parser"

Title:Readability Parser API
Rating:Not rated. Create the first rating!
Latest version:0.0.4
SHA1 sum:93e2c909d5a323ccf8d78ba5c89e42c46a7fe6b4
Author:Michael Hendricks <michael@ndrix.org>
Maintainer:Michael Hendricks <michael@ndrix.org>
Packager:Michael Hendricks <michael@ndrix.org>
Home page:http://packs.ndrix.com/readability_parser/index.html
Download URL:http://packs.ndrix.com/readability_parser/readability_parser-0.0.4.tgz
Requires:uri_qq

Reviews

No reviews. Create the first review!.

Details by download location

VersionSHA1#DownloadsURL
0.0.493e2c909d5a323ccf8d78ba5c89e42c46a7fe6b423http://packs.ndrix.com/readability_parser/readability_parser-0.0.4.tgz
0.0.38545491c76c9dd9df8618a41c14f00f2592db8ae3http://packs.ndrix.com/readability_parser/readability_parser-0.0.3.tgz
0.0.200608b55d397fa7ab416c47af75bf5a8bf3e8b193http://packs.ndrix.com/readability_parser/readability_parser-0.0.2.tgz
0.0.1219a653bd9277c62f46c4f2276e1bcdc58d14d2c3http://packs.ndrix.com/readability_parser/readability_parser-0.0.1.tgz

Synopsis

:- use_module(library(readability_parser)).
?- build_agent("f861ea4...", Agent),
   parse(Agent, 'http://foo.com/article.html', Response).
Response = _{ author: "John Doe"
            , content: "A long time ago ..."
            , title: "A Fairy Tale"
            , word_count: 372
            ...
            }.

Description

Access Readability's parser API for extracting article content from an HTML page.

Changes in this Version

  • Workaround Readability SSL weirdness

Installation

Using SWI-Prolog 7.1.5 or later:

?- pack_install(readability_parser).

This module uses semantic versioning.

Source code available and pull requests accepted at http://github.com/mndrix/readability_parser

author
- Michael Hendricks <michael@ndrix.org>
license
- BSD

Contents of pack "readability_parser"

Pack contains 4 files holding a total of 3.7K bytes.