Apache Any23 Dev Site Apache Software Foundation

Apache Any23 - Anything To Triples - Live Service Demo

Parses Microformats, RDFa, Microdata, RDF/XML, Turtle, N-Triples, JSON-LD and NQuads.

Download and install Any23: visit the Developers Site and the Documentation.


Convert document at IRI

/
[?]

Convert copy&pasted document

[?]

API quick reference

Examples

Compact API

HTTP GET requests can be made to IRIs of the shape

http://.../format/input-uri

The response is the input document converted to the desired output format.

Form-style GET API

HTTP GET requests can be made to the IRI http://.../ with the following query parameters:

uriIRI of an input document.
formatDesired output format, defaults to best.
validation-modeThe validation level to be applied on the input. Possible values:
none (no validation applied);
validate(apply validation and produce validation report if annotate flag is enabled);
validate+fix(apply validation, try to fix detection issues and produce validation report if annotate flag is enabled).
annotateIf specified the output RDF will contain extractor specific scope comments.
Possible values: on/off
reportIf specified will produce a full XML report containing extraction and validation issues other than produced metadata.
Possible values: on/off
openieIf specified the Open Information Extraction (Open IE) system will be activated (default off).
Possible values: on/off
Formatting the URL according to the above will return an HTTP 302 redirect to http://...any23/format.

The response is the input document converted to the desired output format.

Direct POST API

HTTP POSTing a document body to http://.../format will convert the document to the specified output format.
The media type of the input has to be specified in the Content-Type HTTP header. Depending on the servlet container, a Content-Length header specifying the length of the input document in bytes might also be required.
Typical media types for supported input formats are:

Input formatMedia type
HTMLtext/html
RDF/XMLapplication/rdf+xml
Turtletext/turtle
N-Triplestext/nt
N-Quadstext/nq
TriXapplication/trix

Example POST request:

POST /rdfxml HTTP/1.0
Host: example.com
Content-Type: text/turtle
Content-Length: 174

@prefix foaf: <http://xmlns.com/foaf/0.1/> .

[] a foaf:Person;
    foaf:name "John X. Foobar";
    foaf:mbox_sha1sum "cef817456278b70cee8e5a1611539ef9d928810e";
    .

Form-style POST API

A document body can also be converted by HTTP POSTing form data to http://.../.
The Content-Type HTTP header must be set to application/x-www-form-urlencoded. The following parameters are supported:

typeMedia type of the input, see the table above. If not present, auto-detection will be attempted.
bodyDocument body to be converted.
formatDesired output format; defaults to best.
validation-modeThe validation level to be applied on the input. Possible values:
none (no validation applied);
validate(apply validation and produce validation report if annotate flag is enabled);
validate+fix(apply validation, try to fix detection issues and produce validation report if annotate flag is enabled).
annotateIf specified the output RDF will contain extractor specific scope comments.
Possible values: on/off
reportIf specified will produce a full XML report containing extraction and validation issues other than produced metadata.
Possible values: on/off
openieIf specified the Open Information Extraction (Open IE) system will be activated (default off).
Possible values: on/off

Output formats

Supported output format identifiers are:

Error reporting

Processing errors are indicated via HTTP status codes and brief text/plain error messages. The following status codes can be returned:

CodeReason
200 OKSuccess
400 Bad RequestMissing or malformed input parameter
404 Not FoundMalformed request IRI
406 Not AcceptableNone of the media types specified in the Accept header are supported
415 Unsupported Media TypeDocument body with unsupported media type was POSTed
501 Not ImplementedExtraction from input was successful, but yielded zero triples
502 Bad GatewayInput document from a remote server could not be fetched or parsed

Report Format

The XML report format is subjected to changes. The current content is described in section Any23 Service.


Apache Any23 v.2.3 (2021-02-02 16:55:59+0000)

Any23 project homepage | Hosted at Apache Software Foundation