Microformat Extraction | AYLIEN

Microformat Extraction

Extract Meaning from HTML

Microformats are simple conventions or entities that are used on web pages, to describe a specific type of information; Contact info, Reviews, Products, People, Events, etc.

Microformats are often included in the HTML of pages on the web to add semantic information about that page. They make it easier for machines and software to scan, process and understand webpages.

AYLIEN Microformat Extraction allows users to detect, parse and extract embedded Microformats when they are present on a page.


Microformat Extraction Example:

This endpoint can be used to extract Microformats from a webpage. As an example take the following webpage: http://tomorris.org

GET /microformats?url=http://tommorris.org/

Yields the following result:

{
  "hCards": [{
    "location": {
      "id": "4aef7ad915718d57463d6f3405c3e6e3c1a58d85",
      "latitude": "50.8212025",
      "longitude": "-0.1431826"
    },
    "photo": "/images/me.jpg",
    "email": "tom@tommorris.org",
    "fullName": "Tom Morris",
    "structuredName": {
      "id": "df28ebe7466663f0e03e93e0d70ed976654e6a6c",
      "givenName": "Tom",
      "familyName": "Morris"
    },
    "logo": "/images/me.jpg",
    "id": "a8854a8fd9bdd9add4108a39179810c661bc4232"
  }]
}