Article Extraction

Extract what matters.

While web pages hold a lot of valuable information, they are often crowded with links, advertisements, and other irrelevant texts. To pinpoint the important information on a web page can often take a bit of effort.

Article extraction helps to automatically remove navigation links, ads and more undesired content from a web page and extract what matters.

Focus on what matters and disregard what doesn’t. Remove all clutter and extract the main text and media from an article or URL.

Extract the main body of an article including embedded media such as links, images, videos etc. from any URL or Webpage.

Article Extraction Example:

Let’s say we want to analyze an image and assign appropriate tags automatically:

GET /extract?url=