Home Products
Kapow Web Data Server PDF Print E-mail

The Vanguard of Data Access for the enterprise and public web data

The Kapow Web Data Server is the vanguard and the only enterprise platform in the market that can wrap any existing website or web application into data feeds or programmatic interfaces (API’s) - all with no coding. Kapow Technologies partners with customers to ensure they learn from experts, reduce project risks, and lower implementation costs. “Kapow “robots” use standard web technology to automate the navigation and interaction with a web site or web application, providing access to the underlying data and business logic and support the existing and in-place security mechanisms such as SSL.”

Web Data Server

The Kapow Web Data Server excels above “web scrapers” and “web crawlers” by combining:

  • An intuitive point-and-click IDE with surgical data extraction accuracy
  • Full support for dynamic HTML e.g. JavaScript and AJAX
  • High resilience against changes on each targeted web page.

The Kapow Web Data Server includes three core components:

  1. Client IDE (Interactive Development Environment) Tool—RoboMaker is a cutting edge visual-scripting application that enables business analysts and developers to build, test and maintain robots with ease. All interaction is defined into visual process flow steps, i.e. the robot, which then executes to automate the interaction with a web site. Web site navigation and data extraction is easily accomplished using point-and-click.

  2. RoboServer—A highly scalable server that executes robots in a multi-threaded, multi-server environment for maximum throughput and performance.

  3. A simple system configuration application for project handling, proxy setting, license administration and more.

Access Unlimited Web Data with Precise Extraction:

Kapow Technologies’ server performance is significantly more scalable than “web scraping” products using FireFox, IE and/or Safari. Its unique, high performance, IE compliant “headless” HTML parser and JavaScript engine ensures that the Kapow Web Data Server can handle any interactive JavaScript and AJAX based web sites.

The Kapow Web Data Server provides real-time access to the underlying database behind a web site, including data tables, data structures and data types—all through the existing web interface with no need for direct database or API access. Any human web-browser interaction—including load web page, fill out form, click submit, scroll down page, copy-paste-content—can be instructed into a robot that can automate the interaction with a web page or entire web application to extract data or give programmatic access to its underlying business logic.

Since the Kapow Web Data Server wraps an application at the web presentation layer, it can also leverage the valuable business logic at the presentation layer, like data verification, user login, different presentation formats, etc.

Enrich Web Data through Custom Transformation Rules:

During the data extraction process, the Kapow Web Data Server can perform comprehensive transformation and “cleansing” logic to ensure noise-free data quality. This enriches the data completeness for higher quality data analysis and better results in the target application, beyond any other product on the market.

Kapow Technologies performs the transformation—or cleansing—of the data, e.g. converting dates on web pages into standard formats, removing or replacing specified HTML tags, and cleaning text and extra spaces from the content. The Kapow Web Data Server separates content and presentation data, removes and replaces HTML with semantic XML using powerful pattern matching capabilities, and coverts HTML to RSS compliant XML for syndication. It can even execute JavaScript code to expose the next level of rich data transformation.

Serve Noise Free Web Data to File, Database, or Service Formats:

Kapow Web Data Server – Standard Edition delivers output in standard formats like XML, CSV and RSS for easy consumption by any data and feed consuming product. Through the Database Module and API Module, outputs are delivered to an SQL database or SOAP and REST services.