Main menu
The Data Quality Market -
The data quality market for the calendar year 2010 was worth around $873 million, of which software sales and maintenance accounted for around $724 million. The overall figure includes the professional services arms of data quality vendors, but excludes the (substantial) revenues of systems integrators and consultancies involved with data quality initiatives. This represents 9% growth over 2009, reflecting some recovery in the economy. Financial services in particular has shown significant growth in 2010, presumably reflecting the increased focus on regulation in banking, and new regulatory initiatives such as Solvency II in insurance.
Perhaps the most significant reshaping of the industry over the last year has been the clearer distinction between companies offering a broad platform capability, of which data quality is an integral part, and the pure-
There is a growing interest in providing data quality capabilities in real time, as a "data quality firewall", often via web services, with data quality capabilities being called up from within other applications. Some vendors can provide data quality through the cloud rather than requiring on-
As companies expand globally and Asia's economies continue to develop, support for non-
Data volumes continue to grow, and more vendors have responded by re-
The industry has traditionally focused on customer name and address data, yet a continuing theme of our client research is how customer name and address is not the only priority for customers. Product data in particular is viewed as a major problem for enterprises, and is much more complex and less structured than name and address data. Companies such as Silver Creek (bought by Oracle), Datactics and Inquera specialize in this area, while some other vendors are data domain-
Data governance is rapidly entering the mainstream, and beginning to have an influence on the data quality industry. As master data management continues to grow, data governance initiatives have been set up to complement these MDM projects, and the scope of these initiatives usually includes data quality. Data quality vendors are therefore seeing demand to support data governance initiatives, and many have added some functionality in this area, such as support for data stewards. Increasingly the link between master data management and data quality in being recognised, with many MDM vendors using OEMs of data quality vendors to round out their offerings, or in some cases buying data quality vendors outright.
Data quality remains a fragmented market. Since many vendors specialize in name and address verification, companies with local knowledge have built up offerings that track people and companies that move address in order to avoid badly targeted mailing and ensure optimal bulk rates for mass mailing (such as Satori Software). Others have built up deep local knowledge of particular markets, such as Uniserv in continental Europe.
Significant enrichment of postal address data is possible via geocoding. This technique allows customers to not just check the postal code of an address, but to see it displayed on a map, and to display such things as its distance from nearby stores, the demographics of the area, its political constituency, or even whether the address lies within a flood plain. Pitney Bowes Business Insight (with its Mapinfo technology) is one vendor that offers particularly comprehensive enrichment capabilities, but many others are now doing so too.
One intriguing development was the entrance of Google into the market with its Google Refine desktop (open source) product in late 2010. Although existing vendors will doubtless deride it as lightweight, it has profiling, de-
The diagram that follows shows the major data quality vendors, displayed on three dimensions. See later for definitions of these. The largest vendors of data quality in terms of revenue are Experian QAS, SAP, Informatica, IBM, Trillium and DataFlux.
It is important to understand that this is a high-
As part of the landscape process, each vendor was asked to provide at least eight reference customers (some provided over 25 references) which were surveyed to determine their satisfaction with the data quality software of the vendor. The happiest customers based on this survey were those of X88, IBM, DataFlux, Talend, Informatica and Datactics, followed closely by the customers of Trillium, Active Prime, and HelpIT.
Main Vendors
Below is a list of the main data quality vendors.
Vendor |
Brief Description |
Website |
Address Doctor |
Vendor that specializes in providing wide coverage of name and address information; now owned by Informatica. |
|
Ataccama |
Prague- |
|
Active Prime |
California- |
|
Business Data Quality |
UK- |
|
Capscan |
London- |
|
Datactics |
UK- |
|
Datanomic |
Cambridge- |
|
DataFlux |
Part of SAS, one of the leading players in data quality. |
|
DataQualityFirst |
US start- |
|
Datiris |
Colorado vendor of data profiling technology. |
|
Datras |
Munich- |
|
DQ Global |
UK data quality and address verification software. |
|
Exeros |
California- |
|
Experian QAS |
UK- |
|
| The search engine giant now does data quality. |
||
HelpIT |
"UK/US- |
|
Human Inference |
Dutch data quality vendor. |
|
IBM |
Data quality software from the industry giant. |
|
Informatica |
California- |
|
Infogix |
Illinois- |
|
Infoglide |
US vendor specializing in identity resolution. |
|
Infoshare |
UK data quality specialising in the public sector market. |
|
Inquera |
Israeli company with innovative approach to product data quality using machine- |
|
Innovative Systems |
Long established Pittsburgh- |
|
Intelligent Search |
Identity management company now with a more general data quality capability. |
|
Melissa Data |
California- |
|
Netrics |
New Jersey vendor of impressively accurate matching software. Now owned by Tibco. |
|
Omikron |
German data quality vendor with strong Asian language capabilities. |
|
Pitney Bowes Business Insight |
The data quality vendor formerly known as Group 1 Software, part of Pitney Bowes Inc. |
|
Postcode Anywhere |
UK vendor of web- |
|
SAP |
The software giant is a major data quality player. |
|
Satori Software |
Seattle- |
|
Silver Creek Systems |
Colorado- |
|
Talend |
Paris- |
|
Trillium |
Part of Harte Hanks, one of the leading data quality vendors. |
|
Uniserv |
Large German data quality vendor. |
|
X88 |
Recent UK market entrant specializing in data discovery. |
Other vendors of data quality software include:
Ciant (www.ciant.com)
Research Methodology
The Information Difference Landscape diagram shows three dimensions of a vendor:
Market strength
Technology
Customer base.
"Market strength" is made up of a weighted set of five factors: revenues, growth, financial strength, geographic scope and partner network. Each of these individual elements is scored, the total producing the "market strength" figure. Similarly "technology" is made up of four factors: "technology breadth" (the coverage of the vendors in various data quality areas as illustrated below), the longevity of the software in the market, analyst perception of the product via briefings, and customer feedback from reference customers (this has a high weighting), which we surveyed. In each case the scoring is on a scale of 0 (worst) to 6 (best).
Vendors were asked to submit answers to various questions via a questionnaire. Vendors were interviewed directly by an analyst and their software demonstrated and assessed. Reference customers were surveyed to give their experience of the software of each vendor. The technology functions which the vendors were asked about are as shown below. These are drawn from the Information Difference vendor functionality model; if you are interested in more detail on this then please contact The Information Difference.
Functional Areas
