How We Build It

01 CONTENT ACQUISITION

We crawl millions of different sources every day, from PR news and company blogs to job openings.

02 CONTENT CATEGORIZATION

Using classification models we categorize useful content and identify meaningful texts.

03 INFORMATION EXTRACTION

Our proprietary models extract various entities (organizations, divisions, persons, financing types, products etc.) and relationships between them.

04 ENTITY RESOLUTION

Organization entities are linked to unique IDs (with domains) in our database for further manipulations.

05 NORMALIZATION & DEDUPLICATION PROCESS

Entities are normalized with a set of rule-based approaches and then sent to the deduplication system.

06 MANUAL APPROVAL

We employ multiple data analysts who monitor and verify data on a daily basis to ensure the data is of highest quality.

Example Datasets

Companies with new products in the last month

by emerging technology
Loading...
Loading...

Companies that expanded operations in the last month

by state

Companies that will change leadership in Q1 2018

by job function
Loading...

Interested how your organization can leverage these insights?