Loading...

Great, let's set up a demo!

We'd Love to Hear from You

F.A.Q.

  • What is PredictLeads and who does your data benefit?

    PredictLeads specializes in providing structured, actionable data on companies. Our datasets enable:

    • VCs and Private Equity firms: To track company growth, monitor portfolios, and identify promising investment opportunities.
    • Sales Enablement platforms: To enrich company profiles and deliver relevant sales triggers to their clients.
    • Job Boards: To expand listings by incorporating fresh job openings directly from company websites.
    • Sales and Marketing Teams: To identify new leads and write personalized outreach with relevant triggers.
    • Investment Firms: To perform due diligence and assess growth signals.
    • Market Intelligence Companies: To track sector trends, analyze competitor activity, and uncover new opportunities with accurate and actionable data.
    • Risk Mitigation: To find vendor/client relationships and identify where companies have production operations, helping assess multi-tier supply chain risks.
    • Competitive Intelligence platforms: Monitor competitor activity by tracking news mentions, hiring trends, and when they add new client logos on their website.
  • What datasets do you provide?

    We offer the following datasets:

    1. News Events: Tracks 29 categories, including product launches, funding rounds, partnerships, and C-level changes.
    2. Job Openings: Captures job listings directly from company career pages.
    3. Technologies: Identifies technologies companies use via HTML, JavaScript, DNS records and job descriptions.
    4. Key Customers: Reveals vendor/client/investor/integration relationships through image recognition on logos.
    5. Website Evolution: Monitors website content changes.

    All our datasets and their information are found here.

  • Do you have historical data?

    Yes. Here’s our historical coverage:

    • News Events: Since 2016; retroactively includes events referenced in later articles.
    • Job Openings: Since 2018.
    • Technologies: Since 2018.
    • Key Customers: Since 2019.
    • Website Evolution: Since 2019.
  • Is your data point-in-time?

    Yes. All our data includes “first_seen_at” and “last_seen_at” attributes. This allows you to track when a data point was first identified and how recently was observed providing a clear temporal context for every signal we deliver.

  • What are your accuracy rates?

    Our accuracy rate is above 95% across all datasets. PredictLeads ensures this high standard by sourcing data directly from primary public sources such as company websites, press releases, and companies job subpages. Additionally, our dedicated Quality Assurance team reviews the data regularly, and our systems are designed to minimize duplication and errors.

  • How often is your data updated?

    Our refresh rate ranges from twice every day to once every two weeks, depending on a website’s activity level. For job openings, open listings are refreshed every 36 hours.

  • How can your data be accessed?

    We offer multiple delivery options:

    1. Flat Files: Delivered in JSON format, on a daily, weekly, or monthly basis.
    2. API: Real-time access to data via endpoints.
    3. Webhooks: Automatically push updates such as jobs or news events to your system in real time.
  • What are your sources?

    PredictLeads gathers data exclusively from publicly available sources such as company websites, press releases, news articles, and blogs. We respect robots.txt files and avoid gated content.

  • What technologies do you track?

    We monitor technologies across various categories, including CRM, marketing, analytics, payment systems, and more. More information about the technology dataset can be found here.

  • Do you cover non-English sites?

    Our coverage is global, but it is slightly biased toward websites that have an English version.

    For job openings, we actively crawl listings in French, German, Spanish, Portuguese, Italian, Dutch, Swedish, Danish, and other languages as well, ensuring coverage across multilingual job postings.

  • How do you ensure data quality?

    Our quality assurance measures include:

    • Daily QA Reviews: PredictLeads Quality Assurance team checks on a daily basis some few hundred records for each of the datasets to ensure ongoing quality.
    • Continuous System Monitoring: PredictLeads uses systems to track crawling performance and data freshness. Designated developers are notified if something is amiss.
    • Status Page: PredictLeads tracks availability of the datasets via APIs also through our status page: https://status.predictleads.com/.
    • Anomaly Detection System is being used to detect potential anomalies in the data that are then reviewed by PredictLeads Quality Assurance team.
    • De-duplication: for each dataset the de-duplication logic is implemented to avoid repeat records and maintain signal clarity.
  • How many companies do you track?

    PredictLeads currently tracks over 92M companies worldwide.

    Updated: May 2025.

  • Are you GDPR and CPPA compliant?

    Yes, we are GDPR and CPPA compliant and we ensure that we do not collect any personal identifiable information (PII). We strictly crawl public data and respect all applicable regulations.

  • How does your dataset compare with competitors?

    News Events Dataset

    Tracks 29 distinct event categories with high granularity and a strong signal-to-noise ratio. These categories focus on relevant news, like product launches and funding rounds, while filtering out generic PR content to minimize noise.

    Job Openings Dataset

    Extracts job openings directly from company websites, unlike most competitors sourcing from aggregators like Indeed or Monster. Because companies tend to keep their own career pages most up to date, this approach provides fresh, accurate, and de-duplicated data, enhancing reliability and reducing noise.

    Technologies Dataset

    Tracks which technologies companies are using via HTML, JavaScript, DNS records and job descriptions, capturing both publicly visible and subtle "behind the firewall" technology signals.

    Key Customers Dataset

    Identifies 200+ million company relationships like partners, clients, vendors, and investors through image recognition on logos.