Module 06

Revenue Data Products

Corporate revenue split data for 10,300+ public companies across 47 markets. AI-extracted from financial filings, NAICS-mapped to 6-digit precision, continuously updated.

~10,300
Public companies covered globally
~99%
Global equity by market capitalisation
25,806
Financial documents processed
47
Markets (23 developed + 24 emerging)
The Data Gap

Segment labels ≠ industry exposure.

Companies report revenues in ways that suit their own narrative — geographic breakdowns, brand segments, customer type. None of this tells an investor what industry the revenue actually comes from. alphaX extracts the underlying economic activity and maps it to NAICS industry codes, regardless of how the company chose to report.

Our Methodology

AI extraction. Human-quality output.

Multilingual PDFs, tables, and narrative disclosures are processed by an AI agent trained specifically on financial report structures. Extracted segments are validated, normalised, and mapped to 6-digit NAICS codes. Annual, quarterly, and half-year cadences are supported.

Processing Pipeline
01
Financial Reports
Annual · Quarterly · Half-year
02
Pre-processing
Multilingual PDF · Table extraction
03
AI Extraction
Segment names · Values · %
04
Revenue Splits
Validated & normalised
05
NAICS Mapping
6-digit precision · QC reviewed
Case Studies
Case Study 1
Evolution Mining Ltd — Product-based reporting

Evolution Mining reports revenues by metal type: gold, silver, copper. alphaX maps each metal to distinct NAICS codes — gold & silver to NAICS 212220 (primary activity), copper to NAICS 212230. This reveals commodity-level industry exposure for investors.

Insight: Geographic labels ≠ industry exposure. Product labels reveal commodity-level risk that balance sheets obscure.
Case Study 2
Airbnb, Inc — Geography-based reporting

Airbnb reports geographically — North America, EMEA, APAC — but all regions represent identical economic activity. alphaX's AI identifies the underlying business (traveler accommodation) and maps 100% of revenue to NAICS 721199.

Insight: We don't regurgitate segment labels. We classify the underlying economic activity to reveal true industry exposure.

Underlying Activity Classification

We classify the economic activity behind every revenue line — not just the label a company chose to use. The result is cross-company comparability that company-reported data cannot provide.

6-Digit NAICS Precision

Industry classification at the maximum available granularity — enabling precise sector exposure analysis, peer comparison, and regulatory reporting.

Continuous Updates

New filings are processed as they are released. The dataset reflects the most recent published financial information for all 10,300+ covered companies.

Explore All Modules