From wafer fab to finished device, the chip industry spans dozens of partners that never share a schema. It is the Bronze-Silver-Gold problem at planetary scale.
The chip shortage was not only a capacity problem. It was a visibility problem — nobody could see the true state of demand and supply across a chain that spans a dozen companies and not one shared schema.
A single modern chip can pass through more than a thousand process steps and cross multiple continents before it reaches a finished device. A design house hands a layout to a foundry. The foundry runs hundreds of fab steps and generates terabytes of sensor and yield data. Wafers move to an OSAT partner for assembly and test. Finished parts flow through distributors, contract manufacturers, and finally the OEM. Every one of those hand-offs is a company boundary — and a data boundary. None of them share a schema.
Integrating that chain into a coherent, real-time picture of supply and demand is one of the hardest data engineering problems in any industry. The data is heterogeneous in format, inconsistent in update frequency, and incomplete in ways that are not random: the parts of the chain with the least visibility are often the ones that matter most when something breaks. It is the Bronze-Silver-Gold problem at its most extreme.
Fab process tooling emits high-frequency sensor telemetry in proprietary equipment formats. Manufacturing execution systems record lot genealogy and yield in schemas that differ by site. Suppliers report lead times over EDI feeds with varying levels of integration and reliability. Distributors share point-of-sale and inventory positions in spreadsheets and portals. Demand signals from OEMs arrive as forecasts that are revised constantly and, during a crunch, are quietly inflated. Each source is internally sensible and mutually unintelligible.
The Bronze layer is the raw collection of all of it: equipment logs, MES extracts, supplier EDI, distributor reports, and OEM forecasts, landed as-is. To anyone who has not built the pipelines, it does not look like one dataset at all.
The Silver layer is where the value is created: lot and part identifiers are reconciled across systems, units and time zones are harmonised, yield and test results are normalised to a common event schema, and every record carries lineage back to its source. This is the unglamorous engineering that most organisations underfund — and then wonder why their analytics disagree.
The Gold layer is purpose-built for decisions: a capacity-and-allocation model for planners, a yield-analytics surface for process engineers, a demand-signal view that strips out double-ordering, and a supplier-risk index for procurement. Each is designed around the specific decision it supports and the person who makes it.
When the 2021–2023 shortage hit, the most damaging dynamic was not raw capacity — it was the bullwhip effect amplified by blindness. Buyers, unable to see true allocation, placed duplicate orders across multiple suppliers to hedge. Suppliers, reading inflated demand, planned against signals that were partly phantom. No single party could see the real state of the chain, because the data that would have revealed it was trapped in incompatible systems at every boundary. The companies that navigated it best were the ones that had already invested in stitching their supply data together.
Every high-value AI use case in semiconductors — yield prediction, predictive maintenance on tooling, demand forecasting, automated defect classification — depends on the same foundation: governed, lineage-tracked, harmonised data. Models do not fail at the algorithm layer. They fail because the yield data is defined three ways, the equipment telemetry is unlabelled, and the demand history is contaminated by orders that were never real.
The semiconductor industry builds the most precise machines humanity has ever made, and still struggles to answer a simple question across its own supply chain: what do we actually have, and what do we actually need? The bottleneck is rarely the silicon. It is the data layer that sits between a dozen partners who each speak their own dialect. The firms that win the next cycle will be the ones that treat their data supply chain with the same engineering rigour they apply to a 3-nanometre process — explicit lineage, honest uncertainty, and a Gold layer built for the decisions that matter. That is the foundation AI needs. Everything else is a demo.