Sometime in the next month, an engineer at an aerospace company will open a blank Excel spreadsheet, type "Criterion" in cell A1, and begin building a Pugh matrix from scratch. They will manually list evaluation criteria. They will manually assign weights based on a conversation they had in a conference room that nobody documented. They will manually score each design alternative using a scale they invented that morning. They will arrive at a winner, copy the table into PowerPoint, present it at a design review, and save the file to a SharePoint folder where it will never be opened again.
Eighteen months later, a new engineer will join the program and ask why the team chose concept B over concept A. Nobody will remember. The spreadsheet, if found, will contain numbers without context — weights without justification, scores without rationale, a conclusion without reasoning. The trade study will have produced a decision but destroyed the knowledge that supported it.
This is not a failure of process. The engineers followed the methodology correctly. It is a failure of tooling — and it has been the same failure, unchanged, for forty-five years.
Good Frameworks, Terrible Tools
The engineering profession has excellent decision-making frameworks. The Pugh matrix, published by Stuart Pugh in 1981, provides a structured method for comparing design alternatives against a reference concept using qualitative scoring. The Analytic Hierarchy Process (AHP), developed by Thomas Saaty in 1980, decomposes complex decisions into pairwise comparisons and derives mathematically consistent priority weights. Weighted scoring matrices, Kepner-Tregoe analysis, Quality Function Deployment — each of these methods has been refined over decades and taught in every engineering program.
The tools for executing these methods have not been refined at all. They are spreadsheets. In 1981, when Pugh published his method, the state-of-the-art tool for running a Pugh matrix was a table drawn on paper. In 2026, the state-of-the-art tool for running a Pugh matrix is a table drawn in Excel. The methodology matured. The tooling froze.
| Criterion | Wt. | Angular Contact (Reference) | Ceramic Hybrid | Air Bearing |
|---|---|---|---|---|
| Speed Rating | 5 | 0 | +1 | +2 |
| Load Capacity | 4 | 0 | 0 | −2 |
| Contamination Resistance | 5 | 0 | +1 | +2 |
| Unit Cost | 3 | 0 | −1 | −2 |
| Lead Time | 2 | 0 | −1 | −2 |
| Maintenance Interval | 3 | 0 | +1 | +1 |
| Weighted Total | 0 | +5 | +2 |
The matrix above looks decisive. Ceramic hybrid wins. But the decision is only as good as the inputs, and every input is vulnerable. Who decided the weights? A conversation in a meeting that wasn't recorded. Why isn't "vibration damping" a criterion? Nobody thought of it — or someone did and was overruled, and neither fact was documented. What happens if the cost weight increases from 3 to 5? Nobody ran the sensitivity analysis. What specification data supports the +1 contamination score for ceramic hybrid? It came from a datasheet someone read last week and paraphrased from memory.
Every one of these failure modes is a tooling problem, not a methodology problem. Pugh's method explicitly calls for systematic criteria identification, rigorous scoring against specification data, and iterative refinement. The spreadsheet doesn't enforce any of this. It accepts whatever you type and calls it a trade study.
Zero AI Trade Study Tools Exist
This is not an exaggeration. Across the entire landscape of AI-powered engineering tools — enterprise platforms, startup products, open-source projects, research prototypes — not a single purpose-built AI trade study tool exists. No product automates Pugh matrices. No product assists with AHP pairwise comparisons. No product runs sensitivity analysis on criterion weights. No product generates documented decision rationale from trade study results.
Caltech's AI-Assisted Model-Based Systems Engineering workshop teaches students to run AI-assisted trade studies — using ad-hoc ChatGPT workflows combined with Cameo, manually orchestrated by the engineer. The demand is clear enough that a premier engineering school built a course around it. But there is no product. The workflow is duct tape.
Consider what an AI-assisted trade study could actually do. An engineer describes the decision context: "I need to select a bearing type for a high-speed spindle application — 15,000 RPM, 500N radial load, clean room environment." An intelligent tool could suggest evaluation criteria the engineer might have missed based on the application domain — vibration damping, thermal expansion coefficient, outgassing rate for clean room compatibility. It could pull specification data from manufacturer datasheets to populate performance ratings with traceable sources rather than paraphrased memory. It could run Monte Carlo sensitivity analysis across the weights to show whether the decision is robust or whether a small change in priorities flips the winner. And it could generate a natural-language decision rationale suitable for a design review — not a table of numbers, but a documented argument.
None of this is technically difficult. Weighted scoring and AHP eigenvector calculations are undergraduate math. Monte Carlo sampling is a few lines of Python. The AI components — criteria suggestion, data extraction, rationale generation — are exactly what LLMs do well. The missing ingredient is not technology. It's someone building the product.
Trade Studies Are Disconnected From the Model
Even if a better trade study tool existed, it would solve only half the problem if it operated in isolation. The real pathology is that trade studies exist outside the system model. An engineer selects a bearing type in a spreadsheet, then manually creates a node in the architecture model, then manually writes requirements that reflect the choice, then manually updates the interface definitions. The decision and the model are connected only through the engineer's memory and discipline.
When the trade study is disconnected from the model, three things break. First, traceability dies — there is no link between "we chose ceramic hybrid bearings" and the system model node that represents the bearing assembly. You cannot navigate from the model element to the decision that created it. Second, revisitation is blind — when requirements change eighteen months later and someone asks whether the bearing decision should be reconsidered, they have to find the original spreadsheet, understand the context, re-evaluate the criteria, and check whether the runner-up alternatives are still viable. If the trade study lived in the model, all of this would be immediate. Third, the pruned alternatives vanish — the concepts that were evaluated and rejected disappear entirely, along with the reasoning for rejection. Future engineers don't just lack the answer to "why did we pick B?" — they lack the answer to "what else was considered?"
What a Model-Native Trade Study Looks Like
Imagine the trade study not as a spreadsheet that exists alongside the system model, but as a structured operation within it. The engineer identifies a node — "Bearing Assembly" — and says "run a trade study." The tool already knows the parent subsystem, the sibling components, the interface requirements, the performance parameters. It suggests evaluation criteria derived from the system context and the application domain. The engineer adjusts, adds, removes. The tool helps populate scores from specification data. The engineer validates, overrides where judgment matters. The weights are assigned through explicit pairwise comparison rather than arbitrary numbering, and the tool checks them for consistency.
The result is not a spreadsheet. It is a decision record attached to the model node, containing the full set of alternatives (including the rejected ones, preserved as pruned branches), the evaluation criteria with their justifications, the scoring with traceable sources, the sensitivity analysis showing decision robustness, and a generated rationale narrative suitable for design review documentation. When someone asks "why ceramic hybrid?" eighteen months from now, the answer is one click away — not buried in a SharePoint folder, not lost in an email thread, not dependent on the memory of an engineer who has moved on.
The technical requirements for building this are not exotic. The decision-analysis math is well-understood. The AI components — criteria suggestion, specification extraction, rationale generation — are standard LLM capabilities. The data model requirement is the interesting one: the trade study must be a governed operation that produces a reviewable result, not a freeform document that exists outside the engineering workflow. The decision must be proposed, reviewed, and accepted through the same governance process as any other model change.
Stuart Pugh gave engineers a rigorous methodology in 1981. Thomas Saaty gave engineers a mathematically consistent weighting method in 1980. What nobody has given engineers — in forty-five years — is a tool that implements these methods with the intelligence, traceability, and governance that the decisions they support actually deserve. The spreadsheet is not a tool for engineering decisions. It's the absence of one.