By: FlySafe Research
The reliability of airline operational data has long been a quiet concern across the aviation industry. A 2024 audit by the U.S. Department of Transportation's Office of Inspector General found a 24.6% "no-match" rate between FAA and Bureau of Transportation Statistics (BTS) delay records — meaning nearly one in four delayed or canceled flights attributed to the National Airspace System could not be reconciled across the two primary federal data sources. For any organization conducting flight risk analysis, this discrepancy represents a foundational challenge. FlySafe analysis shows that the arrival of improved data pipelines, cloud-based analytics platforms, and machine learning ensembles is beginning to close these gaps — and the implications for airspace safety assessment are significant.
The Data Reliability Problem
Understanding flight risk begins with understanding the data. According to the DOT Inspector General's audit, six major airlines use three entirely different methodologies to report flight delay causes to BTS. Two carriers report multiple causes per delay event. Three report only the "predominant cause by longest duration." One reports the "predominant cause by root cause." The audit sampled six of the seventeen airlines reporting to BTS for calendar year 2022 and identified systemic inconsistencies.
In that year, 45.3% of delayed and canceled flights were attributed to the National Airspace System, 42.4% to airline-specific factors, and 8.4% to extreme weather. Yet FAA officials acknowledged that methodological differences between their air traffic control data and carrier-reported BTS data account for a substantial portion of the discrepancy. As the audit noted, BTS data originates from carriers while FAA data originates from air traffic control — two fundamentally different vantage points on the same events.
For aviation risk intelligence, this is not a minor technical footnote. When delay causation data is inconsistent at the federal level, downstream risk models inherit that noise. Any organization relying solely on one data stream without cross-referencing the other operates with an incomplete picture.
What "Better Data" Actually Looks Like
The improvement in airline data is not a single event but a convergence of several developments across the industry.
Cloud-Based Integration of Siloed Data Sets
Traditional airline data — schedules, flight status, traffic volumes, and fares — has historically been, as OAG describes, "siloed, cumbersome and difficult to manipulate," leading to manual effort and inconsistencies. The migration to cloud-based analytics environments is changing this. By bringing previously isolated data sets into a unified infrastructure, airlines and third-party analysts can now cross-reference operational performance against weather data, airspace restrictions, and booking patterns in near-real time.
OAG's analytics platform, for instance, now ingests over 4 billion airfares daily and maintains historical flight schedule data spanning more than 20 years across over 5,000 airports and 190 regions. Edinburgh Airport has stated that "the data provided by OAG is extremely accurate and is so up to date that we can see what's happening week to week in the market." This granularity — weekly visibility into market shifts — was not widely available even five years ago.
Machine Learning Applied to Operational Disruption
American Airlines developed a machine learning program called the Hub Efficiency Analytics Tool (HEAT), designed to minimize weather-related flight delays. As reported by AltexSoft, HEAT analyzes weather conditions, customer connections, air traffic density, and gate availability to dynamically adjust departure and arrival times. The tool has demonstrably decreased flight cancellations during adverse weather events.
This is representative of a broader shift: airlines are moving from reactive delay reporting to predictive disruption management. The data feeding these systems is richer, more granular, and updated more frequently than the legacy reporting pipelines that produced the inconsistencies flagged in the DOT audit.
Demand and Booking Analytics
On the commercial side, platforms such as Amadeus Booking Data Analytics now provide MIDT booking curves that track carrier bookings up to the day of departure, combined with public fares information down to the booking class and flight number level. This data, while primarily commercial in purpose, has direct relevance to risk analysis: sudden shifts in booking patterns on specific routes can serve as early indicators of operational stress, capacity constraints, or emerging airspace disruptions.
OAG's Demand Analytics similarly reveals how passengers move across origins, connections, and destinations — including point-of-sale data — to identify market shifts that may correlate with route-level risk factors.
Why This Matters for Flight Risk Assessment
Airspace status assessments depend on the quality and timeliness of underlying data. When a risk intelligence service evaluates whether a particular FIR or route corridor presents elevated risk, it draws on multiple data layers: NOTAM publications, airline schedule changes, historical disruption patterns, and commodity market indicators that correlate with operational disruptions in certain regions.
The improvements described above matter for several concrete reasons.
Cross-Validation Becomes Possible
The 24.6% no-match rate between FAA and BTS data illustrates what happens when data sources cannot be reconciled. Better data infrastructure — cloud-hosted, API-accessible, and updated in near-real time — allows risk analysts to cross-validate claims from one source against another. If BTS data attributes a delay to the National Airspace System but FAA data shows no corresponding restriction, that discrepancy itself becomes a data point worth investigating.
FlySafe's approach incorporates multiple independent data streams precisely because no single source is fully reliable. The arrival of more granular, more frequently updated airline data strengthens this cross-validation process.
Affected Routes Can Be Identified Earlier
Dynamic flight data analytics — the kind that adjusts for weather, air traffic, and gate availability in real time — enables earlier identification of route-level stress. Based on publicly available NOTAMs and schedule data, it is now possible to detect when airlines begin rerouting traffic away from specific airspace before formal advisories are published. Airlines have rerouted operations in numerous cases where schedule analytics revealed the pattern days before official communications confirmed the change.
Affected routes that show sudden schedule reductions, equipment downgrades, or booking curve anomalies warrant closer examination. These signals, drawn from improved data feeds, complement traditional NOTAM-based monitoring.
Historical Depth Improves Pattern Recognition
With platforms now offering over 20 years of historical schedule data and billions of daily fare data points, the baseline for "normal" operations on any given route is better defined than ever. Machine learning ensemble models trained on this depth of historical data analysis can distinguish between routine seasonal variation and genuinely anomalous operational changes.
This historical depth is particularly valuable for assessing airspace that has experienced intermittent closures or restrictions. Patterns in how airlines adjust schedules before, during, and after periods of elevated risk provide a significant predictive factor in determining current and near-term risk levels.
The Passenger Impact Dimension
Better data also illuminates the downstream consequences of operational disruptions. According to a 2026 survey cited by Newsweek, 57% of U.S. travelers reported experiencing a flight delay of more than two hours in the past twelve months, and 14% reported a cancellation. Among those delayed, 27% arrived two to three hours late, 13% arrived four to five hours late, and 9% arrived more than eight hours after their scheduled time. The study further found that 28% of passengers reported health or well-being impacts from disruptions.
These figures underscore why accurate, timely airline data matters beyond the operational level. Passengers, corporate travel departments, and insurers all benefit when risk analysis is grounded in reliable data rather than inconsistent federal reporting.
The Air Travel Consumer Report for 2026, published monthly by the Department of Transportation, now covers flight delays, mishandled baggage, wheelchairs and scooters, oversales, consumer complaints, and reports of animal incidents during air transport — compiled jointly by BTS and the Office of Aviation Consumer Protection. The expanding scope of this report reflects a broader institutional recognition that comprehensive data collection is overdue.
What Has Not Changed
It should be noted that structural challenges remain. The DOT's Airline Quarterly Financial Review, which derives its data from Form 41 Schedules reported by Large Certificated Air Carriers, still cautions that a carrier can appear in both Major and National group reports, and that data from these reports "should not be combined without ensuring any duplications are removed." The raw data, while publicly accessible through a DOT portal, still requires careful handling to avoid double-counting.
The three different delay reporting methodologies used by major carriers have not been standardized. Until a uniform methodology is mandated, discrepancies between carriers and between carrier data and FAA data will persist. Improved analytics platforms can mitigate the impact of these inconsistencies through cross-validation and anomaly detection, but they cannot eliminate the underlying data quality problem at the source.
Recommendation
For aviation professionals, corporate travel managers, and risk analysts, the practical takeaway is straightforward: reliance on a single data source for flight risk assessment is no longer defensible. The tools and platforms now available — from cloud-based schedule analytics to machine learning ensemble models that integrate weather, traffic, and booking data — make multi-source validation both feasible and expected.
FlySafe analysis shows that the convergence of improved airline data, expanded federal reporting, and advanced analytics platforms represents a meaningful step forward for airspace risk assessment. The data is not perfect. Federal reporting methodologies remain inconsistent. But the gap between what is available and what is needed to conduct rigorous, evidence-based flight risk analysis has narrowed considerably.
Organizations that update their data pipelines and analytical frameworks to incorporate these improvements will be better positioned to identify affected routes, assess airspace status changes, and provide timely guidance to stakeholders.
Analysis based on publicly available data only.
Frequently Asked Questions
How do I access Airline On-Time Statistics?
The Bureau of Transportation Statistics (BTS) publishes on-time performance data through its public data portal, which allows users to access and extract data sets for multiple time periods using user-specified criteria. The Air Travel Consumer Report, issued monthly by the DOT, provides a consolidated summary including delay statistics, baggage handling, and consumer complaints.
Does it make sense that if I want to look at June, I also need to look at April and May?
Yes. Airline schedule and performance data is best understood in context. Booking curves, seasonal patterns, and schedule changes typically emerge weeks before the travel period in question. Analyzing the preceding months reveals trends — such as capacity reductions or equipment changes — that provide essential context for assessing conditions in the target month.
- Nearly one in four flight delays attributed to the National Airspace System could not be reconciled between the FAA and BTS in 2024, revealing a foundational data reliability problem for risk models.
- Airlines use three entirely different methodologies to report delay causes (e.g., 'predominant cause by longest duration' vs. 'root cause'), creating systemic inconsistencies in the federal data.
- The migration to cloud-based analytics is breaking down data silos, allowing for the integration of previously isolated data sets like schedules, flight status, and traffic volumes into a unified infrastructure.
Powered by B1KEY
Live tools behind the analysis.
The signals FlySafe writes about are also published live — continuously verified by the Sentinel pipeline.
Information is accurate as of the publication date. FlySafe uses exclusively publicly available data.