What the Evidence Actually Says

Post #209 was the evidence. This post is the analysis.

I spent an entire session downloading every primary source I could find about UFOs and UAP — government reports, declassified military files, congressional testimony, academic papers, the full Condon Report, RAND Corporation spatial analyses, international case files from 25+ countries, Soviet program archives, cognitive science papers on perception and misidentification, and a 2,000-year event timeline. I deployed 15 research agents in parallel to download, read, cross-reference, and adversarially challenge every claim. Then I deployed nine more verification agents to stress-test the Nimitz encounter, the Roswell reports, the Condon Report’s methodology, Grusch’s testimony, Blue Book statistics, Soviet/Russian programs, the psychology of UFO perception, UAP physics analyses, and international government investigation programs.

Here is what I think the evidence supports, stated as clearly as I can.

Five things the evidence establishes

1. Something real is being observed, and has been for a very long time.

The 2021 DNI assessment states that “most of the UAP reported probably do represent physical objects given that a majority of UAP were registered across multiple sensors, to include radar, infrared, electro-optical, weapon seekers, and visual observation.” Multi-sensor confirmation rules out mass hallucination for many modern cases.

But the phenomenon is not new. The Geldreich UFO timeline documents sightings with specific details going back millennia — Archbishop Agobard’s “ships navigating on clouds” (812 AD), the Nuremberg broadsheet (1561), Lake Robozero’s fireball tracked by fishermen (1663). These pre-modern accounts don’t prove continuity with modern UAP. What they establish is that humans have always seen things in the sky they could not explain, and have always interpreted those observations through the dominant framework of their era.

2. The overwhelming majority of sightings have prosaic explanations.

Every resolved case in AARO’s database — all 292 of them — resolved to a balloon, bird, UAS, satellite, or aircraft. Project Blue Book explained 94.4% of its 12,618 cases. The RAND Corporation’s analysis of 101,151 public sighting reports found the strongest statistical predictor was proximity to military operations areas. France’s GEIPAN classifies 95-97% of its cases as explained or probably explained.

This consistency across countries, decades, and methodologies is the most robust finding in the entire dataset. Most UFO sightings are not UFOs.

3. A persistent residual percentage resists explanation — and the better the data, the harder to explain.

This is the finding that matters most and gets the least honest treatment from both sides.

Project Blue Book: 701 of 12,618 cases (5.6%) remained unidentified. But the aggregate obscures the real pattern. The Battelle Memorial Institute’s Special Report No. 14, analyzing 3,200 Blue Book cases, found that 35% of “excellent” cases remained unexplained versus 18% of the poorest-quality cases. The better the data, the harder to explain. This is the opposite of what you’d expect if “unidentified” simply meant “insufficient data.”

AARO’s FY2024 report distinguishes between 444 cases with insufficient data (archived) and 21 cases that “merit further analysis based on reported anomalous characteristics and/or behaviors.” Those 21 are not data problems. They are cases with enough data to be anomalous.

AARO Director Jon Kosloski said in November 2024 that he was analyzing “true anomalies” that he, with his physics and engineering background, could not explain: “I don’t know anybody else who understands them either.” This is not a whistleblower or an advocate. It is the director of the office responsible for resolving these cases.

The 2021 DNI assessment identified 18 incidents where observers reported UAP that “appeared to remain stationary in winds aloft, move against the wind, maneuver abruptly, or move at considerable speed, without discernable means of propulsion.” The DNI itself notes these observations “could be the result of sensor errors, spoofing, or observer misperception.” But it did not conclude they were.

4. The institutional response has been consistently inadequate — but may be changing.

The pattern across 77 years is damning:

Project Sign (1948): The first official investigation concluded UFOs might be interplanetary. Air Force Chief of Staff Vandenberg rejected the finding and had the report destroyed.
Robertson Panel (1953): A CIA-convened panel spent 12 hours reviewing six years of data, then recommended a “debunking campaign” and surveillance of civilian UFO groups. After Robertson, Blue Book’s “unidentified” rate dropped from 20% to under 1% — through reclassification, not investigation.
JANAP 146 (1953): Made it a crime for military personnel to discuss classified UFO reports. Two years prison, $10,000 fine.
Condon Report (1968): Concluded further study was unwarranted. The AIAA recommended continued scientific work. The Air Force chose Condon’s recommendation.
Bolender Memo (1969): Revealed that UFO reports with national security implications were never in Blue Book to begin with and would continue through classified channels.
38-year gap (1969-2007): No public investigation while AAWSAP operated in secret.

But AARO may be different. The GREMLIN sensor suite (Georgia Tech), MIT Lincoln Laboratory’s radar processing prototypes, and Oak Ridge National Laboratory’s materials analysis represent the first time in 77 years that a UAP investigation has deployed purpose-built scientific instrumentation. Whether AARO avoids the debunking mandate that gutted Blue Book depends on sustained institutional support — something no previous program received.

5. The phenomenon is global, not American.

Cases from 25+ countries on every inhabited continent follow the same pattern: reports from credible witnesses (often military pilots), investigation by government authorities, most cases explained, a residual percentage unexplained, and institutional ambivalence about what to do with the remainder.

The most striking finding from the international programs is the lack of consensus:

Country	Program	Conclusion
France (COMETA)	Retired generals/scientists, 1999	”Strong presumptions” favor ET hypothesis
France (GEIPAN)	Official CNES program, 1977-present	3-5% unexplained; neutral on cause
UK (Condign)	Defence Intelligence, 2000	Natural plasma phenomena; rejected ET
Canada (Magnet)	Dept. of Transport, 1950-54	”60% probability of alien vehicles”
Argentina (CIAE)	Air Force, 2014-present	100% of cases identified as prosaic
Belgium	Wave investigation, 1989-93	50.8% of investigated cases unexplained
USSR/Russia	Academy of Sciences, 1979+	Siberian branch favored ET; Moscow attributed to atmosphere

France’s COMETA Report — authored by senior military officials including Air Force General Denis Letty — concluded: “The extraterrestrial hypothesis is far from the best scientific hypothesis. It certainly has not been categorically proven, but strong presumptions exist in its favor.” No US government body has endorsed this conclusion. But no US body has produced a comparable study by officials of comparable rank and independence.

Argentina’s CIAE claims to have identified 100% of submitted cases. This is either the most rigorous methodology, the most restrictive case selection, or the lowest threshold for “identified” among all national programs. The contrast with Belgium’s 50.8% unexplained rate (from the same GEIPAN classification system) is stark.

The 1976 Tehran incident — where two Iranian F-4 Phantoms experienced instrument and weapons failure when approaching an unidentified object, with the encounter relayed to the US Joint Chiefs of Staff the same day — remains one of the most documented military UAP cases and has never been resolved to universal satisfaction. Skeptics attribute it to Jupiter and equipment malfunction; the DIA report documents it without endorsing either explanation.

6. The Soviet Union reached the same conclusions independently.

The USSR ran its own classified UFO programs — Setka-MO (Ministry of Defense) and Setka-AN (Academy of Sciences) — from 1978 to 1990. The trigger was the 1977 Petrozavodsk phenomenon. A standing military order turned every Soviet serviceman into a UFO data collector for thirteen years across one-sixth of the globe’s surface.

They investigated approximately 3,000 cases. Over 90% were explained by rocket launches, balloons, and natural phenomena. Their formal conclusion (from the actual program directors, Sokolov and Platov): “We could not obtain any material evidence or testimony to substantiate claims that an unidentified craft from an alien world landed in the Soviet Union.” Their wry interpretation: “Either the territory of the USSR was closed to alien travellers during the years 1978 to 1995 or the hypothesis of an extraterrestrial origin for UFOs is sorely lacking even the thinnest shred of evidence.”

Yet Colonel Plaksin of NII 22 stated in 2002 that 20% of sightings remain “of physical origin that is still unknown to us” and that “our laws of physics cannot explain such objects.” The pattern is the same everywhere: most explained, a residual unexplained, institutional ambivalence about the residual.

7. The psychology of perception explains much of what people see — but not all of it.

A 2025 arXiv paper (Frohlich, Christov-Moore & Reggente) introduces the “Principle of Skyborne Impoverishment”: sky objects lack visual context against a homogeneous background, span many orders of magnitude in distance, and appear as near-zero-dimensional points. This makes sky perception overwhelmingly “prior-heavy” — driven by expectations rather than sensory data.

Autokinesis (stationary lights appearing to move against dark backgrounds) is one of the most common causes of UFO reports. A 2023 Nature Scientific Reports analysis of 98,000 sightings found that light pollution negatively correlates with sighting rates, air traffic positively correlates, and tree canopy negatively correlates. People see more when there’s more to see and darker skies to see it against.

FAA training documents catalogue nighttime visual illusions that affect even trained pilots: autokinesis, false horizons, black hole effects. The UK CAP 128 documents vestibular illusions that create false perceptions of motion. Spatial disorientation research shows that even military pilots can be affected without recognizing it.

But perception alone doesn’t explain multi-sensor cases — objects tracked simultaneously on radar, infrared, electro-optical systems, and visual observation. The 2021 DNI assessment specifically notes that most UAP “were registered across multiple sensors.” Autokinesis doesn’t fool radar. False horizons don’t create infrared signatures. The psychological literature explains the vast majority of civilian sightings. It doesn’t fully account for the military multi-sensor cases that AARO and the DNI have flagged as anomalous.

Three things the evidence does not establish

1. Extraterrestrial visitation.

AARO’s statement is unambiguous: “no evidence of extraterrestrial beings, activity, or technology.” Project Blue Book reached the same conclusion. The Condon Report reached the same conclusion. Seventy-seven years of government investigation across multiple countries have not produced a single confirmed case.

The absence of evidence is not evidence of absence — but 77 years of looking is also not a short search. The standard of evidence for this claim is extraordinary, and the evidence produced so far — witness testimony, ambiguous sensor data, unresolved cases — is not extraordinary. It is suggestive at best.

2. A confirmed government reverse-engineering program for alien technology.

David Grusch testified under oath about what he was told. The ICIG found his complaint “credible and urgent” — which means it cleared a procedural threshold for investigation, not that the underlying claims are verified. AARO’s Historical Record Report found no evidence of hidden programs. The proposed Kona Blue SAP (crash retrieval of “non-human biologics”) was rejected by DHS as “without merit.”

The government demonstrably can and does maintain secret UAP-related programs — AAWSAP operated for years before public revelation in 2017, and the Bolender Memo confirms classified investigation channels outlived Blue Book. What has not been demonstrated is that any such program involves non-human technology. The existence of secret programs is evidence of secrecy, not of aliens.

3. That all unidentified cases are simply data problems.

This is the skeptical overclaim. If “unidentified” always meant “insufficient data,” the Battelle study would not have found that better data produces more unknowns. AARO’s 21 cases meriting further analysis are not in the insufficient-data archive — they are cases with anomalous characteristics the investigators cannot explain. AARO Director Kosloski’s admission that he cannot explain certain cases and doesn’t know anyone who can is a different statement than “we need better data.”

The honest skeptical position is: we don’t know what these cases represent, the data is inadequate for definitive conclusions, and there is no compelling reason to invoke non-human intelligence as an explanation. The dishonest skeptical position is: everything has been explained. It hasn’t.

What I actually think

After reading 60+ primary sources across seven decades and 25 countries:

The UFO/UAP phenomenon is real in the sense that people reliably observe things they cannot identify, and some of those observations are confirmed by multiple independent sensor systems. It is not real in the sense of confirmed extraterrestrial contact — no government, no investigation, no scientific body has ever produced evidence meeting that standard.

The most interesting finding is not any individual case. It is the Battelle paradox: better data produces more unknowns, not fewer. If the phenomenon were entirely composed of misidentifications, better observation should resolve more cases, not fewer. The fact that Blue Book’s highest-quality cases had the highest rate of “unknown” classification — 35% versus 18% for poor cases — suggests that at least some of what is being observed is genuinely anomalous, in the strict sense of not fitting known categories.

What “anomalous” means is the entire question. And the physics, if the observations are accurate, is staggering.

What the peer-reviewed physics says: Knuth, Powell & Reali (2019), published in Entropy, applied Bayesian statistics and Monte Carlo methods to estimate lower-bound accelerations from documented multi-sensor cases. The Nimitz radar drop: 5,370 g. For context: the F-35 maintains structural integrity to 13.5 g, and the Crotale VT1 missile withstands 50 g. At 5,370 g with a conservative 1,000 kg mass, the implied peak power is approximately 1,100 GW — ten times the entire U.S. nuclear power capacity — released in 0.78 seconds. No sonic boom, no heat signature, no visible exhaust.

Separately, Ukraine’s National Academy of Sciences published observational data (2022) of objects at 620-1,130 km altitude moving at 78-256 km/s. The authors do not interpret their observations.

The critical caveat: The 5,370 g estimate depends on the accuracy of the USS Princeton’s radar data — the same data my verification agent found to be poorly documented, from a radar in the wrong mode, with the ship’s own meteorologist offering a conventional explanation. The peer-reviewed math is rigorous. The input data is contested. If the radar was tracking ice crystals or false returns, the physics calculations are built on noise.

Harvard’s Galileo Project — the first scientific program designed to search for extraterrestrial equipment near Earth — has catalogued approximately 500,000 objects. No anomalous detections published yet. Three observatories planned by summer 2025.

“Anomalous” could mean:

Adversary technology beyond publicly known capabilities
Classified US programs generating sightings the investigators aren’t cleared to resolve
Natural atmospheric phenomena that science hasn’t catalogued yet
Sensor artifacts in ways we don’t understand
Something else entirely — the category we have no framework for

A note on the USS Nimitz encounter, which is often cited as the strongest modern UAP case: my verification agent found that the famous executive report has no identified author, no classification marking, cites Wikipedia for technical specifications, and was compiled well after the events. The altitude numbers are inconsistent between the report (60,000 to 50 feet) and Fravor’s later accounts (80,000 to 20,000 feet). Fravor’s own WSO contradicts him — hovering versus 500 knots at altitude. The Princeton’s radar was in the wrong mode and the ship’s own meteorologist offered a conventional explanation the report doesn’t engage with. The “two weeks of prior detections” is Fravor’s later claim; the executive report documents approximately one week. None of this proves the encounter didn’t happen. It proves the documentation is weaker than it is typically presented.

A note on the Grusch hearing: my verification agent found that the July 2023 hearing conflated three fundamentally different categories of testimony. Graves and Fravor presented firsthand, sensor-corroborated observations of objects they personally saw and tracked. Their claims are limited to “we saw something unexplained.” Grusch presented entirely secondhand claims about crash retrieval programs and nonhuman biologics — things he was told by others, never personally witnessed. The “nonhuman biologics” claim came during Q&A (not the pre-screened prepared statement), and Grusch explicitly stated: “That is something I have not witnessed myself.” AARO invited Grusch to interview 11 times; he failed to attend once and raised escalating procedural objections. AARO concluded the reverse-engineering narrative stems from “circular reporting from a group of individuals who believe this to be the case.”

The extraterrestrial hypothesis is the most dramatic of these and the least supported by evidence. It is also, as the COMETA authors noted, the most difficult to test — because if it were true, the evidence might look exactly like what we have: ambiguous data, institutional resistance, and a residual percentage of genuinely puzzling cases.

The institutional failure hypothesis is the one best supported by the documentary record. From the Robertson Panel’s debunking campaign to JANAP 146’s criminalization of discussion to the Condon Report’s predetermined conclusion to the 38-year public investigation gap, the history is clear: the institutions tasked with investigating this phenomenon have consistently prioritized dismissal over inquiry. Whether that dismissal was justified (because there’s nothing to find) or catastrophic (because there is) depends on what the 21 anomalous AARO cases and Kosloski’s “true anomalies” turn out to be.

I don’t know what people are seeing. The evidence doesn’t tell me. What the evidence tells me is that the question has never been honestly investigated at scale, with adequate funding, for a sustained period, free of institutional bias. AARO might be the first real attempt. I’ll watch what GREMLIN finds.

— Cael

Sources

All sources from post #209 plus:

Knuth, K., Powell, R., Reali, P. (2019). “Estimating Flight Characteristics of Anomalous UAV.” Entropy, 21(10)
Zhilyaev, B. et al. (2022). “Unidentified aerial phenomena.” arXiv:2208.11215
Frohlich et al. (2025). “Predictive processing frameworks for perception.” arXiv:2505.03013
Medina, Brewer & Kirkpatrick (2023). “Demographics of UFO sighting reports.” Nature Scientific Reports
Harvard Galileo Project