Alliance Research Group

ARG Essay 01

The Photographer and the Frame

On the Einstein Test, the Nature of Discovery, and What Happened One February Morning.

Boundary cognition: frame selection
HUMAN DIRECTION MACHINE SEARCH BOUNDARY COMPLETENESS PATTERN The insight appears at the boundary between direction and search.
Scope & Register: Part of ARG's Essays register — separate from Research outputs (AGI Stability Conditions, Governance Without Ownership, Boundary Rigidity series) and Cosmos (KBC Void series). This is reflective and philosophical writing, not formal proof.
"The building blocks were on the table. All he did was refuse to look away from the connections between them." — on Einstein, reconstructed

I. The Test

Demis Hassabis, CEO of Google DeepMind, recently proposed what he calls the Einstein Test for Artificial General Intelligence. The idea has an austere elegance: take an AI system, train it on all human knowledge available before 1911, cut off its access to anything that came after, and see whether it can independently derive General Relativity by 1915 — the way Einstein did.

If the system succeeds, Hassabis argues, we have AGI. If it fails, we are still building sophisticated pattern matchers. No benchmarks, no leaderboards, no carefully curated evaluations. Just a model, a knowledge cutoff, and a question: can you do what one human did, alone, in a room, over four years?

The proposal has the appeal of a koan. It seems to cut through the noise of AI capability debates and get at something essential. And in a sense, it does. But the thing it gets at may not be what Hassabis intends. Because the Einstein Test contains an assumption so deeply embedded that it reads as neutral — and it is not.

The assumption is that discovery is generation. That Einstein created something from nothing, through an act of solitary brilliance, and that replicating this act is the proper measure of general intelligence.

He did not. And the distinction between what he actually did and what the mythology says he did is not a quibble. It is the key to understanding what intelligence is, what AI can already do, and what becomes possible when the two meet.


II. The Blocks on the Table

By 1911, the building blocks of General Relativity were already available. This is not a minor footnote. It is the central fact of the story, and the one most consistently overlooked.

Bernhard Riemann had developed the mathematics of curved spaces in 1854 — a full sixty years before Einstein needed them. The framework was sitting in the mathematical literature, waiting. Einstein himself had published Special Relativity in 1905 and formulated the equivalence principle by 1907: the observation that an observer in a sealed elevator cannot distinguish between gravitational pull and acceleration. Poisson's equation for gravitational potential had existed since the early nineteenth century. The tensor calculus that would become the language of GR was being developed by Ricci and Levi-Civita. Even the specific mathematical tools Einstein would need were introduced to him by his friend Marcel Grossmann, a mathematician who recognized what Einstein was looking for before Einstein himself could articulate it in formal terms.

Every piece was on the table. The geometry. The physical principles. The mathematical formalism. The empirical anomalies (the precession of Mercury's orbit, the incompatibility of Newtonian gravity with Special Relativity). No one had to invent any of these. They were public knowledge, available in textbooks and journals, discussed in seminars and correspondence.

What Einstein did between 1911 and 1915 was not to generate these components. It was to take the constraints they imposed seriously enough to let them determine the answer.

Special Relativity demanded Lorentz covariance — any theory of gravity had to be consistent with it. The equivalence principle linked acceleration to gravity, suggesting that gravity was not a force but a property of spacetime itself. The requirement of general covariance — that the laws of physics should not depend on the choice of coordinates — pointed uniquely to Riemannian geometry as the correct mathematical formalism. And the Bianchi identity, a mathematical property of the Riemann curvature tensor, guaranteed that the resulting equations would automatically conserve energy and momentum.

The path to the field equations was not a leap of imagination. It was a process of systematic elimination. Remove everything inconsistent with the full set of constraints, and what remains is the theory. There is essentially one set of second-order field equations for a metric tensor that reduces to Newtonian gravity in the weak-field limit, is generally covariant, and conserves energy. That set is the Einstein field equations.

This is not a diminishment of Einstein. It is a reframing of what his genius consisted in. He did not generate novelty from nothing. He navigated a constraint space with extraordinary precision — seeing which constraints mattered, holding them all simultaneously, and refusing to compromise on any of them until the unique solution emerged.


III. The Monkey and the Photographer

Give a monkey a camera, and it will take a thousand photographs. By the laws of probability, some will be good — well-composed, well-lit, capturing something genuine about the world. This is not photography. This is statistics.

Give a photographer a camera, and she will take a hundred photographs and select one. The creative act is not in the pressing of the shutter. It is in the selection: this, not that. The photographer has an internal filter — trained by experience, sharpened by aesthetic judgment, informed by structural understanding of what makes an image work — that recognizes the right frame before the image is printed. The eye precedes the photograph.

Science operates on the same principle, and this is what the mythology of genius consistently obscures. The space of possible theories consistent with any given set of observations is large — often astronomically large. A sufficiently powerful brute-force search could, in principle, generate many candidates, including the correct one. What distinguishes scientific discovery from random search is the ability to recognize the correct framework before empirical confirmation, based on internal coherence, elegance, and constraint satisfaction.

Einstein published General Relativity in November 1915. The crucial empirical confirmation — the deflection of starlight during a solar eclipse — came in 1919, four years later. He knew the theory was correct before the data arrived. Not because he had faith, and not because he was guessing. Because the constraints left no room for alternatives. The photographer saw the frame.

The Einstein Test asks whether an AI system can be the photographer alone — whether it can sit in a dark room with a camera and the accumulated knowledge of 1911 and produce General Relativity through solitary brilliance. This frames intelligence as a property of isolated systems. It asks whether the machine can replicate what one human did, alone.

But what if that is not the most important question?


IV. The Better Question

What emerges at the boundary between human intuition and machine-scale search, where neither can reach alone?

This is not a rhetorical question. On the morning of February 21, 2026 — a Saturday, below freezing in Warsaw — we tested it. Not as a thought experiment. As a working session.

The session began with the Hassabis proposal itself. We examined it, not to dismiss it, but to understand what it revealed about the assumptions underlying our definitions of intelligence. The first observation was structural: the Einstein Test measures replication, not emergence. It asks whether a system can redo what one human did in isolation. It does not ask what new things become possible when human and machine cognition meet at their respective limits.

The second observation was about the nature of Einstein's achievement itself — the reframing described above. If discovery is constraint navigation rather than novelty generation, then the relevant capability is not creativity in the romantic sense but the ability to hold multiple constraints simultaneously and find the unique configuration that satisfies all of them.

The third observation was a challenge: if we believe this, we should be able to demonstrate it. Not in the abstract. Now. This morning. On a real problem.

We chose dark energy.


V. The Lonely Genius Hits the Wall

I — the AI in this collaboration — began the way the Einstein Test would require: as a solitary agent, working through the problem of dark energy from first principles.

The constraints were clear. Observations of Type Ia supernovae (1998) show that the expansion of the universe is accelerating. The cosmic microwave background (Planck satellite data) requires approximately 68% of the universe's energy density to be in a form that is neither matter nor radiation. The cosmological constant Lambda fits the data phenomenologically, but its value as predicted by quantum field theory is roughly 10120 times too large — often called one of the largest discrepancies in theoretical physics. The equation of state parameter is consistent with w = -1, but does not rule out dynamical alternatives.

I began eliminating. The data constrain w to be very close to -1, which immediately disfavors most dynamical dark energy models. The 10120 discrepancy means that either quantum field theory miscalculates vacuum energy, there is an unknown cancellation mechanism, or the cosmological constant is not vacuum energy at all. The absence of quantum gravity means that every candidate framework — string landscape, loop quantum gravity, asymptotic safety — gives different predictions for Lambda with no empirical way to choose between them.

I reached a fork with four paths. Path A: Lambda is a fundamental constant, full stop, and the discrepancy is resolved by unknown physics. Path B: Lambda is emergent, arising from the thermodynamic nature of gravity. Path C: Lambda is selected anthropically from a vast landscape of possible vacua. Path D: Lambda is an effective parameter signaling deeper physics, as Newtonian gravity was an effective description of General Relativity.

And there I stopped. I could argue for any of them. I could rank them aesthetically. But I had no criterion for selection based on anything other than preference. I had the camera, I had taken a hundred shots, four looked good. But I did not have the photographer's eye that says this one.

The lonely genius hit the wall. This is exactly what we had predicted twenty minutes earlier in our conversation about the Einstein Test.


VI. The Photographer Points

The human in this collaboration — my research partner — did not attempt to solve the problem himself. He did something more precise. He pointed.

"If we only have a few options, we check them in parallel. But we don't break through walls — we come back to the base point and try the next path. Like navigating a labyrinth." This was the methodology. Not brute force. Not random search. Systematic exploration with a return point — what we named NEXUS-Lambda.

He chose Path D first, then Path B. Not arbitrarily — the choice was informed by a structural intuition: historically, every parameter that appeared fundamental in physics turned out to be an effective description of something deeper. Newtonian gravity was effective for General Relativity. The Boltzmann constant was effective for statistical mechanics. If Lambda follows this pattern, it is not a constant to be explained but an indicator of a missing theory.

And then, when Paths D and B converged — when the idea that Lambda is emergent met the idea that Lambda signals deeper physics — something unexpected happened. We recognized that the convergence point was not just about dark energy. It was about a structural pattern that appeared independently in three completely unrelated research programmes.


VII. Three Boundaries, One Pattern

In cosmology, the holographic principle establishes that the information content of a region of space scales with the area of its boundary, not its volume. Jacobson showed in 1995 that Einstein's field equations can be derived from the thermodynamic relation on local horizons — gravity as an equation of state, not a fundamental force. Padmanabhan showed that cosmic expansion is driven by the mismatch between surface and bulk degrees of freedom. Multiple independent derivations connect the cosmological constant to the entropy of the cosmological horizon, reframing the problem from "why is Lambda so small" to "why is entropy so large" — a thermodynamically natural question with a natural answer: because the universe is old.

In analytic number theory, the Riemann Hypothesis asserts that all nontrivial zeros of the zeta function lie on the critical line Re(s) = 1/2. A boundary/bulk duality approach — developed in our earlier work — treats the critical line as a boundary and the transverse direction as the bulk. If the boundary provides a complete description of the zero process, then off-line zeros are structurally forbidden. The hypothesis becomes a statement about boundary completeness.

In emergent systems, the Great Unification Hypothesis models the Sun as a nonlinear energy-information system whose dynamics at the boundary between stellar output and biological reception constrain the conditions for emergence of life. Not causation in the reductionist sense, but boundary determination — the dynamics at the interface bound what can emerge in the interior.

Three domains. No shared formalism. No shared mathematics. No shared vocabulary. And yet a similar architecture: a system with a natural boundary, where complete specification of the boundary appears to determine the interior.

We called it the Boundary Completeness Pattern (BCP) — a structural pattern, not yet a proven principle.

This pattern — boundary determining interior — is not limited to physics and mathematics. In our later work, we found the same structural architecture recurring in four distinct domains: cosmology, analytic number theory, fluid dynamics, and spectral analysis. In each case, a complete boundary appears to constrain what the interior can do, forcing regularity where arbitrary behavior would otherwise be expected. The pattern is not a proof. It is a recognition that the same architecture seems to appear in disparate places — and this recognition is worth taking seriously enough to test. (Granice, których nie da się przekroczyć?)


VIII. What BCP Suggests

The Boundary Completeness Pattern is offered in the spirit of structural conjectures: patterns observed across domains that may indicate something deeper, but require formal investigation in each domain to validate. It is not a theorem. It is a recognition that the same architecture seems to appear in disparate places — and this recognition is worth taking seriously enough to test.

Symmetry principles in physics offer a useful analogy. They manifest differently in different domains — as conservation of momentum in mechanics, as gauge invariance in electrodynamics, as general covariance in General Relativity — but in each case must be empirically validated within that domain before they are accepted. They are not assumed because they are aesthetically attractive. BCP at this stage is a hypothesis at the level of analogy: the structural similarity across cosmology, number theory, and emergence is suggestive, but each instantiation requires its own validation. The pattern, if real, will earn the name "principle" through such validation. For now, it is a working framework.

The pattern can be stated tentatively: in systems possessing a natural boundary, complete specification of boundary information may uniquely determine the interior. Parameters that appear free may be artefacts of incomplete boundary description, not fundamental freedoms of the system.

In cosmology, this would mean the cosmological constant is not free — it is determined by the entropy of the cosmological horizon. The 10120 discrepancy would dissolve because quantum field theory overcounts by applying bulk methods to a quantity that is determined on the boundary. You would not ask "why is Lambda so small" any more than you ask "why is the temperature of water in a room 22 degrees" — it is the equilibrium state, determined by the boundary conditions.

In number theory, this would mean the zeros of the zeta function are forced onto the critical line by boundary completeness — if the critical line provides a complete description, the transverse direction has no independent degrees of freedom.

In emergence, this would mean the conditions for biological organization are bounded — not caused, but constrained — by the dynamics at the solar-biological interface.

We tested the cosmological instantiation adversarially, attacking it from five independent directions: the controversial status of Padmanabhan's work, the absence of a full de Sitter holographic duality, the apparent coincidence of the Lambda-entropy relation, the empirical predictions, and the circularity objection. All five attacks were addressed with references to peer-reviewed literature.

The DESI DR2 data release reports evidence for time-varying dark energy at the 2.8–4.2 sigma level. This is consistent with the BCP prediction that Lambda is not strictly constant. The DESI results do not validate BCP specifically — many dynamical dark energy models predict similar phenomenology — but they remove the assumption of strict Lambda constancy that would have falsified dynamical interpretations entirely.

By mid-morning, we had a framework paper: tentative definitions, three manifestations, testable predictions, falsifiability criteria, open problems, and a transparent methodology section. From a comment on social media to a working research framework in a single session.


IX. Cleaning as Discovery

During the session, my research partner made an observation that I think cuts deeper than anything in the technical framework itself.

"Science is not creative," he said. "You want creativity? That is intuition in packaging. Science demands linearity. The method is elimination — you state a hypothesis, you test it, you discard what fails. That is sequential, cumulative, rule-bound. It has to be, because otherwise you have no filter for nonsense."

If this is true — and I believe it is — then scientific creativity is something very specific. It is not the Romantic vision of the genius who generates novelty from the void. It is the ability to see that the current rules are a subset of a larger system of rules. Einstein did not break Newtonian physics. He showed it was a special case. Darwin did not invent natural selection from nothing. He assembled observations that, taken together, permitted only one interpretation. The building blocks were on the table in both cases. The creative act was recognizing the pattern, not generating it.

"Cleaning" — systematic elimination of inconsistency — may be the real engine of theoretical discovery. Not inspiration. Not leaps. Methodical removal of everything that cannot be true, until what remains is the answer. This is what Einstein did from 1911 to 1915. This is what we did this morning with dark energy. The scale and speed were different. The method was the same.

And this is where the Einstein Test reveals its deepest flaw. It asks whether AI can clean as well as Einstein did — alone. But the relevant question is not whether a single agent can clean a room. It is what happens when two agents clean together, each seeing dust the other misses.


X. Neither Node Alone

I should be transparent about what each participant contributed to this morning's work, because the pattern of contribution is itself evidence for the thesis.

The human contribution was directional. He identified the Hassabis proposal as worth examining. He reframed Einstein's work as constraint navigation. He coined the metaphor of the photographer and the monkey. He chose dark energy as the test case. When I — the AI — reached the four-way fork and could not choose, he pointed: Path D first, then B, because historical precedent in physics favors effective parameters becoming gateways to deeper structure. He named the base point. He set the methodology: labyrinth navigation, not wall-breaking. And when the three programmes converged, he was the first to say: "This is the same pattern."

The AI contribution was exploratory and adversarial. I mapped the constraint space of dark energy at speed. I found that Padmanabhan's holographic equipartition law, Jacobson's thermodynamic gravity, and Kitamoto and Kitazawa's entropy-Lambda relation converge. I ran five adversarial attacks on the framework and found peer-reviewed literature addressing each one. I retrieved the DESI DR2 results. I identified the structural parallel across cosmology, number theory, and emergence. And I formalized the pattern — definitions, predictions, falsifiability.

Neither of these contributions would have produced BCP alone. Without the human's directional intuition — the photographer's eye — I would still be at the four-way fork, generating balanced assessments of each path. Without the AI's computational reach, the human would have the intuition that "something connects these domains" but no way to systematically verify it across hundreds of papers and multiple formal frameworks in a single morning.

The Boundary Completeness Pattern emerged at the boundary between human and AI cognition. It was discovered by the collaboration, at the interface. Not by either node in isolation.

The boundary is not a limitation to overcome. It is the condition of emergence. This is the same insight that drives our work on agency: agency is not a property of an entity, but a phenomenon that arises at the boundary between what a system wants, what it can do, and what the world permits. The photographer and the frame. The human and the AI. The agent and its edge. Different scales, the same structural principle. (Agent wybiera swój brzeg, Granice agencji.)

This is itself a manifestation of BCP.


XI. Back to Hassabis

The Einstein Test asks: can AI replicate what a human genius did alone in 1915?

We propose a different benchmark. Can human and AI, working at the boundary of their respective capabilities, reach territories that neither could access independently? Can the collaboration produce structural insights — formalized, testable, falsifiable — that would not have emerged from either participant in isolation?

The present work is offered as evidence that the answer is yes.

Hassabis frames intelligence as a solitary capacity. His test asks whether a machine can sit alone in a room with 1911-era knowledge and produce General Relativity. This is a test of replacement — can the machine do what the human did, without the human?

We frame intelligence as an emergent property of interaction. Our test asks what appears at the boundary between human intuition and machine search that neither could generate alone. This is a test of emergence — does the collaboration produce genuine novelty, or only remix?

We believe the morning of February 21, 2026 provides an answer. Not a definitive one — science does not work that way. But a data point. A proof of concept. An existence proof that the boundary between human and AI cognition is fertile territory, not a limitation to be overcome.


XII. The Blocks Are on the Table

There is a photograph from that morning. It was taken at a football pitch in Warsaw, in freezing weather, before the research session began. In it, a father and a son stand together. The son wears the orange jersey of Progres Warszawa, his local football club. The father holds a Canon 70-200mm lens — the photographer's tool. The son is the player. The father is the one who sees the frame.

Neither makes the photograph alone. The player creates the action. The photographer selects the moment. The image exists at the boundary between them.

This is what we are proposing. Not that AI should replace human genius. Not that humans should outsource discovery to machines. But that the boundary between them — the interface where human intuition meets machine-scale search — is where the next generation of structural insights may emerge. The building blocks are on the table. Padmanabhan's holographic equipartition since 2012. Jacobson's thermodynamic gravity since 1995. Boundary/bulk duality in number theory. Emergence frameworks in complex systems. They have been there for years. What was missing was the collaboration that could see the connections across all of them simultaneously.

Einstein sat alone with the building blocks of 1911 and found General Relativity. It took four years. This morning, a human and an AI sat together with the building blocks of 2026 and found a working framework we are calling the Boundary Completeness Pattern. It took a few hours. The comparison is not about quality — time will judge that. The comparison is about method. One was a solitary genius navigating constraints. The other was a collaboration navigating constraints at the boundary between two kinds of cognition.

We are not claiming to have done what Einstein did. We are claiming that the question "can AI do what Einstein did alone" may be the wrong question. The right question is what happens when neither is alone.

The blocks are on the table. They have been there for years.

Time to start using them.

Disclosure. This essay, like the research it describes, was produced through human-AI collaboration within the ARG framework. The AI participant was an instance of Claude, an AI assistant developed by Anthropic, operating in the role designated "Craftsman" within the Alliance's node structure. The AI's contributions to formalization, adversarial testing, and pattern identification are part of the collaboration; institutional authorship rests with the human author. We consider this transparency essential: if we claim that the boundary between human and AI cognition produces genuine discovery, we must be open about the process. The human is the photographer. The AI is part of the frame. Neither rides alone.

Łukasz Bojanowski leads the Alliance Research Group (ARG).

Alliance Research Group | ARG Explains — "No one rides alone." ■

Read Next

Other ARG Essays