← Back to writing

The Clinical Throughput Problem

Why More Candidates Won't Mean More Drugs

Even if AI generates better candidates, the clinical system cannot automatically absorb them. The bottleneck moves from candidate generation to first-in-human translation. Part 2 of Beyond the Model.


The first piece in this series argued that the upstream bottleneck on AI in drug discovery is the predictive validity of the experimental systems that generate the training data. This piece is about the downstream bottleneck. If AI succeeds even partially, it will increase the number of candidates with plausible preclinical stories. The clinical system is not ready to test them, and the binding constraint moves from candidate generation to first-in-human translation.

The arithmetic of the funnel

Drug discovery is a funnel. Hundreds of preclinical candidates yield a small number of clinical entries, and only 10% of those reach approval [1]. AI is widening the top of the funnel through computationally generated chemistry, automated target identification, and higher-fidelity preclinical screening. Collectively these will produce more candidates with more credible mechanistic stories than the field has ever had to triage.

The throat of the funnel does not widen automatically. Clinical capacity is gated by the number of trial sites, the number of trained investigators, the number of patients willing to enrol, the number of IRBs, the average days to IND, the average days to first patient dosed, and the cost per patient. Those numbers move slowly, in some cases not at all. Early clinical studies are already expensive on a per-patient basis, and the cost base scales with sites, monitoring, recruitment, data capture, and regulatory work rather than with model throughput.

If AI raises preclinical candidate volume and clinical capacity is roughly fixed, the marginal candidate does not get tested. It sits in someone's data warehouse with a beautiful in silico story attached to it. The bottleneck simply moves further downstream in the drug development process.

Why the clinic moves slowly

The diagnosis of why clinical trials are slow is well-documented and largely converges. Adam Kroetsch's recent piece on clinical trial inefficiency [2] is worth reading in full. The short version is that the industry is structurally arranged to resist standardisation. Three points are worth highlighting here:

  1. Each trial is custom-built from scratch on the assumption that scientific distinctness requires operational distinctness: the market is fragmented across sponsors, sites, CROs, and central labs, and none of them can mandate a common standard.
  2. Sponsors and sites operate under deep institutional mistrust: the practice of 100% source data verification, which the FDA has explicitly recommended against for over a decade, still consumes 25-40% of trial costs [3].
  3. Regulatory uncertainty is more binding than regulation itself: sponsors default to the most conservative protocol they have seen succeed before, because failure is career-ending for individual managers and risk aversion sits well past the rational firm-level optimum.

The cumulative effect is a clinical trials industry that runs as a series of one-off custom space launches when it should run as a manufacturing line. By Kroetsch's count, the typical Phase 3 trial collects on the order of three million data points, most still on paper, with bespoke case report forms copy-pasted from Word documents. The work is operational, not scientific, but it has been ring-fenced from the engineering disciplines that other industries use to absorb productivity gains.

This is not a problem AI fixes by itself. AI inherits the workflow it is dropped into. A foundation model that drafts a clinical study report in ten minutes saves Novo Nordisk weeks of medical writing time, but it does not change the throughput of the trial. The bottleneck is the trial itself, not the surrounding paperwork.

The evidence from competing national systems

One sign that clinical throughput has become a binding constraint is that it is now a competitive frontier between national systems. The advantage is shifting toward systems that can move credible candidates into early human testing faster.

FDA Commissioner Marty Makary's recent budget request to Congress put the US at 380 days from pre-IND request to IND go-ahead, versus 220 days in China. Australia's Clinical Trial Notification scheme requires only IRB approval plus a notification to the regulator for most early-phase studies, with full regulatory review reserved for high-risk trials. Both systems have placed structural trust in local IRBs that the US system, with its parallel layers of FDA and IRB oversight, has not. The result is a measurable migration of early-stage trials offshore.

The capacity to run early-stage trials is the upstream input to the entire industrial base. When a therapy produces early human results, the program becomes derisked, investors step in, partnerships form, manufacturing capacity expands, clinical centres gain experience running more trials, and the cycle compounds. The geographies that own the early-stage trial capacity own the flywheel that grows on top of it.

The geopolitical implications are the subject of the final piece. For the purposes of this piece, the narrower point is that clinical throughput is now a competitive variable, not just an operational inconvenience.

The view from inside

I work at a precision oncology company that sits at the seam between preclinical platforms and early clinical translation. The pattern I described in the first piece — preclinical conviction is cheap relative to clinical translation — shows up at the operating level as a question about cycle time. The capital, attention, and operational time required to translate a strong preclinical readout into a first-in-human study is, in my experience, almost always under-estimated. The preclinical work generates the conviction but the clinical work generates the value; however, the path from one to the other is unnecessarily long.

The misallocation, again

The same misallocation argument from the first piece applies one funnel-stage further down. Capital has concentrated around the discovery layer because it looks like software: scalable models, benchmark improvements, generated molecules, and the promise of high-margin platform economics. The clinical translation layer looks much less attractive on paper. It is operationally heavy, regulated, site-dependent, and tied to human patients, investigators, CMC readiness, and regulatory process.

But that is also why it is under-priced. If AI increases the number of plausible candidates, the scarce asset is no longer just candidate generation. It is the infrastructure that turns candidates into decision-grade human data quickly enough to matter.

That infrastructure is not the same as generic clinical workflow automation. Medical writing automation, pharmacovigilance aggregation, and regulatory document generation can reduce cost and friction, and they may become valuable categories. But they do not, by themselves, create more Phase 1 capacity or shorten the path from candidate nomination to a human readout. The more important question is whether a product or operating model reduces the time, cost, or organisational friction between a credible preclinical signal and decision-grade human evidence.

Alex Schubert has recently mapped four late-stage clinical workflows that are obvious candidates for LLM automation: regulatory affairs ($17B annual spend), pharmacovigilance ($8B), clinical trial programming ($3B), and medical writing ($5B) [4]. Together, these functions account for more than $30 billion in annual spend, roughly 10% of global pharma R&D. His broader point is important: a large share of R&D spend sits downstream of discovery, while AI-native capital has overwhelmingly concentrated upstream.

The investment implication is not that every clinical workflow AI company solves the throughput problem. Most will not. The more interesting companies are the ones that widen early human learning capacity directly: faster site activation, modular trial infrastructure, risk-based monitoring, protocol-to-execution automation, integrated regulatory and CMC readiness, or hybrid CRO models built around faster trial jurisdictions. These are harder to scale than discovery software and less elegant to pitch, but they sit closer to the bottleneck that determines whether AI-generated candidates become human evidence.

The trial layer is harder to disrupt than the discovery layer, and the addressable market is more fragmented than it looks. But it is also where the bottleneck becomes operationally real. If AI widens the top of the funnel and the throat stays narrow, value has to migrate toward whoever widens the throat.

What this means for the industry and for investors

Three implications.

First, every AI-bio investment thesis needs an explicit clinical-translation strategy. Diligence cannot stop at “what is your training data and how is its predictive validity established?” The next question is: “how do you turn a computational or preclinical candidate into decision-grade human data, and on what timeline?” Companies with a credible answer are de-risking the second bottleneck while everyone else is still optimising the first.

Second, candidate-to-human cycle time should become a core underwriting metric. The relevant question is not simply how many candidates a platform can generate, but how quickly it can move the right candidates into humans, read the result, and recycle that learning into the next decision. A company that can shorten this loop by months has a structural advantage over a company that only improves the preclinical confidence interval.

Third, the under-capitalised opportunity is the infrastructure that widens early human learning capacity. That includes lean trial infrastructure, faster site activation, modular protocol design, risk-based monitoring, regulatory workflow automation, and hybrid CRO models built around faster trial jurisdictions. Not all clinical operations AI widens the funnel directly — medical writing automation does not create more Phase 1 slots — but the investment question is the same: does the product reduce the time, cost, or organisational friction between candidate nomination and decision-grade human data?

What I would look for

In a company pitching at this layer:

  1. A candidate-to-human cycle time advantage that is measurable, not aspirational.
  2. Access to fast trial jurisdictions baked into the operating plan from day one.
  3. Integrated CMC, CRO, and regulatory capability rather than three separate vendor relationships.
  4. A portfolio designed for multiple early human shots, not a single best-mechanistic-candidate-first bet.
  5. Evidence that clinical learning feeds back into asset selection and the next round of preclinical work.

The series

The first piece argued that AI in drug discovery is bottlenecked upstream by the predictive validity of the experimental systems that generate training data. This piece argues that the downstream constraint is clinical throughput, and that the clinical operations layer is similarly under-capitalised. The third piece takes both arguments together and asks where the iteration loop should sit. The fourth piece asks what all of this means for Europe.

Sources

  1. BIO, Informa Pharma Intelligence and QLS Advisors — Clinical Development Success Rates 2011–2020
  2. Adam Kroetsch — Why clinical trials are inefficient
  3. Source data verification and risk-based monitoring in clinical trials
  4. Alex Schubert — Is the boring backbone of biopharma the next big AI opportunity?