Question 1

What is Reasoned Rule Mining (RRM)?

Accepted Answer

Reasoned Rule Mining is a calibrated, precision-optimized LLM classifier developed by Vela Research and the University of Oxford Department of Engineering Science. RRM mines IF-THEN rules from LLM reasoning traces, runs a compute-aware two-stage inference cascade, and produces calibrated posterior probabilities via Platt scaling and log-odds fusion. It is the statistical core of Vela's quant VC research program. Published at IEEE CSCloud / SecureFinAI 2025.

Question 2

What is quant VC?

Accepted Answer

Quant VC is the application of quantitative, reproducible, empirically validated methods to venture capital decision-making. It treats venture capital as a rare-event prediction problem and brings the same rigor to it that quantitative finance brings to credit risk or quantitative medicine brings to diagnostic screening. A quant VC firm requires quantitative scoring against honest baselines, reproducible methodology, and interpretability so every prediction can be audited.

Question 3

Where does RRM fit in Vela's quant VC research program?

Accepted Answer

RRM sits at the statistical core of Vela's quant VC architecture, alongside GPTree (LLM decision trees), Random Rule Forest (LLM feature ensembles), and Policy Induction (editable in-context policies). RRM's contribution is the mathematical machinery, Platt scaling, log-odds fusion, conditional-independence decomposition, threshold optimization, needed to turn LLM outputs into probability estimates a quant VC partnership can defend to an LP committee.

Question 4

How does RRM predict founder success?

Accepted Answer

Three stages. Stage 1 mines IF-THEN rules from LLM reasoning traces paired with true outcomes, filters them by perplexity, and consolidates them into a human-readable policy. Stage 2 runs a compute-aware cascade: a lightweight rules-first pass screens most founders, and only those classified 'High' at 95%+ confidence are promoted to a stricter harsh re-evaluation. Stage 3 calibrates raw LLM log-probabilities to proper posteriors via Platt scaling, fuses stage evidence on the log-odds scale, and picks the operating threshold that maximizes F0.5.

Question 5

How accurate is RRM on founder-success prediction?

Accepted Answer

On 8,000 founders with a 2% base rate, RRM achieves 24.5% precision at 15% recall and F0.5 = 0.217 in best-threshold mode, a 12.3x uplift over the base rate. In precalibrated out-of-sample mode it reaches 17.3% precision (8.65x uplift). A single-model LLM baseline reaches only 4.5% precision, so RRM's pipeline contributes a 5.4x precision gain on top of the raw LLM.

Question 6

Why is calibration the core quant VC problem for LLM classifiers?

Accepted Answer

Raw LLM output token probabilities are not proper posteriors. They are sensitive to prompt wording, decoding strategy, and class priors the model never saw. A 'top-token equals class' heuristic produces classifications that look confident but are systematically miscalibrated. In a rare-event setting like quant VC (2% base rate), miscalibration destroys precision at exactly the operating point that matters. RRM treats conversion of an LLM into a quant VC classifier as a calibration problem, not a prompt-engineering problem.

Question 7

How much compute does RRM save over a brute-force LLM classifier?

Accepted Answer

In the paper's evaluation, only 698 of 8,000 founders (8.7%) required the expensive Stage 2 harsh re-evaluation. The remaining 91.3% were resolved by the rules-first pass. That is an order-of-magnitude compute reduction with no loss of precision at the chosen operating point, which makes RRM tractable for a fund evaluating thousands of founders per year.

Question 8

What makes RRM auditable for quant VC decisions?

Accepted Answer

Three components, all inspectable and editable: (1) The mined IF-THEN rule policy is human-readable and can be edited by an expert who disagrees with a specific rule. (2) The calibrated posterior probabilities are proper probabilities, a 25% posterior means a 25% posterior, and a reliability diagram can be produced to confirm it. (3) The log-odds fusion step makes each stage's contribution explicit and editable.

Question 9

What are RRM's limitations?

Accepted Answer

Three limitations. (1) Platt scaling assumes a monotone link between the raw score and the true log-odds; non-monotone relationships call for isotonic regression or constrained splines. (2) The additive log-odds decomposition is exact only under conditional independence; correlated stages must be monitored and compensated via a meta-logistic combiner. (3) The framework is binary and single-period; multiclass outcomes and temporal horizons are promising but unaddressed.

Reasoned Rule Mining — calibrated LLM classification for quant VC.

What Reasoned Rule Mining contributes to quant VC

What is quant VC, and where does RRM fit?

How does RRM predict founder success?

How accurate is RRM?

Why calibration is the core quant VC problem

Why selectivity beats brute force in quant VC inference

What makes RRM auditable for quant VC decisions

How RRM fits into Vela's quant VC research program

Limitations

Read the paper