Venture Capital Risk Scoring

Predict startup investment risk scores using panel data with 20 startups tracked over 18 time periods.

Dataset Exploration

The Venture Capital dataset contains investment data and startup performance metrics over time. Features include burn rate, revenue growth, customer churn, founder experience, and industry risk level.

Panel data structure: 20 startups tracked over 18 time periods (360 total rows, 9 columns).

Fit Data (first 5 rows)

TimeIndividualBurnrateRevenue GrowthCustomer ChurnBurnrate×Rev.GrowthFounder Exp.Industry RiskRisk Score
113522.059771.757124.42
213927.12131057.687124.63
314125.14171030.747132.27
414412.5119550.447136.07
514523.09211039.057131.46

Forecast Data (first 3 rows)

The forecast file covers future periods without the Risk Score target column.

TimeIndividualBurnrateRevenue GrowthCustomer ChurnBurnrate×Rev.GrowthFounder Exp.Industry Risk
1912721.1413570.78103
2012810.8814304.64103
2113214.6718469.44103

Download sample datasets from GitHub

Code Walkthrough

Step 1: Initialize

Python
import pandas as pd
from datfid import DATFIDClient

client = DATFIDClient(token="your_DATFID_token")

Step 2: Fit the Model

Python
url_fit = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Venture_Capital.xlsx"
df = pd.read_excel(url_fit)

result = client.fit_model(
    df=df,
    id_col="Individual",
    time_col="Time",
    y="Risk Score",
    current_features="all",
    filter_by_significance=True
)

Step 3: Forecast

Python
url_forecast = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Venture_Capital_forecast.xlsx"
df_forecast = pd.read_excel(url_forecast)

forecast = client.forecast_model(df_forecast=df_forecast)

Analysis Results (Model Fit)

Formula

Risk Score ~ α1*Intercept + α2*Industry Risk Level + α3*Founder Experience + β1*Burnrate + β2*Revenue Growth + β3*Customer Churn

Alpha Estimates (Time-Invariant)

VariableEstimateT-statInterpretation
Intercept12.415.5Baseline risk score before any company- or founder-specific factor (~12.4).
Industry Risk Level+0.1810.992Industry risk shifts the score by ~+0.18 per step, but the effect is not statistically significant once founder and financial drivers are in.
Founder Experience-0.2422.8Each additional unit of founder experience trims the risk score by ~0.24.

Beta Estimates (Time-Varying)

VariableEstimateT-statInterpretation
Burnrate+0.4615.7Each unit of burnrate raises the risk score by ~0.46 — the dominant time-varying risk driver.
Revenue Growth-0.35214.6Each unit of revenue growth lowers the risk score by ~0.35.
Customer Churn+0.4225.2Each unit of customer churn raises the risk score by ~0.42.

Model Performance

0.847
R² Overall
0.944
R² Between
0.833
R² Within
3.9
MAE
22.9
MSE

Try it yourself: Select "Venture Capital" in the Free Playground.